Introduction

Robustly seen in paleo proxy records1,2, historical observations3,4,5, and model simulations6,7,8,9, the temperature response to climate change is amplified in the Arctic relative to the rest of Earth. This so-called Arctic Amplification (AA) not only affects the Arctic cryosphere and ecosystem but also influences global circulation and climate by modulating the meridional temperature gradient10,11,12,13,14,15,16. However, the degree of AA, commonly defined as the ratio in surface temperature changes between the Arctic and global mean, varies widely by a factor of two among climate models17. This large uncertainty is not unexpected, considering the complex processes and feedbacks contributing to AA17,18,19. Locally, surface warming in the Arctic is amplified by the positive albedo feedback due to sea ice loss20,21, the positive lapse-rate feedback due to surface-trapped warming18,22, and the less negative Planck feedback due to colder climate18. Remotely, the increase in meridional atmospheric heat transport (AHT), implied by enhanced meridional moisture gradient from global-scale warming, must be constrained by amplified Arctic warming so that it does not exceed the demand from the meridional gradient in radiative forcing23,24,25,26,27,28. Furthermore, the above effects are not independently additive but can interact with each other29,30,31.

The relation between AA and its contributing factors has been studied through diagnostic and modelling methods. For example, the contributions of individual factors to Arctic and global warming have been diagnosed based on energy budget equations32,33,34,35,36. The dependences of AA on regional feedbacks, ocean heat flux, and radiative forcing have been investigated using both the diffusive energy balance model (EBM)37,38,39,40 and comprehensive climate models41. For AA in a diffusive EBM with uniform feedbacks, analytic estimates have been proposed39. Despite these achievements, our understanding of how the degree of AA is determined remains limited as several outstanding questions remain: What are the essential physics that determine the degree of AA? What is the functional relationship between the degree of AA and the key physical parameters? How to understand the range of AA (i.e., why it is 1.5–3.5 instead of 10)? What causes the variation in the degree of AA among models?

Here, we answer the above questions by establishing an analytic theory for the degree of AA. Specifically, we build a two-box energy-balance model of AA and reveal that the degree of AA is a simple nonlinear function of five key physical parameters. The theoretical function conveys a concise picture of how the degree of AA emerges from essential physics and explains the varying degree of AA in individual climate models. Furthermore, it clearly interprets the intricate role of AHT in AA and reveals the close compensation between the effects of the lapse-rate and water-vapor feedbacks on AA.

Results

The degree of AA as a simple nonlinear function of key physical parameters

We start with the energy balance equations for the Arctic and global mean under climate change,

$${Arctic}:{F}^{A}+\Delta {O}^{A}+\Delta {AHT}+{{\sum}_{i}}{\lambda }_{i}^{A}{\Delta T}^{A}=0,$$
(1)
$${Global}:{F}^{G}+\Delta {O}^{G}+{{\sum}_{i}}{\lambda }_{i}^{G}{\Delta T}^{G}=0,$$
(2)

where the superscripts A and G denote the Arctic and global mean respectively, F is the effective radiative forcing, ∆ denotes the response to climate change, T is the surface temperature, \({\lambda }_{i}\) represents individual feedback parameters (defined respectively with respect to the Arctic and global mean surface warming) from Planck, lapse rate, water vapor, albedo and clouds, ∆O refers to the change in oceanic heat flux, and ∆AHT represents the change in AHT into the Arctic.

The widely-used diagnostic method32,33,34,35,36 utilizes Eqs. (1, 2) to partition the total warming into contributions of individual factors as

$$\Delta {T}^{A}=\frac{{F}^{A}}{-{\lambda }_{P}^{G}}+\frac{\Delta {O}^{A}}{-{\lambda }_{P}^{G}}+\frac{\Delta {AHT}}{-{\lambda }_{P}^{G}}+\frac{({\lambda }_{{alb}}^{A}+{\lambda }_{{LR}}^{A}+{\lambda }_{{WV}}^{A}{+{\lambda }_{{cld}}^{A}+\lambda }_{P}^{{\prime} A}){\Delta T}^{A}}{-{\lambda }_{P}^{G}},$$
(3)
$$\Delta {T}^{G}=\frac{{F}^{G}}{-{\lambda }_{P}^{G}}+\frac{\Delta {O}^{G}}{-{\lambda }_{P}^{G}}+\frac{({\lambda }_{{alb}}^{G}+{\lambda }_{{LR}}^{G}+{\lambda }_{{WV}}^{G}+{\lambda }_{{cld}}^{G}){\Delta T}^{G}}{-{\lambda }_{P}^{G}},$$
(4)

where \({\lambda }_{P}^{G}\) is the global mean Planck feedback and \({\lambda }_{P}^{{\prime} A}={\lambda }_{P}^{A}-{\lambda }_{P}^{G}\). The feedback \({\lambda }_{i}\) is diagnosed as contributing to the total Arctic warming (∆TA) by \(-\frac{{\lambda }_{i}^{A}{\Delta T}^{A}}{{\lambda }_{P}^{G}}\) and to the total global warming (\({\Delta T}^{G}\)) by \(-\frac{{\lambda }_{i}^{G}{\Delta T}^{G}}{{\lambda }_{P}^{G}}\). Such partition is helpful for understanding the relative importance of individual factors. However, with \({\Delta T}^{A}\) and \({\Delta T}^{G}\) used as input, it is silent on why the degree of AA varies among models, and more generally, how the degree of AA varies with these physical factors. Furthermore, the attributed warming may not reflect the real role of a specific factor. For example, if a feedback has the same strength between the Arctic and global mean (\({\lambda }_{i}^{A}={\lambda }_{i}^{G}\)), it should not physically contribute to AA but would be diagnosed as contributing more warming to the Arctic relative to the globe given ΔTA > ΔTG. A theory, beyond the current diagnostic framework, is needed to better understand how the degree of AA is determined.

Here, we build a two-box energy-balance model of AA by deciphering the change in AHT and upon that derive an analytic formula between AA and its contributing factors (Fig. 1). The change in AHT, ∆AHT, is a unique factor involved in AA. It is neither a pure forcing (e.g., F) nor a pure feedback (e.g., \({\lambda }_{i}\)) to AA. Instead, it consists of two parts—a forcing-like part as global-scale warming enhances meridional moisture gradient (\(d{q}^{*}/{dT}\) is higher in warmer tropics) and increases AHT to amplify Arctic warming, and a negative feedback part as AA weakens meridional temperature gradient and reduces AHT. This motivates us to formulate ∆AHT as a function of global mean warming \({\Delta T}^{G}\) and enhanced Arctic warming \({\Delta T}^{A}-{\Delta T}^{G}\),

$$\Delta {AHT}\cong a{\Delta T}^{G}-b\left({\Delta T}^{A}-{\Delta T}^{G}\right),$$
(5)

and a reformulation of Eq. (5) gives

$$\frac{\Delta {AHT}}{{\Delta T}^{G}}\cong a-b\left({AA}-1\right),$$
(6)
Fig. 1: A two-box energy-balance model of Arctic Amplification (AA).
figure 1

The two boxes represent the global mean and the Arctic mean, respectively. The energy balance equations involve forcing (\(F\)), feedbacks (\({\sum}_{i}{\lambda }_{i}\Delta T\)), the change in ocean heat convergence and uptake (\(\Delta O\)), and the change in atmospheric heat transport into the Arctic (\(\Delta {AHT}\)). \(\Delta {AHT}\) can be formulated as a function of global mean warming (\({\Delta T}^{G}\)) and enhanced Arctic warming relative to the global mean (\({\Delta T}^{A}-{\Delta T}^{G}\)).

The above formula of ∆AHT is not only supported by the significant correlation between \(\frac{\Delta {AHT}}{{\Delta T}^{G}}\) and AA − 1 across models (r = −0.70; Fig. 2a) but also derivable from the theoretical diffusive formulation of AHT (Methods). The intermodel regression between \(\frac{\Delta {AHT}}{{\Delta T}^{G}}\) and AA − 1 estimates â ≅ 2.1 W m−2 K−1 and \(\hat{b}\cong\) 1.7 W m−2 K−1 (Fig. 2a). The diffusive theory derives a and b as functions of basic parameters of the climate systems and suggests consistent values (Methods). The hat symbol here denotes a general, constant estimate for the model ensemble instead of individual models. With constant estimates of \(\hat{a}\) and \(\hat{b}\), Eq. (5) reasonably captures ∆AHT projected by models (r = 0.71), explaining ~50% of the variance in ∆AHT (Fig. S1). The unexplained variance reflects the variations in the parameters \(a\equiv \frac{\partial {AHT}}{\partial {T}^{G}}\) and \(b\equiv -\frac{\partial {AHT}}{\partial ({T}^{A}-{T}^{G})}\) among models, that is, the model-dependent sensitivities of AHT to global uniform warming and enhanced Arctic warming. We further estimate a and b for individual models according to the deviation of \(\frac{\Delta {AHT}}{{\Delta T}^{G}}\) from the regression line, i.e., \(\hat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}=\hat{a}-\hat{b}\left({AA}-1\right)\), so that Eq. (5) reproduces ∆AHT in individual models (Methods).

Fig. 2: Development of an analytic formula for the degree of Arctic Amplification (AA).
figure 2

a Scatterplot between AA−1 and the change in atmospheric heat transport into the Arctic normalized by the global mean warming (\(\Delta {AHT}/\Delta {T}^{G}\)) across models. b The global and Arctic feedback parameters (defined with respect to their respective surface warming) diagnosed using the radiative-kernel method. The crosses indicate the uncertainty range (±s.d.) across models and the dots indicate values in individual modes. The correlations between the global and Arctic feedbacks are denoted for the total and individual feedbacks. c. Scatterplot between the model-projected AA and the theoretical estimate Eq. (8) using constant \(\hat{a}\) and \(\hat{b}\). d Scatterplot between the model-projected AA and the theoretical estimate Eq. (8) using model-dependent \(a\) and \(b\). e The mean and uncertainty (±s.d.) of the model-projected AA, the theoretical estimate \({I}^{{AA}}\equiv 1+\frac{a+{\lambda }^{A}-\hat{\gamma }{\lambda }^{G}}{b-{\lambda }^{A}}\) Eq. (8), the baseline AA \({I}_{B}^{{AA}}\equiv 1+\frac{a}{b-{\lambda }^{G}}\) Eq. (9), and the effects from differential forcing \((\gamma < 1)\), differential total feedback (\({\lambda }^{A} > {\lambda }^{G}\)) and differential individual feedbacks (\({\lambda }_{i}^{A}\) vs. \({\lambda }_{i}^{G}\)) between the global and the Arctic mean. Source data are provided as a Source Data file.

To facilitate derivation, we have also written the change in ocean heat flux, ∆O, as a feedback to the temperature change, similar to ref. 42,

$$\Delta O={\lambda }_{O}\Delta T,$$
(7)

where \({\lambda }_{O}\) is diagnosed as ∆O/∆T for the globe and the Arctic. While this is mainly a mathematical treatment, it can be physically justified as ∆O is approximately proportional to ∆T over the transient response to climate change (Fig. S2).

Equations (1, 2) and (57) form a two-box energy-balance model of AA, in which the Arctic and the globe experience their respective forcing and feedbacks and interact through AHT (see Fig. 1 for a schematic). This two-box model yields a theoretical solution for the degree of AA as,

$${AA}\equiv \frac{{\Delta T}^{A}}{{\Delta T}^{G}}=1+\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\equiv {I}^{{AA}},$$
(8)

where \(\gamma \equiv \frac{{F}^{A}}{{F}^{G}}\) is the ratio in radiative forcing between the Arctic and the globe, \({\lambda }^{A}\equiv \sum {\lambda }_{i}^{A}\) and \({\lambda }^{G}\equiv \sum {\lambda }_{i}^{G}\) are the sums of feedbacks (Planck, albedo, lapse rate, water vapor, cloud, and ocean) for the Arctic and the globe respectively, \(a\equiv \frac{\partial {AHT}}{\partial {T}^{G}}\) measure the increasing rate of AHT with global uniform warming and \(b\equiv -\frac{\partial {AHT}}{\partial ({T}^{A}-{T}^{G})}\) measures the decreasing rate of AHT with enhanced Arctic warming. Physically, the numerator \(a+{\lambda }^{A}-\gamma {\lambda }^{G}\) measures the anomalous energy input into the Arctic relative to the globe because of the increasing AHT with global uniform warming and its less negative feedback, while the denominator \(b-{\lambda }^{A}\) measures the energy damping efficiency of the Arctic, reflected by the decreasing AHT with enhanced Arctic warming minus the Arctic feedback.

We proceed to validate the theory by applying it to capture the degree of AA in individual climate models (Methods). The responses to climate change (∆) are computed from the differences between the 1980−1995 period in the historical simulations and the 2085−2100 period under the Shared Socioeconomic Pathways SSP2-4.5 in 32 models from the Coupled Model Intercomparison Project Phase 6 (CMIP6)43. The feedback parameters, \({\lambda }_{i}\), are diagnosed using the radiative-kernel method44,45 for both the globe and the Arctic. As shown in Fig. 2b, the Planck, albedo, lapse-rate feedbacks are less negative or more positive in the Arctic than in the globe (above the 1:1 line) and would contribute positively to AA; in contrast, the cloud and water-vapor feedbacks contribute negatively to AA. It is difficult to accurately estimate the parameter γ for individual models, so we adopt a constant value of \(\hat{\gamma }=0.6\) based on the spatial pattern of the 2xCO2 radiative forcing46. The omission of a model-dependent γ may be justified as radiative forcing accounts for only a small portion of uncertainty in the absolute Arctic warming18,36. Using diagnosed model-dependent λA and λG but constant \(\hat{a}\), \(\hat{b}\) and \(\hat{\gamma }\), the theoretical formula reasonably captures the degree of AA in individual models, with \({I}^{{AA}}=1+\frac{\hat{a}+{\lambda }^{A}-\hat{\gamma }{\lambda }^{G}}{\hat{b}-{\lambda }^{A}}\) correlated with the model-projected AA at r = 0.71 across models (Fig. 2c). The accuracy is further improved by using the model-dependent a and b. \({I}^{{AA}}=1+\frac{a+{\lambda }^{A}-\hat{\gamma }{\lambda }^{G}}{b-{\lambda }^{A}}\) is correlated with the model-projected AA at r = 0.92 across models (Fig. 2d).

A concise picture of how the degree of AA emerges from essential physics

The theoretical formula, \({I}^{{AA}}\equiv 1+\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\), conveys a concise picture of how the degree of AA is mutually determined by essential physics. First, a baseline AA exists with even spatially uniform radiative forcing (γ = 1) and feedbacks (λA=λG). This baseline AA,

$${I}_{B}^{{AA}}=1+\frac{a}{b-{\lambda }^{G}},$$
(9)

reflects the fundamental role of AHT and is set by the ratio between the increasing rate of AHT with global uniform warming (a) and the decreasing rate of AHT with enhanced Arctic warming minus global climate feedback (\(b-{\lambda }^{G}\)). Its degree is estimated to be 1.67 ± 0.19 (±s.d.) (grey bars in Fig. 2e) based on the values of a, b, and λG. Then, the differential radiative forcing and feedbacks between globe and Arctic influences AA by

$${FF} \equiv {I}^{{AA}}-{I}_{B}^{{AA}} =\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}-\frac{a}{b-{\lambda }^{G}} \\ \, =\frac{a+{\lambda }^{A}-{\lambda }^{G}+\left(1-\gamma \right){\lambda }^{G}}{b-{\lambda }^{G}-\left({\lambda }^{A}-{\lambda }^{G}\right)}-\frac{a}{b-{\lambda }^{G}}.$$
(10)

Specifically, the weaker radiative forcing in the Arctic relative to the globe reduces AA (\(\gamma \equiv \frac{{F}^{A}}{{F}^{G}} < 1\) with \({\lambda }^{G} < 0\); cyan bar in Fig. 2e) but the less negative (or even positive) climate feedback in the Arctic amplifies AA (\({\lambda }^{A}-{\lambda }^{G} > 0\); golden bar in Fig. 2e). The effect of differential feedbacks dominates and the degree of AA is amplified from the baseline AA to 2.51 ± 0.49. We further quantify the contribution of individual feedback to the degree of AA by overriding its Arctic value with its global value in the formula (Methods). The Planck, albedo and ocean feedbacks increase the degree of AA respectively by 0.57 ± 0.03, 0.65 ± 0.28 and 0.35 ± 0.34, while the cloud feedback decreases AA by 0.49 ± 0.51.

The lapse-rate feedback, being negative in low latitudes with amplified upper-level warming but positive in the Arctic with surface-trapped warming, has been widely recognized as a major contributor to AA18. Furthermore, as lapse rate in the Arctic is affected by radiative forcing and AHT, the lapse-rate feedback presents dependency on other factors involved in AA29,30,31. Understanding the lapse-rate feedback and its intricacy seems to be a leading-order problem for understanding AA. On the other hand, it has been long recognized in the climate sensitivity research community that the lapse-rate feedback is compensated by the water-vapor feedback47. Here, we show that the contribution of the lapse-rate feedback to AA is fully, if not overly, compensated by that of the water-vapor feedback (Fig. 2e). As the surface-trapped warming simultaneously reduces the increase in the tropospheric water vapor (Fig. 3a, b), the more positive lapse-rate feedback in the Arctic is associated with a less positive water-vapor feedback, so the sum of these two feedbacks is nearly identical between the globe and the Arctic (Fig. 2b; Fig. 3c). Furthermore, models that simulate a larger tropospheric warming also present a larger tropospheric moistening (solid versus dashed line in Fig. 3a, b), so the lapse-rate and water-vapor feedbacks are negatively correlated across models for both the globe (r = −0.74) and the Arctic (r = -0.88), leading to small intermodel spread in their sums (Figs. 2b, 3c). Eventually, the sum of the lapse-rate and water-vapor feedbacks is 1.36 ± 0.11 W m−2 K−1 for the globe and 1.28 ± 0.07 W m−2 K−1 for the Arctic. Thus, the combined feedback contributes little to AA and accounts for little intermodel uncertainty in AA compared to other feedbacks (Fig. 2e).

Fig. 3: Compensation between effects of the lapse-rate and water-vapor feedbacks on Arctic Amplification (AA).
figure 3

a The vertical profile of the change in air temperature for the globe (red) and the Arctic (blue). b The vertical profile of the fractional change in specific humidity for the globe (red) and the Arctic (blue). In (a and b), the solid lines show the mean of the model ensemble (MME) and the dashed lines show the mean of five models with more negative lapse-rate feedback. c Intermodel scatterplot between the lapse-rate feedback (\({\lambda }_{{LP}}\)) and the water-vapor feedback (\({\lambda }_{{WV}}\)) in the globe (red) and the Arctic (blue). The mean and s.d. of \({\lambda }_{{LP}}\), \({\lambda }_{{WV}}\), \({\lambda }_{{LP}}+{\lambda }_{{WV}}\), and the intermodel correlation between \({\lambda }_{{LP}}\) and \({\lambda }_{{WV}}\) are listed for the globe and the Arctic. Source data are provided as a Source Data file.

The formula \({I}^{{AA}}=1+\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\) articulates the intricate role of AHT in AA. In literature, the effect of AHT on AA is often diagnosed from AHT (see Eq. (3)). AHT, however, consists of a negative feedback part that depends on the degree of AA itself. The formula here highlights the partial sensitivities of AHT to global uniform warming (\(a\equiv \frac{\partial {AHT}}{\partial {T}^{G}}\)) and meridional warming gradient (\(b\equiv -\frac{\partial {AHT}}{\partial ({{T}^{A}-T}^{G})}\)), instead of AHT itself, as the fundamental parameters for understanding the role of AHT. Even without differential feedbacks, the physics of AHT leads to the baseline AA, \({I}_{B}^{{AA}}=1+\frac{a}{b-{\lambda }^{G}}\cong 1.67\), which forms a major part of the total AA. The skill of IAA in capturing AA is improved from r = 0.71 to r = 0.92 by using the model-dependent a and b, highlighting the important role of AHT in determining the degree of AA in individual models. The effect of AHT in individual models can be further understood from the following approximated formula (see derivation in Methods),

$${I}^{{AA}}=1+\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\cong 1+\frac{\hat{a}+d+{\lambda }^{A}-\gamma {\lambda }^{G}}{\hat{b}-{\lambda }^{A}},$$
(11)

where \(d=\frac{\Delta {AHT}}{{\Delta T}^{G}}-\widehat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}\) is the difference between the model-simulated\(\,\frac{\Delta {AHT}}{{\Delta T}^{G}}\) and the prior-estimated \(\widehat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}=\hat{a}-\hat{b}({AA}-1)\) according to \(\hat{a}\) and \(\hat{b}\) (i.e., the regression line in Fig. 2a). The effect of AHT is represented by \(\hat{a}\), \(\hat{b}\), and d. A more positive \(d=\frac{\Delta {AHT}}{{\Delta T}^{G}}-\widehat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}\) implies a higher a and/or a lower b that would favor a higher AA. This further explains why directly using ∆AHT to interpret the effect of AHT may be misleading. For example, as shown in Fig. 2a, M12 (CanESM5) projects a negative AHT but has a positive d that would favor a larger AA.

The formula \({I}^{{AA}}\equiv 1+\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\) provides a physical explanation for the likely range of AA, specifically, why AA sits roughly between 1.5 and 3.5 in climate models. The partial sensitivities of AHT to global uniform warming (a) and enhanced Arctic warming (b) are rooted in basic parameters of the climate system (Methods) and estimated to be a = 2.1 ± 0.3 W m−2 K−1 and b = 1.7 ± 0.3 W m−2 K−1 in models. The global and Arctic climate feedbacks, as diagnosed from the radiative-kernel method, are λG = −1.7 ± 0.6 W m−2 K−1 and λA = −0.4 ± 0.6 W m−2 K−1. Furthermore, the diffusive theory derives that a/b should be around one given the climatological distribution of specific humidity (Eq. (27); Methods), and the positive correlation between λA and λG (Fig. 2b) constrains \({\lambda }^{A}-\gamma {\lambda }^{G}\) to be 0.6 ± 0.5 W m−2 K−1. Applying these ranges in IAA suggests that it is very hard for the degree of AA to be below 1.5 or above 3.5.

Understand the variation and outliers of AA among climate models

As the theory well captures the degree of AA in individual models (Fig. 2d), we further apply it to understand the causes of the variation and outliers of AA. In Fig. 4a, the theory-predicted AA, \({I}^{{AA}}\equiv 1+\frac{a+{\lambda }^{A}-\hat{\gamma }{\lambda }^{G}}{b-{\lambda }^{A}}\), is decomposed into the sum of the baseline AA, \({I}_{B}^{{AA}}\) Eq. (9) and the effect of differential forcing and feedback between Arctic and globe, FF Eq. (10). Models are ranked according to IAA and the magnitude of FF is indicated by the cool-to-warm color. The outlier models, three with the lowest IAA (M7, M8, M21) and three with the highest IAA (M12, M19, M32), are denoted. The variation in IAA are contributed by both \({I}_{B}^{{AA}}\) (1.3−2.1) and FF (0−1.8), but the effect of FF is more dominating. To understand why the degree of AA varies among models, we proceed to understand the variations in \({I}_{B}^{{AA}}\) and FF.

Fig. 4: Attribution of the spread and outliers of Arctic Amplification (AA) to physical parameters.
figure 4

a The theoretical estimate, \({I}^{{AA}}\), is ranked from low to high and decomposed into the baseline AA, \({I}_{B}^{{AA}}\) (grey bars) and the effect of differential forcing and feedbacks, \({FF}\) (bars with cool-to-warm color indicating the magnitude). The degree of the model-projected AA is indicated by the black dots. The six outlier models are denoted by the blue and red boxes. The model number is shown in the x-axis. b. The intermodel variation of \({I}_{B}^{{AA}}=1+\frac{a}{b-{\lambda }^{G}}\) Eq. (9) explained by the variations in \({\lambda }^{G}\) and \(d=\frac{\Delta {AHT}}{\Delta {T}^{G}}-\hat{\frac{\Delta {AHT}}{\Delta {T}^{G}}}\). A higher \(d\) is equivalent to a larger \(a\equiv \frac{\partial {AHT}}{\partial {T}^{G}}\) and/or a smaller \(b\equiv -\frac{\partial {AHT}}{\partial ({{T}^{A}-T}^{G})}\). The magnitudes of \({I}_{B}^{{AA}}\) in individual models are indicated by the cool-to-warm color. The shading reflects the value of \(1+\frac{\hat{a}+d}{\hat{b}-{\lambda }^{G}}\). c The intermodel variations of \({FF}=\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}-\frac{a}{b-{\lambda }^{G}}\) Eq. (10) explained by the variations in \({\lambda }^{G}\) and \({{\lambda }^{A}-\lambda }^{G}\). The magnitudes of \({FF}\) in individual models are indicated by the cool-to-warm color. The shading reflects the value of \(\frac{\hat{a}+{\lambda }^{A}-\gamma {\lambda }^{G}}{\hat{b}-{\lambda }^{A}}-\frac{\hat{a}}{\hat{b}-{\lambda }^{G}}\). In (b and c), the outlier models are denoted by the enlarged diamonds. d The values of \(a\), \(b\), \(d\), \({\lambda }^{G}\), \({\lambda }^{A}-{\lambda }^{G}\), and individual feedbacks in the model ensemble (boxplots indicating the 10%, 25%, mean, 75%, 90% percentiles) and in the six outlier models (colored dots). Source data are provided as a Source Data file.

According to Eq. (9), \({I}_{B}^{{AA}}\) is larger if \(a\equiv \frac{\partial {AHT}}{\partial {T}^{G}}\) is larger, \(b\equiv -\frac{\partial {AHT}}{\partial ({{T}^{A}-T}^{G})}\) is smaller, and λG is weaker (less negative). A model that simulates a higher \(\frac{\Delta {AHT}}{{\Delta T}^{G}}=a-b({AA}-1)\) than the prior-estimated \(\widehat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}=\hat{a}-\hat{b}\left({AA}-1\right),\) should have a larger a and/or a smaller b. Thus, \({I}_{B}^{{AA}}\) would be larger if \(d=\frac{\Delta {AHT}}{{\Delta T}^{G}}-\widehat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}\) is higher and λG is weaker. Indeed, as shown in Fig. 4b, the intermodel variation of \({I}_{B}^{{AA}}\) (indicated by cool-to-warm color of the diamond symbols) is consistently explained by its increases with d and λG. For the outlier models, M7 presents a small \({I}_{B}^{{AA}}\) due to its abnormally small d; M21 presents a small \({I}_{B}^{{AA}}\) due to its excessively strong λG (<−3 W m−2); on the other hand, M12,19,32 present a large \({I}_{B}^{{AA}}\) due to their weak λG (>−1 W m−2).

According to Eq. (10), FF would be larger when λG is weaker (less negative) but \({\lambda }^{A}-{\lambda }^{G}\) is higher (more positive). We relate FF to λG and λA − λG instead to λA and λG, as there is a significant correlation between λA and λG (r = 0.6; black dots in Fig. 2b). As shown in Fig. 4c, the intermodel variation of FF (indicated by cool-to-warm color of the diamond symbols) is consistently explained by its increases with λG and \({\lambda }^{A}-{\lambda }^{G}\). For the outlier models, M7 and M8 present a very small FF as their λA − λG is abnormally low; M21 presents a very small FF despite normal λA − λG as its λG is excessively strong; on the other hand, M12, 19, 32 present a strong FF as their λG are abnormally weak.

The variations in λA − λG and λG are further attributed to the effects of individual feedbacks (\({\lambda }_{i}^{A}\) and \({\lambda }_{i}^{G}\)). Based on their baseline AA, FF, and d, \({\lambda }_{i}^{A},{\lambda }_{i}^{G}\) (Fig. 4d), we conclude the causes of the abnormal AA in the six outlier models as follows.

  • M7 (CESM2): Small baseline AA due to abnormally low d; Small FF due to low \({\lambda }^{A}-{\lambda }^{G}\) which is further attributed to the effects of λCld and λOcn.

  • M8 (CIESM): Small FF due to abnormally low \({\lambda }^{A}-{\lambda }^{G}\), which is further attributed to the effects of λAlb and λOcn.

  • M21 (INM-CM4-8): Small baseline AA and FF due to excessively strong λG, which is further attributed to the effect of λCld.

  • M12 (CanESM5), M19 (HadGEM3-GC31-LL), M32 (UKESM1-0-LL): Large baseline AA and FF due to abnormally weak λG, which is further attributed to the effect of λCld.

Thus, the intermodel spread in the degree of AA can be understood from three key parameters, the dynamics of AHT (d), the differential feedbacks between Arctic and globe (\({\lambda }^{A}-{\lambda }^{G}\)), and the global mean feedback (λG). Given the formula \({I}^{{AA}}\equiv 1+\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\), one might expect AA to increase with a stronger (more negative) λG. However, as λA and λG are correlated, a stronger λG implies a stronger λA and reduces IAA. By influencing λG, global mean cloud feedback strongly affects the degree of AA, as seen in the outlier models M21 and M12,19 and 32.

Application to explaining previous numerical results of AA

The formula, \({I}^{{AA}}\equiv 1+\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\), can analytically explain the degree of AA in previous numerical studies that manipulate feedbacks and radiative forcing. Reference 39 imposed uniform forcing and feedbacks in a diffusive EBM. The resultant AA is ~1.6, which is consistent with the baseline AA here. Reference 41 investigated AA under regionally confined radiative forcing in a comprehensive climate model. They found a strong AA with polar forcing (poleward of 60o) but a very weak AA with tropical or midlatitudinal forcing. These results can be analytically explained by our formula. Specifically, \({I}^{{AA}}\cong \frac{a+b-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\) predicts \({I}^{{AA}}\cong 6\) for polar forcing with γ \(\cong\) 4 and IAA\(1.7\) for tropical or midlatitudinal forcing with γ = 0. Reference 40 conducted feedback-removing experiments in a diffusive EBM and showed that the presence of the water-vapor feedback warms the Arctic more than the globe although the water-vapor feedback is weaker in the Arctic. This puzzling result can be analytically explained by our formula. Specifically, given \({I}^{{AA}}=\frac{a+b-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\), the presence of the water-vapor feedback warms the global mean by \({T}_{{wv}}^{G}={T}^{G}-{T}_{-{wv}}^{G}=\frac{{F}^{G}}{{-\lambda }^{G}}-\frac{{F}^{G}}{-{\lambda }_{-{wv}}^{G}}\cong 0.31{F}^{G}\) and warms the Arctic by \({T}_{{wv}}^{A}={T}^{A}-{T}_{-{wv}}^{A}={T}^{G}{I}^{{AA}}-{T}_{-{wv}}^{G}{I}_{-{wv}}^{{AA}}=\frac{{F}^{G}}{{-\lambda }^{G}}\frac{a+b-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}-\frac{{F}^{G}}{-{\lambda }_{-{wv}}^{G}}\frac{a+b-\gamma {\lambda }_{-{wv}}^{G}}{b-{\lambda }_{-{wv}}^{A}}\cong 0.7{F}^{G}\) (the subscript \(-{wv}\) denotes the experiment with the water-vapor feedback set to zero). These successes highlight that, the degree of AA, emerging from controls of multiple factors, is well captured by our simple nonlinear formula.

Discussion

Our results suggest that the degree of AA, despite governed by effects of multiple feedbacks and intricate interactions among feedbacks and AHT, can be analytically understood. By deciphering atmospheric heat transport (AHT) and establishing a two-box EBM of AA, we show that the degree of AA is a simple nonlinear function of five key parameters as \({I}^{{AA}}\equiv 1+\frac{a+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\), where a is the increasing rate of AHT with global uniform warming, b is the decreasing rate of AHT with enhanced warming in the Arctic, γ is the ratio between the Arctic and global mean radiative forcing, and \({\lambda }^{A}\) and \({\lambda }^{G}\) are the sum of climate feedbacks in the Arctic and the globe respectively. The theory conveys a concise picture of how the degree of AA emerges from essential physics as a combination of a baseline AA (which arises from the fundamental physics of AHT and exists even with uniform forcing and feedbacks) and the effect of differential forcing and feedbacks between Arctic and globe. The formula accurately captures the degree of AA in individual climate models and attributes the variation to specific physical factors that can be further targeted by modelling centers. The formula analytically explains the degree of AA in multiple numerical studies that manipulate feedbacks and radiative forcing. More generally, the analytic theory allows us to predictively understand how the degree of AA would change if certain physical factors (e.g., feedbacks, forcing, and atmospheric diffusivity) change.

The analytic theory works beyond the existing diagnostic framework of AA which partitions the total global warming linearly into contributions of individual factors (Eqs. (3) and (4)). The diagnostic partition cannot answer how AA would change with physical factors and may misinterpret the real role of a factor (see our discussion in the introduction). Here, the analytic form provides a direct, clear understanding of how individual physical factors influence the degree of AA. The lapse-rate feedback has been widely recognized as a leading-order contributor to AA. We show here that the sum of the water-vapor and lapse-rate feedback is nearly identical between the Arctic and global mean, so the combined feedback contributes little to AA. Such compensation was not recognized by the diagnostic framework which would diagnose the combined feedback to contribute more to Arctic warming given ΔTA  > ΔTG. The role of AHT in AA was often interpreted based on its total change ∆AHT. We show here that ∆AHT consists of a forcing-like part from global-scale warming and a negative feedback part that depends on AA. Instead of by its total change, the fundamental role of AHT in AA is represented by its partial sensitivities to global uniform warming (a; the forcing part) and enhanced Arctic warming (b; the negative feedback part).

We have focused on the intermediate emission scenario SSP2-4.5, but the theory also works well for low and high emission scenarios. \({I}^{{AA}}\) is correlated with the model-projected AA at r = 0.88 for SSP1−2.6 and r = 0.84 For SSP5-8.5 (Fig. S3). Besides the application to understanding the intermodel variation, the theory may also be applied to understand the response of AA to changing physical parameters under paleo and future long-term climate change. Furthermore, the theory can be readily applied to the Antarctic and to understanding the hemispheric asymmetry in polar amplification. Finally, the theory provides a basis for the physical constraints on the degree of AA. Specifically, if certain parameters (a, b, λG, λA) can be observationally estimated or constrained, as shown in previous work for the global cloud feedback48 and the Arctic summer albedo feedback49, the uncertainty range of the degree of AA may be narrowed by substituting the observationally constrained values into the formula.

One limitation of application of the theory is that we have used a constant γ, which however could be model dependent. We note that the formula with a constant γ tends to overestimate the high AA and underestimate the low AA in models, leading to a larger variation in AA (Fig. 4a; black dots vs. bars). One possible explanation could be that the variation in γ may partially offset the variation in the formula with a constant γ and potentially leads to a better match between the theory and the models.

Methods

Global climate models

Outputs of 32 global climate models in the Coupled Model Intercomparison Project Phase 6 (CMIP6) are used. Responses to climate change are computed from the differences in the climatology between the 1980−1995 period in the historical simulations and the 2085−2100 period in the Shared Socioeconomic Pathways (SSP) simulations. We have considered the low (SSP1−2.6), intermediate (SSP2-4.5) and high (SSP5-8.5) emission scenarios. To estimate the forced response to anthropogenic climate change, we take the mean of multiple realizations of each model, considering the effect of climate variability approximately canceled out among different realizations. The names of the 32 models and the numbers of realizations used are summarized in Table S1.

Feedback parameters quantified by the radiative-kernel method

The method of radiative kernel is applied to calculate the feedback parameters, λi, as,

$${\lambda }_{i}=\frac{\Delta {R}_{i}^{{TOA}}}{\Delta T},$$
(12)

where \(\Delta {R}_{i}^{{TOA}}\) is the annual-mean top-of-atmosphere (TOA) radiation anomaly induced by the change in the feedback variable and ΔT is the change in the annual-mean surface temperature. The radiative kernel contains the monthly and regionally dependent response of the TOA radiative fluxes to incremental changes in the feedback variables (e.g., albedo, air temperature, and specific humidity). \(\Delta {R}_{i}^{{TOA}}\) is estimated by multiplying the anomaly of a feedback variable with its corresponding radiative kernel and integrating the product vertically throughout the troposphere. Specifically, the Planck feedback is computed from the change in the TOA radiation due to surface temperature changes that propagate throughout the troposphere; the lapse-rate feedback is computed from the effect of temperature departures from the vertically uniform change; the water-vapor feedback is computed from the effect of changes in air specific humidity. The albedo feedback is computed from the effect of changes in albedo, which is diagnosed from the surface downward and upward shortwave radiation. The cloud feedback is computed from the change in cloud radiative effect (ΔCRE) minus the effect of non-cloud variables on ΔCRE (i.e. the difference in \(\Delta {R}_{i}^{{TOA}}\) using the all-sky and clear-sky kernels).

Change in atmospheric heat transport into the Arctic

The energy budget in the Arctic is influenced by the change in the atmospheric heat flux into the Arctic, termed as ∆AHT. Considering the short timescale of the atmospheric energy balance, ∆AHT can be computed from the area-weighted integration of the atmospheric energy convergence (i.e., the difference between the net surface and TOA energy fluxes) as,

$$\Delta {AHT}=-\mathop{\iint }_{{Arctic}}\,\Delta \left({R}^{{TOA}}-{R}^{{SFC}}+{LH}+{SH}\right){dA}.$$
(13)

∆AHT is formulated as a function of the changes in global mean surface temperature \({\Delta T}^{G}\) and meridional temperature gradient \({\Delta T}^{A}-{\Delta T}^{G}\) as

$$\Delta {AHT}\cong a{\Delta T}^{G}-b\left({\Delta T}^{A}-{\Delta T}^{G}\right).$$
(14)

A reformulation of Eq. (14) gives

$$\frac{\Delta {AHT}}{{\Delta T}^{G}}\cong a-b({AA}-1).$$
(15)

This formula is supported by the significant correlation between \(\frac{\Delta {AHT}}{{\Delta T}^{G}}\) and AA - 1 across models (Fig. 2a; r = -0.70). The intermodel regression estimate \(\hat{a}\cong\) 2.1 W m−2 K−1 and \(\hat{b}\cong\) 1.7 W m−2 K−1. The hat symbol denotes a constant estimate that works generally for the model ensemble instead of individual models. By using \(\hat{a}\) and \(\hat{b}\), Eq. (14) reasonably captures \(\Delta {{AHT}}^{A}\) in individual models (r = 0.71), capturing ~50% of the variance in \(\Delta {AHT}\) among models (Fig. S1). The deviation of ∆AHT from this estimate with constant \(\hat{a}\) and \(\hat{b}\) reflects the variations of a and b among models. It is however challenging to accurately estimate a and b for individual models. The spread of the climate responses among different SSPs (SSP1−2.6, SSP2-4.5, SSP5-8.5) do not always yield a reasonable regression line between \(\frac{\Delta {AHT}}{{\Delta T}^{G}}\) and AA−1 for estimating a and b (Fig. S4). This cloud be due to either the varied a and b among different SSPs or insufficient realizations to cleanly remove the contamination from internal variability.

We approximately estimate a and b in individual models as deviations from \(\hat{a}\) and \(\hat{b}\) so that Eq. (15) reproduces the model-projected \(\frac{\Delta {AHT}}{{\Delta T}^{G}}\). Specifically, the difference between the model-projected \(\frac{\Delta {AHT}}{{\Delta T}^{G}}\) and the prior estimate \(\widehat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}=\,\hat{a}-\hat{b}({AA}-1)\), that is, \(d=\frac{\Delta {AHT}}{{\Delta T}^{G}}-\widehat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}\) is attributed to \(a\) and \(b\) as

$$a=\hat{a}+(1-\delta )d,$$
(16)
$$b=\hat{b}-\delta d/({AA}-1).$$
(17)

In the main text, we use \(\delta=0.5\), that is, attributing \(d\) equally to \(a\) and \(b\). With the model-dependent \(a\) and \(b\), Eq. (14) fully reproduces \(\Delta {AHT}\) (Fig. S1).

We show below that the value of \({I}^{{AA}}\) is insensitive to the exact attribution of \(d=\frac{\Delta {AHT}}{{\Delta T}^{G}}-\widehat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}\) to \(a\) and \(b\).

If \(d\) is all attributed to \(a,\) \(a=\hat{a}+d\) and \(b=\hat{b}\), then we have,

$${I}^{{AA}}-1\equiv \frac{\hat{a}+d+{\lambda }^{A}-\gamma {\lambda }^{G}}{\hat{b}-{\lambda }^{A}}.$$
(18)

If \(d\) is all attributed to \(b,\) \(a=\hat{a}\) and \(b=\hat{b}-d/({AA}-1)\), then we have,

$$\begin{array}{c}{I}^{{AA}}-1\equiv \frac{\hat{a}+{\lambda }^{A}-\gamma {\lambda }^{G}}{\hat{b}\, -\frac{d}{{AA}-1}\, -{\lambda }^{A}}\\ \begin{array}{c}\begin{array}{cc}\cong \frac{\hat{a}+{\lambda }^{A}-\gamma {\lambda }^{G}}{\hat{b}-{\lambda }^{A}} & \left(1+\frac{d}{\left({AA}-1\right)\left(\hat{b}-{\lambda }^{A}\right)}+\frac{{d}^{2}}{{\left({AA}-1\right)}^{2}{\left(\hat{b}-{\lambda }^{A}\right)}^{2}}\right.\end{array}\\ \left.+\ldots \frac{{d}^{n}}{{\left({AA}-1\right)}^{n}{\left(\hat{b}-{\lambda }^{A}\right)}^{n}}\right)\\ \begin{array}{c}\begin{array}{ccc}\cong \frac{\hat{a}+{\lambda }^{A}-\gamma {\lambda }^{G}+d}{\hat{b}-{\lambda }^{A}} & -\frac{{d}^{2}}{\left({AA}-1\right){\left(\hat{b}-{\lambda }^{A}\right)}^{2}} &+\frac{{d}^{2}}{\left({AA}-1\right){\left(\hat{b}-{\lambda }^{A}\right)}^{2}}\frac{b-{\lambda }^{A}}{\hat{b}-{\lambda }^{A}}\end{array}\\+\ldots \frac{{d}^{n}}{{\left({AA}-1\right)}^{n-1}{\left(\hat{b}-{\lambda }^{A}\right)}^{n}}\frac{b-{\lambda }^{A}}{\hat{b}-{\lambda }^{A}}\\ \begin{array}{c}\cong \frac{\hat{a}+{\lambda }^{A}-\gamma {\lambda }^{G}+d}{\hat{b}-{\lambda }^{A}}-\frac{{d}^{n+1}}{{\left({AA}-1\right)}^{n}{\left(\hat{b}-{\lambda }^{A}\right)}^{n+1}}\\ \begin{array}{cc}\cong \frac{\hat{a}+{\lambda }^{A}-\gamma {\lambda }^{G}+d}{\hat{b}-{\lambda }^{A}}-\left({AA}-1\right){\left(\frac{\hat{b}-b}{\hat{b}-{\lambda }^{A}}\right)}^{n+1} & \cong \frac{\hat{a}+{\lambda }^{A}-\gamma {\lambda }^{G}+d}{\hat{b}-{\lambda }^{A}}.\end{array}\end{array}\end{array}\end{array}\end{array}$$
(19)

The above derivation uses the relation \({AA}-1\cong {I}^{{AA}}-1=\frac{\hat{a}+{\lambda }^{A}-\gamma {\lambda }^{G}}{b-{\lambda }^{A}}\) and assumes the deviation of \({{{{{\rm{b}}}}}}\) from \(\hat{{{{{{\rm{b}}}}}}}\) \(\left|\hat{{{{{{\rm{b}}}}}}}-{{{{{\rm{b}}}}}}\right| < \hat{{{{{{\rm{b}}}}}}}-{{{{{{\rm{\lambda }}}}}}}^{{{{{{\rm{A}}}}}}}\cong 2.1\) W m−2 K−1. So \({I}^{{AA}}\) is expected to be nearly identical when we attribute \(d=\frac{\Delta {AHT}}{{\Delta T}^{G}}-\widehat{\frac{\Delta {AHT}}{{\Delta T}^{G}}}\) all to \(a\) or \(b\). Indeed, \({I}^{{AA}}\) is largely invariant with either attribution and consistently correlated with the model-projected AA at r = 0.91 (Fig. S5).

The invariance of \({I}^{{AA}}\) with the attribution can be mathematically derived by considering arbitrary attribution, \(a=\hat{a}+{a}^{{\prime} }\), \(b=\hat{b}+{b}^{{\prime} }\) and their relation \({a}^{{\prime} }-{b}^{{\prime} }\left({AA}-1\right)=d\). Let \(\hat{\alpha }=\hat{a}+{\lambda }^{A}-\gamma {\lambda }^{G}\), \(\hat{\beta }=\hat{b}-{\lambda }^{A}\), \(\alpha=a+{\lambda }^{A}-\gamma {\lambda }^{G}\), \(\beta=b-{\lambda }^{A}\), we have

$$\begin{array}{c}{I}^{{AA}}-1\equiv \frac{\hat{\alpha }+{a}^{{\prime} }}{\hat{\beta }+{b}^{{\prime} }}\cong \left(\frac{\hat{\alpha }}{\hat{\beta }}+\frac{d+{b}^{{\prime} }\left({AA}-1\right)}{\hat{\beta }}\right)\left(1-\frac{{b}^{{\prime} }}{\hat{\beta }}\right)\\=\frac{\hat{\alpha }+d}{\hat{\beta }}+\frac{{b}^{{\prime} }\left({AA}-1\right)}{\hat{\beta }}-\frac{\hat{\alpha }{b}^{{\prime} }}{{\hat{\beta }}^{2}}-\frac{d{b}^{{\prime} }}{{\hat{\beta }}^{2}}-\frac{{{b}^{{\prime} }}^{2}}{{\hat{\beta }}^{2}}\left({AA}-1\right)\end{array}$$

Substituting \({AA}-1\cong {I}^{{AA}}-1=\frac{\hat{\alpha }+{a}^{{\prime} }}{\hat{\beta }+{b}^{{\prime} }}=\frac{\hat{\alpha }}{\hat{\beta }}\left(1+\frac{{a}^{{\prime} }}{\hat{\alpha }}\right)\left(1-\frac{{b}^{{\prime} }}{\hat{\beta }}\right)=\frac{\hat{\alpha }}{\hat{\beta }}(1+\frac{{a}^{{\prime} }}{\hat{\alpha }}-\frac{{b}^{{\prime} }}{\hat{\beta }})\) leads to

$$\small \normalsize \begin{array}{ccc}{I}^{{AA}}-1 &\!\!\!\! \cong &\!\!\!\!\frac{\hat{\alpha }+d}{\hat{\beta }}+\frac{{b}^{{\prime} }}{\hat{\beta }}\frac{\hat{\alpha }}{\hat{\beta }}\left(1+\frac{{a}^{{\prime} }}{\hat{\alpha }}-\frac{{b}^{{\prime} }}{\hat{\beta }}\right)-\frac{\hat{\alpha }{b}^{{\prime} }}{{\hat{\beta }}^{2}}-\frac{d{b}^{{\prime} }}{{\hat{\beta }}^{2}}-\frac{{{b}^{{\prime} }}^{2}}{{\hat{\beta }}^{2}}\left({AA}-1\right)\hfill\\ &\!\!\!\!=& \!\!\!\!\frac{\hat{\alpha }+d}{\hat{\beta }}+\frac{{{b}^{{\prime} }a}^{{\prime} }}{{\hat{\beta }}^{2}}-\frac{{\hat{\alpha }{b}^{{\prime} }}^{2}}{{\hat{\beta }}^{3}}-\frac{d{b}^{{\prime} }}{{\hat{\beta }}^{2}}-\frac{{{b}^{{\prime} }}^{2}}{{\hat{\beta }}^{2}}\left({AA}-1\right)\hfill\\ &\!\!\!\!=& \!\!\!\!\frac{\hat{\alpha }+d}{\hat{\beta }}+\frac{{b}^{{\prime} }}{{\hat{\beta }}^{2}}\left(d+{b}^{{\prime} }\left({AA}-1\right)\right)-\frac{{\hat{\alpha }{b}^{{\prime} }}^{2}}{{\hat{\beta }}^{3}}-\frac{d{b}^{{\prime} }}{{\hat{\beta }}^{2}}-\frac{{{b}^{{\prime} }}^{2}}{{\hat{\beta }}^{2}}\left({AA}-1\right)=\frac{\hat{\alpha }+d}{\hat{\beta }}-\frac{{\hat{\alpha }{b}^{{\prime} }}^{2}}{{\hat{\beta }}^{3}}\hfill\\ &\!\!\!\! \cong & \!\!\!\!\frac{\hat{\alpha }+d}{\hat{\beta }}=\frac{\hat{a}+d+{\lambda }^{A}-\gamma {\lambda }^{G}}{\hat{b}-{\lambda }^{A}}.\hfill\end{array}$$
(20)

Theoretical derivation of the formula of \(\Delta {AHT}\) Eq. (5)

The formula of \(\Delta {AHT}\cong a{\Delta T}^{G}-b\left({\Delta T}^{A}-{\Delta T}^{G}\right)\) can also be theoretically derived from the diffusive formulation of AHT. Specifically, according to the diffusive theory, AHT into the Arctic can be written as,

$${{{{{\rm{AHT}}}}}}=-\beta {D}_{T}{C}_{p}\frac{\partial T}{\partial \theta }{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}-\beta {D}_{q}{L}_{v}\frac{\partial q}{\partial \theta }{{{{{{\rm{|}}}}}}}_{{\theta }_{o}},$$
(21)

where \(T\) and \(q\) are surface temperature and specific humidity respectively, \({D}_{q}\) and \({D}_{T}\) are the diffusivity for moisture and temperature respectively, \(\beta=\frac{\cos {\theta }_{o}}{1-\sin {\theta }_{o}}\frac{{P}_{s}}{{R}^{2}\,g}\) scales AHT to the unit of W m−2 K−1, \({P}_{s}\) is the surface pressure, \(R\) is the earth radius, \(\theta\) is the latitude radian, and \({\theta }_{o}=65\)oN defines the Arctic boundary, and \({|}_{{\theta }_{o}}\) indicates values around \({\theta }_{o}\). Under climate change, \(\frac{\partial q}{\partial \theta }{|}_{{\theta }_{o}}\) and \(\frac{\partial T}{\partial \theta }{|}_{{\theta }_{o}}\) changes as,

$$\Delta \frac{\partial T}{\partial \theta }{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}={{{{{\rm{\zeta }}}}}}\left(\Delta {T}^{A}-\Delta {T}^{G}\right),$$
(22)
$$\Delta \frac{\partial q}{\partial \theta }{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}= \frac{\partial \left(\frac{{dq}}{{dT}}\right)}{\partial \theta }{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}\Delta {T}^{G}+{\widetilde{\frac{{dq}}{{dT}}}}{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}\frac{\partial \Delta T}{\partial \theta }{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}=\,\frac{{L}_{v}}{{R}_{v}{T}^{2}}\frac{\partial q}{\partial \theta }{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}\Delta {T}^{G}\\ +\frac{{L}_{v}}{{R}_{v}{T}^{2}}{\widetilde{q}}{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}{{{{{\rm{\zeta }}}}}}\left(\Delta {T}^{A}-\Delta {T}^{G}\right),$$
(23)

where the tilde symbol ̃ indicates the warming-weighted mean and \({{{{{\rm{\zeta }}}}}}\equiv \frac{\frac{\partial T}{\partial \theta }{|}_{{\theta }_{o}}}{\Delta {T}^{A}-\Delta {T}^{G}}\). In Eq. (23), we have used the relation \(\frac{{dq}}{{dT}}=\frac{{L}_{v}q}{{R}_{v}{T}^{2}}\), which is derived from the Clausius-Clapeyron relation and the assumption that relative humidity stays roughly constant under climate change. Substituting Eqs. (22) and (23) into Eq. (21) leads to the same formula of \(\Delta {{{{{\rm{AHT}}}}}}\) as Eq. (5),

$$\Delta {{{{{\rm{AHT}}}}}}=a\Delta {T}^{G}-b\left(\Delta {T}^{A}-\Delta {T}^{G}\right),$$
(24)

with

$$a\equiv \frac{\partial {AHT}}{\partial {T}^{G}}=-\beta {D}_{q}\frac{{L}_{v}^{2}}{{R}_{v}{T}^{2}}\frac{\partial q}{\partial \theta }{{{{{{\rm{|}}}}}}}_{{\theta }_{o}},$$
(25)
$$b\equiv \frac{\partial {AHT}}{\partial \left({T}^{A}-{T}^{G}\right)}=\beta {{{{{\rm{\zeta }}}}}}\left({D}_{q}\frac{{L}_{v}^{2}}{{R}_{v}{T}^{2}}\widetilde{q}{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}+{{D}_{T}C}_{p}\right).$$
(26)

The above derivation shows that the formula \(\Delta {{{{{\rm{AHT}}}}}}=a\Delta {T}^{G}-b(\Delta {T}^{A}-\Delta {T}^{G})\) can be theoretically derived from the diffusive formulation of AHT and the parameter \(a\) and \(b\) are functions of basic parameters of the climate system.

Approximate values of \(a\) and \(b\) ca be estimated using Eqs. (25) and (26). According to the climatological distribution of surface (2 m) specific humidity (Fig. S6a), \(\frac{\partial q}{\partial \theta }{|}_{{\theta }_{o}}\cong 7{{{{{\rm{g}}}}}}\) kg−1, \(\widetilde{q}{|}_{{\theta }_{o}}\cong 2{{{{{\rm{g}}}}}}{{{{{{\rm{kg}}}}}}}^{-1}\). According to the meridional structure of the warming pattern (Fig. S6b), \({{{{{\rm{\zeta }}}}}}\equiv \frac{\frac{\partial T}{\partial \theta }{|}_{{\theta }_{o}}}{\Delta {T}^{A}-\Delta {T}^{G}}\cong 1.\) For diffusivity, Ref. 50 estimated \({D}_{q}\cong 1.8\) km2 s−1 and \({D}_{T}\cong 1.2\) km2 s−1 based on an aquaplanet simulation, while Ref. 51 used a fixed diffusivity \({D}_{T,q}\cong 1.06\) km2 s−1 that best fits their results. The estimates from the intermodel regression, \(\hat{a}\cong\) 2.1 W m−2 K−1 and \(\hat{b}\cong\) 1.7 W m−2 K−1, are reproduced by Eqs. (25) and (26) if \({D}_{q}\cong 1.6\) km2 s−1 and \({D}_{T}\cong 1.0\) km2 s−1. We note that the ratio between \(a\) and \(b\) is a function of the climatological distribution of specific humidity and \({D}_{T}/{D}_{q}\) as

$$\frac{a}{b}=\frac{\frac{\partial q}{\partial \theta }{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}}{\widetilde{q}{{{{{{\rm{|}}}}}}}_{{\theta }_{o}}+{{D}_{T}C}_{p}{R}_{v}{T}^{2}/{D}_{q}{L}_{v}^{2}}\cong \frac{7}{2+5.7{D}_{T}/{D}_{q}},$$
(27)

whose value should be around (or more likely slightly above) 1.

Ocean feedback

The energy budgets in the Arctic and the globe are influenced by the changes in the ocean heat flux, termed as \(\Delta O\). Considering the short timescale of the ocean-mixed-layer energy balance, \(\Delta O\) can be estimated from the change in the net surface fluxes as,

$$\Delta O=\Delta ({SH}+{LH}-{R}^{{SFC}}),$$
(28)

where \({SH}\) and \({LH}\) are the surface (upward) sensible and latent heat flux respectively and \({R}^{{SFC}}\) is the net downward radiation flux at the surface. To facilitate the derivation and simplify interpretation, we have written \(\Delta O\) as a feedback,

$$\Delta O={\lambda }_{O}\Delta T,$$
(29)

where \({\lambda }_{O}\) is diagnosed as \(\Delta O/\Delta T\) for the globe and the Arctic. While this is just a mathematical treatment, it can be physically justified as \(\Delta O\) is approximately proportional to \(\Delta T\) in individual models (Fig. S2). For the globe, \({\lambda }_{o}^{G}=-0.41\pm 0.10\) W m−2 K−1; For the Arctic, \({\lambda }_{o}^{A}=-0.14\pm 0.37\) W m−2 K−1, where \(\pm\) indicates the s.d. among models. We note that \({\lambda }_{O}\) is similar to the ocean heat exchange parameter κ in ref. 42 but subtly different from (and thus smaller in magnitude than) the parameter \(\eta\) in Ref. 52 which further considered a deep-ocean layer.

Decompose the degree of AA into contributions of individual physical factors

\({I}^{{AA}}\) is decomposed into a baseline AA and the net effect of differential radiative forcing and feedbacks between Arctic and globe, that is,

$${I}^{{AA}}=1+\frac{a{+\lambda }^{A}-\hat{\gamma }{\lambda }^{G}}{b-{\lambda }^{A}}=\underbrace{{1+\frac{a}{b-{\lambda }^{G}}}}_{\begin{array}{c}b{{{{{\rm{aseline\; AA}}}}}}\\ {I}_{B}^{{AA}}\end{array}}+\underbrace{{\frac{a+{\lambda }^{A}-\hat{\gamma }{\lambda }^{G}}{b-{\lambda }^{A}}-\frac{a}{b-{\lambda }^{G}}}}_{\begin{array}{c}{{{{{\rm{effect}}}}}}\; {{{{{\rm{of}}}}}}\; {{{{{\rm{differential}}}}}}\; {{{{{\rm{forcing}}}}}}\; {{{{{\rm{and}}}}}}\; {{{{{\rm{feedbacks}}}}}}\\ {FF}\end{array}}.$$
(30)

The effects of differential forcing and feedbacks, \({FF}\), can be further separated. However, we should note that, due to the nonlinear format of \({FF}\), the separate effects of differential forcing and feedbacks depend on each other. For example, the effect of differential forcing may be estimated as \(\frac{(1-\hat{\gamma }){\lambda }^{G}}{b-{\lambda }^{G}}\) or \(\frac{(1-\hat{\gamma }){\lambda }^{G}}{b-{\lambda }^{A}}\) depending on whether the effect of differential feedbacks has already been included. Both estimates are negative but they differ by ~0.1. In Fig. 2e, the effect of differential forcing is computed with the effect of feedbacks already included as \(\frac{(1-\hat{\gamma }){\lambda }^{G}}{b-{\lambda }^{A}}\).

The effect of differential feedbacks is estimated by replacing the total Arctic feedback with the total global feedback, that is,

$$\frac{a+{\lambda }^{A}-{\hat{\gamma }\lambda }^{G}}{b-{\lambda }^{A}}-\frac{a+{\lambda }^{G}-{\hat{\gamma }\lambda }^{G}}{b-{\lambda }^{G}}.$$
(31)

The contribution of individual feedback \({\lambda }_{i}\) is then estimated by replacing its Arctic value with its global value, that is,

$$\frac{a+{\lambda }^{A}-{\hat{\gamma }\lambda }^{G}}{b-{\lambda }^{A}}-\frac{a+{\lambda }^{A^{\prime}}+{\hat{\gamma }\lambda }^{G}}{b-{\lambda }^{A^{\prime}} }.$$
(32)

with \({{\lambda }^{A}}^{{\prime} }={\lambda }^{A}-{\lambda }_{i}^{A}+{\lambda }_{i}^{G}\).