Coupled climate response to Atlantic Multidecadal Variability in a multi-model multi-resolution ensemble

North Atlantic sea surface temperatures (SSTs) underwent pronounced multidecadal variability during the twentieth and early twenty-first century. We examine the impacts of this Atlantic Multidecadal Variability (AMV), also referred to as the Atlantic Multidecadal Oscillation (AMO), on climate in an ensemble of five coupled climate models at both low and high spatial resolution. We use a SST nudging scheme specified by the Coupled Model Intercomparision Project’s Decadal Climate Prediction Project Component C (CMIP6 DCPP-C) to impose a persistent positive/negative phase of the AMV in the North Atlantic in coupled model simulations; SSTs are free to evolve outside this region. The large-scale seasonal mean response to the positive AMV involves widespread warming over Eurasia and the Americas, with a pattern of cooling over the Pacific Ocean similar to the Pacific Decadal Oscillation (PDO), together with a northward displacement of the inter-tropical convergence zone (ITCZ). The accompanying changes in global atmospheric circulation lead to widespread changes in precipitation. We use Analysis of Variance (ANOVA) to demonstrate that this large-scale climate response is accompanied by significant differences between models in how they respond to the common AMV forcing, particularly in the tropics. These differences may arise from variations in North Atlantic air-sea heat fluxes between models despite a common North Atlantic SST forcing pattern. We cannot detect a widespread effect of increased model horizontal resolution in this climate response, with the exception of the ITCZ, which shifts further northwards in the positive phase of the AMV in the higher resolution configurations


Introduction
Over the twentieth century, Sea Surface Temperatures (SSTs) in the North Atlantic underwent periods of warming and cooling.These SST variations are now commonly referred to as the Atlantic Multidecadal Variability (AMV).Earlier studies use the term Atlantic Multidecadal Oscillation (AMO: Kerr 2000), but doubts about the oscillatory nature, given the short observational record led to the revised, more general name.
Observational studies suggest that the AMV has multiple impacts on climate (Gastineau and Frankignoul 2014;O'Reilly et al. 2017;Peings and Magnusdottir 2014;Nigam et al. 2018), including the following: changes in rainfall on both sides of Atlantic (Folland et al. 1986;Hoerling et al. 2006;Zhang and Delworth 2006;Uvo et al. 1998;Folland et al. 2001;Zhou and Lau 2001;Knight et al. 2006), warming and drying over North America (Sutton andHodson 2005, 2007;Hodson et al. 2009), changes in European summer climate (Sutton and Dong 2012), hurricane formation (Shapiro and Goldenberg 1998;Enfield et al. 2001;Goldenberg et al. 2001) and Atlantic winter atmospheric blocking (Kwon et al. 2020).Modelling studies also suggest the AMV drives changes outside the Atlantic region; the Asian and Indian monsoon (Zhang andDelworth 2006, 2005), Siberian rainfall (Sun et al. 2015), Antarctic sea ice (Li et al. 2014) and modulation of the ENSO (Dong and Sutton 2007).A number of these links between the AMV and impacts have also been seen in historical and pre-industrial coupled climate model integrations (Ting et al. 2011;Lyu and Yu 2017).
The ultimate driver of the AMV remains a matter of debate.Patterns similar to the observed AMV (Fig. 1) are associated with multidecadal variations in the Atlantic Meridional Overturning Circulation (AMOC) in coupled climate models (Delworth et al. 1993;Vellinga and Wood 2002;Zhang and Delworth 2005;Dong and Sutton 2007;Knight et al. 2006;Hodson and Sutton 2012;Ruprich-Robert and Cassou 2015) suggesting that historical variations in the AMOC may have played a role in the observed AMV.However, the 20th century also saw considerable variation in natural and anthropogenic forcings (Hodson et al. 2014); anthropogenic aerosols rose and then declined over the region, volcanic activity reduced mid-century (Sato et al. 1993) whilst total solar irradiance (TSI) underwent considerable variability in the solar cycle around an upward trend (Lean 2018).The historical variations of anthropogenic aerosols in coupled climate models modulate Atlantic SST and reproduce the observed phasing and magnitude of the AMV (Booth et al. 2012;Watanabe and Tatebe 2019;Undorf et al. 2018).Others studies suggest that volcanic stratospheric aerosols played a key role (Otterå et al. 2010;Birkel et al. 2018), possibly by exciting multidecadal coupled modes of variability in the subpolar gyre and Labrador Sea (Swingedouw et al. 2015).The possibility also remains that the AMV may arise simply as a consequence of the ocean mixed layer integrating high-frequency atmospheric noise (Clement et al. 2015).
Whatever the ultimate origin of the AMV, the question remains: what is the climate response to the AMV?Previous studies have demonstrated significant global climate impacts using atmosphere-only models forced by AMV patterns (Sutton andHodson 2005, 2007;Hodson et al. 2009;Davini et al. 2015;Vigaud et al. 2018;Omrani et al. 2014Omrani et al. , 2016;;Peings and Magnusdottir 2014;Mohino et al. 2011;Elsbury et al. 2019).However, such experiments do not capture the response of the ocean to this AMV forcing, and any consequent feedback on global climate.Early AMV studies using coupled atmosphere-ocean models (e.g.Zhang and Delworth 2006;Dong et al. 2006) suggested that such coupled feedbacks might be significant.Recent studies have begun to examine these feedbacks in more detail by imposing AMV forcing patterns in coupled atmosphere-ocean climate models (Kucharski et al. 2016(Kucharski et al. , 2016b;;Ruprich-Robert et al. 2017, 2018;Levine -et al. 2018;Qasmi et al. 2017).
These coupled model studies find widespread global impacts beyond those found in the atmosphere-only studies and hence warrant further examination.A coordinated coupled AMV experiment was therefore proposed to assess the robustness of these responses across models and was subsequently included in the Decadal Climate Prediction Project MIP (DCPP-C Boer et al. 2016), as part of CMIP6.However, one key question that arises, is how dependent are these responses on model spatial resolution?The EU Horizon 2020 PRIMAVERA project (https:// www.prima vera-h2020.eu/) aimed to assess the impact of increased model resolution on a wide range of climate processes.As part of PRIMAVERA we examined the global climate impact of the AMV in five coupled climate-models in both low and high spatial resolution configurations.This allows us to assess how the modelled climate impact of the AMV is dependent on model design and model horizontal resolution.
In this paper we focus on these three key questions: • What is the global coupled climate response to the AMV across these five models?• How consistent is this response across the models?• Does increasing model resolution alter this response?
This paper is set out as follows.Section 2 outlines the climate models, data and methodology used in the AMV experiments.Section 3 outlines the ANOVA technique (we provide a more detailed explanation in Appendix A).Section 4 presents the results of the analysis of these experiments.We Fig. 1 A) 2 * AMV + anomaly pattern used in model experiments.The AMV pattern is derived from observed Sea Surface Temperatures (SSTs), as described by the CMIP6 DCPP-C protocol (Boer et al. 2016) https:// www.wcrp-clima te.org/ wgsip/ docum ents/ Tech-Note-1.pdf.Here we double this pattern-see Sect.2.3 for more details.Units: K (per standard deviation).Contours are 0.2K.B) Associated AMV timeseries (low-passed and standardized) then discuss these results in Sect. 5 and present our conclusions in Sect.6.Additional analysis supporting this analysis can be found in the Supplementary Information.

Models
We employed five coupled climate models in this study (CNRM-CM6-1, EC-Earth, ECMWF-IFS, MetUM-GOML2 and MPI-ESM1.2).The AMV experiments were performed at two groups of horizontal resolutions for each atmosphere model: low resolution (LR: 250-100 km) and high resolution (HR 100-50 km) for each model (summarized in Table 1) .Brief descriptions of the five climate models and their formulations are given below.

CNRM-CM6-1
CNRM-CM6-1 is coupled climate model consisting of the ARPEGE-Climat (Déqué et al. 1994) (2019).CNRM-CM6-1-LR uses a spectral model atmosphere with 91 vertical levels and a horizontal truncation of T127, resulting in a resolution at the equator of about 1.4 • .The ocean has 75 vertical levels and a horizontal resolution of about 1 • , reducing to 1/3 • in the tropics.CNRM-CM6-1-HR uses a spectral model with 91 vertical levels and a horizontal truncation of T359, resulting in a resolution at the equator of about 0.5 • .The ocean has 75 vertical levels and a horizontal resolution of about 0.25 • .

EC-Earth
EC-Earth3P and EC-Earth3P-HR (Haarsma et al. 2020) are coupled climate models consisting of an atmospheric component based on the IFS (cycle 36r4 of the Integrated Forecast System (IFS) atmosphere-land-wave model of the European Centre for Medium Range Weather Forecasts (ECMWF)) coupled to NEMO (v3.6 Madec et al. 2017).
The H-TESSEL model is used for the land surface (Balsamo et al. 2009) and is an integral part of IFS: for more details see Hazeleger and Bintanja (2012).The atmosphere and ocean/sea ice parts are coupled through the OASIS (Ocean, Atmosphere, Sea Ice, Soil) coupler (Valcke 2013).The NEMO ocean has 75 vertical levels and a horizontal resolution of about 1 • , reducing to 1/3 • in the tropics.The higher resolution EC-Earth3P-HR uses a T511 horizontal truncation/N256 grid resolution ( ∼50 km) IFS atmosphere, together with a 0.25 • NEMO ocean and 75 vertical levels.

ECMWF-IFS
ECMWF-IFS is a global Earth system model that includes dynamic representations of the atmosphere, sea ice, ocean, land surface, and ocean waves.A detailed description of the ECMWF-IFS-HR and ECMWF-IFS-LR configurations used in this study, including scientific assessment of the coupled model performance, is provided in Roberts et al. (2018).ECMWF-IFS is based on the IFS atmosphere-landwave model (cycle 43r1) coupled to NEMO (v3.4 Madec et al. 2017) and the Louvain-la-Neuve Sea Ice Model (LIM2; Bouillon et al. 2009;Fichefet and Maqueda 1997).ECMWF-IFS-LR uses a reduced Gaussian octahedral grid (Tco199 ∼100 km) in the atmosphere and NEMO ORCA1 grid ( ∼100 km) for ocean-sea ice.ECMWF-IFS-HR uses (Tco399∼50 km) in the atmosphere and NEMO ORCA025 grid ( ∼25 km) for ocean-sea ice.One of the significant dif- ferences between these configurations is the use of the Gent and Mcwilliams (1990) parameterization for the effect of mesoscale eddies with the ORCA1 grid, which is disabled when using the ORCA025 grid.Both ocean configurations use the same vertical discretization, which consists of 75 z-levels and partial cells at the ocean floor.

MetUM-GOML2
MetUM-GOML2 is an ocean mixed-layer coupled configuration of the Met Office Unified Model (MetUM-GOML2;  (Hirons et al. 2015) via the Ocean Atmosphere Sea Ice Soil (OASIS) coupler (Valcke 2013) ).For full details of MetUM-GOML2 see Hirons et al. (2015).The atmosphere and ocean have a horizontal resolution of either 1.25 × 1.87 • ( ∼250 km, N96-LR) or 0.833 × 0.55 • ( ∼100 km, N216- HR).The Atmosphere has 85 vertical levels whilst the ocean mixed-layer component extends to 1 km depth with 100 vertical levels.Sea ice fraction is prescribed from 1976-2005 mean climatology, as is Sea Surface Temperature in regions that are not ice-free all year.Although there is vertical ocean mixing, there is no horizontal advection or mixing in the model; these terms are replaced by seasonally-varying 3d temperature and salinity flux corrections, diagnosed from seasonal climatologies.Consequently, MetUM-GOML2 has small sea surface temperature biases and small model drifts (Hirons et al. 2015).In this paper, a 1976-2005 mean ocean temperature and salinity reference climatology is used, derived from the Met Office ocean analysis (Smith and Murphy 2007).Anthropogenic greenhouse gases concentration, aerosol emissions, volcanic activity are imposed and kept constant to their mean value of the period 1976-2005.

Regridding
Each model and resolution (Table 1) uses a different horizontal model grid.In order to compare a model variable between models and resolutions on a grid-point basis, the variables are regridded to a common grid using bilinear interpolation.For all variables we regrid to a 1 • × 1 • longitude latitude grid.This will potentially result in the loss of some small-scale differences between high and low resolutions, but our primary focus in this study is the impact of increased resolution on the large-scale climate response to the AMV, rather than such smaller scale impacts.

Experimental design
The goal of this study is to assess the global impact of the AMV using coupled climate models.Previous atmosphereonly studies (e.g.Sutton and Hodson 2005;Hodson et al. 2009;Davini et al. 2015;Vigaud et al. 2018) used AMV SST anomalies (derived from observations) to drive atmosphere models.In a coupled ocean-atmosphere model, a balance must be preserved between maintaining the AMV SST pattern (on a background SST climatology), and permitting the ocean to respond to anomalous fluxes from the atmosphere.To achieve this, we followed a modified form of the experimental design proposed for the CMIP6 DCPP-C AMV experiments.Full details of this design are given in Boer et al. (2016).We briefly outline this experimental design and the necessary modifications that were required for the experiments presented here.
The AMV forcing pattern used in these experiments is shown in Fig. 1.This pattern was generated from observed SSTs (ERSST4 Huang et al. 2015) after removing an estimate of the global forced trend (from CMIP5, by regression).(Full details of this method is given in https:// www.wcrp-clima te.org/ wgsip/ docum ents/ Tech-Note-1.pdf).This AMV pattern is similar to those diagnosed in other studies; the horseshoe structure with higher values in the subpolar gyre (SPG) (e.g.Sutton and Hodson 2005;Deser et al. 2010;Buckley and Marshall 2016).Figure 1 does display a more prominent small maximum in the Labrador Sea than these other studies, this may arise due to the different methods used to remove the global mean SST trend.
For each model experiment, SSTs within the North Atlantic (using a predefined mask) are nudged towards the target SST field constructed from an SST climatology plus or minus the 2*AMV spatial pattern in Fig. 1: SST target = Climatology ± 2 * AMV .Climatology is a model climatology derived from a control run (or observations, in the case of MetUM-GOML2-see section 2.1.4).SST nudging is achieved through an additional surface heat flux term hfcorr, defined as: The coefficient of −40 ( W∕m 2 ∕K ) was chosen based on a range of sensitivity studies (for details see http:// www.wcrpclima te.org/ wgsip/ docum ents/ Tech-Note-2.pdf).
Each model is then integrated multiple times, each starting from atmosphere and ocean initial conditions taken from different points in a control integration.Each realization is integrated for a maximum of 10 years, to prevent hfcorr from generating significant anomalies in the unconstrained subsurface ocean, which could significantly impact on deep ocean circulations and hence overwhelm the AMV forced signal.The resulting difference between the 2AMV + and 2AMV − experiments can then be used to assess the impact of AMV forcing on global climate.We assume that each year in a realization is statistically independent, hence we consider each year to be an ensemble member.(We examine this assumption further in SI section 9.) One of the goals of this study (and a focus of the PRIMA-VERA project), is to assess the impact of increased model resolution.Integrating a climate model at a higher resolution requires extra computational resources.In order to meet this goal within the available computing resources, we used a modified form of the DCPP-C AMV experimental design by doubling the originally specified AMV forcing pattern (hence 2AMV).This enhanced forcing reduces the number of realizations required to produce a detectable response.For consistency, we use this enhanced forcing (2AMV) for both LR and HR experiments.The DCPP-C protocol defined the use of a pre-industrial control climatology as the background climatology.However, none of the models had (or were likely to have) high resolution pre-industrial control integrations performed, as required by the DCPP-C protocol.Instead, each model (at both resolutions) used a 1950s constant-forcing control that had already been produced as part of the PRIMAVERA project.One model (MetUM-GOML2), used a later climatology , derived from observations, due to the nature of the model design (see Sect. 2.1.4).

Data availability
All the model data used in this study, together with the code used to analyse and plot the results are available to download via https:// doi.org/ 10. 5281/ zenodo.58842 27.The full experimental dataset is available at https:// prima-dm1.jasmin.ac.uk/ and at the ESGF https:// esgf-index1.ceda.ac.uk/ search/ esgf-ceda/ (search for primwp5 or dcpp-for MPI-ESM).Please contact the authors for further help accessing these datasets.

ANOVA
In the following section we use Analysis of Variance (ANOVA) to examine the global climate response to the AMV and how this response varies across the models and between resolutions.ANOVA is a common statistical technique that simultaneously examines the influence of a set of predictors or factors on a dependent variable.It is closely related to multiple linear regression, but can also be used with categorical factors, such as choice of climate model.It is also a generalization of the two-sample t test (see the appendix (A)) and is better at detecting significant impacts from multiple factors than the application of multiple t tests, as it simultaneously accounts for all sources of variance.
ANOVA has widespread use in many fields, but has only had limited application to climate science to date (e.g.Hodson and Sutton 2008;Yip et al. 2011;Christensen and Kjellström 2020).We provide a brief outline of the basis for ANOVA and its relation to t-tests in the Appendix (A).For a more detailed explanation and the application of ANOVA to climate models see Storch and Zwiers (1999), Zwiers (1996), Hodson and Sutton (2008), Yip et al. (2011) or Wilks (2019).
Much like linear regression, ANOVA begins by proposing a statistical model, consisting of a predictor, or combination of predictors, to explain the variance in a given variable.For example, a model variable X ej (e.g.mean sea level pressure-where e is the experiment, and j is the ensemble member) can be represented by: Here is the mean (over both e and j), and ej is a residual noise term, which we assume is independently and normally distributed with a variance 2 (i.e.ej ∼ N(0, 2 ) ). Hence a predictor ( e ) can then be assessed to see if it explains a significant fraction of the variance of variable X ej , with respect to the residual noise term ej .The similarity with linear regression is clear (e.g.y = c + mx + ).
In this study our dataset X emrj , consists of two experi- ments (e: 2AMV + ,2AMV − ), five models (m), two resolu- tions (r: LR, HR), and multiple ensemble members (j).We can propose the following statistical model for this dataset combining these predictors and their interactions: Here e represents the variation in X emrj due to the experi- ments (e: 2AMV + , 2AMV − ) averaged across all models (m), resolutions (r) and ensemble members (j).A em is an interac- tion term, and represents the variation in X emrj due to both the experiments (e) and models (m).In this way we can examine the impact of the AMV ( e ), and also both how (2) (3) this impact varies across the models (m: A em ), and changes with model resolution (r: G er ).The remaining terms describe other aspects of the dataset, for example, m describes the spread of climatologies between models (m) (see the appendix (A)).We do not consider these further terms in the remainder of this study, but focus on the impact of the experiment ( e ), the impact of the choice of model on the experiment ( A em ), and the impact of the choice of resolution on the experiment ( G er ).

Results
We now present the results of the AMV experiments.Rather than presenting the results for each model at each resolution, we use the above ANOVA analysis to examine the climate response to the AMV for each season and the influence of resolution and model on this response.We begin by examining the multi-model, multi-resolution mean responses ( e ) for each season.We then examine where the models significantly differ in their AMV response ( A em ).Finally we examine whether the AMV response changes with increased resolution ( G er ).To ensure the analysis is not biased towards any single model, we use the same number of ensemble members for each experiment, model and resolution in the analysis-100 (10 realizations × 10 years) members for each season (90 for DJF, as most models start on the 1st January, only 9 consecutive winters are available), e.g. a total of 1000 (DJF: 900) across models, resolution and members for each experiment.When a model has more that 100 (DJF: 90) ensemble members (see Table 2), we randomly subsample 100 (DJF: 90) from the full ensemble.The analysis results are mostly the same for different random subsamples (we examine this further in the Supplementary Information: SI Sect.1-Figures S1-S9).There is some sensitivity to subsampling when examining the impact of resolution, which we discuss below.
We also attempt to assess the field significance of these results (Storch and Zwiers 1999;Livezey and Chen 1983) by counting all grid points where a result is significant ( p < 0.05 ) and dividing this by the total number of grid- points.This ratio is shown in each panel of the subsequent spatial field figures.For data drawn from the same random distribution we would expect this fraction to be 0.05 for a gridpoint threshold of p < 0.05 .In the absence of covari- ances between neighbouring gridpoints, we could therefore classify any field where this fraction is > 0.05 to be field significant, hence reject the null hypothesis of it occurring purely by chance.In practice, there will be such covariances, and this will lead to an elevated threshold for field significance (i.e.greater than 0.05).There is no obvious objective method of assessing the resulting reduction of the degrees of freedom in this set of experiments, therefore we use this threshold as a guide; if a result falls below the field significance threshold, we can reject it, if it far exceeds the threshold, we can accept it.We should be more cautious, however if a result only just exceeds the threshold.

Multi-model multi-resolution mean response
We first examine the seasonal mean climate response to the AMV for three key climate variables, surface air temperature (tas), precipitation (pr) and mean sea-level pressure (psl).For each variable, we average over models, resolutions and ensemble members and compute the seasonal mean difference between the two experiments ( 2 * AMV + − 2 * AMV − ).These differences between the experiments considered are significant where e is significant in the ANOVA ( p < 0.05 , Eq. ( 3)).The following results are robust to subsampling (see SI section 1.1-Figures S1-S3).

Surface air temperature
Figure 2 shows the seasonal mean surface air temperature response to the AMV ( e , 2 * AMV + − 2 * AMV − ).The imposed time-invariant AMV SST pattern (Fig. 1a) is generally well maintained in the North Atlantic throughout the year (Fig. 2), although there is some seasonal variation.(A very similar pattern is seen in the model SSTs-SI section 2-Figure S10).The northern subpolar maximum is weakest in spring (MAM), perhaps due to deeper mixed layer depths that occur during late winter and early spring (Montégut et al. 2004); the larger heat capacity of the deeper mixed ocean column leading to a longer adjustment timescale for a given nudging heat flux (Eq.1).
The AMV drives a downstream warming of surface air temperatures to the east of the Atlantic basin; much of Eurasia and northern Africa is warmer throughout the year.Over Europe, this warming is largely confined to the western and southern edges.Central Europe shows a much weaker warming, particularly in summer (JJA, Fig. 2c).This weaker warming is not what would be expected from a simple advection by the mean flow of the warm North Atlantic anomalies (e.g.VT ′ ), and may arise due to colder polar air being advected into the region by the atmospheric circulation response to the AMV (e.g.V ′ T (Fig. 3c)).The balance between such dynamic ( V ′ T ) and thermodynamic ( VT ′ ) con- tributions to the climate response to the AMV have been examined in both observations (O'Reilly et al. 2017) and model studies (Qasmi et al. 2017) using analog reconstructions to separate the two processes.O' Reilly et al. (2017) demonstrates that the circulation response to the AMV acts to reinforce the warming of summer temperatures in Europe in observations.This difference may arise due to multiple confounders present in the observations with respect to the AMV forcing experiments, or climate model deficiencies, but highlights the potential regional sensitivities of the response to the detailed pattern of the circulation response.
North and South America show large regions of warming throughout the year-peaking in summer (JJA) in the United States (Fig. 2c), consistent with previous studies (e.g.Sutton and Hodson 2005;Ruprich-Robert et al. 2017).Over North America, the summer warming may arise as a response to the increased descent (and hence reduced cloud cover and increased insolation) in the atmosphere associated with the Gill-type response in mean sea level pressure (psl-Fig.3-see Sect.4.1.3),as described by Ruprich-Robert et al. (2018).In South America, the warming peaks in spring (SON, Fig. 2d).Warming of northeast Asia peaks in autumn (SON) and may be driven by the changes in atmospheric circulation as described in Monerie et al. (2020).
Whilst most impacts over land are warmings, the AMV drives a cooling over some land regions: northern South America, western Canada and Alaska during winter and spring (Fig. 2 a, b) and India, northern sub-Saharan Africa during summer and autumn (Fig. 2 c, d).These coolings may be due to reduced net surface shortwave radiation from increased cloud cover associated with the enhanced precipitation (northern South America, India, northern sub-Saharan Africa-see Fig. 4 and Sects.4.1.2and 5) or strong changes in circulation (western Canada and Alaska-see Fig. 3 and Sect.4.1.3).
The AMV also drives significant surface air temperature changes over the oceans beyond the North Atlantic (the same pattern is seen in SSTs (tos) SI section 2-Figure S10).The Aleutian low region in the north Pacific is warm throughout the year and the eastern and central Pacific experiences a widespread cooling, peaking in winter (DJF, Fig. 2a) and autumn (SON, Fig. 2d).This pattern of warm and cool anomalies is close to the Pacific Decadal Oscillation (PDO: Zhang and Delworth 2015) and is a common Pacific response in Atlantic-forced coupled models (Ruprich-Robert et al. 2017;McGregor et al. 2014;Kucharski et al. 2011;Li et al. 2016); we discuss this further in Sect.4.3.

Precipitation
The global precipitation response (Fig. 4) is dominated by dipolar anomalies within the tropics, which imply a northward shift in the Inter Tropical Convergence Zone (ITCZ).In the North Atlantic, the ITCZ undergoes a northward shift in all seasons.This is likely due to the enhanced heating within the northern hemisphere modifying the northward global energy transports, a modified Hadley circulation, and a consequent northward displacement of the ITCZ (see Kang et al. 2008).
This ITCZ displacement response is accompanied by an enhanced African monsoon in summer (JJA, Fig. 4c), increased precipitation in northern South America in winter (DJF, Fig. 4a) and spring (MAM, Fig. 4b), followed by reduced precipitation throughout South America during the remainder of the year.
Precipitation is reduced over North America in all seasons, with summer (JJA, Fig. 4c) and autumn (SON, Fig. 4d) showing the most widespread reductions.
Such reductions may be driven by enhanced subsidence over the region associated with the circulation response over North America (Fig. 3) as noted above, and described by Ruprich-Robert et al. (2018).There are small increases in precipitation over Europe (Fig. 4), mainly over southern and western Europe, but peaking over northern Europe and Scandinavia in summer (JJA).These increases may be the result of the circulation response driving a northward displacement of the Atlantic jet (Figs. 3 and SI section 3-Figure S12).
The AMV drives an increased Indian monsoon (JJA, SON; Fig. 4c, d), possibly driven by the increased landsea temperature contrast between the Indian Ocean and the Tibetan Plateau (Zhang et al. 2004) (Fig. 2), together with an increased rainfall over eastern China in summer (JJA, Fig. 4c) followed by a reduction in autumn (SON, Fig. 4 d).
The impacts over eastern China are consistent with the circulation response (JJA, Fig. 3c) modifying seasonal rainfall patterns and are consistent with an earlier studies using a single model (Monerie et al. 2019(Monerie et al. , 2020)).
There are widespread precipitation anomalies across the Pacific Ocean.These are predominantly characterized as a northwards shift of the ITCZ and a southward shift of the South Pacific convergence zone (SPCZ).The reduced precipitation over the South and tropical Pacific is likely related to the cooler surface temperatures over the region (Fig. 2)-most likely this is a response to increased subsidence over the Pacific (Monerie et al. 2020).We discuss the link between this displacement of the ITCZ and the Pacific cooling further in section 4.3.These widespread monsoonal changes in rainfall are examined further in detail using a precipitation budget analysis (for a single model in the ensemble) by Monerie et al. (2019).

Mean sea level pressure
Figure 3 shows the widespread global impact of the AMV on global circulation.The large scale response is a low pressure anomaly over much of the AMV forcing region in the North Atlantic and high pressure anomalies over large parts of the Pacific ocean; the most notable feature of which is the large high pressure anomaly in the Aleutian low region of the north Pacific.This feature persists throughout the year, peaking in winter (DJF, Fig. 3a) with a minimum in summer (JJA, Fig. 3c).Similar, but weaker, pressure anomalies over the north Pacific have been seen in previous atmosphereonly studies (e.g.Hodson et al. 2009;Sutton and Hodson 2007), suggesting the ocean may play a role in intensifying this response.Ruprich-Robert et al. (2017) and Lyu et al. (2017) demonstrate that that ocean atmosphere coupled feedbacks over the Tropical Pacific do enhance this AMV Pacific response There are widespread circulation impacts over land.The circulation changes over Europe arising from the low pressure response (Fig. 3) as discussed above, may partly explain the weak response in surface air temperature in the region during summer and autumn (Fig. 2c, d); the increased northerly flow bringing colder polar air in the region, partly counteracting any local warming.The large Atlantic low pressure anomaly at 40 • N during winter (DJF, Fig. 3a) leads to an equator-wards displacement of adjustment of the winter jet (SI section 2-Figure S12).Ruggieri et al. (2021) see a similar displacement of the jet in AMV-forced experiments; although they find the extratropical Atlantic circulation response is a weak North Atlantic Oscillation (NAO-) pattern, whereas Fig. 3a projects more onto the East Atlantic Ridge Pattern (Cassou et al. 2004).
Over North America the circulation response peaks in summer (JJA, Fig. 3c).This response, a low pressure anomaly extending over much of the southern US, with a maximum over the Gulf of Mexico, is characteristic of the Gill off-equatorial pressure response to off-equatorial heating (Gill 1980); where a latent heat anomaly (due to increased precipitation) north of the equator drives a stationary Rossby Wave, a surface low pressure anomaly, northwest of the heating source.Such a summertime circulation response has been seen in previous atmosphere-only studies (Sutton and Hodson 2005;Hodson et al. 2009).Whilst, the intensification of the western Pacific subtropical high (Zhou et al. 2009) near 20 • N in the Northwest Pacific during summer (JJA, Fig. 3c) is likely to drive significant changes in East Asian climate (Monerie et al. 2020).

Surface heat fluxes
The heat flux nudging used to maintain the 2AMV +∕− forcing pattern (Sect.2.3) adds heat to the top ocean model layer at each timestep.Some of this additional heat will be mixed down into the ocean layers below, but the majority will be released into the atmosphere above the Atlantic Ocean.Across the models, the AMV drives a strong surface upward latent heat flux from the Atlantic (Fig. 5), with a strong seasonal cycle and a maximum in the winter months (DJF).The net surface upward heat fluxes from all fluxes (Fig. 6-note, land values have been multiplied by 10), which also peaks in the winter months, are generally greater in magnitude than the surface latent heat fluxes, although the latent heat fluxes makes the largest contribution in most regions (SI section 4-Figure S13).
The amplitude of net upward heat fluxes over land are smaller than over the ocean, but generally consistent with the pattern of surface air temperature changes (Fig. 2); suggesting that changes in surface heat fluxes drive the surface air temperature response.However, in western Eurasia, Alaska, and eastern Canada (Fig. 2a, b), where the surface flux contribution is weak, horizontal heat flux convergences within the atmosphere must play a greater role.
Changing surface shortwave fluxes drive surface air temperature responses in some regions.The summer and autumn cooling over sub-Saharan Africa and the Indian subcontinent, together with the northern South American cooling during winter and spring (Fig. 2) are all associated with a reduction in downwelling surface shortwave (Fig. 7).Similarly, the year-round warming response of North America and the spring (SON) warming over South America (Fig. 2) are both accompanied by increased surface shortwave.Both these responses are co-located with corresponding responses in precipitation (Fig. 4) and hence the positive (negative), surface air temperature anomalies are likely driven by reduced (increased) precipitation and cloud cover, leading to an increase (decrease) in downwelling shortwave, together with reduced (increased) upward surface latent heat fluxes due to reduced (increased) soil moisture which follow the circulation response to the AMV (see Ruprich-Robert et al. 2018).

Pacific ocean response
Over much of the Pacific Ocean (especially the tropical region), there is a net downward flux into the Pacific Ocean (Fig. 6), demonstrating that the heat flux forcing from the Atlantic is largely being absorbed by the Pacific Ocean (stronger in winter (DJF), weaker in summer (JJA)), with the western Indian Ocean also contributing in winter (DJF).Comparing the surface air temperature (Fig. 2-or SST SI section 2-Figure S10) and net heat flux in the region, it is clear that the eastern and central Pacific cooling is not a direct response to net surface heat fluxes-which are generally acting to warm the ocean surface.The cooling must partly arise from the ocean response-for example, increased upwelling of colder subsurface waters, driven by changes in the surface winds (via Ekman pumping).
This widespread east Pacific cooling (Fig. 2) is a robust feature of many coupled model studies examining the impact of a warmer Atlantic (McGregor et al. 2018;Kucharski et al. 2011;Li et al. 2016;Ruprich-Robert et al. 2021;Meehl et al. 2021).Overall, the Pacific response to the AMV can be understood as follows.Warmer North Atlantic SSTs (Fig. 2) increase the latitudinal SST gradient, driving the ITCZ further north (Fig. 4).This displacement results in anomalies in convection and latent heating release aloft over the Tropical Atlantic.Such anomalies will drive changes in the Tropical Walker Circulation (Rodríguez-Fonseca et al. 2009;Kucharski et al. 2011), leading to enhanced descent, increased surface pressure (Fig. 3) and surface easterly wind anomalies over the eastern Pacific Ocean (Li et al. 2016;Meehl et al. 2021).These surface easterly wind anomalies drive surface cooling and increased Ekman upwelling of colder subsurface waters, leading to cooler SSTs (England et al. 2014;Li et al. 2016;Ruprich-Robert et al. 2021;Meehl et al. 2021).The resulting east-west temperature gradient may then be intensified further via the Bjerknes feedback (Bjerknes 1969;Wang 2018).Such a widespread Pacific cooling could be viewed as a global negative feedback to the imposed SST forcing; attempting to return the climate system to its mean state.
The Pacific cooling intensification will be accompanied by changes in subsidence and convection, which may drive mid-latitude changes.Further examination of the upper level circulation (Fig. 8) reveals a wavetrain originating from the Tropical Pacific with a path extending into the north Pacific and over North America; reminiscent of the Pacific-North American (PNA) pattern (Wallace and Gutzler 1981), and mirrored to a certain extent in the Southern Hemisphere.This upper level response is partly reflected in the surface pressure response in winter and spring (strongly over the north Pacific, weakly over Canada and Greenland-Fig.3).Similar wavetrain responses were found in both CM2.1 and CESM1 by Ruprich-Robert et al. ( 2017), but there was a notable differences in the two models responses.

Model dependence
We now examine how the AMV response varies across the five models (averaging over resolution and ensemble members)-where do they significantly disagree in their response to the AMV?We must acknowledge that our five models are not a representative subsample of all current global climate models (e.g.Knutti et al. 2013), and so our assessment here is not a precise estimate of the model uncertainty in the AMV response, but more an assessment of which regions and aspects of the response show significant model uncertainty.We examine this dependence using the A em term in the ANOVA model (Sect.3).If A em > 0 , this implies that the spread of the individual model responses around the multi-model mean response (Figs. 2, 3, 4) is greater than the internal variability ( -Eq. 3).Since there are more than two models, we examine the fraction of the total variance ( FVE A -see Appendix A ) in X emrj that is explained by A em , rather than the mean difference between 2AMV + and 2AMV − as we examined above.When this fraction is large compared to the fraction of the total variance explained ( FVE ) by the experiments ( e : 2AMV + , 2AMV − ) we can conclude that there is considerable model spread around the multi-model mean response.We therefore show the ratio of the two ( FVE A ∕FVE ) in the following figures.We plot this ratio where A em is a significant effect in the ANOVA ( p < 0.05 ).Additionally, we stipple shade where A em is a significant ( p < 0.05 ), but e is not ( p ≥ 0.05 ).This allows us to examine where the significant responses in Figs.(2,3,4) are highly consistent across models, or subject to greater model spread.The following results are also robust to subsampling (see SI section 1.2).

Surface air temperature
Figure 9 shows variation of the surface air temperature (tas) responses to the AMV across the five models as discussed above, larger values highlight where the model spread in the AMV response is large compared to the model mean response.
Over the North Atlantic, model spread is low-implying that the SST nudging methodology (Sect.2.3) is generally successful in applying a consistent SST anomaly across the models (assuming that SST and surface air temperature are closely coupled, which they appear to be (SI section 2-Figures S10)).
The tropics see significant model divergence in the response, particularly over the oceans.The Central Pacific and South Atlantic coolings responses (Fig. 2) show notable variation across models.
Over northern hemisphere land, models generally agree on the temperature (tas) response, with some exceptions over North America in winter (Fig. 9a) and Europe and northern Africa in winter and spring (Fig. 9a, b).
There is larger model spread over tropical and southern Africa, and northern South America.These may be the result of variations in the extent of the northward displacement of the ITCZ across the models in response to the AMV.This northward displacement (Fig. 4) is associated with a band of cooling (Fig. 2)-a latitudinal spread in these cooling regions across the models will result in considerable spread in the model temperatures in the affected regions.The Arctic shows considerable spread in summer and autumn, suggesting a divergent response of the seasonal sea ice melting across the models.
Part of the large spread in the Pacific and South Atlantic temperature response may be due to a larger cooling response in MPI-ESM 1.2-excluding this model from the ensemble results in a smaller model spread and reduced Pacific cooling (SI section 6-Figures S29 and S32).
Overall, the spread in temperature responses suggests the models generally agree on the northern hemisphere extratropical response to the AMV ( A em is not significant-hence the spread in the responses across models is smaller than the internal variability in 3-see Appendix Eq. 10), however there is significantly less agreement over the tropical response ( A em is significant-the spread in the responses across models is larger than the internal variability .)

Precipitation
Outside the tropics, and particularly over land, the precipitation response to the AMV is generally consistent across the models (Fig. 10).Over the US, there is some notable model spread, particularly during summer (JJA).Within the tropics, model spread is greatest over the Tropical Atlantic throughout the year.This spread extends over Sub-Saharan northern Africa in summer (JJA) and autumn (SON) (Fig. 10c, d) and throughout the year over northern South America.This is likely due to variations in the northward displacement of the ITCZ across the models (Fig. 4), a common feature in AMV studies (e.g.Sutton and Hodson 2005;Mohino et al. 2011;Zhang and Delworth 2006).As ITCZ displacements are driven by changes in the cross-equator SST gradient, model spread in the ITCZ may be driven by the spread in the South Atlantic SST response (Fig. 2).
The model spread over the Tropical Pacific and the Maritime continent, key regions of subsidence and ascent, suggest a spread in the models tropical Walker circulation responses to the AMV.Excluding the MPI-ESM1.2model from the ensemble reduces this spread, but the overall pattern of spread is similar (SI section 6-Figures S31 and  S34).

Mean sea level pressure
The local response to the AMV forcing over the North Atlantic is generally consistent between models (Fig. 11), but consistency is weaker over the Tropical Atlantic, particularly in summer and autumn (Fig. 11c & d); this may be related to the spread of the ITCZ response across the models (see above).
There is significant diversity across the models over the Maritime Continent, possibly as a result of varying ascent in the region due to a divergence in the response of the Walker Circulation to the AMV across models.
Summer (JJA, Fig. 11) is the season showing the greatest model diversity in the response, this season also shows the greatest model diversity in the circulation over the Indian Fig. 2 Seasonal Surface Air Temperature (tas) mean (averaged over all models, resolutions and ensemble members) AMV response ( 2AMV + -2AMV − ).A) Winter (Dec-Jan)).B) Spring (Mar-May).C) Summer (Jun-Aug).D) Autumn (Sep-Nov).Regions where the differ-ence is significant (see Sect. 3, Eq.3: e , p < 0.05 ) are shaded.Units: K. Contours are 0.2K.Top right of each panel: fraction of the total number of gridpoints that are significant ( p < 0.05) Ocean, which may imply a diversity in the South Asian monsoon response.

Extratropical Wavetrain
The extratropical wavetrain response in our ensemble (Fig. 8) is generally consistent across models, as shown by the ANOVA for geopotential height in Fig. 12, but there are some inter-model variations in the strength and position of the negative Canadian and positive Icelandic 500 hpa height lobes in winter.These variations may arise from the differences in the strength of the Pacific response to the AMV across the models (Fig. 9), or also differences between models in their mean state upper level flow, which may modify the path of atmospheric Rossby wave propagation from the Tropical Pacific (Scaife et al. 2017).The impact of the AMV on the Pacific in these and other AMV experiments is discussed in depth in Ruprich-Robert et al. (2021).

Impact of resolution
Finally, we examine the impact that model resolution has on the modelled response to the AMV.
As horizontal model resolution is increased, both in the atmosphere and ocean, more small-scale processes are better resolved, sharp gradients associated with frontal systems and topography are better represented; models generally begin to better represent the observed climate.Increasing atmosphere resolution improves model climate; leading to reduced tropical biases (Jung et al. 2012), better representation of northern hemisphere blocking (Berckmans et al. 2013;Schiemann et al. 2017), leading to changes in eddy feedbacks and the representation of frontal structures (which can influence the position of the jet (Czaja et al. 2019)) and leading to enhanced moisture transport from ocean to land (Demory et al. 2014).Increasing the ocean resolution also improves multiple aspects of the ocean mean state compared to observations (Hewitt et al. 2017).The improved representation of ocean eddies and resolution of topography (Hurlburt et al. 2008) Fig. 3 As Fig. 2, but for mean sea level pressure (psl).Units: Pa.Contours show the mean sea level pressure difference for all regions-contour interval: 20 Pa can lead to improved position and representation of the western boundary currents (Marzocchi et al. 2015) leading to reduced SST biases.The resulting sharper SST gradients can significantly influence the overlying atmosphere (Minobe et al. 2008;Parfitt et al. 2016).The increase of model resolution can also lead to changes in the variability of ocean models (Hodson and Sutton 2012;Jackson et al. 2020).
All these aspect demonstrate that increasing horizontal resolution in climate model studies can lead to changes in the mean climate state.For some processes, model climate may change or improve continuously with increasing resolution (Demory et al. 2014), for other processes changes may occur as a critical resolution threshold is passed after which key processes are explicitly resolved, for example atmospheric convection (Fosser et al. 2015), ocean eddies (He et al. 2018), or Rossby radius (Hewitt et al. 2016).These impacts of increasing climate on model resolution suggest that the response of the climate to the AMV may also vary with resolution.We attempt to address this question in the following section.
Experiments were performed at both high and low atmosphere resolution, but examination of Table 1 shows that comparing model resolution between models is dependent on how resolution is defined, due to the variation in grid geometries.It is also clear that there is no resolution threshold that divides the models into low and high resolution.This spread of model resolutions presents a challenge to assessing the impact of model resolution on the AMV response.
The change in the modelled AMV response due to resolution can be best expressed as a quadrature: that is, the difference between the AMV response in the higher resolution models, and the AMV response in the lower resolution models.We can then propose two hypotheses: 1. D m is proportional to, or monotonic with the change in resolution R m (=R H − R L ) 2. D m is zero unless the low and high resolution models span a critical resolution threshold, R c Fig. 4 As Fig. 2, but for precipitation (pr).Units: mm/day We can examine these hypotheses using the ANOVA test for G er (Eq. 3).If G er is not a significant factor ( p < 0.05 ), then we are unable to reject the null hypotheses that the model mean of D m is zero.This could arise because none of the D m are greater than zero (the AMV response does not change as resolution is increased), or a threshold ( R c ) is spanned by a subset of the models ( m ′ ), but the result- ing D m ′ are too small to be detected when meaned over all models.
Unlike in the previous sections, the significance of these results for resolution do appear to be somewhat sensitive to resampling (see SI section 1.3).Consequently, we only comment on the features that are robust to resampling in the following discussion.Additionally, we note that the field significance of many of these results is marginal.
Figures 13, 14 and 15 show that the impact of increasing model resolution is generally small; that is, we are unable to detect the impact of increased model resolution on the AMV response, although there are some regions where there are notable differences.
There are small significant changes in surface air temperature (tas) in the northern North Atlantic and Arctic in all seasons (Fig. 13), most notably in the Labrador Sea which sees colder (warmer) temperatures during DJF (JJA) at higher resolution, with a small cooling in the Barents Sea during MAM.These changes may arise due to resolution sensitivities in the mixed layer depth or sea ice, but could also arise from differences in the mean state of the sea ice cover in HR and LR controls-small differences in the mean sea ice extent could lead to large differences in the surface air temperature response between the resolutions.These changes are only marginally field significant, however.
The impact of resolution on the large scale circulation response is very weak and small compared to the mean response, with perhaps a slight positive mean sea level pressure response over southern South America in HR (Fig. 14 and SI section 1.3-Figure S8).But in none of the seasons are these results field significant.
The strongest impacts of increasing model resolution appears in the tropical precipitation response (Fig. 15).The AMV drives a northward displacement in the Atlantic ITCZ, represented by the dipole in Fig. 4.This northwards displacement is stronger in HR, resulting in a precipitation tripole (difference between two displaced dipoles) in the Tropical Atlantic.This tripole is strongest in summer (JJA, Fig. 15c).This could arise due to an enhanced northern Fig. 5 As Fig. 2, but for surface upward latent heat flux.Units: W∕m 2 (Note: values over land have not been multiplied by 10-unlike Fig. 6) hemisphere warming response in the HR models during summer (Fig. 13c) (e.g.Frierson et al. 2013), which could drive a more northward shift in the ITCZ and the Hadley circulation.
There is also a small increase in precipitation over the west Pacific Ocean in winter and autumn (Fig. 15a, d), consistent with an enhanced ascent and a strengthening of the tropical Walker Circulation.
Because of the difficulty of distinguishing high and low resolution models across the ensemble, as discussed above, we can also examine the impact of resolution in individual models (see SI section 5-Figures S14-S28).This analysis shows that the models generally agree on the weak impacts seen across the full ensemble, but that the MPI-ESM 1.2 model shows a much stronger impact of resolution, with warmer temperatures over the wider Atlantic subpolar gyre, and a stronger Atlantic ITCZ displacement (SI section 5-Figures S18 and S28).
Despite this diversity, we have not been able to detect a large scale change in the climate response to the AMV after an increase of horizontal resolution (Figs. 2,3,4).Small consistent impacts of increased resolution are small regional variations in surface air temperature (tas) over the Arctic together with a northward shift in the ITCZ.

Discussion
The global scale climate response to the AMV in the multimodel multi-resolution ensemble mean (Figs. 2, 3, 4) is in broad agreement with the findings of previous coupled AMV experiments (Ruprich-Robert et al. 2017;Dong et al. 2006;Levine -et al. 2018).Key features of the surface air temperature, pressure and rainfall responses identified above are also seen in these studies.This similarity suggests that the pattern of the climate response to the AMV is broadly consistent across coupled climate models, but the details and the magnitudes of the model responses may differ (McGregor et al. 2018;Kajtar et al. 2018;Ruprich-Robert et al. 2021).This climate response is also broadly similar to that found in previous atmosphere-only AMV studies (Hodson et al. 2009;Sutton and Hodson 2007;Mohino et al. 2011; Davini et al.  2006), however the response over the fixed-SST oceans in those experiments is generally weaker than seen in the coupled studies, suggesting that ocean-atmosphere coupling enhances the climate response to the AMV.
We have examined the difference between the 2AMV + and 2AMV − experiments, but does the climate respond dif- ferently to 2AMV + and 2AMV − ?In other words, how linear is the climate response to the AMV around the model climatology?We can examine this question by comparing both AMV responses to the model climatology (SI section 8-Figures S37-43).We conclude that the large-scale climate response is mostly linear-that is the 2 * AMV + − Clim and Clim − 2 * AMV − responses have the same spatial structure as the 2 * AMV + − 2 * AMV − response (Figs. 2-4).We dis- cuss this further in the SI (section 8).
Each model realization was integrated for 10 years.In our previous analysis we assumed that each of these years is statistically independent.If the climate response to the AMV forcing evolved over time (i.e.drifted), then this would be an incorrect assumption.To test this independence we can examine the influence of the year of the realization in a similar way to the influence of the models (Sect.4.4-Figs.9, 10), and ask the question: does this factor significantly affect the AMV response?Figures S44-S46 (SI section 9) demonstrate that there are only small regional impacts of this factor.In other words, the climate response to the AMV is largely constant across the 10 years of simulation (see SI section 9 for more details).

Comparison with observations
How does the modelled response to the AMV compare with estimates of the observed response?The observed climate evolved in response to multiple sources of forcings, not just the AMV, hence it is challenging to derived a robust estimate of the true observed response to the AMV.Previous observational studies have attempted this, and some show a similar European warming (Fig. 2) in observations (Gastineau and Frankignoul 2014;O'Reilly et al. 2017;Sutton and Dong 2012).The modelled circulation response (Fig. 3) is less consistent and does not show the negative NAO response seen in Gastineau and Frankignoul (2014) or Peings and Magnusdottir (2014).The pattern of the precipitation response (Fig. 4) is broadly consistent with the increase over Europe seen by O'Reilly et al. ( 2017) and Sutton and Fig. 7 As Fig. 2, but for downwelling surface shortwave (rsds).Units: W∕m 2 .Contours: 2 W∕m 2 Dong (2012), and with the observed increases over the Sahel (Folland et al. 1986;Zhang and Delworth 2006), northeast Brazil (Uvo et al. 1998;Folland et al. 2001) and the reductions over North America (Sutton and Hodson 2005;Hodson et al. 2009).
Another approach to estimate the observed forced AMV response is to follow the method used to estimate the AMV forcing patterns (https:// www.wcrp-clima te.org/ wgsip/ docum ents/ Tech-Note-1.pdf), by removing an estimate of the forced historical warming trend (from a historical forced multimodel ensemble mean-Figure 1a from Technical Note) from observed SSTs.Averaging the resulting residuals over the North Atlantic gives the (detrended) AMV index (Fig. 1b).
We can follow the same approach with any observed field: removing the estimate of the forced trend (from Technical Note Figure 1a, above) and considering the residuals as an AMV response.We can then form a composite difference from residuals between a high-AMV period (1930:1959) and a low-AMV period (1960:1959) (Fig. 1b).Figures 16 and 17 show these differences for surface air temperature (HadCRUT4-see Sect.2.4.1) and mean sea level pressure (HadSLP2-see Sect.2.4.1); the observed record of precipitation is too short for this approach.(We have multiplied these composites by 2 to aid comparison with the 2*AMV model responses previously shown.)An alternative approach is to composite based on when the AMV index (Fig. 1b) exceeds (falls below) plus (minus) one standard deviation.This approach produces similar results (SI section 7: Figures S35 and S36).Figures 16 and 2 show some consistencies between the modelled and estimated observed surface temperature response to the AMV: the warm anomaly of North America in DJF, extending to western Europe in JJA, and the cool Sahel anomaly band in JJA.Over the oceans, the cool Southern Ocean is consistent with the modelled response in most seasons, and the signal of eastern tropical Pacific cooling seen in the models is detectable in most seasons.The consistency is much lower for mean sea level pressure (psl) (Figs. 17 and 3).The greatest consistency appears in JJA, with a low pressure anomaly over North America, extending eastwards over the Atlantic to Europe.There is a hint of the modelled Aleutian high pressure response in DJF and JJA.Alternatively, we can examine the amplitude of the modelled AMV response compared to an observed estimate.This approach is motivated by the signal-to-noise paradox observed in seasonal forecasts; where the forecast amplitude of the North Atlantic Oscillation is about one-third of the observed amplitude (Scaife et al. 2014;Scaife and Smith 2018).If we project the observation residuals (i.e.detrended as above) onto the modelled response (Figs. 2 etc), we can estimate a timeseries ( (t) ) of the observed response to the AMV for any variable (see SI section 7 for further details).If the real climate responds to the AMV with the same spatial pattern as the model, then (t) would match the AMV index exactly in both shape and amplitude.SI section 7-Figures S35 and S36 show that whilst the model responses do capture some of the multidecadal variability in the observations, this is somewhat weaker and out-of-phase with the observed AMV.The latter part of the observed record (1960 onwards) is generally much better captured.This suggests that the weaker response seen in the model, compared to observations, may be due to the greater uncertainties in the earlier part of the observational record.Overall, there is some evidence that the modelled response to the AMV is weaker than the observed response, but the limited observational data makes it hard to be definitive.

AMV forcing spread
Whilst the AMV SST anomalies are consistently maintained across models (Fig. 9, Figure S11), the resulting net oceanatmosphere surface fluxes may differ across models.Examining this spread across models, the net surface latent heat fluxes released into the atmosphere over the North Atlantic AMV forcing region varies significantly across models in winter and summer, peaking in winter and autumn (Fig. 18).The spread increases when we consider the total net surface flux (Fig. 18).Hence, although the atmosphere in each model sees closely similar North Atlantic SST anomalies, there is a significant model spread in the resulting heat flux forcing of the atmosphere (perhaps due to a spread in conditions at the air-sea interface).This spread may arise from Fig. 9 Impact of model choice on the seasonal mean Surface Air Temperature (tas) AMV response.For each season we plot the fraction of the total variance in the ensemble explained by the model-experiment term ( A em , Sect. 3 eq.3) divided by the fraction of the total variance explained by the experiment term ( e ).Dark red regions hence show where the models disagree on the AMV response.Shading shows where A em is significant ( p < 0.05 ).Stippling show where A em is significant, but e is not.A Winter (Dec-Jan).B Spring (Mar-May).C Summer (Jun-Aug).D Autumn (Sep-Nov).Top right of each panel: fraction of the total number of gridpoints that are significant ( p < 0.05) model formulation differences, or perhaps differences in the climatologies across the models (for example, the extent of sea ice cover over the Arctic).Such variation in forcing may be a significant factor in the model spread in the AMV climate response across models (Figs.9-10).The model spread in surface air temperature (Fig. 9) over the Tropical Pacific appear to be partly related to this spread in AMV forcing (SI section 10-Figure S49) for part of the year.Ruprich-Robert et al. (2021) demonstrate that the latitude of the ITCZ varies across the model climatologies and that this explains a significant proportion of the spread in the Tropical Pacific response.

P-E
The AMV drives global changes in the hydrological cycle via changes in precipitation (Fig. 4).The AMV also drives changes in surface evaporation (Fig. 5).These changes result in the net surface moisture fluxes (p-e: precipitation-evaporation) (Fig. 19).Whilst the AMV drives a reduction in precipitation across the US in summer (Fig. 4c) the net impact on p-e is positive for the central and western US; due to a widespread reduction in surface evaporation (Fig. 5).In contrast, the AMV drives a strong seasonal cycle of p-e over northern South America; with increased downward moisture fluxes in DJF and net upward fluxes in JJA.The enhanced South Asian monsoon (Fig. 4c) results in a mixed net downward moisture flux, the increased rainfall being balanced somewhat by enhanced evaporation.Over Europe, there is notable seasonality, with increased downward moisture flux in winter, and a drier summer and autumn.

Summary
In broad agreement with other AMV impact studies (Sutton and Hodson 2005;Hodson et al. 2009;Dong et al. 2006;Ruprich-Robert et al. 2018;Levine -et al. 2018), we can summarize the global response to the AMV as a northward shift in tropical precipitation (hence a northward shift of the ITCZ and perhaps the Hadley cell) together with an adjustment in the tropical Walker circulation.The Hadley cell changes are likely driven by the hemispheric imbalance in heating (e.g.Kang et al. 2008) and lead to global changes in precipitation following the displacement of the ITCZ.Latent heat release over the Tropical Atlantic may then drive changes in the tropical Walker circulation as Fig. 10 As Fig. 9, but for precipitation (pr) shown by Kucharski et al. (2011).The resultant surface wind changes over the Equatorial Pacific interact with the ocean driving enhanced upwelling, via a Bjerknes feedback, leading to a widespread eastern and central Pacific cooling (Li et al. 2016).This cooling increases subsidence and reduces convection over the East and Central Pacific, driving extratropical wavetrains (Scaife et al. 2017) leading to changes in extratropical circulation over the north Pacific and Atlantic.These large scale responses lead to widespread regional changes in temperature and circulation.

Conclusions
We have examined the global climate impact of the AMV in five coupled climate models, at two groups of atmospheric horizontal resolutions: low resolution (LR: 250-100 km) and high resolution (HR: 100-50 km) for each model.We have discussed the model mean climate response and where choice of model and resolution alters this response.Our key findings are: -The AMV has a global-scale impact on climate, affecting global circulation, surface air temperature and rainfall.
The positive AMV drives: -warming over much of Eurasia, northern Africa and North and South America (Fig. 2).This is accompanied by some regional cooling over land: Alaska, northern sub-Saharan Africa and India.These changes are partly due to the advection of warm (or cold) air, partly due to changing shortwave fluxes due to changes in cloud cover.Outside the Atlantic, the AMV drives widespread cold SSTs, most notably a PDO-like cooling over the Central and eastern Pacific.This cooling is likely driven by enhanced ocean upwelling.-a global shift in the hydrological cycle, characterized by a northward shift in the ITCZ over the Atlantic and Pacific and a displacement of the African Monsoon system, accompanied by reduced rainfall over the Tropical Pacific, and increased rainfall over Asia and the Maritime Continent (Fig. 4).-global-scale changes in circulation, characterized by ascent over the Atlantic and descent over the Pacific (Fig. 3).The response peaks in summertime (JJA), but there are significant impacts on the Aleutian low during winter (DJF) and spring (MAM)-these latter drive anomalous cooling over Alaska and western Canada in winter (DJF).
-There is a global multimodel-mean AMV response across multiple variables ( e -significant when compared to internal variability), but models disagree on the magnitude of this response in some regions ( A em )-most nota- bly in the Tropics (Figs. 9 and 10).Part of this model variation may arise from the differing atmosphere heat flux forcings that result from the same SST pattern forcing pattern (Fig. 18).
-We are generally unable to detect a change in the multimodel mean responses as model resolution is increased, although the extent of the northward displacement of the ITCZ has some sensitivity to resolution (Fig. 15), moving further north at higher resolution.There is also some evidence of an enhanced tropical Walker Circulation.This does not preclude the possibility that larger changes to the AMV response exist across a specific resolution threshold within our sample, or for resolutions greater than we have sampled here.
This study suggests that resolution (in the range we have sampled here) may not be a large source of uncertainty in experimental estimates of the large-scale impact of the AMV.Model variation is likely to be a more significant source of uncertainty.Resolution may play a greater role for smaller scale processes or extremes, such as hurricanes or temperature extremes.Future studies analysis will examine these impacts in these experiments.Given the widespread nature of the impacts of the AMV seen in this study, a better understanding of these model uncertainties, combined with good estimates of the future evolution of the AMV are crucial to predict near-term global climate changes.Further future analysis of the full CMIP6 DCPP-C AMV experiment ensemble will enhance our understanding and ability to do this.

A Analysis of Variance (ANOVA)
In this section we outline the basis for Analysis of Variance (ANOVA) see Storch and Zwiers (1999, p117), Zwiers (1996) or Wilks (2019) for detailed explanations).ANOVA aims to decompose the total variance of a dataset into contributions from different factors.The significance of each contribution can then be assessed.Consider a multi-model ensemble experiment with (m = 1..M) models and (e = 1..E) experiments, where each experiment was performed (j = 1..J) times.1 Suppose we wish to examine the factors influencing Mean Sea Level Pressure (MSLP) in this ensemble.Assuming the models uses a common spatial grid, we can define MSLP at a grid point across all experiments (e), models (m) and ensemble members (j) to be X emj .We can then express X emj as a linear combination of factors: Here, is the average over all experiments, models and ensemble members: (5) e is the part of MSLP that changes between experiment, but does not change between ensemble members or models.We could consider this the true experimental response (about which individual model responses will cluster).Because of the definition of ( 6), e is constrained to satisfy: m is the part of MSLP that changes between models, but does not change between experiments or ensemble members.In other words, it is the model bias of a given model m.Again, m is constrained in the same manner as e : em is often called the interaction term-it accounts for the across-experiment differences between the ensemblemeans, once the model biases ( m ) have been accounted for.Finally, emj describes the residual noise.emj is assumed to be independent and normally distributed with a zero mean and a variance of 2 , e.g.emj ∼ N(0, 2 ) .Having proposed a linear statistical model for this variable, we can form hypotheses and construct tests.For example, is there a significant model bias between the models?In other words, do the models have a spread of MSLP climatologies that is detectable above the internal variability, emj ?We can re-frame the question as: is To answer this question we first define the Total Sum of Squares, TSS Where, • implies a mean over that index-for example: On close inspection, as the noise term emj is independently distributed, the cross terms between different j vanish, hence TSS can be decomposed as follows: where This is simply the familiar idea that the total variance is just the sum of individual sources of variance.We can then construct unbiased estimators of the terms on the right hand side of Eq. 5. Consequently, it can be shown (e.g.Storch and Zwiers 1999) that SS is an unbiased estimator of: Similarly, it can be shown that SS is an unbiased estima- tor of Therefore the ratio is an unbiased estimator of If ∑ m  2 m > 0 then this expression will be greater than 1.Formally we can then pose a null hypothesis H 0 and an alternative hypothesis H 1 : In other words, if at least one of m is not equal to zero then we can reject the null hypothesis H 0 .An F-test can be used to test H 0 by assessing whether the F-statistic (10) is signifi- cantly greater than 1 (using the F-distribution F M−1,EM(J−1) ).A significant result (e.g.p < 0.05 ) implies that the factor represented by m , in this case model bias, has a detectable effect on MSLP in this multi-model ensemble experiment.Comparing ∑ m 2 m to 2 in this way, allows us to assess whether the effects associated with m are greater than the noise, emj .Similar estimators to (10), and hence similar tests, can be found for e and em .
We can also estimate the size of the effect of a given factor on X emj in (5) by computing the fraction of the variance explained (FVE).Examination of ( 8) and ( 9) shows that: is an unbiased estimator of the variance of m .Therefore: FVE is (an unbiased estimate of) the fraction of the total variance (TSS), explained by m (see 7).Similar expressions can also be found for e and em .Furthermore, it can be shown that ANOVA is a more general form of the t-test.For example, for the simpler case of one factor (e.g.one-way ANOVA) (5) becomes, For the case of only two models (M=2-i.e. two samples) the F-statistic (10) for m is exactly equal to the square of the corresponding two-sample t-statistic, and it can be shown (by integration) that the cumulative distributions of t n and  Shaded regions are significant (two-sided t-test between the two time periods, p < 0.05 ).Units K. Top right of each panel: fraction of the total number of gridpoints that are significant ( p < 0.05) F 1,n are identical.Hence, one-way ANOVA with two treat- ments (M=2) is a two-sample t-test.Equally, we can extend this model to: where C j is a factor that is the same for each m, but varies with ensemble member j.For M = 2 , this reduces to a paired t-test-the presence of C j reduces the estimate of the noise variance ( 2 ), which reduces the denominator in (10) leading to a larger F-statistic.A paired t-test similarly eliminates C j by computing pair differences between the two samples, which results in a reduced denominator in the corresponding t-statistic, and hence a more sensitive test.In both cases a source of variance is being removed to compare the remaining variance of interest with a better estimate of the noise.
We can extend the statistical model (5) to include more factors.In this paper, we consider consider the extra factor of resolution, and hence X emrj , where r is resolution.(5) can then be extended to include all possible interactions between these factors, and hence: (12) X mj = + m + C j + mj Fig. 17 As Fig. 16, but for mean sea level pressure.Units: Pa Fig. 18 Comparison of (open symbols, left) upward surface latent heat fluxes ( 2AMV + −2AMV − ) and (black, right) net upward surface heat fluxes (latent, sensible, shortwave and longwave), both averaged over the AMV region (Fig. 1), for each season.Symbols show means over all ensemble members and resolutions for each model.Circle shows mean across all models.Grey filled circle denotes the model spread is significant from the ANOVA (e.g.A em in section 3 ( eqn.3)) Here, A em replaces em .r represents the climatol- ogy changes that occur averaged across all models when resolution is changed, irrespective of the experiment.G er represents how the experimental response changes when (13) X emrj = + e + m + r + A em + G er + Z mr + W emr + emrj resolution is changed-a key question for this paper.Z mr represents how the spread of model climatologies changes when resolution is changed, irrespective of the experiment.W emr represents how the model spread between experi- ments changes when resolution is changed.In this way, we account for all possible sources of variance, which refines and reduces the estimate of the noise emrj , allowing us to detect smaller influences of the various factors than would otherwise be the case.
Acknowledgements The Authors would like to acknowledge the use of the UKRI funded JASMIN data analysis facility which was essential to the analysis and storage of PRIMAVERA project data.Ongoing curation of project data has been supported by the IS-ENES3 project that has received funding from the European Union' Horizon 2020 research and innovation programme under Grant Agreement No. 824084.Authors DH, PM, JS, PD, YRR, CDR acknowledge funding from the PRIMAVERA project (www.prima vera-h2020.eu), funded by the European Union's Horizon 2020 programme under Grant Agreement 641727.PM was supported by U.

Fig. 6
Fig. 6 As Fig. 2, but for net upward heat flux from the surface.Values over land have been multiplied by 10 to aid comparison.Units: W∕m 2

Fig. 13
Fig. 13 Seasonal Surface Air Temperature (tas)-Impact of increasing model resolution on modelled AMV response.Each panel shows the difference between the AMV High Resolution (HR) ensemble responses ( 2AMV + -2AMV − ) and the Low Resolution (LR) responses.
em is similarly constrained by the definitions of , e and m to satisfy:

Table 2
AMV experiments-number of ensemble members for the 2AMV+ (2AMV−) experiment for each model and resolution.For each model, each realization was integrated for 10 years.We consider each year of a given realization to be independent (see SI section 9), hence combine realizations*years to give the total available number of ensemble members

Table 1
Klaver et al. 2020))lution in km for each model at low and high resolution configurations.The first number is the nominal resolution (derived from the model grid definition and threshold truncation (point 3) http:// goo.gl/ v1drZl (Appendix 2) orKlaver et al. (2020)supplementary) and the following number in brackets is the effective resolution (derived from the model grid's ability to resolve the KE spectra (seeKlaver et al. 2020)