Decadal predictions of the cooling and freshening of the North Atlantic in the 1960s and the role of ocean circulation

In the 1960s North Atlantic sea surface temperatures (SST) cooled rapidly. The magnitude of the cooling was largest in the North Atlantic subpolar gyre (SPG), and was coincident with a rapid freshening of the SPG. Here we analyze hindcasts of the 1960s North Atlantic cooling made with the UK Met Office’s decadal prediction system (DePreSys), which is initialised using observations. It is shown that DePreSys captures—with a lead time of several years—the observed cooling and freshening of the North Atlantic SPG. DePreSys also captures changes in SST over the wider North Atlantic and surface climate impacts over the wider region, such as changes in atmospheric circulation in winter and sea ice extent. We show that initialisation of an anomalously weak Atlantic Meridional Overturning Circulation (AMOC), and hence weak northward heat transport, is crucial for DePreSys to predict the magnitude of the observed cooling. Such an anomalously weak AMOC is not captured when ocean observations are not assimilated (i.e. it is not a forced response in this model). The freshening of the SPG is also dominated by ocean salt transport changes in DePreSys; in particular, the simulation of advective freshwater anomalies analogous to the Great Salinity Anomaly were key. Therefore, DePreSys suggests that ocean dynamics played an important role in the cooling of the North Atlantic in the 1960s, and that this event was predictable.


Introduction
Decadal prediction, wherein climate models are initialised from the observed climate state to improve predictions of near-term climate, has recently gained much attention Keenlyside et al. 2008;Meehl et al. 2009). One region that benefits from initialisation is the North Atlantic (Smith et al. 2010;Robson 2010;Kim et al. 2012;García-Serrano et al. 2012;Hazeleger et al. 2013). Over the twentieth century North Atlantic SST exhibited substantial multi-decadal variability (Kushnir 1994;Kerr 2000;Trenberth and Shea 2006), which has been linked to a range of climate impacts from the number of Atlantic hurricanes (Goldenberg et al. 2001;Smith et al. 2010), to the surface climate over North and South America, Europe, and the Sahel (Folland et al. 1986;Sutton and Hodson 2005;Knight et al. 2006;Zhang and Delworth 2006;Hodson et al. 2010;Kushnir et al. 2010;Müller et al. 2012;Sutton and Dong 2012). Therefore, the improvement in capturing past North Atlantic SST variability is encouraging for the prospect of issuing societally relevant predictions in the future. However, to build confidence in decadal predictions it is important to understand the reasons why the SST predictions are improved by initialisation.
An important approach for building confidence in decadal predictions is the use of specific case studies of large decadal change events to test the prediction systems. Given the short observation record, case studies enable the important mechanisms to be understood by analyzing events where the signal-to-noise is relatively high. For example, analysis of the mid-1990s rapid warming of the North Atlantic has been important for understanding that dynamical ocean changes, and a subsequent increase in the northward ocean heat transport, were key for this event to be predicted (Robson et al. 2012;Yeager et al. 2012). Such case study analysis has also proven useful for revealing that predictions successfully predict the impact of the 1990s warming on surface climate (Robson et al. 2013). Case studies have also been used to understand predictions of the 1970s shift in the Pacific (Meehl and Teng 2012), and the recent global warming hiatus (Guemas et al. 2013).
A notable event that has yet to be studied in the context of initialised decadal predictions is the cooling of the North Atlantic ocean in the 1960s (see Fig. 1). The cooling was centered in the North Atlantic SPG region (Thompson et al. 2010; Fig. 1), and was also associated with a significant freshening (Fig. 1b)-the Great Salinity Anomaly (GSA; Dickson et al. 1988). Uncertainty remains over the cause of the cooling. One hypothesis is that the SST variability is related to natural fluctuations in the ocean circulation, or a response of the circulation to the GSA (Häkkinen 1999). Such a hypothesis arises because the observed SST variability is similar to natural variability related to the Atlantic Meridional Overturning Circulation (AMOC) in models (Delworth and Mann 2000;Knight et al. 2005Knight et al. , 2006Ting et al. 2009;Hodson and Sutton 2012). However, in reality, no direct link of these Atlantic SST changes to the strength of the AMOC has been made due to a lack of direct observations of the AMOC. It is also possible that external forcings, such as volcanic (Otterå et al. 2010;Swingedouw et al. 2013) or anthropogenic aerosol (Chang et al. 2011;Booth et al. 2012) may also have contributed to the cooling of the North Atlantic SST in the 1960s. However, current models do not support the idea that external influences dominate the 1960s cooling (Terray 2012), and the importance of anthropogenic aerosols in particular remains an important matter of debate (Zhang et al. 2013).
In this study we will assess the predictability of the 1960s cooling in the UK Met Office's Decadal Prediction System (DePreSys; Smith et al. 2010), and assess the important mechanisms. The article is organized as follows; section 2 describes DePreSys and the methodology, before results are presented in Sect. 3. Finally, conclusions and discussions are presented in Sect. 4 2 Data and methods

DePreSys
This study examines the perturbed physics ensemble version of DePreSys (Smith et al. 2010), which uses nine variants of the HadCM3 model (one standard, eight with perturbed parameters) to represent some of the modelrelated uncertainty in predictions (Collins et al. 2006). The parameter perturbations introduce radiative imbalances to the models, and so flux-adjustments are used to maintain a realistic climatology for SST and surface salinity (see Collins et al. 2006 for details). Nine-member ensemble hindcasts (1 member per model version) start every November 1 from 1960 to 2005 after observed anomalies are assimilated into the ocean and atmosphere. More specifically, anomalies are computed from pre-calculated gridded datasets for the ocean (3D temperature and salinity from the Met Office Ocean analysis  and atmosphere (3D winds, 3D temperature and sea level pressure (SLP) from ERA-40 (Uppala et al. 2005) and are assimilated by relaxing the model to its own climatology plus the observed anomalies. The ocean and atmosphere climatologies are defined as 1951-2006 and 1958-2001 respectively, and are calculated from each (a) SST 1966SST :1980SST -1951SST :1965 -1960s (1951-1965) and after (1966-1980) computed from HadISST data. Stippling shows where the difference is significant at p 0.05 based on a two-tailed Student's t test. b The time series of SST (black; from HadISST) and sea surface salinity [SSS (PSU), purple; from the Met Office ocean analysis] averaged over the North Atlantic subpolar gyre (50 N-65 N, 60 W-10 W). Rolling annual and five-year means are calculated from seasonal anomalies and are shown in the thin and thick lines respectively. Note that no trends are removed from the data. model's (flux-corrected) free-running transient simulation.
Hindcasts are forced with (historical) anthropogenic and (projected) natural forcings. More specifically, total solar irradiance was assumed to follow the previous 11-year solar cycle, and any volcanic aerosol present in the initial conditions was reduced to zero with an e-folding timescale of one year (i.e. volcanic eruptions are assumed to be unpredictable and so ''future'' eruptions are not included). The sulfur cycle is simulated interactively, driven by emissions of sulfur dioxide; the direct and first indirect effect of sulphate aerosols are represented (Johns et al. 2003). After 2000, the SRES A1B forcing scenario is used. Hindcasts are integrated for 9.75 years.
A control experiment is also performed (NoAssim). NoAssim is identical to DePreSys (i.e. nine member ensembles), except the initial conditions are taken from the free-running transient simulations (using the same model variants as DePreSys), which were initialized in preindustrial conditions and forced with historical anthropogenic and natural forcings. A full description of DePreSys and NoAssim is found in Smith et al. (2010).

Analysis of hindcasts and comparison with observations
Our analysis approach follows that of Robson et al. (2012) and Robson et al. (2013); we focus on the changes predicted by hindcasts initialised shortly before the North Atlantic cooling, specifically hindcasts started in 1963-1968 1 . To assess the surface climate impact of the cooling, the difference between the first 4 years of the predictions, and the last four is calculated. Specifically, for DePreSys hindcasts started between 1963 and 1968, the difference between a variable, D, averaged over years 6-9 (D 1963À1968 t¼6À9 ) and years 1-4 (D 1963À1968 t¼1À4 ) gives the predicted in-hindcast change for the 1963-1968 DePreSys hindcasts ), where the bar shows that it is the average over all starts in the superscript, and forecast lead times (t) in the subscript.
Although DePreSys is an anomaly assimilation system, hindcasts will still tend to drift (i.e. a systematic lead-time dependent error will be present) (Robson 2010). Thus, we make the in-hindcast trend of surface climate fields (i.e. DDeP 63À68 ) relative to the in-hindcast trend of all hindcasts (i.e. DDeP ¼ DDeP 63À68 À DDeP 60À05 , where ). This removes a mean drift from the hindcasts and ensures that hindcasts are not sensitive to the definition of the climatological period. 2 Finally, the (externally) forced change can be removed by comparing to the same calculation in NoAssim (i.e. DDeP À DNoA), thus revealing the impact of initialisation.
The modelled surface climate changes are compared with the observed change in SST and sea ice concentration (SIC) from HadISST (Rayner et al. 2003), SLP from HadSLP2 (Allan and Ansell 2006) and surface air temperature and precipitation (SAT & Precip) from CRU TS3.1 (Mitchell and Jones 2005). Specifically, the difference between 1972 and 1977 minus 1964-1971 is calculated (the period which validates the 1963-1968-mean inhindcast change). Removing the difference in NoAssim from the difference in DePreSys removes the forced change. Therefore, for the comparison, an estimate of the forced change is also removed from all observations at each grid point by fitting a linear regression to the rolling 3year-average global-mean (60 S À 75 N) SST over the period 1949-2009. However, note the detrending will not remove regional variations in the forced response (e.g. Anthropogenic Aerosols (Forster et al. 2007;Booth et al. 2012). We investigated sensitivity to other detrending methods (e.g. quadratic) and to changing the endpoints and found the magnitude of the SST, SAT and SIC anomalies are sensitive to the details of detrending, but, the patterns of anomalies for this period are broadly insensitive. Note that SLP and precipitation anomalies are not sensitive to the detrending. Finally, it is worth highlighting that we should not expect the model and observations to agree perfectly since we are comparing a single realization (the real world) with the mean of many model predictions.  Robson et al. 2012) and 0-500 m S (not shown), we also note that the observational uncertainty is greater in the 1960s . Therefore, we focus on the large-scale decadal trends, which we assume are better captured by the observations than interannual variability. NoAssim hindcasts of SPG heat content also cool in the mid 1960s (see Fig. 2, the blue shaded region shows the mean and the spread of all NoAssim hindcasts), but the magnitude of the cooling is much smaller than that in the observations, or that predicted by DePreSys. NoAssim also does not predict any substantial freshening of the SPG in the 1960s (see Fig. 3). Therefore, the cooling and freshening of the SPG in the 1960s in DePreSys is not simply a forced response in this model. The assimilation of observations was key for predicting the cooling.

Predictions of the cooling and
An interesting observation from Figs. 2 and 3 is that all the DePreSys hindcasts that start in the 1960s are warmer and saltier than NoAssim. Therefore, as No-Assim could be seen as the preferred model state, is it surprising that the SPG in the 1960s DePreSys hindcasts cool and freshen? Could the evolution simply be explained by the damping of the initial anomalies by surface fluxes? The next sections of this paper will examine reasons behind the successful predictions. For simplicity, and because of the observational uncertainties associated with this period, the analysis will focus on the average of the predictions started in the mid 1960s. Specifically, we will mean together the predictions started between 1963 and 1968 for DePreSys and NoAssim separately. These averages will be referred to  1964, 1965, 1966, 1967 and 1968 respectively. g Thick lines shows the mean of all DePreSys (red) and NoAssim (blue) hindcasts that start between 1963-1968 as a function of lead-time, thin lines show the 95 % confidence interval (note that the axis are different).
as the mean-hindcasts 3 , and are shown in Figs. 2g and 3g.

Subpolar gyre heat budget
To determine the important drivers of the 1960s cooling in the DePreSys predictions a simplified heat budget analysis of the SPG is calculated, similar to that used in Robson et al. (2012). The change in annual-mean heat-content integrated over the SPG region of the North Atlantic (DE, i.e. the annual-mean difference between temperature anomalies integrated over the full-depth 4 of the Atlantic Ocean between 50 and 65 N from successive years) is Here H O is the ocean heat transport convergence [defined as the meridional heat transport (MHT) at 50 N minus the MHT at 65 N], and H A is defined as the total heat loss to the atmosphere (turbulent and radiative fluxes) integrated over the subpolar North Atlantic. Note that the terms of the energy budget are calculated from seasonal-mean data, but the heat budget is largely closed (see Fig. 4b). Figure 4a shows H O (black) and H A (orange) as a function of lead time for the mean-DePreSys (solid) and mean-NoAssim (dash) hindcasts. For DePreSys, H A is larger than H O for the the first 5 years of the hindcasts, which explains the cooling over the first 5-6 years of the hindcast (see Fig. 4b). However, for NoAssim H A is only larger than H O for years 1 and 2, after which the difference reverses and the mean-NoAssim hindcast warms slowly.
Comparing the fluxes in the mean-DePreSys hindcast to the climatological values of H O and H A (the black and orange stars, which are calculated by averaging year 1 fluxes from all NoAssim hindcasts) it appears that H O and H A (which are anomalously low and high respectively) both play a role in the 1960s cooling of the SPG in DeP-reSys (see Fig. 4a). Nonetheless, H A for DePreSys is not significantly different from that in NoAssim for the majority of the length of the hindcasts (i.e. NoAssim only lies outside the 95 % confidence interval for years 2 and 3). In contrast, the difference in H O between DePreSys and NoAssim is significant for almost the entire length of the mean-hindcast. Therefore, the initialisation and persistence of weak ocean heat transport convergence is key for DePreSys to predict a sustained cooling of the SPG in the 1960s.

Ocean heat transports
We now investigate the origin of the low ocean heat transport convergence anomalies in DePreSys. Figure 5 shows latitude-time plots of the MHT in the mean-DeP-reSys (Fig. 5a) and mean-NoAssim (Fig. 5e) hindcasts. The weak H O is largely due to weak MHT across 50 N, rather than increased MHT out of the 65 N. Therefore, in DePreSys the advection of cold Arctic waters south with the GSA (Dickson et al. 1988) is not the dominant oceanic cause of the SPG cooling in the 1960s. Instead, low MHT into the southern boundary of the SPG dominates, and appears to be associated with the initialisation of a weak AMOC (see Fig. 5b).
The attribution of low MHT to the anomalously low AMOC is confirmed when the MHT is broken down further, as in Dong and Sutton (2002), into a part that is associated with anomalous velocities (v 0 ; i.e. the anomalous volume transport) and anomalous temperature (T 0 ; i.e. advection of anomalous temperature by the mean volume transport). Figure 5c shows that v 0 , associated with a weak AMOC (Fig. 5b), dominates the total low MHT anomaly in DePreSys at the latitudes of the SPG (compare Fig. 5c and 5a, for the total MHT anomaly). Beyond year 4 the advection of anomalously cool water (i.e. T 0 ; see Fig. 5d) plays an important role in suppressing the total MHT.
Interestingly, the simulation of anomalous MHT in NoAssim is very different to that seen in DePreSys. In the first 2 years there is weak MHT at 50 N (see Fig. 5e), which accounts for the anomalously low H O in Fig. 4. However, unlike DePreSys which has a weak AMOC, NoAssim predicts a significant strengthening of the AMOC. The increase in AMOC is likely due to HadCM3's response to volcanic forcing from Agung (Iwi et al. 2012), but could be related to anthropogenic aerosols (Delworth and Dixon 2006;Menary et al. 2013). Notwithstanding why, at the latitude of the SPG the increase in volume flux (v 0 ) associated with the AMOC cancels out anomalously low MHT due to cool temperature advection and leads to a small ensemble-mean warming of the SPG in NoAssim after year 3 (see Fig. 2g).

Subsurface density anomalies
But why is the AMOC initialised weak in the mean-DeP-reSys predictions? Figure 6 shows the vertically-integrated 1,200-3,000 m density anomalies, which are important for the strength of the AMOC in this model (see Robson 2010, Figure 2.7). The 1963-1968-mean density anomaly calculated from the Met Office ocean analysis (i.e. the density anomalies assimilated into DePreSys, see Fig. 6a) shows significantly low density anomalies in the Labrador Sea. Also, positive density anomalies were initialised over much of the rest of the North Atlantic, particularly on the east side of the basin. Therefore, the initialisation of a weak AMOC in the mean-DePreSys hindcasts is consistent with the initialised density anomalies (Baehr et al. 2004;Roussenov et al. 2008;Robson 2010).
Within the first year of the DePreSys predictions (see Fig. 6b) the low density signal spreads south along the western boundary, consistent with other models (Roussenov et al. 2008;Hodson and Sutton 2012;Robson et al. 2014). As the hindcast lead-time increases, the low density  . b as a but for the AMOC [Sv]. c and d are as a but for the contribution to the transport which is due to anomalous velocity (v 0 ), or temperature (T 0 ) anomalies respectively. e-h shows the same as a-d but for NoAssim hindcasts. Dashed and dotted black lines show where the anomalies are significantly different to climatology at the p 0.05 and p 0.01 level respectively, based on a two-tailed Student's t test.

Analysis 63-68
Transients 63-68 mean-DeP yr 1 mean-DeP yr 3 mean-DeP yr 5 mean-DeP yr 7 mean-NoA yr 1 mean-NoA yr 3 mean-NoA yr 5 mean-NoA yr 7 -15 -10 -5 -3 -1 1 3 5 10 15 Fig. 6 1,200-3,000 m density anomalies in the mean-hindcasts. a The vertically-integrated density anomaly, relative to 1951-2006, calculated from the Met Office ocean analysis averaged over 1963-1968 (Units: kg m À2 ). This represents the initial condition of the mean-DePreSys hindcast. b The mean density anomaly from year 1 of the mean-DePreSys hindcast. c, d and e are the same as b but now for years 3, 5 and 7 respectively. f-g The same as a-e but for the mean-NoAssim hindcast, where f now shows the 1963-1968 mean from the transient runs used to initialise NoAssim. The dotted black lines show where the anomalies are significantly different to climatology for panel a, or from all the first years from the NoAssim hindcasts (for b-j), at the p B 0.05 level based on a two-tailed Student's t test. Note that the density climatology for the hindcasts is computed from the transient runs.

(a) (b) (c) (d) (e) (j) (i) (h) (g) (f)
Decadal predictions of the 1960s North Atlantic cooling anomalies continue to propagate southward down the western boundary, but become progressively weaker, consistent with the southward, and weakening, AMOC signal (see Fig. 5b). In contrast to DePreSys, the mean-NoAssim hindcast is not initialised with significant density anomalies (see Fig. 6f), and hence no substantial AMOC anomalies are initially present. Instead, positive density anomalies, which are temperature driven (not shown), appear down-stream of the Denmark Strait in year one (Fig. 6g). These density anomalies then intensify and propagate down the western boundary (Fig. 6h-j), consistent with a strengthening AMOC (see Fig. 5f).
Interestingly, in DePreSys, the initial low density anomalies are replaced by positive density anomalies, which are also temperature driven (not shown). After 7 years, the density anomalies in the mean-DePreSys prediction look similar to those in the mean-NoAssim prediction (compare Fig. 6e, 6j), and the initially negative AMOC anomalies are replaced by positive anomalies (Fig. 5b). However, the Met Office ocean analyses suggests that the observed low density anomalies persisted, and continued to further spread into the interior of the subpolar North Atlantic (not shown). Therefore, the difference between the analysis and the DePreSys and NoAssim predictions suggests that the model is not capturing all the correct processes at this time.

Subpolar salinity budget
In this section we now quantify the main drivers of the predicted SPG freshening in DePreSys. A simplified salt budget is calculated for the full column (surface to bottom) for the subpolar North Atlantic (50 N-65 N) similar to the heat budget in Sect. 3.3. Unfortunately, the data for the evaporation or runoff from DePreSys and NoAssim is not available. We therefore define the change in salt content as  Figure 7a shows the subpolar gyre MST convergence averaged for the mean-DePreSys and mean-NoAssim hindcasts. S O is anomalously weak in both DePreSys and NoAssim, compared to S O , indicating a freshening. However, S O is significantly weaker in the mean-DePreSys hindcast than that in the mean-NoAssim hindcast. Therefore, Fig. 7a again highlights the importance of the initialisation of the ocean in predicting the observed changes in the 1960s. Note that the assumption that MST convergence is the main driver of the SPG salt content is found to be largely reasonable, especially for DePreSys (see Fig. 7b which shows the comparison of the DS that is calculated using S O as above (solid), and from the actual salinity changes in the model (dash) for the mean-DePreSys and mean-NoAssim hindcasts).

Ocean salinity transports
We now analyze what aspect of the MST was important for DePreSys to be able to predict the freshening of the SPG. Figure 8 shows latitude-time plots of the MST anomalies. In DePreSys, there is anomalously weak MST into the southern edge of the SPG, but there is also anomalously strong MST at the northern edge (Fig. 8a). The anomalous salt transport out of the northern edge of the SPG The disagreement between solid and dash shows that ocean salt convergence is not the only driver of the salinity anomalies, but the general agreement suggests that MST dominates the change in salinity in DePreSys and NoAssim.
dominates the change in salt content accounting for 76 % of the total subpolar salt content change over the first 5 years of the integrations (not shown). Unlike the heat transport (see Fig. 4), the anomalous salt transport does not project on to the anomalous strength of the AMOC (see Fig. 8b, c) but instead is largely explained by the advection of anomalous salinity anomalies (see Fig 8d). Close inspection of the salinity advection in DePreSys shows that anomalously strong MST at northern edge of the SPG is actually due to anomalously fresh (i.e. negative) salinity anomalies propagating south over the Denmark strait (not shown), similar to what is argued have occurred in reality (Dickson et al. 1988). For the 1963-1968 DePreSys hindcasts, low salinity anomalies are initialised in the Greenland-Iceland-Norwegian Sea (not shown). Thus, the fresh salinity anomalies moving south across the Denmark strait are consistent with the advection of the initialised fresh anomalies with the mean flow. However, as the runoff or sea ice melt is not available for analysis it is difficult to fully attribute the freshening in DePreSys.
In comparison with DePreSys, the mean-NoAssim hindcast does not have a significantly strong salt export out of the northern edge of the SPG (see Fig. 8e). Therefore, NoAssim is not able to predict the freshening of the SPG. Figure 9 shows the predicted and observed climate impacts calculated as described in Sect. 2.2. As well as a cooling of the SPG heat content, DePreSys also predicts a general cooling of the wider North Atlantic SST (see Fig. 9a). The largest cooling of SSTs is in the SPG region, with SSTs cooling by $ 1 C in the west SPG. However, the Atlantic doesn't cool uniformly; warm SST anomalies are predicted in the Greenland-Iceland-Norwegian (GIN) Sea, and in the Gulf Stream extension region. Weak SAT anomalies are predicted over land, but many are not significant. These predicted changes in surface temperature are similar to what was observed (see Fig. 9b), especially the pattern of SST anomalies across the North Atlantic, but also the SAT anomalies over land (e.g. warm Northern-Europe, cold anomalies in Southern Europe, Northern Africa, and across Northern South-America), although the magnitude of the anomalies are stronger in the observations (although note that we are comparing an ensemble-mean prediction with one single realization of the real-world; thus, we can not expect total agreement).

Climate impacts
As well as predicting SST changes, DePreSys also predicts shifts in other surface variables. For example, a weak trend in annual-mean SLP towards positive North Atlantic Oscillation (NAO)-like conditions ( fig. 9c), and a  . b as a but for the AMOC [Sv]. c and d are as a but for the contribution to the transport which is due to anomalous velocity (v 0 ), or salinity (S 0 ) anomalies respectively. eh The same as a-d but for NoAssim hindcasts. Dashed and dotted black lines show where the anomalies are significantly different to climatology at the p 0.05 and p B 0.01 level respectively, based on a two-tailed Student's t test.
southward shift of the inter-tropical convergence zone (ITCZ; see fig. 9e). DePreSys also predicts a reduction of annual-mean SIC in the GIN-sea, and an increase in the Labrador sea (see fig. 9i). The predicted patterns are also somewhat similar to those observed for this period (see fig. 9d, f and j).
Although there is some agreement for annual-mean rainfall anomalies, the agreement is not as good as for surface temperature or SIC. However, HadCM3 is known to simulate the link between Atlantic SSTs and Sahel rainfall poorly (Knight et al. 2006). Thus, it is not surprising that the observed drying over Africa at this time  ( fig. 9f) is not well predicted by the DePreSys. However, further examination of the 20th Century reanalysis (Compo et al. 2011) reveals a significant drying over the tropical North Atlantic ocean in this period (not shown, but see Dunstone et al. (2013) for rainfall anomalies in the main hurricane development region). Although there is uncertainty related to precipitation in the 20th Century reanalysis, the similarity with DePreSys suggests that the 1960s cooling of the North Atlantic may have contributed to a southward shift of the ITCZ. The agreement for annual-mean SLP anomalies is also not as good as surface temperature. However, it is well known that Atmospheric surface pressure variability in the North Atlantic is dominated by the winter NAO (Hurrell 1995). Interestingly, DePreSys predicts a stronger shift in SLP for December-February (DJF), which also dominates the observed annual mean SLP trend (see fig. 9h) in the mean-DePreSys hindcast.
The similarities between the observed and predicted shift in surface climate variables suggests that some of these changes arose in response to the changes in the ocean and were predictable, as in the 1990s warming (Robson et al. 2012). Further experiments are needed to elucidate the exact reasons why DePreSys is able to capture the wider climate transition in the late 1960s.

Conclusions
This study has explored predictions of the 1960s North Atlantic cooling using the perturbed physics version of the UK Met Office Decadal Prediction System (DePreSys). By comparing DePreSys predictions started in the mid-1960s just before the cooling event (specifically 1963-1968) to predictions that do not assimilate the observed information (NoAssim), it is shown that:--The cooling and freshening of the North Atlantic SPG (SPG) in the mid to late 1960s could have been predicted in advance. However, only DePreSys predictions, i.e. those initialised using observations, are able to capture the magnitude of the changes. -In DePreSys, the sustained cooling of the SPG over 5-6 years is dominated by anomalously weak ocean heat transport convergence. Increased heat loss to the atmosphere does contribute to the cooling, but it plays a minor role. -The anomalously weak ocean heat transport convergence into the SPG in DePreSys was largely due to the initialisation of a weak ocean circulation. In particular, the initialisation of a weak AMOC was key. -The sustained freshening of the SPG in DePreSys is due to ocean salinity transports. However, unlike the ocean heat transport changes, the freshening was not due to dynamical changes, but instead due to the advection of freshwater anomalies. In particular, a large pulse of freshwater moving south over the Denmark Strait into the SPG, similar to the observed GSA (Dickson et al. 1988). -Associated with the cooling of the SPG, DePreSys also predicts significant changes in the surface climate. In particular, DePreSys predicts a cooling of SST over most of the North Atlantic, which agrees well with the observed changes. DePreSys also predicts a significant strengthening of the winter NAO and a southward shift of the inter tropical convergence zone. Finally, DeP-reSys also predicts a reduction of SIC in the Greenland-Iceland-Norwegian (GIN) seas, and an increase in the Labrador sea. Many of these features, particularly the winter NAO signal, are similar to the observed changes.
The successful predictions by DePreSys highlights the 1960s cooling of the North Atlantic as another important case study to test decadal prediction systems. Along with the successful predictions of the mid-1990s warming (Robson et al. 2012;Yeager et al. 2012), these results bring confidence in the ability of initialised decadal (or nearterm) predictions to successfully capture future shifts in the North Atlantic. The analysis also presents compelling evidence that the ocean circulation, and in particular the AMOC, played an important role in the cooling. Moreover, given that DePreSys predictions also capture many key aspects of the transition to a cool North Atlantic, the predictions suggest that a similar mechanism played a key role in the observed changes. Therefore, these experiments do not support the idea that contemporaneous changes in forcing were the primary driver of the 1960s cooling; in fact, changes in forcing from volcanic and anthropogenic aerosol played a relatively minor role in this model. Although DePreSys does successfully capture the 1960s cooling of the North Atlantic, it is important to state that it did so only after large AMOC anomalies were initialised. Whether DePreSys could have predicted the development of these large AMOC anomalies is unclear (and would be a valuable subject for future research). However, the main caveats of this study is the quality of the initial conditions used for the predictions and the quality of the model, including its response to the specified forcings. Indeed, there were far fewer observations of the sub-surface ocean in the 1960s than today . Thus, the assimilated anomalies rely heavily on the modelderived covariances to create the initial state. The aerosol effects in HadCM3, especially those involving cloud-aerosol interactions, are also rather crude (Johns et al. 2003). Therefore, we also recognize that the assimilation of observations could be correcting for model deficiencies, for example, an incorrect forced response. In particular, the difference between the predicted and observed density changes in the deep North Atlantic warrants further investigation in future work. Hence, we can not rule out the possibility that the radiative forcing that preceded the mid-1960s influenced the AMOC in reality. Given the uncertainty in the initial conditions and the forced response it would be useful to also examine hindcasts of the 1960s cooling from other decadal prediction systems.