1 Introduction

The ocean circulation, and in particular the Atlantic Meridional Overturning Circulation (AMOC), is an important driver of climate variability in the North Atlantic (Delworth et al. 1993; McCarthy et al. 2015) that can modulate regionally the global warming signal. Rapid and large decadal fluctuations in the AMOC strength emerge frequently in models as a result of internal climate variability, showing also important widespread climate impacts (Allison et al. 2014). However, most of these results refer to the model world. Our current knowledge of the real AMOC variability is rather limited due to the lack of long adequate observations. Continuous measurements available since 2004 thanks to the RAPID monitoring array (Cunningham et al. 2007), located along the 26°N latitudinal transect, report a slight decreasing trend during the last 10 years (Smeed et al. 2014). A complementary altimetry-based reconstruction of the AMOC along the OVIDE section (from Greenland to Portugal) also shows a decline in its strength between 1993 and 2010 (Mercier et al. 2015). However, both records are still too short to determine if this decadal trend emerges in response to global warming, or if it is compatible with natural ocean fluctuations.

It is therefore of paramount importance to explore different ways of reconstructing AMOC variability back in time, and use them to make inferences about its characteristic time scales and variability drivers. One can distinguish four major ways of estimating past AMOC variability. First, the use of proxy records from sedimentary sources, usually sensitive to the deep ocean currents (e.g. McManus et al. 2004; Kissel et al. 2013; Thornalley et al. 2013). These estimates are, however, localised and challenging to interpret. Second, reconstructions can be based on well-established physical equations (Duchez et al. 2014; Frajka-Williams 2015; McCarthy et al. 2015) linking AMOC variability to that of other variables more easily observed. For example, the thermal wind relationship allows to compute the meridional geostrophic transport from zonal density gradients (Marotzke et al. 1999), and constitutes the conceptual framework for the RAPID operational array. The most important shortcoming of this approach is the necessity of long and widespread observations from hardly accessible regions like the deep ocean. The third method relies on the identification of significant statistical relationships between the AMOC and other oceanic and atmospheric variables, such as the sea surface temperature and height (Latif et al. 2004; Zhang 2008; Willis 2010; Duchez et al. 2016), the deep Labrador sea water densities (Robson et al. 2014), or the cumulative effect of heat fluxes in the deep convection regions (Ortega et al. 2011). However, these covariances have been mostly established using models, which have inevitable biases that may compromise the reality of these inferred relationships. Besides, direct observations of the AMOC are still too short and can only be used to identify significant relationships at the monthly timescale (Duchez et al. 2016), which do not necessarily hold at decadal and longer timescales. Finally, climate models can also be used to constrain the past AMOC evolution, by assimilating the observed variability of relevant climate quantities (e.g. Huck et al. 2008; Swingedouw et al. 2013; Shaffrey et al. 2016). Yet, while products assimilating surface data (Reichler et al. 2012) show encouraging agreement in their representation of AMOC variability, other ocean reanalyses including subsurface data reveal large discrepancies (Karspeck et al. 2015), thus casting doubt on the efficiency of current assimilation techniques, which are not specifically designed to reconstruct AMOC variability.

This paper deals with this latter approach. It explores new ways of assimilating observations in the models and investigates their relevance to reproduce the associated AMOC variability. Such reconstructions are also essential for decadal prediction purposes as they provide the necessary initial ocean conditions for hindcasts and forecasts.

To date, different modelling groups have implemented and explored a variety of reconstruction protocols, following various techniques and model setups (Meehl et al. 2014). One standard approach is to apply a simple Newtonian relaxation (or nudging) towards one or several observational fields, as opposed to more complex assimilation techniques used, e.g., in ocean reanalyses (Balmaseda et al. 2015). In both cases, one important choice to be made is the vertical extent of the variables that are initialised—involving either surface (e.g. Ray et al. 2015; Servonnat et al. 2015) or three-dimensionnal (3D) ocean fields (e.g. Pohlmann et al. 2009; Tatebe et al. 2012). Subsurface nudging is probably more efficient in constraining the zonal and meridional density gradients which, through geostrophic balance, influence the ocean circulation (e.g. Huck et al. 2008; Sévellec and Fedorov 2013). Widespread reliable observations at these levels are nevertheless only globally available since 2000 AD, with the deployment of ARGO profiling floats. Before this year, observations of the deep ocean are subject to large uncertainties that affect the reliability of assimilation in regions of key importance for the AMOC like the Gulf Stream Extension or the Labrador Sea (Turpin et al. 2016). This loss of reliability is in line with the reported lack of consistency among ocean reanalyses regarding the temperature evolution below 500 m (Ray et al. 2015), the AMOC (Karspeck et al. 2015) and the subpolar gyre (Born et al. 2015). These limitations are a strong motivation to use surface restoring, for which quality records extend further back in time.

Reconstructing the annual AMOC variability through surface restoring relies on the assumption that the ocean interior can be initialised through a proper representation of relevant processes forced by the ocean surface, like deep ocean convection and ventilation, which show important variability at shorter timescales (monthly and seasonal). Ocean surface restoring is however sensitive to the quality of the model, and in particular to its correct simulation of internal variability. Yet, this approach has the potential to exploit the highest reliability and best temporal and spatial coverage of surface observations. Surface relaxation is classically done by adding a restoring heat flux component γT (SST model  − SST target ) to the SST conservation equation. Thus, a key choice is the strength of the relaxation term γT to be applied. While measurements of the oceanic heat flux feedback estimate γT to be close to −40 W/m2/K (Barnier et al. 1995; Frankignoul and Kestenare 2002), many nudging studies have considered substantially larger restoring terms (6–60 times larger; e.g. Luo et al. 2005; Pohlmann et al. 2009; Dunstone and Smith 2010) in order to constrain the simulated temperatures more closely to observations. However, there are several drawbacks to using large γT values. One is the constant forcing of overly high restoring heat fluxes in regions with strong model biases (|SSTmodel − SSTtarget| ≫ 0), such as in the Gulf Stream, creating spurious water masses that can interfere with the internal ocean variability. Other significant issues include the tampering with key high-frequency ocean–atmosphere interactions (Christophe Cassou pers. comm.) as well as with subsurface stratification. Some initial tests within our group also suggest that excessively large restoring coefficients can also overwhelm the salt-advection feedback first described by Stommel (1961), which is key to guarantee AMOC stability. It is therefore of great importance to explore new methodologies that could overcome or minimise these effects. Initialized experiments can additionally or alternatively assimilate salinity (Servonnat et al. 2015), sea ice (e.g. Caya et al. 2010), land soil moisture (e.g. Han et al. 2012), as well as different atmospheric fields, such as wind-stress (Ding et al. 2013; Thoma et al. 2015) or the 3D winds (Stewart and Haine 2013). Likewise, assimilation can be applied in ocean-only (e.g. Yeager et al. 2012) or atmosphere–ocean coupled model (e.g. Yang et al. 2013; Servonnat et al. 2015) configurations.

Wind-nudging alone can also potentially reconstruct ocean dynamics, and in particular the AMOC variability, both by directly reproducing the wind-driven circulation, and also indirectly via the wind contribution to turbulent fluxes (Large and Yeager 2009). Thoma et al. (2015) have shown that wind-stress anomaly assimilation in partially-coupled simulations is able to reconstruct most of the observed AMOC variability at 26°N in the 2005–2012 period, although it is not clear if the skill goes beyond the reconstruction of the Ekman transport component. Matei et al. (2012) also report multiyear predictive skill for the monthly AMOC at 26°N in a set of experiments with the ECHAM5/MPI-OM, initialized from an ensemble of MPI-OM ocean-only simulations forced with reanalyzed atmospheric data. However, the validity of these results has been called into question, given the limited forecast evaluation period considered and the inclusion of the annual cycle in their assessment (Vecchi et al. 2012). Indeed, robust forecast skill estimates would require of long reference observational records that are still not available.

A first step to evaluate the best methods to reconstruct past AMOC variations is the use of perfect model frameworks (Boer 2004). By using the models as a surrogate for the real world, where the climate system is perfectly known and therefore observational limitations do not apply, it is possible to test the potential predictability of the AMOC and the performance of different initialization techniques in a longer term perspective. For example, a perfect model study with the IPSL-CM5A-LR model suggests that most of the simulated AMOC variability in a target control simulation can be correctly reconstructed when both sea surface temperature (SST) and salinity (SSS) nudging are applied (Servonnat et al. 2015), despite issues in reproducing the amplitude of the largest AMOC variations. Likewise, Zhang et al. (2010) have shown that the ability to properly characterise the AMOC variability is substantially improved when both temperature and salinity are assimilated down to 2000 m.

Other perfect model studies (Dunstone and Smith 2010; Counillon et al. 2014) also suggest potential predictability of the AMOC variability for lead times of up to a decade, after initializing the ocean through subsurface temperature and salinity assimilation, or via SST-only assimilation. AMOC predictability has been shown to depend on the initial ocean state (Msadek et al. 2010), with better skill generally associated with hindcasts starting from extreme (maxima and minima) AMOC states (Collins et al. 2006; Persechino et al. 2013). This conditional skill, estimated in studies that assume perfectly known initial conditions, can be only achieved in the historical context if the initial ocean state is well constrained, in particular during the extreme AMOC states.

Using a perfect model approach, this study explores the ability of different reconstruction techniques to reproduce the simulated AMOC variability in a target period of a preindustrial control simulation, specifically selected to include an extreme positive AMOC event. Because of its potential to produce longer reconstructions, only surface nudging is used and several types of restoring coefficients are explored. This study also addresses the sensitivity of the results to the starting conditions used in the nudged experiments. The paper is organised as follows: the model, methodology and experimental setup are described in Sect. 2. Section 3 analyses the performance of the standard reconstruction techniques, and Sect. 4 investigates the reasons for the extreme AMOC peak, and its underestimation in the associated nudged experiments. Section 5 builds on the previous findings to propose and validate a novel nudging approach, based on a variable restoring coefficient that is proportional to the mixed layer depth. This new technique overcomes the major limitations of classic nudging approaches and emerges as an ideal compromise between weak and strong restoring terms. Finally, the major findings of the study are summarized and discussed in Sect. 6.

2 Model and experimental setup

2.1 Model configuration

This study is based on the Institut Pierre Simon Laplace coupled model IPSL-CM5A-LR, a standard CMIP5 configuration presented and described in details in Dufresne et al. (2013). This is the same model version used in previous perfect model studies focused both on surface nudging techniques for ocean initialisation (Servonnat et al. 2015) and potential predictability of the AMOC (Persechino et al. 2013), and has been extensively used in other studies analysing mechanisms of natural decadal variability in the North Atlantic (Escudier et al. 2013; Ortega et al. 2015), as well as evaluating historical decadal prediction skill in the ocean through surface nudging (Swingedouw et al. 2013; Mignot et al. 2015). Using IPSL-CM5A-LR thus allows us to benefit from this in-depth expertise of the model, and also from its relatively cheap computational cost, ideal to run several sets of ensemble experiments. The model is composed of the atmospheric component LMDZ5A (Hourdin et al. 2013) at the CMIP5 96 × 96 resolution (i.e. 3.75° × 1.875°), and the oceanic component NEMOv3.2 (Nucleus of European Modelling of the Ocean, Madec 2008), itself composed of the ocean model OPA9 and the sea ice component LIM2 (Fichefet and Maqueda 1997), both using an ORCA2.0 tripolar grid that has an averaged horizontal resolution of 2° × 2° refined to 0.5° in the tropics. The respective vertical resolution corresponds to 39 levels in the atmosphere and 31 in the ocean. Other components also included in IPSL-CM5A-LR are the land-surface model ORCHIDEE (Krinner et al. 2005), and the biogeochemistry module PISCES (Aumont and Bopp 2006), which is integrated in NEMO. Various studies describing the different components and several aspects of the model performance are gathered in a special issue of Climate Dynamics (http://link.springer.com/journal/382/40/9/), including an assessment of the oceanic module in the coupled model configuration (Mignot et al. 2013).

2.2 Methodology and experimental protocol

Following the Servonnat et al. (2015) approach, this analysis is carried out in the framework of the 1000-year CMIP5 preindustrial control simulation, described and analysed in Escudier et al. (2013). A subset of this simulation (CTRL from now on)—characterised by the occurrence of an extreme AMOC maximum (in the simulation year 2070)—constitutes the target trajectory and provides the surface boundary conditions (for all the model components) used to drive the nudging experiments in the present perfect model study. Sensitivity to the initial conditions is explored by considering three different initial states, corresponding to weak, moderate and strong AMOC conditions (i.e. 5.8, 9.5 and 13.4 Sv at 48°N, respectively) chosen outside of the target period in the control simulation. An estimate of the best potential initialisation achievable by each nudging technique is explored in additional experiments using exact ocean starting conditions from CTRL, representing therefore the correct initial state for each of the relevant AMOC precursors (see Fig. 1, and Sect. 4 for more details). These simulations are expected to follow a different trajectory than CTRL because they were run in a different machine with different round-off errors, and also because of the restoring terms introduced by the nudging. The bulk of the experiments is started 15 years before the AMOC target maximum, to provide sufficient time for these AMOC precursors to be initialised. Three standard surface nudging protocols are implemented and analysed.

Fig. 1
figure 1

Schematic of the AMOC precursors in the ISPL-CM5A-LR model, as identified in Escudier et al. (2013) and Ortega et al. (2015). The estimated leading times are indicated in the arrows

The first initialization approach (Wind15P) is based solely on the prescription of the wind-stress components τx and τy that are seen by the ocean, each of them according to the formula:

$$\tau = (1 - \alpha )\tau _{{{\text{model}}}} + \alpha \times \tau _{{{\text{target}}}} ,$$
(1)

where τmodel is the simulated wind-stress in the nudged experiment, τtarget the wind-stress in the target CTRL period and α is the restoring coefficient, whose value is 1 poleward of 20°N/S, zero between 10°S and 10°N, and follows a smooth step-like function in the 10°S–20°S and 10°N–20°N latitudinal bands. The tropical band is excluded to avoid interfering with the strong tropical air–sea coupling. This restoring approach is expected to reconstruct the AMOC by representing the wind-driven circulation and its effect on the transport of (i.e. climatological) sea surface temperature and salinity. The key AMOC precursor in this case is the East Greenland Current, which has an important Ekman-driven contribution in IPSL-CM5A-LR (Escudier et al. 2013, Fig. 1).

The second methodology (SSTS15P) corresponds to the nudging strategy showing the best performance in the analysis of Servonnat et al. (2015) in terms of AMOC reconstruction. It consists of applying a joint SST and SSS restoring in the sea-ice free regions. This regional constraint avoids problems related to the occurrence of spurious buoyancy forcing and convection near the sea-ice border. A standard nudging approach is applied on a daily basis by adding a Haney fixed restoring term γfix to the conservation of heat and salt, as described by the following equations:

$$\frac{{\partial SST}}{{\partial t}} = \cdots + \frac{{\gamma _{T} }}{{\rho _{0} C_{p} h}}(SST_{{{\text{model}}}} - SST_{{{\text{target}}}} ),$$
(2)
$$\frac{{\partial SSS}}{{\partial t}} = \cdots + \frac{{\gamma _{S} }}{h}(SSS_{{{\text{model}}}} - SSS_{{{\text{target}}}} ),$$
(3)

where ρ 0 is the mean ocean density, c p the specific heat of the ocean, h the depth of the associated surface layer, and γT and γS the restoring coefficients controlling the strength of the nudging in temperature and salinity. SSTtarget and SSStarget are linearly interpolated at the daily resolution from the target monthly-resolved CTRL data. As in our previous studies (Swingedouw et al. 2013; Servonnat et al. 2015), these restoring terms are fixed respectively to −40 W/m2/K and −864 mm/day, two estimates corresponding both to a relaxation timescale of about 60 days over a 50 m deep mixed layer. While the first is based on the observed temperature/heat flux relationships, the second has no physical basis and is chosen to yield the same relaxation timescale as in the first. For further discussion on the magnitude of the restoring coefficient, please refer to Servonnat et al. (2015). This nudging approach is expected to be more effective than Wind15P in initialising the AMOC, as the combined restoring of both SSS and SST enables a direct representation of the target North Atlantic deep convection region (Servonnat et al. 2015), which is largely driven in the model by SSS changes in the Labrador Sea (Fig. 1). Finally, the third set of nudging experiments includes the prescription of wind-stress fields combined with the restoring in SST and SSS (SSTSWind15P).

As indicated above, an ensemble of four experiments is performed for each of these approaches, using different initial conditions. Several additional experiments are conducted beyond this basic protocol in order to assess the consistency of our results and identify potential performance improvements. These include longer simulations, which start 25 years before the occurrence of the peak to provide a longer time for the deeper ocean to be initialised. We also develop a novel nudging strategy based on a variable restoring term γvar, detailed in Sect. 5. The complete set of model experiments performed and analysed in this study is summarised in Table 1.

Table 1 Summary of the different families of nudging experiments

2.3 Assessment of nudging performance

Several metrics are used throughout the analysis to assess different aspects of the performance of the nudging techniques. For any given variable X, the average distance to the target CTRL variability is evaluated with the ensemble mean absolute error (AE), which is computed as follows:

$$AE(X)=\frac{1}{m}\underset{i=1}{\overset{m}{\mathop \sum }}\,\left| {{X}_{CTRL}}-{{X}_{i}} \right|,$$
(4)

where i is an index that goes through the four different ensemble members.

Also, the sensitivity to the initial ste is explored using the mean intra-ensemble distance (IED), which averages the mean distance between any two ensemble members i, j as described in the following equation:

$$IED(X)=\frac{1}{12}\underset{i=1}{\overset{4}{\mathop \sum }}\,\underset{\begin{matrix} j=1 \\ j\ne i \\\end{matrix}}{\overset{4}{\mathop \sum }}\,\left| {{X}_{j}}-{{X}_{i}} \right|~.$$
(5)

To highlight extreme variability in the different indices, most figures include the thresholds of the 5th and 95th percentiles from the 1000-year long control simulation (e.g. lower and upper horizontal dashed lines in Fig. 2). Values exceeding these thresholds will be referred to as extreme. Besides, in some cases, the extreme nature of the peak in CTRL and the nudged simulations will be quantified as the number of standard deviations away from the 1000-year control climatological mean (Table 2).

Fig. 2
figure 2

Evolution of the maximum AMOC value at 48°N after the Ekman component is removed (MOI-48N) in CTRL and three ensembles of nudged simulations: a WindOnly, b SurfaceTS and c SurfaceTSWind. Each ensemble consists of four individual members initialized from weak (blue), moderate (green), strong (orange) and exact (dotted red) AMOC starting conditions taken from CTRL. Horizontal dashed lines mark the climatological mean and the associated 5th and 95th percentiles of CTRL. Grey squares at the bottom of each panel highlight the centre of a 11-year long sliding window in which CTRL and the ensemble mean of the four experiments correlate significantly at the 95% confidence level

Table 2 Summary of magnitude and timing of the peaks in the ocean precursors related to the extreme maximum in MOI-48N

3 Performance of the standard nudging techniques

The reference meridional overturning index to evaluate AMOC initialisation in the different nudged experiments is defined as the annual local vertical maximum of the AMOC streamfunction at 48°N (MOI-48N; Fig. 1). This corresponds to the latitude where the full AMOC streamfunction and the northward thermal-wind transport reach their climatological maximum in the model (Ortega et al. 2015). Since our interest lies in the thermohaline-driven component of the AMOC, the Ekman transport signal is removed from the MOI-48N index.

Figure 2 shows how the MOI-48N is reconstructed by the different standard nudging techniques. Wind-stress alone is insufficient to reproduce the main positive peak, at least consistently and at the right time, although all the Wind15P individual experiments do represent the second peak in year 2090. The poor representation of the first maximum is confirmed by the large AE values in Fig. 3a during the decade centered in the peak. These errors are slightly reduced in the SSTS15P ensemble, which also better represents the timing of the peak (Fig. 2). The amplitude is nevertheless still clearly underestimated. In both Wind15P and SSTS15P ensembles, the initial state has an impact on the reconstruction. For example, in SSTS15P, there is a stark contrast between the experiments started from weak and moderate AMOC conditions, the first showing a clear MOI-48N maximum that crosses the 95th percentile, and the second missing completely the peak. The same occurs when comparing the experiments started from moderate and strong AMOC conditions in Wind15P. Interestingly, the first peak is also misrepresented by the experiments using exact initial conditions, thereby highlighting important limitations in these standard nudging approaches to capture the extreme nature of the peak.

Fig. 3
figure 3

a Evolution of the ensemble mean absolute error (AE) in the three nudged sets of experiments: Wind15P, SSTS15P and WindSSTS15P. Each ensemble is composed of four individual experiments initialized from weak, moderate strong and exact AMOC starting conditions. b Accumulated ensemble mean absolute errors. c, d Same as in a, b but for the mean intra-ensemble distance (IED)

Adding wind-stress to SST and SSS nudging does not lead to any improvement regarding the amplitude of the first peak (Fig. 2c), but the sensitivity to initial AMOC conditions appears to be substantially reduced, most notably from year 2080 onwards (Fig. 3c). Also, this is the only configuration for which, as one would expect, the experiment starting from exact ocean conditions reproduces the AMOC peak more accurately (in terms of timing and magnitude, Fig. 2). This, together with the reduced IED in the SSTSWind15P ensemble, suggests that this combined technique is the less sensitive to the initial state. The best overall performance of SSTSWind15P is confirmed when the different metrics in Fig. 3 are accumulated in time. As compared to the other approaches, this combined nudging technique reduces by at least 20% the ensemble mean AE and by 30% the mean IED (Fig. 3d) after 45 years of integration. SSTSWind15P thus constitutes our baseline strategy for the new developments in Sect. 5, implemented to improve the representation of the MOI-48N extreme. But first, next section explores the causes of this extreme event and the reasons for its misrepresentation in the standard nudging experiments.

4 Causes of the AMOC peak and its misrepresentation

To understand why the AMOC extreme occurs and why it is not well captured by the different nudging approaches, it is important to quantify the contribution of the key precursors of AMOC variability identified in IPSL-CM5A-LR (Fig. 1) and investigate how they are initialised. We first consider the variability mechanism occurring in the upper layers of the subpolar North Atlantic described by Escudier et al. (2013). It relates to an advective mode in which Labrador salinity leads by 5 years the changes in deep convection south of Iceland (referred to as the southern convection region), and this controls the AMOC changes 6–7 years ahead. Further interactions with arctic sea-ice and sea level pressure impact the intensity of the East Greenland Current, which, 3 years later, has a negative feedback on Labrador salinity. In principle, all these surface variables can be, at least to some extent, directly reconstructed by the surface nudging, as investigated in Fig. 4. To facilitate the comparison among the different sets of nudged experiments, only the envelope and the mean of the respective ensembles are shown.

Fig. 4
figure 4

a Evolution of the MOI-48N in CTRL and the nudged ensembles Wind15P, SSTS15P and SSTSWind15P. Colored envelops represent the total ensemble spread, and thick colored lines the associated ensemble mean. Horizontal dashed lines mark the climatological mean and the associated 5th and 95th percentiles of the CTRL simulation. Symbols at the bottom of each panel highlight the centre of 11-year long sliding windows in which the free and the nudged simulation correlate significantly at the 95% confidence level. bd Same as in a but for the East Greenland Current, the Labrador Sea Surface Salinity and the mixed layer depth in the Southern Convection region, respectively

A relatively extreme minimum of the EGC strength (i.e. just below the 5th percentile from the complete 1000-year long CTRL, cf. Table 2) is detected in CTRL 15 years before the peak (Fig. 4b), its magnitude being underestimated in the three nudging ensembles. The comparatively improved representation of the magnitude of other subsequent extrema (e.g. in 2066 and 2074) suggests that this issue is likely caused by the early occurrence of the EGC minimum, preventing most of the thermohaline contribution to the EGC to be properly reconstructed. Consistently with the surface mode described above, 3 years later there is a maximum in Labrador SSS in CTRL (Fig. 4c), which is well captured by the two ensembles initialising this variable (both crossing the 95th percentile), and largely underestimated in the Wind15P ensemble. This again shows that nudging wind stress alone is insufficient to represent specific thermohaline events. It is also important to note that even if both the EGC minimum (in 2056) and Labrador SSS maximum (in 2060) are respectively below/above the 5th/95th percentiles in CTRL, they are clearly not as extreme as the maximum of the MOI-48N (in 2070, see Table 2). The exceptionality of this maximum, representing 3.1 standard deviations above the MOI-48N mean, can neither be explained by the previous indices, nor by the peak in the mixed layer depth (MLD) south of Iceland observed 6 years before (Fig. 4d). Note that the MLD maximum is reproduced in the three nudging ensembles, although with different degrees of accuracy (the peak is underestimated by 7, 20 and 35% in the SSTSWind15P, SSTS15P and Wind15P ensemble means, respectively).

In order to understand the extreme nature of the MOI-48N peak it is essential to analyse the deeper ocean. Indeed, Ortega et al. (2015) showed that the surface mechanism in IPSL-CM5A-LR is coupled to a subsurface basin mode that helps to explain how the previous convection changes impact the AMOC. Positive density anomalies formed during convection in the top 2000 m induce a geostrophic response, characterized by a cyclonic flow that leads to a westward propagation of the density anomaly resulting from its interaction with the mean meridional density gradient. In this framework, the 6–7 year lead-time between convection and the AMOC corresponds to the transit time of these density anomalies to reach the western Atlantic margin, where they contribute to establish a zonal density gradient that enhances the AMOC through thermal wind balance. This zonal gradient is characterised by positive density anomalies in 300–3000 m at the western boundary, and concomitant negative density anomalies in 300–1000 m at the eastern boundary. We now analyse the reconstruction of these relevant subsurface quantities (Fig. 5).

Fig. 5
figure 5

Same as in Fig. 4 but for a selection of indices at depth: a, b the vertically averaged top 1000 and 1000–2000 m densities over the region of convection south of Iceland, respectively, c the vertically averaged 300–3000 m densities in the western Atlantic boundary at 48°N, and d the vertically averaged 300–1000 m densities in the eastern Atlantic boundary at 48°N

In the southern convection region, the top 1000 m density anomalies in CTRL (Fig. 5a) exhibit a marked positive peak 1 year after the mixed layer is deepest, although its magnitude is less extreme than for the MOI-48N maximum (Table 2). Interestingly, a comparable extreme (i.e. 3 standard deviations above the mean) is found at deeper levels (1000–2000 m, Fig. 5b), suggesting that the characteristics of the newly formed deep waters are more important than the depth of convection itself (reasonably well represented in Fig. 4d) to explain the exceptional AMOC peak in 2070. Following the basin-mode mechanism in Ortega et al. (2015), these dense anomalies (27.92 kg/m3 during the peak) propagate westward and penetrate slightly downwards, leading to a extreme 300–3000 m density peak at the western boundary in 2071 (Fig. 5c). This is preceded by the arrival of relatively light waters at the eastern side in 2069 (Fig. 5d; Table 2). Thus, these concomitant signals contribute to the occurrence of the MOI-48N maximum in 2070 (Fig. 5c). In terms of exceptionality, the peak is explained by the contribution of particularly dense waters (27.95 kg/m3, 2.9 SDs above the mean) in the deepest levels (i.e. 2000–3000 m) of the western margin (Table 2).

We now investigate the conditions leading to the exceptional dense water formation in the region of convection 5 years before the AMOC peak. Coinciding with the appearance of the dense anomalies, we identify four severe winters with remarkably negative anomalous heat fluxes (i.e. the atmosphere cooling the ocean; Fig. 6a), all below the 5th percentile from the control simulation. Their cumulative cooling effect comes on top of the arrival of positive salinity anomalies from the Labrador Sea (Fig. 4c) associated with the so-called surface mechanism. Both SST and SSS anomalies add up to produce dense anomalies and enhance convection by reducing the ocean stratification.

Fig. 6
figure 6

a Deseasonalized monthly heat flux anomalies over the southern convection region in CTRL. Horizontal dashed lines set the 5th and 95th percentiles of the heat fluxes in the full 1000-years CTRL. The vertical grey box highlights four consecutive winters (i.e. 2063–2066) with heat fluxes below the 5th percentile. b 48-Month running means of the anomalous heat fluxes in CTRL and two SSTSWind15P and experiments starting from moderate and exact MOC ocean conditions, respectively. Anomalies are computed over the same convection region, and refer to the long-term climatological mean in the full 1000-year CTRL. Correlation coefficients between the nudged experiments and CTRL are shown in the legend. c Same as b but for the SSTSWind15Pvg experiments

The reconstruction of the key anomalous heat fluxes identified above, whose origin in CTRL is presumably stochastic, is explored in a subset of the nudged experiments. Note that, in our experimental setup, heat fluxes can only be constrained indirectly through the nudging of surface temperature. For simplicity, this analysis is exclusively focused on the SSTSWind15P approach, which shows the best overall performance. Only two of the individual experiments are considered, the one started from a moderate AMOC state (as it represents starting conditions leading to a complete misrepresentation of the AMOC peak), and the experiment using exact initial conditions from CTRL (which reproduces most accurately the peak, despite the underestimation). For both cases it is evident that the evolution of heat fluxes from CTRL is not well constrained (Fig. 6b). This is particularly true for the moderate AMOC initial conditions, which show positive instead of negative heat fluxes at the time when the dense anomalies are formed. By contrast, the simulation using the exact initial state does reproduce some part of the cooling (about 25% of the total anomaly), but not the correct intensity.

To quantify the effect of the anomalous heat fluxes on dense water formation, we perform a water mass transformation analysis for the wider North Atlantic sector (20–80°N and 80°W–40°E), following Walin (1982) and Speer and Tziperman (1992). Similarly to Servonnat et al. (2015), we concentrate on the diabatic contributions of surface buoyancy forcing during the 4 years of extreme negative heat fluxes. As expected, CTRL shows a substantial increase in the formation of the densest waters (28.0 kg/m3) during this period as compared to climatology (black line vs filled grey contour in Fig. 7a). This increase is somewhat reproduced in the SSTSWind15P simulation with exact initial conditions (blue line), but for slightly lighter waters (27.95 kg/m3). This misrepresentation mostly comes from differences in the contribution of heat fluxes, with freshwater fluxes playing a minor role (Fig. 7b).

Fig. 7
figure 7

a Water mass transformation (in Sv) in the North Atlantic during the period comprising the four severe winters highlighted with grey boxes in Fig. 6. The black line represents water mass transformation in CTRL, while the dotted blue and magenta lines correspond respectively to SSTSWind15P and SSTSWind15Pvg experiments starting from the exact initial conditions in CTRL. Filled grey contours mark the 95th percentile of water mass transformation in all the possible 4-year running windows in the full 1000-years control simulation. b Difference in the mean water mass transformation with respect to CTRL due to freshwater (solid lines) and heat (dashed lines) fluxes, during the same 4 years

Besides the important influence of surface fluxes, convection is also highly sensitive to ocean stratification. Particularly stratified conditions can prevent deep convection even if anomalous cooling occurs at the surface. Thus, the additional effect of surface salinity is essential to destabilize the water column (Swingedouw et al. 2007). One drawback of surface nudging techniques is that they require several years to initialise the subsurface of the ocean (Servonnat et al. 2015) and thus reproduce the correct stratification. This can be a potential issue in the previous ensembles of experiments, which only begin 15 years prior to the AMOC peak (i.e. 8 before the convection peak). To investigate this, additional experiments have been performed starting the nudging procedure 25 (instead of 15) years before the peak. Figure 8a shows that, overall, the vertical density profile in these new SSTSWind25P (as compared to the previous SSTSWind15P) simulations is now better constrained (i.e. closer to the target CTRL) during the four consecutive severe winters, in particular for the experiment started from adverse initial conditions (dashed lines). Since there is more time to initialise the deeper ocean, the representation of the peak in these longer simulations is also less sensitive to the choice of initial conditions (blue lines in Fig. 9a, b). However, the extreme nature of the peak is still misrepresented due to the underestimated influence of the anomalously cold heat fluxes (not shown).

Fig. 8
figure 8

Vertical profile of the mean density anomaly (sigma2) in the southern convection region during the period 2063–2066 in CTRL and different sets of nudged experiments: a SSTSWind15P and SSTSWind25P, b SSTSWind15P and SSTSWind15Pvg, and c SSTSWind25P and SSTSWind25Pvg. Full (dotted) colored lines correspond to experiments starting from moderate MOC (exact CTRL) ocean conditions. The black dashed lines represent to the long-term control mean and the thick black solid lines the CTRL mean during the selected period 2063–2066

Fig. 9
figure 9

Evolution of the MOI-48N-noek index in CTRL (thick black line) and both SSTSWind15P (blue lines) and SSTSWind15Pvg (magenta lines) experiments. Colored (dotted) lines account for experiments starting from moderate (exact) initial conditions. Horizontal grey dashed lines indicate the climatological mean and the 5th and 95th percentiles of the 1000-years CTRL. b, c Same as a but for the SSTSwind25P/SSTSwind25Pvg and SSTS25P/SSTS25Pvg experiments, respectively

The underlying problem for this underestimation is that during deep convection events the restoring coefficient of −40 W/m2/K is too weak to correctly represent the ocean adjustment to these relatively fast atmospheric influences over the whole mixed layer depth. The next section presents a novel approach to address this issue while limiting the detrimental effects associated with the application of stronger restoring terms everywhere.

5 An improved surface nudging strategy for the AMOC

Previous sections have highlighted the importance of accurately reproducing the deep density anomalies to produce the extreme AMOC strength. As noted above, imposing the correct surface fluxes is one of the key aspects to improve. Winter deep convection typically occurs on a timescale of 2 months and involves vertical mixing at great depths (~2000 m). However, the restoring coefficients previously applied (γT = −40 W/m2/K, γS = −864 W/m2/K) are equivalent to a relaxation timescale of 60 days over a much shallower ocean layer of 50 m. Indeed, extending the effect of restoring to the top 1000 m of the ocean for the same relaxation timescale would require increasing γT from −40 to −1600 W/m2/K. Applying this strong γ everywhere would most probably improve the representation of water mass transformation in some regions of the North Atlantic, but at the expense of introducing overly large correction terms in other regions where the typical mixed layer is shallower, with spurious consequences for the ocean stratification and air-sea interactions.

To overcome this limitation, we introduce a restoring term variable in time and space (γvar) defined to be proportional to the mixed layer depth. In this way, the restoring strength is effectively only increased in the regions and during the months of convection, minimising the impact of unphysical correction terms elsewhere and/or in other times of the year. To make this approach directly comparable to the one based on a fixed restoring term, γvar is defined so that the relaxation timescale coincides with that in the γfix experiments. Also, |γfix| is set as the minimum absolute value that γvar can achieve (i.e. 40 W/m2/K for temperature and 864 for salinity) to guarantee some effective nudging in the regions where the mixed layer depth is very shallow.

By considering this latter condition, and equating the respective heat flux terms in Eq. (2) (and the freshwater terms in Eq. 3), we get:

$$\frac{{{\gamma }^{fix}}~(SST-SS{{T}_{ref}})}{{{\rho }_{0}}{{C}_{p}}{{h}_{0}}}=\frac{{{\gamma }^{\operatorname{var}}}~(SST-SS{{T}_{ref}})}{{{\rho }_{0}}{{C}_{p}}MLD},$$
(6)
$$\frac{{{\gamma }^{fix}}~(SSS-SS{{S}_{ref}})}{{{h}_{0}}}=\frac{{{\gamma }^{\operatorname{var}}}~(SSS-SS{{S}_{ref}})}{MLD},$$
(7)

from where it follows that,

$${{\gamma }^{\operatorname{var}}}=\max \left( {{\gamma }^{fix}},{{\gamma }^{fix}}\frac{MLD}{{{h}_{0}}} \right),$$
(8)

where h 0 is 50 m—the depth of the surface layer in the standard nudging approach—and MLD the mixed layer depth in each particular grid-point. Note that this expression (Eq. 8) is valid both for the SST and SSS nudging. No changes are introduced to the wind-stress restoring approach.

This new nudging technique is applied for three different experimental setups (Table 1). First, under an equivalent configuration to SSTSWind15P, but using the variable gamma instead (SSTS15WindPvg). The second setup follows again the same procedure, but starts the simulations 25 years prior to the peak in MOI-48N (SSTSWind25Pvg), thus providing a longer time to initialise the deep ocean. Finally, in the third approach, the wind-stress restoring is removed (SSTS25Pvg), to find out if this has a detrimental effect on the final performance. The three different approaches are compared with analogous experiments using γfix. For each experiment family, two simulations are considered, corresponding to the moderate AMOC state and exact initial conditions from CTRL, which represent adverse and favourable conditions for the initialisation of the MOI-48N peak, respectively.

A manifest overall improvement is detected in SSTSWind15Pvg with respect to SSTSWind15P (Fig. 9a). The variable nudging allows for a better characterisation of heat flux evolution in the region of convection, evident both for the adverse and exact ocean initial conditions (Fig. 6c), the first reproducing half of the cooling anomaly in CTRL, and the second even capturing the extreme signal (i.e. below the 5th percentile) preceding the MOI-48N peak by about 5 years. Indeed, a strong correlation (R = 0.86) is found between the 48-month running means of the surface heat fluxes in CTRL and the SSTSWind15Pvg experiment using exact initial conditions, which is substantially higher than the correlation values for the equivalent SSTSWind15P experiment (R = 0.46). Encouragingly, SSTSWind15Pvg also succeeds in improving the water mass transformation when compared to SSTSWind15P (Fig. 7a, b), showing, like in CTRL, a larger contribution of denser waters (28.0 kg/m3, instead of 27.9 kg/m3) to deep water formation. This improvement of the γvar experiments with respect to the γfix ones is due to a better representation of both the anomalous surface heat fluxes and the target vertical stratification in the region of convection (Fig. 8b, c).

All of the above enhancements have a remarkable effect on AMOC initialisation. From Fig. 9a it is immediately clear that the simulations using γvar (magenta lines) are able to reproduce the AMOC peak, even for the adverse initial conditions. The magnitude of the peak in this latter configuration is comparable to that in the SSTSWind15P experiment using exact (and thus most favourable) initial conditions. In both cases the peak lies close to the 95th percentile in the long control simulation. Furthermore, this threshold is clearly exceeded when both exact initial conditions and γvar are considered. Among all configurations investigated here, this is the first one to achieve such performance, which is the ultimate goal of the study.

Nevertheless, such an exact knowledge of the ocean initial state is not accessible in the real world. Hence, the question arises as whether it is possible to capture the extreme nature of the AMOC peak with γvar experiments using arbitrary starting conditions. Further experiments show that it is possible, given enough time for the reconstruction. By extending the experiments to start 25 years before the AMOC peak, its extreme nature is consistently reconstructed. SSTSWind25Pvg experiments show comparable MOI-48N trajectories (magenta lines in Fig. 9b), both describing an extreme AMOC event regardless of the initial conditions considered. Additionally, we also test the contribution of the prescribed wind-stress, by reproducing the same analysis but for the SSTS25P and SSTS25Pvg experiments (Fig. 9c). The magnitude of the peak is again better represented when γvar is applied, although in this case the results seem to be more sensitive to the initial state. Indeed, the AMOC extreme is better captured by the experiments initialised from a moderate AMOC ocean state (full lines in Fig. 9c) than by those using exact ocean conditions (dotted lines in Fig. 9c), like for the SSTS15P experiments in Fig. 2a.

To investigate the differences in performance for the three sets of experiments considered in this section, we first look at the generation of deep dense anomalies in the region of convection (Fig. 10a, c). In all three cases, the experiments using γvar exhibit a clear density extreme (well above the 95th percentile). It is particularly remarkable how close the SSTSWind15Pvg and SSTSWind25Pvg simulations are from the target CTRL, which confirms that a precise representation of water mass transformation is achievable when γvar and wind stress restoring are combined, regardless of how close the simulation starts from the peak, or which starting conditions are considered. The remaining differences seen for the AMOC in Fig. 9 are therefore not related to the formation of deep waters by convection. Differences can emerge as these subsurface waters move westward via geostrophic self-advection (Ortega et al. 2015) and their propagation is altered by potentially unfavourable conditions in the neighbouring ocean areas—where convection is absent or shallow and the deep ocean takes longer to be reconstructed. Indeed, the dense waters reaching the western boundary and affecting the AMOC (Fig. 10d–f) are no longer as extreme as in the convection region, and show larger differences between CTRL and the nudged simulations. For example, the SSTSWind15Pvg simulations reproduce only between 60 and 70% of the western boundary positive density anomaly in CTRL. Among all the families of experiments, the largest density anomaly at this western side of the basin (87% of CTRL) corresponds to the SSTSWind25Pvg experiment simulation using moderate AMOC initial conditions. This implies that, unlike for the MOI-45N-noek index (Fig. 9b), the representation of the western boundary densities is not improved in the simulations using exact starting conditions. This effect is counterbalanced in these specific simulations by a stronger contribution of the densities at the eastern margin (not shown), the exceptional MOI-45N-noek peak thus resulting from the combination of both boundary signals.

Fig. 10
figure 10

Same as in Fig. 9 but for ac the vertically averaged 1000–2000 m densities over the region of convection south of Iceland, and df the vertically averaged 300–3000 m densities in the western Atlantic boundary at 48°N

To conclude, we have shown that the new proposed nudging technique based on a variable restoring coefficient γvar outperforms the previous standard procedures (using a fixed coefficient) for three different protocols (magenta vs blue lines in Figs. 9, 10). More importantly, we have identified one particular case (SSTSWind25Pvg) for which the target AMOC extreme is consistently reproduced, without requiring a perfect knowledge of the initial state of the ocean, suggesting that AMOC variability can be properly reconstructed through ocean surface nudging only.

6 Conclusions and discussion

Using a perfect model approach with the IPSL-CM5A-LR coupled model, we analyse how different surface nudging techniques are able to reconstruct an extreme AMOC anomaly (3 standard deviations above the long-term mean of a 1000-year long control simulation). We have shown that classical nudging protocols, using fixed physically-based restoring terms (γfix), can broadly reproduce the AMOC evolution from a target period in the control simulation (CTRL), but fail to capture the magnitude of the exceptional AMOC maximum. This AMOC peak is associated with the occurrence of four consecutive severe winters a few years earlier that have a cumulative cooling effect and affect water mass transformation in a major region of convection South of Greenland. The exceptionally dense anomalies formed this way are later propagated westward to the North American coast, where they enhance the AMOC via the thermal wind balance. Constraining the influence of the cold atmospheric signals on convection requires a fast relaxation timescale (2–3 months) over a deep mixed layer (1000–2000 m), which is not provided using the physically-based restoring coefficients, thus explaining why the AMOC peak is systematically underestimated in the previous nudged simulations.

One straightforward alternative is to increase the value of this γfix. Yet, the benefits of applying a large γ everywhere are likely outweighed by the associated disadvantages, which are especially important in experiments nudged with observations. For example, a strong γfix can introduce highly unrealistic surface fluxes in regions where the model climatology deviates strongly from the observations, and jeopardise at the same time the ocean subsurface (by creating the wrong water mass properties) and the atmosphere (by perturbing the high-frequency air–sea interactions).

A novel approach is proposed here to overcome these limitations. By implementing a time and spatially varying restoring coefficient (γvar) that is proportional to the mixed layer depth, this flexible nudging technique allows for a stronger relaxation in the areas of deep convection—which play a key role in deep water formation and thereby in the AMOC—while keeping a weak restoring term (more compatible with the observed heat-flux feedbacks) in regions and months with a shallow mixed layer. The new experiments based on this γvar show an encouraging improvement in the representation of the AMOC variability with respect to the experiments using γfix. Furthermore, for one particular setup (SSTSWind25Pvg), the timing and extreme nature of the AMOC maximum are both properly captured, irrespective of the initial conditions used. This corresponds to experiments that meet three requirements: (1) they begin 25 years before the peak, (2) nudge both sea surface temperature and salinity with γvar, and (3) simultaneously prescribe the wind stress field. The first requirement is important to provide sufficient time for the deep ocean (key for vertical stratification) to be correctly reconstructed. This relatively long time span associated with the surface nudging techniques is likely due to slow dynamical and thermodynamical adjustments taking place in the subsurface. Additionally, the combined restoring to the SST and SSS fields is key to successfully capture the general evolution of the AMOC. This choice was justified in a previous study (Servonnat et al. 2015) over the use of SST only nudging, which showed a poorer performance, in particular in the representation of the AMOC. Finally, the additional prescription of the wind-stress fields appears to be pivotal to both reduce the sensitivity to the initial conditions and reproduce the extreme nature of the AMOC peak. Equivalent experiments started 15 years before the AMOC maximum can still capture the general AMOC evolution, but fail to robustly reconstruct the extreme nature of the peak.

Ultimately, this study aims to identify a nudging protocol that can be applied in the historical context to faithfully constrain the past AMOC variability, and help reduce the large uncertainties among the current ocean reanalyses (Karspeck et al. 2015). The novel technique described above shows therefore a great potential, as it has been specifically conceived to improve the characterisation of the key relevant contributions to the AMOC. Furthermore, its exclusively reliance on surface data for which long observations and reanalysed atmospheric data are available (e.g. EN4 for the SST and SSS fields, and the NOAA-CIRES twentieth Century Reanalysis for the surface winds) opens the possibility of reconstructing the AMOC variability back to year 1900 AD. Note that the first 25 years in these simulations would be subject to larger uncertainties, due to the poorer quality of the observational products, and also to the long time required to initialise the deep ocean. Furthermore, the applicability of this approach to the real world is not straightforward. The first challenge relates to the combination of SST and SSS observational datasets. In perfect model studies, like the current one, SST and SSS fields are physically consistent. This means that when nudged together they both contribute to constrain the correct density fields. In the real world, by contrast, beyond the fact that salinity observations are far less numerous and reliable, temperature and salinity observations can present large uncertainties introducing inconsistencies between the two fields. Some preliminary tests show that these potentially conflicting signals can substantially damp AMOC variability. A second important issue relates to the fact that the AMOC is highly model-dependent, and sensitive to model biases. As a result, the drivers of the AMOC can largely vary from one model to another. Indeed, the IPSL-CM5A-LR model analysed here presents a strong negative salinity bias that prevents the simulation of deep convection in the Labrador Sea (Escudier et al. 2013), which is a major region of deep water formation in other models and also in the real world. The performance of the γvar nudging to reconstruct the AMOC therefore has to be evaluated in other models with different AMOC variability. And another key question is what will be the effect of the model biases when nudging to the observed variability. These issues are currently addressed and will be reported in a follow-up study. Further modifications to the current protocol, such as prescribing the full 3D wind structure, and the nudging towards surface temperature over the continents are also envisaged for future studies.