1 Introduction

The El Niño/Southern Oscillation (ENSO) is the strongest interannual climate fluctuation. It originates in the tropical Pacific, has a period of about 4 years, is characterized by basin-wide changes in sea surface temperature (SST) and has global impacts (Philander 1989; McPhaden et al. 2006). Coupled ocean–atmosphere interactions are at the heart of the ENSO mechanism (Bjerknes 1969). The predominantly easterly trade winds at the surface drive a zonal SST gradient across the equator by accumulating heat in the warm pool region in the west and forming the cold tongue region in the east by coastal and equatorial upwelling. An anomalously strong zonal SST gradient enhances the easterly trade winds, which in turn further strengthens the zonal SST gradient, and vice versa. This positive ocean–atmosphere feedback efficiently enhances initial SST anomalies. El Niño events, the warm extremes of ENSO, for example, go along with anomalously weak trade winds. Similarly, La Niña events, the cold extremes of ENSO, are associated with stronger than normal trade winds (McPhaden et al. 2006). The equatorial upper-ocean heat content acts as a delayed negative feedback in ENSO, and perturbations of heat content and SST are roughly in quadrature (Meinen and McPhaden 2000). Hence, the growth rate of ENSO can be theoretically determined by the total of wind dynamic coupling and the thermal damping (Jin et al. 2006).

Despite the complexity of ENSO dynamics, our understanding of ENSO has considerably improved during the recent years. The fundamental ENSO mechanism has been condensed into a conceptual model, termed the recharge oscillator paradigm (Jin 1997). However, each ENSO event is unique, and a large diversity is observed with respect to pattern, amplitude and temporal evolution (Capotondi et al. 2015; Timmermann et al. 2018). Anomalous tropical SSTs affect the deep convection over western tropical Pacific, acting as a Rossby wave source and generating global teleconnections (Trenberth et al. 1998). The teleconnections too can strongly vary from event to event. Thus, understanding ENSO and related impacts remains a major challenge in climate research.

In the past, ENSO has undergone substantial variations in amplitude, frequency and spatial pattern, as demonstrated by instrumental and proxy data (Fedorov and Philander 2000; Yeh and Kirtman 2005; Sun and Yu 2009; McPhaden et al. 2011). ENSO characteristics are believed to have experienced a great variability during the Holocene (Cole 2001; Tudhope et al. 2001; Koutavas et al. 2006; Cobb et al. 2013; McGregor et al. 2013), which is supported by climate models (Khon et al. 2018). Low-frequency modulation of ENSO behavior can be due to internal variability (Wittenberg 2009) as well as external forcing such as increasing atmospheric carbon dioxide (CO2) levels (Latif et al. 2015).

The future behavior of ENSO under anthropogenic forcing remains an open question. Although ENSO amplitude diversity has been reduced by a factor of two from the Coupled Model Intercomparison Project Phase 3 (CMIP3) to its Phase 5 (CMIP5), ENSO projections for the twenty-first century are still highly uncertain (Meehl et al. 2007; Collins et al. 2010; Stevenson 2012; Taylor et al. 2012; Bellenger et al. 2014; Zheng et al. 2016). In particular, the change in ENSO amplitude remains unclear, even with respect to its sign (Knutson et al. 1997; Solomon et al. 2007; Stocker et al. 2013; Cai et al. 2015; Wang et al. 2019). The inter-model disagreement on the ENSO amplitude change persists in CMIP6 models (Eyring et al. 2016; Brown et al. 2020).

Projecting ENSO amplitude under global warming faces several challenges. First, due to large internal ENSO variability on decadal and centennial time scales, it is hard to untangle the anthropogenic forced signal from natural variability (Wittenberg 2009; Collins et al. 2010). For example, there is an ongoing debate whether the reduction in ENSO variability since 2000 is due to natural variability or anthropogenic forcing (Hu et al. 2013, 2017; Lübbecke and Mcphaden 2013; Guan and McPhaden 2016; Maher et al. 2018; Seager et al. 2019; Xu et al. 2019). Second, due to the heterogeneity of ENSO amplitude and dynamics in state-of-the-art climate models, ENSO amplitude projection is strongly model-dependent. In historical simulations with estimates of observed external forcing, ENSO amplitude greatly differs among the CMIP5 (Bellenger et al. 2014) and CMIP6 models (Brown et al. 2020; Planton et al. 2020). Differences in the wind-SST feedback (Vijayeta and Dommenget 2018) and stochastic forcing explain part of the spread (Wengel et al. 2018). Further, model bias is still an issue. State-of-the-art climate models notoriously exhibit strong disagreements compared to observations with regard to ENSO atmospheric feedbacks and ENSO dynamics (Guilyardi et al. 2009; Lloyd et al. 2009, 2011, 2012; Dommenget et al. 2014; Kim et al. 2014b; Bayr et al. 2018, 2019). The equatorial cold SST bias, which is common to and varies among climate models, shifts the models’ mean state into a La Niña-like state with a too westward position of the rising branch of the Pacific Walker Circulation. Linked to that, many climate models underestimate the amplifying wind-SST feedback and damping heat flux-SST feedback, with error compensation between the two (Bayr et al. 2018, 2019, 2020). Moreover, in half of the CMIP5 models the shortwave-SST feedback is erroneously positive, leading to a hybrid of wind stress-driven and shortwave-driven ENSO (Dommenget 2010; Dommenget et al. 2014; Bayr et al. 2019), whereas observations indicate a largely wind stress-forced ENSO (Jin et al. 2006; Bayr et al. 2019). This, in addition to model uncertainty, also may explain part of the spread in twenty-first century ENSO projections. Third, the future greenhouse gas (GHG) emissions or concentrations are unknown. Thus different GHG scenarios need to be taken into consideration that can force quite diverse responses due to the non-linearity of ENSO (Hu et al. 2013).

The projections’ uncertainties can be separated into three main components (Hawkins and Sutton 2009). First, the uncertainty due to natural internal variability of the climate system on decadal to centennial time scales. Second, the model uncertainty due to different physics and model biases. Finally, the scenario uncertainty due to unknown future GHG emissions. This method has been applied to global and regional temperature projections (Hawkins and Sutton 2009), and to global warming projections of the Atlantic Meridional Overturning Circulation (Reintges et al. 2017).

The objective of this study is to investigate the uncertainties in twenty-first century ENSO amplitude projections using the CMIP5 and CMIP6 simulations. We quantify the above-mentioned sources of uncertainty. For this purpose, the methodology outlined in Hawkins and Sutton (2009) is used. This study is organized as follows: in Sect. 2, we describe the data and methods used in this paper. Section 3 provides an analysis of the ENSO amplitude and decadal variability in the CMIP5 and CMIP6 model ensembles. In Sect. 4, the sources of uncertainty in the ENSO amplitude projections are investigated. The causes of ENSO amplitude change are addressed in Sect. 5. The main results are summarized and discussed in Sect. 6.

2 Data and methods

2.1 Data

The data used in this study is from the World Climate Research Programme’s (WCRP’s) CMIP5 and CMIP6 ensembles. We analyze the total output of 56 climate models, 36 from CMIP5 and 20 from CMIP6. Pre-industrial control runs are used to study the natural internal variability, which are compared to Hadley Centre Sea Ice and Sea Surface Temperature v1.1 (HadISST) and NOAA Extended Reconstruction Sea Surface Temperature v3b (ERSST) data sets extending between 1870–2017 and 1854–2015, respectively (Rayner et al. 2003; Smith and Reynolds 2003). Further, models’ historical runs and projections under two different radiative forcing scenarios, known as Representative Concentration Pathways (RCP4.5 and RCP8.5) for CMIP5 and Shared Socioeconomic Pathways (SSP2-4.5 and SSP5-8.5) for CMIP6, are investigated. CMIP5 models are forced with historical forcing up to 2005, while CMIP6 models are forced by historical forcing up to 2014. For both ensembles, we define a consistent historical period of 1900–2005 and projection period of 2005–2099. The output of 35 models is available for RCP4.5 and 36 models for RCP8.5 (Table 1), and the output of 20 models for SSP2-4.5 and SSP5-8.5 (Table 2).

Table 1 Coupled Model Intercomparison Project-Phase 5 (CMIP5) data availability for two representative concentration pathway (RCP) scenarios. Models of the “Strong” sub-ensemble are shown in italics. Models are further divided into “High”, “Moderate” and “Low” interannual and decadal ENSO amplitude variability
Table 2 Coupled Model Intercomparison Project-Phase 6 (CMIP6) data availability for two shared socioeconomic pathway (SSP) scenarios. Models of the “Strong” sub-ensemble are shown in italics. Models are further divided into “High”, “Moderate” and “Low” interannual and decadal ENSO amplitude variability

As most climate models fail to simulate realistic ENSO dynamics (e.g., due to a cold equatorial SST bias and underestimated ENSO atmospheric feedbacks), we follow the study of Bayr et al. (2019) and create a sub-ensemble of models with strong ENSO atmospheric feedbacks (hereafter “Strong”). CMIP5 and CMIP6 model’s atmospheric feedbacks are compared to ERA-Interim and ERA40 data sets (Uppala et al. 2005; Dee et al. 2011). Although the ENSO atmospheric feedbacks are underestimated even in the “Strong” sub-ensemble, it is supposed to provide the most reliable results. In most climate models, error compensation is observed between an underestimated amplifying wind-SST feedback and underestimated damping heat flux-SST feedback (Bayr et al. 2018). In particular, the underestimated positive wind-SST feedback is compensated by an erroneously positive shortwave-SST feedback. As a result, ENSO in these models is a hybrid of wind-driven and shortwave-driven dynamics (Bayr et al. 2019). It is questionable if models with biased ENSO dynamics under present-day conditions can provide useful projections of ENSO amplitude under global warming. This is why the “Strong” sub-ensemble is defined (indicated in italics in Table 1 and Table 2).

2.2 Realistic ENSO dynamic models

The CMIP5 (crosses) and CMIP6 (triangles) models exhibit a large spread in the historical zonal wind stress-SST and net heat flux-SST feedbacks (Fig. 1a), which are calculated by regression of zonal wind stress anomalies in the Niño4 region (160° E–150° W, 5° N–5° S) and net heat flux anomalies in the combined Niño3 (150° W–90° W, 5° N–5° S) and Niño4 region on SST anomalies (SSTA) in the Niño3.4 region (170° W–120° W, 5° N–5° S). The wind stress and SST anomalies are computed relative to a 30-year running window. Models with strongest wind stress-SST feedback show a stronger net heat flux-SST feedback with a correlation of − 0.55. As mentioned above, we group the models according to their ENSO feedback strength, and a sub-ensemble of models with strong ENSO atmospheric feedbacks (“Strong”) is created (red crosses and triangles in Fig. 1a). In general, these also are the models with a reduced equatorial cold SST bias (Fig. 1b) and with the most non-linear ENSO behavior (Fig. 1c), in agreement with recent studies (Cai et al. 2020; Hayashi et al. 2020). All other models reside in the “Weak” sub-ensemble. This allows us to analyze separately the models with realistic and biased ENSO dynamics.

Fig. 1
figure 1

For individual CMIP5 and CMIP6 models and reanalysis products: a net heat flux feedback, defined as regression of net heat flux in Niño3 and Niño4 on SST in Niño3.4 on the y-axis, vs. the zonal wind stress feedback, defined as regression of zonal wind stress in Niño4 region on SST in Niño3.4 region on the x-axis; b atmospheric feedback strength (average of wind stress and heat flux feedback, after normalizing each by the average reanalysis value) on x-axis vs. relative SST bias in the Niño4 region (model output SST minus observed SST, after subtracting the tropical Pacific area mean SST from each); c atmospheric feedback strength (average of wind stress and heat flux feedback, after normalizing each by the average reanalysis value) on x-axis vs. ENSO non-linearity, computed as the difference between Niño3 and Niño4 SSTA skewness; crosses indicate CMIP5 models and triangles CMIP6 models; the red colored symbols indicate models with strong atmospheric feedbacks and realistic ENSO dynamics, called “Strong” sub-ensemble; the blue colored symbols indicate models with weak atmospheric feedbacks, called “Weak” sub-ensemble; the correlation with a 95% confidence level is shown

2.3 ENSO amplitude

ENSO amplitude is calculated as the running standard deviation (STD) of SSTA over the Niño3.4 region. Because our focus is the long-term ENSO amplitude evolution, we remove its short-term variability. For this purpose, 10, 20 and 30-year running STD has been tested with SSTA from ERSST and HadISST. SSTA have been calculated relative to the climatological mean of each running window. The evolution of the ENSO amplitude in HadISST when calculating either the 10 or 20-year running STD exhibits still considerable variability on intradecadal timescales (Fig. 2a, b). In contrast, the intradecadal fluctuations are much reduced when applying a 30-year running filter (Fig. 2c). Identical behavior has been obtained when using ERSST (not shown). Hence, a 30-year running filter has been used in the subsequent analysis to compute the decadal ENSO amplitude variability.

Fig. 2
figure 2

ENSO amplitude defined as the running standard deviation of Niño3.4 SSTA in HadISST obtained with: a 10-year, b 20-year and c 30-year window

2.4 Sources of uncertainty

The uncertainty analysis follows the approach of Hawkins and Sutton (2009) with the adaptations made by Reintges et al. (2017). According to Hawkins and Sutton (2009), the total uncertainty can be divided into three sources: internal variability, model and scenario uncertainty. Since a 30-year running filter is applied, internal variability is basically retained on timescales of decadal and longer.

In the following uncertainty analysis, the variable x in (1) represents the ENSO amplitude, which depends on the scenario (s), model (m), and time (t).

$$x=x\left(s,m,t\right)$$
(1)

We consider low-pass filtered values of ENSO amplitude X and decompose it into the internal long-term variability ε and the long-term trend Xf:

$$X\left(s,m,t\right)={X}_{f}\left(s,m,t\right)+\epsilon \left(s,m,t\right)$$
(2)

A common way to estimate the long-term trend Xf from each model is to compute the mean over an ensemble of simulations performed with different initial conditions but identical external forcing (Maher et al. 2018). However, only one realization from each model is available, so that instead, a polynomial fit calculated over 1900 to 2099 is used to estimate Xf. The internal variability ε is obtained by:

$$\epsilon \left(s,m,t\right)=X\left(s,m,t\right)-{X}_{f}\left(s,m,t\right)$$
(3)

The response to the anthropogenic forcing over the period 2005–2099, xf, is estimated from each model with the aid of the corresponding Xf. We subtract the historical average i taken over the period 1979–2005 to the projected long-term trend Xfp:

$${x}_{f}\left(s,m,t\right)={X}_{fp}\left(s,m,t\right)-i\left(s,m\right)$$
(4)

Which polynomial fit should be used to represent the long-term trend, and hence, the response to the anthropogenic forcing? On the one hand, choosing a too high order of the polynomial fit would artificially decrease the level of internal variability. On the other hand, the order of the fit must be high enough to adequately describe the nonlinear externally forced trend. Figure 3 depicts the polynomial fits of the 2nd, 3rd, 4th and 5th order calculated from the GFDL-ESM2M model. Under strong external forcing (RCP8.5), the different orders yield a similar pattern. However, under weak forcing (RCP4.5), only the 2nd order seems to adequately capture the forced trend. Therefore, the 2nd order fit has been chosen. We note in this context that ENSO amplitude can vary internally on multidecadal and centennial time scales (Li et al. 2013). The key results of the uncertainty analysis, however, remain very similar if a higher-order polynomial fit is used.

Fig. 3
figure 3

Historical and projected ENSO amplitude for the GFDL-ESM2M model (black) with 2nd (blue), 3rd (red), 4th (green) and 5th (cyan) order polynomial fits: in a for RCP4.5 and in b RCP8.5 scenario

Using the long-term signal anomaly xf (4), we can compute the spread between the model projections and then average it over the three scenarios. This will be our inter-model uncertainty that is time-dependent (5).

$$M\left(t\right)=\frac{1}{{N}_{s}}\cdot {\sum }_{s}{std}_{m}\left({x}_{f}\left(s,m,t\right)\right)$$
(5)

Next, by averaging xf over all models for each scenario and computing the spread within the two of them, we get the scenario uncertainty that also is time-dependent (6).

$$S\left(t\right)={std}_{s}\left(\frac{1}{{N}_{m}}\cdot {\sum }_{m}{x}_{f}\left(s,m,t\right)\right)$$
(6)

Last, by computing the spread of each model’s internal variability over time, and then averaging over all models and scenarios, we obtain the internal variability uncertainty, which is independent of time (7).

$$I=\frac{1}{{N}_{s}}\cdot {\sum }_{s}\frac{1}{{N}_{m}}\cdot {\sum }_{m}{std}_{t}\left(\epsilon \left(s,m,t\right)\right)$$
(7)

The time evolution of the internal variability has been estimated, with the conclusion that it does not show any relevant differences between different periods (Fig. 4).

Fig. 4
figure 4

Internal long-term ENSO amplitude variability for each model and scenario, defined as the difference between the ENSO amplitude and the polynomial fit: green for RCP4.5/SSP2-4.5 scenario and red for RCP8.5 /SSP5-8.5 scenario; thin dashed lines correspond to individual model simulations and thick solid lines to the scenario means; black solid line is the mean over all simulations

To test whether the global warming signal in ENSO amplitude is statistically significant, we use the signal-to-noise ratio SNR (8). The average of xf over all models and scenarios corresponds to the signal, G (9), and the noise to the 95th percentile of the standard normal distribution qc/2 multiplied by total uncertainty T (10). If the ratio is greater than unity the climate signal is considered detectable.

$$SNR\left(t\right)=\frac{G\left(t\right)}{{q}_\frac{c}{2}\cdot T\left(t\right)}$$
(8)
$$G\left(t\right)=\frac{1}{{N}_{s}}\cdot {\sum }_{s}\frac{1}{{N}_{m}}\cdot {\sum }_{m}{x}_{f}\left(s,m,t\right)$$
(9)
$$T\left(t\right)=M\left(t\right)+S\left(t\right)+I$$
(10)

Finally, in order to identify possible origins of the uncertainties, two factors have been considered: the projected mean zonal SST gradient and wind-SST feedback. The mean zonal SST gradient is defined as the temperature difference between the Niño4 and Niño1 + 2 (90° W–80° W, 0°–10° S) regions relative to the tropical Pacific (120° E–80° W, 15° N–15° S) averaged temperatures. The wind-SST feedback is defined as described above. A 30-year low-pass filter has been applied to the zonal SST gradient and the wind feedback has been computed relative to a 30-year running window. The same methodology as that used for the ENSO amplitude is applied to obtain the global warming signal of the zonal SST gradient and wind-SST feedback. Then, we relate the inter-model spread of the projected ENSO amplitude change to the two factors at the end of the twenty-first century.

3 ENSO amplitude and decadal variability

First, we investigate the ENSO amplitude and its decadal variability in the preindustrial control simulations. The Niño3.4 SSTA is homogenized to the length of 240 years. We have computed the standard deviation of the decadal ENSO amplitude and plotted it against the standard deviation of the interannual Niño3.4 SSTA (Fig. 5). There is a positive relationship with a correlation coefficient of 0.64: models with large (small) interannual Niño3.4 SSTA variability tend to show strong (weak) decadal ENSO amplitude variations. Further, there is a large spread between the models, from much lower to much higher interannual and decadal variability relative to observations. We divide the models into 3 main sub-ensembles: (1) models with high interannual and decadal ENSO variability (“High” sub-ensemble, red squares and triangles), (2) models with low interannual and decadal ENSO variability (“Low” sub-ensemble, blue squares and triangles), and (3) models with the closest variability to observations, moderate interannual and decadal ENSO variability (“Moderate” sub-ensemble, green squares and triangles). Although there is a quite strong linear relationship between interannual and decadal ENSO variability, we consider both the interannual and the decadal ENSO variability for defining the sub-ensembles. If we would only use interannual ENSO variability, we would mix models with different decadal ENSO variability: for instance, we would add several models with unrealistically low decadal ENSO variability as “Moderate”.

Fig. 5
figure 5

Pre-industrial interannual Niño3.4 SSTA standard deviation on the x-axis vs. decadal Niño3.4 SSTA standard deviation on the y-axis. Models are grouped into “High” (red), “Moderate” (green) and “Low” (blue) interannual and decadal ENSO variability models; crosses indicate CMIP5 models and triangles CMIP6 models; HadISST and ERSST data sets are shown in magenta and cyan, respectively; the correlation with a 95% confidence level is shown

4 Global warming signal of the ENSO amplitude and its uncertainties

Figure 6 depicts the global warming signal of the ENSO amplitude. Dashed thin lines in Fig. 6a) represent the individual model simulations for RCP4.5/SSP2-4.5 and RCP8.5/SSP5-8.5 in green and red, respectively. The thick green and red solid lines correspond to the scenario averages and the black solid line to the total scenario and model mean. The strong model disagreement in ENSO amplitude change towards the end of the twenty-first century is clearly visible. The strongest forcing scenario, RCP8.5/SSP5-8.5, contains most of the positive ENSO amplitude changes, while the RCP4.5/SSP2-4.5 simulations are equally balanced between the positive and negative change of the amplitude. The scenario average shows a positive global warming signal for the strongest forcing case and a signal close to zero for RCP4.5/SSP2-4.5 scenario. The total scenario and model mean (thick black line) lays between the two scenario means, showing a slight increase of the ENSO amplitude.

Fig. 6
figure 6

Global warming signal of the ENSO amplitude calculated by subtracting the historical long-term trend (1979–2005) to the projected long-term trend (2005–2099) in a, c, e and to the end of the projected long-term trend (2099) in b, d, f; in a individual simulations (dashed lines), RCP4.5/SSP2-4.5 scenario mean (solid green line), RCP8.5/SSP5-8.5 scenario mean (solid red line) and mean over all simulations (solid black line); in b) mean over “High”, “Moderate” and “Low” sub-ensembles, for RCP4.5/SSP2-4.5 (green) and RCP8.5/SSP5-8.5 (red) scenarios after dividing each model by its climate sensitivity, computed as the global mean temperature difference between 2050–2099 and 1920–1970 under RCP8.5/SSP5-8.5 scenario; error bars show the maximum and minimum value for each sub-ensemble; in c, d same as a, b, but here for the “Strong” sub-ensemble; in e, f same as a, b, but here for the “Weak” sub-ensemble

In Fig. 6b), we divide the global warming signal averages for the end of the twenty-first century (RCP4.5/SSP2-4.5 green, RCP8.5/SSP5-8.5 red) into models with high interannual and decadal ENSO variability, moderate interannual and decadal ENSO variability and low interannual and decadal ENSO variability. Vast climate sensitivity differences are present between CMIP models, which has been incremented for the latest phase, CMIP6 (Andrews et al. 2012; Meehl et al. 2020). Prior the sub-ensemble mean, each model’s ENSO amplitude change is divided by the global mean temperature difference between 2050–2099 and 1920–1970 under RCP8.5/SSP5-8.5. Under RCP4.5/SSP2-4.5, CMIP5’s and CMIP6’s forced signal in all three sub-ensembles is close to zero. Under RCP8.5/SSP5-8.5, there are noticeable differences. While CMIP5 models with high decadal ENSO variability project by the end of the twenty-first century a decrease in ENSO amplitude, the “Moderate” and “Low” sub-ensembles show an increase in ENSO amplitude. However, the error bars, representing the maximum and minimum values, show a large spread between approximately ± 0.1. The three sub-ensembles of CMIP6 models on the other hand agree on the increase of ENSO amplitude. The strongest mean ENSO amplitude change is for “High” sub-ensemble. The error bars show a wide range of positive ENSO amplitude changes. It is important to note that we show the result for 20 CMIP6 models, while CMIP5 contains 36 models. At the time of our research, the output of 20 CMIP6 models was available for the variables and scenarios we use. Considering the large spread shown by CMIP5 models, it is possible that 20 models do not represent the full inter-model spread of CMIP6 models. The combination of CMIP5 and CMIP6 leads to an average positive ENSO amplitude change for “High”, “Moderate” and “Low” sub-ensembles.

When only considering the models with strong ENSO atmospheric feedbacks (Fig. 6c, d), the strongest positive and negative ENSO amplitude changes are reduced, shifting the RCP4.5/SSP2-4.5 scenario and the total means slightly towards negative values (Fig. 6c). The RCP4.5/SSP2-4.5 scenario generally projects a decrease of the mean ENSO amplitude over the sub-ensembles (Fig. 6d). In the RCP8.5 scenario, the CMIP5 “High” sub-ensemble shows a stronger decrease of the ENSO amplitude than in RCP4.5. In contrast, CMIP6 models disagree on the sign of the ENSO amplitude change between the two scenarios. The combination of CMIP5 and CMIP6 sub-ensembles are not able to show any consistent result of global warming signal of ENSO amplitude.

Looking into models with weak ENSO atmospheric feedbacks (Fig. 6e, f), both scenario means and total mean point towards an increase of ENSO amplitude (Fig. 6e). The strongest projected ENSO amplitude change is shown by the RCP8.5/SSP5-8.5 scenario: all sub-ensembles agree on the increase of ENSO amplitude under the strongest forcing scenario for both CMIP5 and CMIP6 ensembles (Fig. 6f). In addition, when comparing to strong ENSO atmospheric feedback models, the positive ENSO amplitude change is stronger in all sub-ensembles with weak ENSO atmospheric feedbacks except for “Moderate” in CMIP5.

We next quantify and identify the main sources of uncertainty in the projections (Fig. 7). The total uncertainty increases towards the end of the twenty-first century from 0.11 °C to approximately 0.35 °C. In the first three decades, the most important source of uncertainty is the internal decadal variability (green). The internal variability uncertainty amounts to approximately 0.07 °C, corresponding to around 65% of the total uncertainty at the beginning of the projection. After 2034, the main uncertainty source is model uncertainty (blue). It exceeds 0.21 °C by 2100, which corresponds to roughly 60% of the total uncertainty. The scenario uncertainty (red) is of similar magnitude as the internal-variability uncertainty at the end of the twenty-first century. However, it is the smallest uncertainty source at all times. We note that the scenario uncertainty is the largest contribution to the total uncertainty by 2100 when analyzing projections of globally averaged surface temperature, doubling the surface temperature warming from RCP4.5 to RCP8.5, and from SSP2-4.5 to SSP5-8.5 (Knutti and Sedláček 2013; Gidden et al. 2019).

Fig. 7
figure 7

a ENSO amplitude uncertainty divided into model (blue), internal variability (green) and scenario uncertainty (red); in b relative uncertainties; solid vertical line represents where model uncertainty becomes larger than internal variability uncertainty

We repeat the uncertainty analysis for CMIP5, CMIP6 and all sub-ensembles, which have been defined above: “Strong”, “Weak”, “High”, “Low” and “Moderate”. We also use the combined selection of “Strong” with “Moderate” sub-ensembles, as the models of this sub-ensemble are closest to observed ENSO in terms of amplitude variability and atmospheric feedback strength. In Fig. 8a) we show the results of the uncertainty analysis, and in Fig. 8b) the signal-to-noise ratio, both towards the end of the twenty-first century. Model uncertainty is the largest contributor to the total uncertainty in all sub-ensembles. This result again stresses the importance of the model uncertainty in global warming projections of ENSO amplitude. From CMIP5 to CMIP6 the model uncertainty is reduced, while the scenario uncertainty is largely increased leading to an increase of total uncertainty. The smallest total and model uncertainties are observed when combining “Strong” and “Moderate” sub-ensembles. However, even restricting the models to this sub-ensemble does lower the total uncertainty only by 0.045 °C (13%) and model uncertainty by 0.05 °C (24%) in comparison to considering all models (0.35 °C and 0.21 °C, respectively). Further, the signal-to-noise ratio (Fig. 8b) does not exceed the value of unity for any sub-ensemble, which means that a global warming signal in ENSO amplitude cannot be detected with high statistical significance.

Fig. 8
figure 8

a Total (black), model (blue), internal variability (green) and scenario (red) uncertainties at the end of the projection, year 2099, and b signal-to-noise ratio for: all models, CMIP5 models, CMIP6 models, “Strong”, “Weak”, “High”, “Low” and “Moderate” sub-ensembles, and the combination of “Strong” and “Moderate” sub-ensembles

We depict the change in ENSO amplitude by the end of the twenty-first century for “High”, “Moderate” and “Low” for all models in Fig. 9 and for “Strong” in Fig. 10. In Fig. 9 the models within each sub-ensemble largely disagree. Although the projected ENSO amplitude changes in “Strong” are reduced, there is no consistency within the sub-ensembles (Fig. 10). Under the strongest scenario, models in “High” (left group in Fig. 10) agree on a reduced ENSO amplitude for CMIP5 (models 7–36), while for CMIP6 models show an increase of ENSO amplitude (41–56). On the other hand, five out six models in “Moderate” (central group in Fig. 10) point towards an increase under RCP8.5/SSP5-8.5, except for the NorESM1-M model (number 35). In “Low”, there only are three models and it is hard to derive a conclusion. In summary, although models with the most realistic ENSO dynamics and with closest ENSO amplitudes to observations generally point towards an increase of ENSO amplitude, the global warming signal is still robustly undetectable due to the large inter-model disagreements.

Fig. 9
figure 9

ENSO amplitude change between 2005 and 2099, computed as a change of the long-term trend, divided by the climate sensitivity of each model, computed as the global mean temperature difference between 2050–2099 and 1920–1970 under RCP8.5/SSP5-8.5 scenario; vertical dashed lines divide from the left to the right; “High”, “Moderate” and “Low” ENSO amplitude sub-ensembles, respectively

Fig. 10
figure 10

Same as Fig. 9, but here for the “Strong” sub-ensemble

5 ENSO amplitude inter-model uncertainty source

Several studies have shown that ENSO amplitude is strongly influenced by the background mean state (Knutson et al. 1997; McPhaden et al. 2011; Hu et al. 2013; Kim et al. 2014a) and the wind-SST feedback (Lloyd et al. 2009; Vijayeta and Dommenget 2018). The background mean state has an influence on ENSO amplitude via the strength of the surface–subsurface coupling (Hu et al. 2013). Changes on climatological trade winds, which affect zonal SST gradient, vary the response of the zonal thermocline slope to zonal wind anomalies (Kim et al. 2014a). In the framework of the recharge oscillator model, Vijayeta and Dommenget (2018) could show under present day condition that the wind-SST feedback has the strongest influence on ENSO amplitude. Although ENSO is a complex phenomenon, we only focus in the following on these two factors to get insight into origin of the inter-model spread.

In Fig. 11a, b), we show the global warming signal of the ENSO amplitude and the wind-SST feedback. A strong positive linear relationship is detected with correlation coefficients of 0.90 (RCP4.5/SSP2-4.5, Fig. 11a) and 0.84 (RCP8.5/SSP5-8.5, Fig. 11b). In “Strong” (red color), the correlation coefficients amount to 0.95 and 0.91, respectively.

Fig. 11
figure 11

Inter-model relationship between the global warming signal of the ENSO amplitude change (x-axis) and; a, b the zonal wind stress-SST feedback change; c, d the Pacific equatorial mean zonal SST gradient change (y-axis) for; a, c RCP4.5/SSP2-4.5 scenario, and b, d RCP8.5/SSP5-8.5 scenario; crosses indicate CMIP5 models and triangles CMIP6 models; red corresponds to “Strong” sub-ensemble; the correlation with a 95% confidence level is shown

The relationship between the projected ENSO amplitude change and the zonal SST gradient is not as strong. While model ensemble exhibits a large spread of ENSO amplitude change, most of them project a decrease of the zonal SST gradient (Fig. 11c, d). The correlation coefficients are for -0.36 (RCP4.5/SSP2-4.5) and -0.25 (RCP8.5/SSP5-8.5). The “Strong” sub-ensemble models show an improved correlation of -0.58 and -0.45. When calculating the SST gradient with different box averages, the results are virtually unchanged. Therefore, we conclude that the change in wind-SST feedback is an important factor of ENSO amplitude under global warming.

6 Summary and discussion

Using a CMIP5 and CMIP6 multi-model ensemble, the global warming signal in projected ENSO amplitude and the corresponding uncertainties have been quantified. The uncertainties have been split into the model uncertainty (spread of ENSO amplitude change within the ensemble), scenario uncertainty (spread of ENSO amplitude change caused by the different scenarios), and internal variability uncertainty (spread due to decadal ENSO variability). CMIP5 and CMIP6 models highly disagree with respect to future ENSO amplitude change. Projected changes range from decreasing to increasing ENSO amplitude (from − 0.4 to + 0.6 °C), with the mean global warming signal averaged over all models and scenarios close to zero.

Many state-of-the-art coupled climate models fail to simulate realistic ENSO characteristics. Therefore, models with realistic ENSO feedbacks and thus possibly realistic ENSO dynamics have been identified and grouped into the “Strong” sub-ensemble. The “Strong” sub-ensemble contains the models that are able to simulate the non-linearity of ENSO most realistically (Cai et al. 2020; Hayashi et al. 2020). We also have investigated the unforced decadal variability of the ENSO amplitude. From this latter analysis, three additional sub-ensembles have been formed: models with high and low interannual and decadal ENSO variability, termed “High” and “Low”, respectively, and models with moderate interannual and decadal ENSO variability, termed “Moderate”. The later sub-ensemble is the closest to the observed ENSO variability.

Within CMIP5 models, the “High” sub-ensemble projects a reduction of the ENSO amplitude towards the end of the twenty-first century, while “Moderate” and “Low” sub-ensembles indicate an increase. When only considering realistic ENSO dynamic models, the “Strong” sub-ensemble, the signal is intensified: the negative and positive changes of the ENSO amplitude are increased both for “High” and “Moderate”, respectively. The result is consistent between scenarios: the signal is stronger for the RCP8.5 scenario than for the RCP4.5. In contrast, most of CMIP6 models under SSP5-8.5 scenario project an increase in ENSO amplitude towards the end of the twenty-first century, in agreement with recent studies (Fredriksen et al. 2020). The strongest increase is projected by models with high interannual and decadal ENSO variability. When considering the “Strong” sub-ensemble, the positive signal of ENSO amplitude change is reduced. In this case, the result is not consistent between the scenarios: models under SSP2-4.5 scenario project a decrease of the ENSO amplitude. At this point, we must keep in mind that in this study we have been able to use 20 CMIP6 models in comparison to 36 CMIP5 models. Looking into models with weak ENSO atmospheric feedbacks, all sub-ensembles besides “Moderate” in CMIP5 show a stronger positive ENSO amplitude change than “Strong” models. In conclusion, the global warming signal of ENSO amplitude highly varies between CMIP5 and CMIP6, and the studied sub-ensembles.

The total uncertainty in the projected ENSO amplitude change obtained from all CMIP5 and CMIP6 models exhibits an increase over time: 0.11 °C at the beginning to 0.35 °C towards the end of the twenty-first century. Internal variability is the main contributor to the total uncertainty during the first three decades. The inter-model differences dominate thereafter, while scenario uncertainty is relatively small throughout the entire twenty-first century. CMIP6 models show a larger uncertainty than CMIP5 models. Although the model uncertainty is decreased, the scenario uncertainty is considerably increased (from 0.04 to 0.12 °C). This is in general agreement with previous studies indicating a greater climate sensitivity for CMIP6 models (Meehl et al. 2020). The largest uncertainty within a sub-ensemble is observed in “High”, approximating to 0.4 °C, and the smallest uncertainty when combining “Strong” and “Moderate” (about 0.3 °C). However, as shown by the signal-to-noise ratio, the global warming signal in the projected ENSO amplitude change is too small to be robustly detectable.

Finally, we have investigated two potential sources for the strong inter-model differences. The model spread is highly correlated with the spread in wind-SST feedback change, with a correlation coefficient of 0.90 and 0.84 for RCP4.5/SSP2-4.5 and RCP8.5/SSP5-8.5 scenarios, respectively. This suggests that it is important to understand the factors determining the wind-SST feedback under global warming to reduce uncertainty in ENSO-amplitude projections. However, from our analysis one cannot assure that the wind feedback is the dominant contributor to the future ENSO amplitude change, as it might partially be canceled by the change of the thermodynamic negative feedback, e.g., the shortwave feedback. A quantitative comparison between the positive and the negative feedback in terms of the ENSO amplitude change is out of scope of this paper. The correlation with the change in mean zonal SST gradient is of  − 0.36 and − 0.25. While most of the models agree on the reduction of the mean zonal SST gradient under global warming, the response of the wind feedback is extremely model dependent. This discrepancy between the mean state changes and the wind feedback changes is a puzzling question that needs to be answered in the future. A previous study has shown that there is a non-linear relation between mean-state changes and ENSO amplitude, in which ENSO amplitude increases till an optimum and then decreases again (Hu et al. 2013). Considering the large mean state biases present in climate models, this might explain why the ENSO amplitude change varies to a similar mean state changes. In fact, if we consider realistic ENSO dynamic models, which show the smallest Niño4 SST bias, the inter-model correlation with SST gradient change is increased to − 0.58. In addition, the wind-SST feedback strength is strongly linked to the rising branch of the Walker Circulation, which again highly depends on the mean state (Bayr et al. 2020). Similarly, there is an ongoing debate about how the Walker Circulation will change under global warming (Knutson et al. 1997; Vecchi and Soden 2007; DiNezio et al. 2009, 2013; Sohn and Park 2010; Yu and Zwiers 2010; Power and Kociuba 2010, 2011; Meng et al. 2012; Luo et al. 2012; L’Heureux et al. 2013; Bayr et al. 2014). Thus, it is of great importance to improve the present mean state model biases, to understand how the Walker Circulation will change under global warming, and how this will affect ENSO amplitude.