1 Introduction

The El Niño-Southern Oscillation (ENSO) dominates interannual climate variability in the tropics and has a remarkable impact on the global climate (e.g., Rasmusson and Carpenter 1982; Brönnimann 2007; Zhang et al. 2017). Over the past three decades, much attention has been given to predicting ENSO and such predictions of the ENSO signal have laid a strong foundation for short-term climate predictions. Previous studies focusing on improving ENSO predictions have made significant progress. Overall, the predictive skill for ENSO has been high when using the Niño3.4 sea surface temperature (SST) anomaly index to represent the ENSO phenomenon (Latif et al. 1998; Jin et al. 2008; Luo et al. 2008; Ren et al. 2017); this skill, however, is always relatively lower in real-time forecasts than in hindcasts and varies during epochs with considerable uncertainties (Barnston et al. 2012). The recent experiences in predicting 2014–2016 super El Niño event have challenged current capability of international ENSO prediction and given enlightenments for further researches on prediction (Mu and Ren 2017).

Many methods and techniques are being continuously developed to understand atmospheric and oceanic processes key to the formation of ENSO-related SST anomalies and to improve ENSO predictions as a result of progress with respect to ENSO dynamics, climate modeling, and oceanic data assimilation (e.g., Sun et al. 2006; Deng et al. 2012; Park et al. 2012). In general, models for predicting ENSO include simplified dynamical atmosphere–ocean coupled models, fully coupled global climate models, and climate system models, as well as physics-based statistical models (e.g., Zebiak and Cane 1987; Chen et al. 1995; Kang and Kug 2000; Kirtman 2003; Luo et al. 2005; Izumo et al. 2010; Zhang and Gao 2016). All these types of models are used in the monthly routine ENSO forecasting issued jointly by the CPC/NCEP (the Climate Prediction Center/National Centers for Environmental Prediction) and the IRI (the International Research Institute for Climate and Society).

Accompanying the increased understanding of ENSO dynamics, several intermediate atmosphere–ocean coupled models were designed to predict ENSO; such models usually involve simplified atmosphere–ocean coupling and physics with coarse resolutions (Cane et al. 1986; Zebiak and Cane 1987; Chen et al. 1995; Kang and Kug 2000; Zhang et al. 2003) and, being thought to efficiently capture the major dynamics and key physics of ENSO, are still used in operation (Barnston et al. 1999, 2012). Moreover, fully coupled climate models have been widely used to produce ENSO forecasts and have made great progress (Latif et al. 1998; Jin et al. 2008); however, it is always an issue how to accurately initialize these complicated coupled models (Luo et al. 2008; Kirtman and Min 2009). The multi-model ensemble has been proved to be an effective method to improve ENSO predictions (Wang et al. 2009; Kirtman et al. 2014). Meanwhile, due to model deficiencies, post-calibration of ENSO predictions is required via the development of empirical/statistical correction methods for model forecasts (Kug et al. 2008; Ren et al. 2014; Wang et al. 2017; Liu and Ren 2017).

Because current dynamical models are unrealistic to some degree when depicting ENSO dynamics, statistical ENSO prediction is still an alternative with advantages, such as directly reflecting the intrinsic processes, and external factors that control the variations of ENSO are worthy of being used in operations with a real-time prediction skill comparable to that of dynamical models (Barnston et al. 2012). Many statistical models have been developed and used for ENSO prediction (Xue et al. 2000; Clarke and Van Gorder 2001, 2003; Ruiz et al. 2005; Drosdowsky 2006; Lima et al. 2009; Izumo et al. 2010; Boschat et al. 2013; Tseng et al. 2017). In addition to the initial values of the SST anomaly, the equatorial Pacific warm water volume (WWV) and the surface zonal wind stress (ZWS), which together control the intrinsic dynamics of ENSO, have been widely proved to be effective indicators and the key precursors of the Niño SST indices representing the variations of ENSO (Clarke 2014). The WWV precursor represents the zonally quasi-uniform mode of the tropical Pacific thermocline depth variations, with its peak occurring several months prior to that of the central–eastern Pacific SST anomalies due to the discharge–recharge process of the upper-ocean heat content (Jin 1997a, b; Clarke et al. 2007). The ZWS precursor reflects the contribution of the equatorial zonal wind anomaly, which is phase-locked with the seasonal cycle over the Indo–Pacific areas, to the growth of the SST anomaly in the central–eastern Pacific (Clarke 2014). Other precursors that are located outside the tropical Pacific have also been, or can potentially be, used in ENSO statistical predictions by transferring their signals into ENSO preconditions. Such precursors primarily include the Indian Ocean Dipole (IOD) (Izumo et al. 2010), the North Pacific oscillation (NPO) (Chang et al. 2007; Park et al. 2013), the tropical Atlantic SST anomalies (Ham et al. 2013a, b), and the Southern Hemispheric SST modes (Terray 2011; Boschat et al. 2013). However, further effort is needed to examine the relative roles of these external factors by involving them in “benchmark” statistical prediction models that have previously been established using both the WWV and ZWS precursors.

Despite its considerable successes, ENSO prediction is still a challenging issue with considerable uncertainties, particularly because ENSO behaviors have become increasingly diverse and therefore difficult to predict since around the year 2000 (Barnston et al. 2012). The major factor that caused the decrease in the prediction skill is related to ENSO diversity. A large amount of evidence indicates the coexistence of two flavors/types of ENSO events, the eastern Pacific (EP) type and central Pacific (CP) type, in terms of the zonal positions of their equatorial SST anomaly center, which was likely first noted by Fu et al. (1986). Many studies have revealed that the CP type has become more frequent since the 1980s (Larkin and Harrison 2005a, b; Ashok et al. 2007; Kao and Yu 2009; Kug et al. 2009; Ren and Jin 2011) and will likely become more frequent in a changed climate (Yeh et al. 2009). The EP and CP types of ENSO, also called the cold-tongue (CT) and warm-pool (WP) types, correspond to two coupled ENSO modes (Bejarano and Jin 2008; Ren et al. 2013; Wang and Ren 2017; Xie and Jin 2018). Because the impact of the CP-type ENSO on the climate worldwide is distinctly different from that of the EP type (e.g., Weng et al. 2007; Kim et al. 2009; Hegyi and Deng 2012; Zhang et al. 2011, 2012; Hegyi et al. 2014; Wang and Wang 2014), large amounts of attention have been given to examining the typical features and mechanisms of the two ENSO types (Yeh et al. 2014, and references therein); however, less attention has been given to their predictability.

Recent studies have assessed the performance of dynamical models in predicting the two ENSO types (Hendon et al. 2009; Lim et al. 2009; Jeong et al. 2012, 2015; Yang and Jiang 2014; Imada et al. 2015; Zhu et al. 2015; Ren et al. 2017). These studies show that the EP type has a relatively higher prediction skill than the CP type, even though the latter tends to possess a better persistence or weaker persistence barrier than the former in terms of the Niño indices (Kim et al. 2009; Ren et al. 2016a). It remains unclear whether the CP type is dynamically less predictable than the EP type because current models are not capable of sufficiently reproducing the observed differences in the two ENSO types (Yu and Kim 2010; Ham and Kug 2012). Using an analogue-based correction method for reducing model prediction errors which can be diagnosed from hindcasts at historical analogue states (Ren et al. 2014), forecast skills of the Niño indices representing the two ENSO types can be significantly improved in the CFSv2 predictions (Liu and Ren 2017).

Studies of the statistical predictability of the two ENSO types are necessary, in addition to those based on dynamical models, because many studies have revealed the physical mechanisms for motivation and maintenance of the two ENSO types. Both the equatorial Pacific thermocline and the zonal wind variations play important roles in motivating and maintaining the two ENSO types (Ashok et al. 2007; Ren and Jin 2013), and the recharge oscillator mechanism proposed by Jin (1997a, b) occurs in both ENSO types (Ren and Jin 2013), suggesting that both the WWV and ZWS precursors are expected to be applicable to statistical predictions of the two types of Niño indices. Moreover, some external motivators outside the tropical Pacific could be potential precursors of the two ENSO types (e.g., Boschat et al. 2013; Xie et al. 2013). For example, atmospheric signals from the Pacific subtropics and extratropics tend to trigger the CP El Niño via the seasonal footprinting mechanism (Yu et al. 2010) and the SST anomalies in the northern tropical Atlantic also favor the CP type (Ham et al. 2013a). Conversely, the equatorial Atlantic SST anomalies tend to enhance the EP type (Ham et al. 2013b; Keenlyside et al. 2013). Recently, a North American dipole (NAD) mode was shown to serve as a unique precursor of a CP El Niño (Ding et al. 2017). Until now, it is still lack of comprehensive examining of statistical predictability of the two ENSO types and hence developing specifically effective statistical models for their respective Niño indices. It also remains unclear whether and to how large a degree these predictors can contribute to the statistical predictability in terms of the two ENSO types because some of these precursors are not generally suitable to ENSO prediction but are strongly locked to the seasonal cycle.

In this study, we focus on the development of statistical prediction models for the different Niño indices that represent the two types of ENSO and examine the statistical predictability of these two types. To do this, we quantitatively demonstrate the relative importance of the key precursors that have been suggested in previous studies with different definitions and then set up an advanced statistical model with relatively high prediction skill for the respective Niño indices of the two ENSO types. Following the introduction, the data and methods are introduced in Sect. 2. The benchmark models for statistically predicting the different Niño indices are set up in Sect. 3. The statistical predictabilities of the two ENSO types are examined in Sect. 4 by incorporating the external precursors into the benchmark models. The establishment and evaluations of the eventual statistical prediction models for the Niño indices of the two ENSO types are given in Sect. 5. A discussion and summary are given in Sect. 6.

2 Data and methods

Traditional Niño3.4, Niño3, and Niño4 indices are calculated as SST anomalies averaged over the Niño3.4 (5°S–5°N, 170°–120°W), Niño3 (5°S–5°N, 150°–90°W), and Niño4 (5°S–5°N, 160°E–150°W) regions, respectively. The Niño warm-pool index (WPI) and the Niño cold-tongue index (CTI) proposed by Ren and Jin (2011) are used in this study to represent the CP and EP types of ENSO, respectively. We also use the El Niño Modoki index (EMI) of Ashok et al. (2007) for comparison. In this study, the EP-type ENSO indices include both the CTI and the Niño3 index, and the CP-type indices include the WPI, EMI as well as the Niño4 index. Definitions of these SST-based indices are given in Table 1. All indices are calculated using monthly optimum interpolation SST (OISST) data from the National Oceanic and Atmospheric Administration (Reynolds et al. 2002), which have a horizontal resolution of 1° × 1°.

Table 1 Definitions of the Niño indices and their precursors from the tropical Indian and Atlantic oceans and the extratropics

We use the monthly potential temperature and the zonal momentum flux (zonal wind stress) from the NCEP Global Ocean Data Assimilation System (Behringer and Xue 2004; Huang et al. 2008). These datasets have a horizontal resolution of 0.333° in latitude and 1.0° in longitude. The WWV is represented by the 20 °C isotherm depth. Monthly global atmospheric circulation data are derived from the NCEP/National Center for Atmospheric Research Reanalysis data, which have a horizontal resolution of 2.5° × 2.5° (Kalnay et al. 1996). The study is conducted during the period of 1982–2016, when all these datasets were available. Monthly anomalies are calculated relative to the climatology of the entire period, and a 3-month running average is applied to the data to reduce the noise. Linear trends were removed prior to the analysis.

The prediction skills of the Niño indices are evaluated using the temporal correlation coefficients (TCCs) between the observations and the predictions. Both the leave-one-out cross validation and the independent validation are used to assess the statistical predictabilities of the Niño indices. For the independent validation, the prediction is performed during the period of 1997–2016 using observations for the period from 1982 to the target year. Because the skill scores obtained from the cross validation are very similar to those obtained from the independent validation, we only show the results of the cross validation. The statistical significance of the regression coefficient and the correlation coefficient is assessed using a two-tailed Student’s t test, and the difference between two correlation coefficients is tested using the Steiger’s z test (Meng et al. 1992).

3 Benchmark models using the equatorial Pacific heat content and zonal wind stress

3.1 Relationships of the Niño indices with the heat content and zonal wind stress

In previous statistical models that did not consider the two ENSO types, the equatorial Pacific upper-ocean heat content (i.e., the WWV) and the ZWS precursors have been widely used. Due to their importance in the ENSO evolution, Figs. 1 and 2 examine the lead–lag correlations of different Niño indices with the equatorial (5°S–5°N) WWV and ZWS anomalies, respectively. It is clearly seen in Fig. 1 that significant WWV signals appear in nearly the entire equatorial Pacific prior to the SST peak, and in particular, the zonally quasi-uniform WWV signal leads the SST peak by more than half a year for all the Niño indices. It is noteworthy that the WWV signal associated with the EP-type indices (Fig. 1a–b) is centered more eastward compared to that for the CP-type indices (Fig. 1d–f). Moreover, the lag relationships of the WPI and EMI with the equatorial Pacific WWV anomalies are obviously weaker compared to those for the other Niño indices.

Fig. 1
figure 1

Lead–lag correlations between the a Niño3, b CT, c Niño3.4, d Niño4, e WP, and f EMI indices and the equatorial (5°S–5°N) mean depth of the 20 °C isotherm (WWV) for all months combined during the period of 1982–2016. White contours indicate significance at 95% confidence level. Lead times with negative values indicate that the WWV is leading

Fig. 2
figure 2

The same as Fig. 1, but for correlations with the equatorial (5°S–5°N) zonal wind stress (ZWS) anomalies

In Fig. 2, the most significant positive ZWS anomalies are observed over the equatorial central–western Pacific leading the SST peak by up to 10 months for all the Niño indices. In addition, note that remarkably negative ZWS anomaly signals appear in the equatorial eastern Pacific prior to the CP ENSO peak, whereas prior to the EP ENSO peak, there are much weaker signals there as well as apparent signals in the equatorial eastern Indian Ocean. Overall, the lead time of the ZWS signal with respect to the SST signal is relatively shorter than that of the WWV signal for both the EP and CP ENSO indices, and the lag correlation of the Niño3.4 index with both the WWV and ZWS precursors is relatively high compared to the other Niño indices.

3.2 Benchmark models and prediction skills for different Niño indices

In this study, we first build benchmark models that use only the initial values of the Niño indices and the equatorial Pacific WWV and ZWS as precursors. Here, we give a multi-element linear regression equation to represent the benchmark model:

$${{{\rm Ni}\tilde {\rm n}{\rm o}}}\left( t \right)=\alpha {{{\rm Ni}\tilde {\rm n}{\rm o}}}(t){\text{ }}+\beta {\text{WWV}}(t){\text{ }}+\gamma {\text{ZWS}}(t),$$
(1)

where α, β, and γ are parameters that change with calendar month and lead month, and can be obtained by applying a linear fit to the observations, t denotes the target month for the prediction, and τ denotes the initial month.

For the model of each Niño index, the definitions of the WWV and ZWS indices refer to the correlation domains shown in Figs. 1 and 2. Table 2 lists the spatial domains used to define the WWV and ZWS indices as precursors for the different Niño indices, which are generally grouped for the EP and CP ENSO types. According to the results shown in Fig. 2, the ZWS anomalies in both the equatorial central–western and eastern Pacific are combined to define the precursors in the models of the CP ENSO indices, whereas only those in the equatorial central–western Pacific are used to define the precursors of the EP ENSO indices. Additional attempts were made to define the WWV and ZWS indices by changing the zonal range of the domains, and for each Niño index, we have chosen the domains that offer the best prediction of that Niño index. Note that the definition of the WWV index for the CP type is slightly different from that for the EP type, while the definitions of the ZWS index differ significantly for the two types.

Table 2 Domains used to calculate the WWV and ZWS precursors for each Niño index

As Clarke and Van Gorder (2003) revealed, the westerly (easterly) wind anomalies, which are phase-locked to the seasonal cycle, usually propagate from the equatorial eastern Indian Ocean into the western Pacific with the development of an El Niño (La Niña) event. They demonstrated that a space–time integration of the Indo–Pacific ZWS signal could improve the prediction skill of the Niño3.4 index for some leads in the range of 10–15 months. However, we did not find an apparent improvement in the prediction of the Niño indices for leads in the range of 0–11 months using a scheme involving both the equatorial Indian and Pacific ZWS signals compared to our scheme, which considers only the equatorial Pacific ZWS (figures omitted). This result reflects the fact that both the EP and CP ENSO indices have major correlations with the simultaneous or short-lead ZWS signals in the equatorial Indian Ocean (Fig. 2). Therefore, only the equatorial Pacific ZWS signals are used in our benchmark models.

To evaluate the prediction skill of the benchmark model, the cross-validated TCC skill scores of the Niño indices predicted by the benchmark model and their differences relative to the persistence forecast for all the months combined during the period of 1982–2016 are computed, as shown in Fig. 3. It is indicated that the skill scores of the benchmark model feature a gradual decline with increasing lead months for all the Niño indices (Fig. 3a). Compared to the skill scores of persistence forecast, the WWV and ZWS precursors can significantly contribute to the skills of the CTI and the Niño3, Niño4 and Niño3.4 indices for leads of 1–11 months (Fig. 3b). For the EMI (WPI), its skill in the benchmark model is much higher than that of the persistence forecast for leads longer than 6 (4) months. Note that the Niño3.4 index has the highest skill and that the Niño4 index has a slightly higher skill than the Niño3 index. Moreover, both the CTI and the WPI, defined as a transformation of the Niño3 and Niño4 indices (Ren and Jin 2011), show relatively lower skills compared to the latter two, respectively. This may be due to their definitions, which use the differences of the two domain-averaged Niño indices, and is consistent with the results shown in Figs. 1 and 2, where the correlations for the Niño3 and Niño4 indices are for the most part higher than those for the CTI and WPI.

Fig. 3
figure 3

a Cross-validated correlation skill scores (solid line) of the Niño indices predicted by the benchmark model and b their differences relative to the persistence forecast (dashed line in panel a) for all months combined during the period of 1982–2016. Markers indicate that the difference is significant at the 95% confidence level

We further examine the dependence of the skills of the benchmark models on the target calendar months, as shown in Fig. 4. The cross-validated skill scores (see the contours) of the three Niño indices that are usually used to represent the traditional or EP ENSO are higher during boreal autumn and winter but much lower during spring and summer (Fig. 4a–c), whereas those of the CP ENSO indices appear to be higher during winter and spring but lower during late summer and autumn (Fig. 4d–f). This is one of the most significant features in the differences of the statistical predictability between the two ENSO types. In Fig. 4c, the skill pattern of the Niño3.4 index is similar to those in previous studies (e.g., Barnston et al. 2012). It is clear that the predictions of the CTI and the Niño3 and Niño3.4 indices have a higher skill score for most of the target and lead months, except the target months of January‒April with short leads, compared to those of the persistence forecast. However, the predictions of the CP-type indices are less skillful in the benchmark model than those of the EP-type indices for some of the target and lead months. In particular, the EMI only has superior skill scores in the boreal autumn and winter with long leads in the model, compared to the persistence forecast.

Fig. 4
figure 4

Cross-validated correlation skill scores (contours) as a function of target calendar month (y axis) and lead month (x axis) for the benchmark model and their differences (shading) relative to the persistence forecast for the a Niño3, b CT, c Niño3.4, d Niño4, e WP, and f EMI indices during the period of 1982–2016. Crosses indicate that the difference is significant at the 95% confidence level

Figure 4 indicates that the WWV and ZWS precursors in the benchmark models can significantly contribute to the prediction skills of most of the Niño indices with leads longer than ~ 1 (6) month(s) for target months during the period of June–December (January–February). A useful skill score (of more than 0.6) can be obtained across the spring predictability barrier (SPB) where the prediction skills decline most rapidly. The SPB is closely related to the persistence barrier of ENSO, which usually occurs in late spring to early summer (Ren et al. 2016a), as shown in Fig. 5. In the study of Ren et al., the persistence barrier is quantitatively shown to be distinct for the EP and CP types of ENSO; the persistence barrier tends to occur in late spring with a stronger intensity for the EP type and in summer with a weaker intensity for the CP type (see the shading in Fig. 6). Based on Fig. 4, similar conclusions can be obtained for the SPB. Here, we can examine the timings and intensity of the SPB by identifying the maximum decline rate of the prediction skills using a method similar to the definition of the ENSO persistence barrier (Ren et al. 2016a). It is shown that the SPB of the EP type appears to be earlier and stronger compared to that of the CP type (see the contours in Fig. 6). In fact, the predictability barrier of the EP type typically occurs during boreal spring, whereas that of the CP type tends to be more visible during boreal summer. It is also clear that the evident skill superiority provided by the WWV and ZWS precursors, compared to the persistence forecast, primarily occurs during the SPB and afterwards. However, the prediction skills for the CP ENSO indices have much less skill superiority, partly due to their own better persistence (Ren et al. 2016a). Indeed, involving new precursors outside the tropical Pacific is likely a potential way to further improve the CP ENSO prediction.

Fig. 5
figure 5

Autocorrelations of the a Niño3, b CT, c Niño3.4, d Niño4, e WP, and f EMI indices during the period of 1982–2016. Crosses indicate significance at the 95% confidence level

Fig. 6
figure 6

Decline rate of the autocorrelations (shading) and cross-validated correlation skill scores for the benchmark model (contours) of the a Niño3, b CT, c Niño3.4, d Niño4, e WP, and f EMI indices during the period of 1982–2016. Contours with negative values are solid

4 Predictability from other precursors in the tropics and extratropics

4.1 Contributions to the prediction skills of Niño indices

Previous studies have demonstrated that there are several external signals from outside the tropical Pacific that can lead the ENSO peak (Ding et al. 2017 and references therein). Impacts of these external signals on the prediction skills of the Niño indices are examined here. The examined precursors include the IOD in Sept–Oct–Nov (SON), the northern tropical Atlantic (NTA) SST anomalies in Feb–Mar–Apr (FMA), the southern Indian Ocean Dipole (SIOD) in FMA, the southern Atlantic Ocean Dipole (SAOD) in FMA, and the NAD and NPO in Nov–Dec–Jan (NDJ), which have been demonstrated to have an important impact on the ENSO variability (Izumo et al. 2010; Boschat et al. 2013; Ham et al. 2013a, b; Ding et al. 2017). Definitions of these precursors are given in Table 1, where the IOD, NTA, SIOD, and SAOD indices are defined using SST anomalies while the NAD and NPO indices are defined using sea level pressure (SLP) anomalies.

These selected precursors have distinct ways to affect ENSO, as referred to these aforementioned papers. The IOD related signal in SON can influence the sequent winter ENSO in phase and the second sequent winter ENSO out of phase through exciting the surface westerly wind anomalies over the western Pacific and then the ocean dynamics (Izumo et al. 2010). The NTA SST cooling in FMA contributes to the CP warming at the sequent winter via the air-sea interaction over the tropical Pacific (Ham et al. 2013a). The NAD in NDJ, as an atmospheric signal, can induce SST anomalies in the NTA and then subsequently influence the development of the CP El Niño (Ding et al. 2017). The NPO-related atmospheric signals from the Pacific subtropics and extratropics tend to trigger the CP El Niño via the seasonal footprinting mechanism (Chang et al. 2007; Park et al. 2013). The SIOD and SAOD in FMA both can affect ENSO development by modulating the southeast trades over the South Pacific in spring (Terray 2011; Boschat et al. 2013).

To investigate the impact of precursors from outside the tropical Pacific on the prediction skills of the Niño indices, the benchmark skill score is first obtained from a benchmark model built with the WWV and ZWS precursors and then the TCC skill scores are calculated for each Niño index by individually adding other precursor (e.g., IOD) to the benchmark model. Figure 7 shows the lag correlations of the Niño indices with the aforementioned precursors at a given season from which the impacts of Niño3.4 SST signal in the previous Dec–Jan–Feb (DJF) season have been linearly removed by using a regression, and Fig. 8 displays cross-validated TCC skill scores for the Niño indices predicted by individually adding these precursors into the benchmark model and their differences relative to the benchmark skills. Note that the IOD and other precursors are used at all initial months as predictors in the statistical models and not just locked to certain seasons as discussed previously.

Fig. 7
figure 7

Lag correlations of the Niño indices with the a IOD in Sep–Oct–Nov (SON), b NTA in Feb–Mar–Apr (FMA), c SIOD in FMA, d SAOD in FMA, e NAD in Nov–Dec–Jan (NDJ), and f NPO in NDJ after excluding the Niño3.4 SST signal in the previous Dec–Jan–Feb (DJF) season during the period of 1982–2016. The dotted (dashed) line indicates significance at the 95% (90%) confidence level. The lead time indicates that the Niño indices are lagged

Fig. 8
figure 8

Cross-validated correlation skill scores (contours) for the a Niño3, b CT, c Niño4, d WP, and e Niño3.4 indices predicted when adding the IOD precursor to the benchmark model and their differences (shading) relative to the benchmark skill scores during the period of 1982–2016. Crosses indicate that the difference is significant at the 95% confidence level. fy and ae are the same as af but for the NTA, SIOD, SAOD, NAD, and NPO precursors, respectively

Figure 7a demonstrates that the Niño3.4 index has a significant in-phase relationship with the SON IOD index for short lags but an out-of-phase relationship for long lags, with a maximum correlation at a lag of ~ 15 months. This agrees well with Izumo et al. (2010), who revealed that a negative (positive) phase of the SON IOD leads the El Niño (La Niña) peak by approximately 14 months. Similar results of a lag relationship are obtained for the CTI and the Niño3 and Niño4 indices; however, a relatively weak and still significant correlation is found between the WPI and the SON IOD index. As expected, the TCC skill scores of the Niño indices, except WPI, are improved by introducing the IOD signal when the predictions are initiated during the boreal autumn and this limited improvement is not significant and is only visible after 4–5 forecast months (Fig. 8a–e).

The NTA SST anomalies during the boreal spring (FMA) could be a candidate predictor of the CP ENSO (Ham et al. 2013a, b). This is supported by Fig. 7b, which indicates that the NTA index in FMA has a significant and negative correlation with both the WPI and the Niño4 index in the following months, with a maximum correlation for a lead of ~ 10 months. Consistently, significant improvements are observed in the boreal autumn–winter predictions of WPI when starting from January–February (Fig. 8i), while similar but relatively weak improvements are found for the Niño4 index (Fig. 8h). Conversely, the correlations of the NTA index in FMA with the CTI and the Niño3 index in the following months are relatively weak (Fig. 7b); therefore, the impact of the former on the skill scores of the two EP ENSO indices is also weak (Fig. 8f, g). However, note that the NTA precursor during the late summer‒early autumn can significantly improve the predictions of the EP ENSO indices, as well as the Niño3.4 index, as opposed to the CP ENSO indices, as seen in Fig. 8f, g, j. Such a distinct impact of the boreal spring and summer‒autumn NTA SST anomalies on the two types of ENSO may be related to the seasonal evolutions of the mean state (Rodríguez-Fonseca et al. 2009; Ham et al. 2013a, b), which requires further examination but lies beyond the scope of the present study.

The subtropical dipole-like SST variability in the southern Indian and Atlantic oceans may provide another source of predictability for ENSO during recent decades (Terray 2011; Boschat et al. 2013). Here, we define the SST-based SIOD and SAOD indices, where the spatial domains are determined following Terray (2011) and Boschat et al. (2013). The SIOD index in FMA is significantly and positively correlated with the CTI and the Niño3 and Niño3.4 indices when the former leads over a range of 1–14 months (Fig. 7c), and similar correlation patterns are observed for the SAOD index in FMA with even higher correlations (Fig. 7d). Both the SIOD and SAOD indices in FMA have the highest positive correlations with the Niño4 index for a lead of approximately 3 months and then decrease gradually with increasing lead time. Much weaker lag correlations are seen for the WPI. However, no significant improvement can be observed in the prediction skills of any of the Niño indices when adding either the SIOD or SAOD precursor to the benchmark model (Fig. 8k–t).

Both the boreal winter NAD and NPO in the extratropical Northern Hemisphere play roles in influencing the ENSO variability in the following winter (Ding et al. 2017). As indicated in Fig. 7e, the NAD index in NDJ has a significant in-phase relationship with both the WPI and the Niño4 index for a lead of approximately 8 months or longer but a very weak relationship with both the CTI and the Niño3 index. This is consistent with Ding et al. (2017), who revealed that the NAD signal is more closely related to the CP ENSO than to the EP ENSO. Consistently, the NAD precursor in early winter can increase the prediction skills of the WPI and the Niño4 index in the following autumn‒early winter seasons, even though a significant improvement is only observed when the NAD leads the WPI by approximately 10 months (Fig. 8w–s). Similarly, the NPO index in NDJ has a significant in-phase relationship with all the Niño indices except WPI for a lead of approximately 6 months or longer (Fig. 7f); however, no significant improvement is obtained in the prediction of any of the Niño indices when adding the NPO precursor to the benchmark model (Fig. 8a–e).

4.2 Dependence on the equatorial Pacific WWV and ZWS indices

According to the recharge oscillator mechanism (Jin 1997a, b), the evolutions of the two types of ENSO are intimately related to the equatorial Pacific WWV and ZWS anomalies (Ren and Jin 2013). Even though the physical processes responsible for the linkage of the IOD, NTA SST anomalies, and extratropical precursors to the two types of ENSO are different, all these ENSO precursors have signatures in the equatorial Pacific WWV and/or ZWS (e.g., Izumo et al. 2010; Terray 2011; Boschat et al. 2013; Ham et al. 2013a, b; Ding et al. 2017).

To examine whether the impacts of the aforementioned precursors on the prediction skills of the Niño indices are related to the WWV and/or ZWS anomalies that might be remotely initialized by affecting the equatorial Pacific atmosphere–ocean conditions, we perform two sensitivity experiments. That is, we exclude either the ZWS or WWV precursor from the benchmark model and then recalculate the prediction skills of the Niño indices by individually adding other precursors into this new benchmark model. Figures 9 and 10 show the corresponding cross-validated TCC skill scores of the Niño indices and their differences relative to the benchmark skill scores.

Fig. 9
figure 9

The same as Fig. 8, but for the skill scores excluding the ZWS precursor from the benchmark model

Fig. 10
figure 10

The same as Fig. 8, but for the skill scores excluding the WWV precursor from the benchmark model

A comparison of Fig. 9 with Fig. 8 indicates that the improvement in the prediction skills of the Niño4 index and the WPI caused by the use of the NTA and NAD precursors becomes more obvious when the ZWS precursor is excluded from the benchmark model. In particular, the NTA precursor in January–April can significantly improve the prediction skill of the WPI in the following autumn–winter seasons and that in late summer‒autumn can significantly contribute to improvement of prediction of the two EP ENSO indices during the next spring when excluding the ZWS precursor from the benchmark model (Fig. 9i). However, the improvement in predicting the Niño indices by other precursors based on a benchmark model excluding the ZWS signal is similar to that in the benchmark model including both the WWV and ZWS signals. This clearly suggests that the NTA SST and NAD act to inject remote signals into the equatorial Pacific ZWS anomalies that tend to initialize the CP ENSO, and the NTA SST also contribute to improving predictions of the EP ENSO in the following spring, whereas the information from other precursors is weakly involved in the ZWS signals.

Conversely, the comparison of Fig. 10 with Fig. 8 shows a pronounced improvement in the prediction skills of the Niño indices except WPI caused by the use of the IOD, NTA, SAOD, and NPO precursors across the SPB domain when the WWV precursor is excluded from the benchmark model; such an improvement is also seen for the SIOD and NAD precursors but covers much smaller domains. For the WPI, the improvement in the prediction skills in the autumn–winter seasons caused by the use of the NAD precursor in the previous winter becomes significant and more pronounced (Figs. 8s, 10s) and the improvement caused by the use of the NTA precursor in the previous winter becomes slightly higher in this case (Figs. 8h, 10h). These results indicate that the influences of most of the external precursors on ENSO are realized via the initialization of the WWV anomalies for the Niño indices. In other words, external precursors do not increase the ENSO prediction skill unless they introduce extra effective signals into the benchmark model in which both the WWV and ZWS precursors are included.

These results suggest that the ENSO precursors from outside the tropical Pacific that are focused on in this study all have an impact on the prediction skills of the Niño indices. The pronounced impacts of the IOD, SAOD, and NPO anomalies primarily occur across the SPB only when excluding the WWV precursor from the benchmark model. Conversely, the NTA SST and NAD anomalies during the winter–spring seasons have a significant impact on the prediction skills of the Niño4 index and the WPI during the following autumn–winter seasons, which becomes more significant when excluding either the WWV or ZWS precursor from the benchmark model. Note that the NTA SST anomalies also have a pronounced impact on the prediction skills of the CTI and the Niño3 and Niño3.4 indices across the SPB, which can only be detected when excluding the WWV precursor from the benchmark model. Based on Figs. 8, 9 and 10, it can be inferred that the NTA SST anomalies in the previous winter‒spring seasons supply extra signals compared to the WWV and ZWS for the following autumn‒winter generation of both types of ENSO.

To support the above conclusions, Fig. 11 further shows the correlations of the SON IOD, NDJ SAOD, FMA NTA, and NDJ NAD indices with the WWV and ZWS indices from which the impacts of Niño3.4 SST signal in the previous DJF season have been linearly removed. These precursors (i.e., IOD, SAOD, NTA, and NAD) at the corresponding specific seasons are chosen according to the results shown in Figs. 8, 9 and 10. That is, the impact of the IOD in SON and SAOD in NDJ on the prediction skills of the Niño3 index and the CTI primarily depends on the equatorial Pacific WWV anomalies, while the impact of the NTA SST in FMA and NAD in NDJ on the prediction skills of the Niño4 index and the WPI is related to both the equatorial Pacific WWV and ZWS anomalies.

Fig. 11
figure 11

Lag correlations of the WWV and ZWS indices with the a SON IOD, b NDJ SAOD, c FMA NTA, and d NDJ NAD indices after excluding the Niño3.4 SST signal in the previous DJF season during the period of 1982–2016. The dotted (dashed) line indicates significance at the 95% (90%) confidence level. The lead time indicates that the WWV and ZWS indices are lagged

As seen in Fig. 11a, b, both the IOD index in SON and the SAOD index in NDJ are highly correlated with the WWV index when the former two lead by approximately 2–3 months or longer; however, their correlations with the ZWS index are relatively weak when they lead by 3–12 months. Conversely, the NTA index in FMA has a significant correlation with the ZWS (WWV) index in the following 0–12 (3–8) months, even though the lag correlations for the WWV index are somewhat weaker than those for the ZWS index over a lag time range of 3–8 months (Fig. 11c). The correlations of the NAD index in NDJ with the WWV index are comparable to those of the ZWS index in the following months and become significant when the former leads the latter two by approximately 6 months or longer (Fig. 11d). These consistent results confirm our conclusions regarding the relationship of the precursors from outside the tropical Pacific with the equatorial Pacific WWV and ZWS anomalies and therefore explain their effects on the prediction skills of the Niño indices.

5 Establishment and evaluations of models for the two ENSO types

The analysis has demonstrated that the NTA SST during the middle winter‒early spring (late summer‒early autumn) can significantly improve the predictions of the WP (EP) ENSO indices in the following months, while the other precursors from outside the tropical Pacific that are focused on in this study have limited contributions to improving the prediction skills of both types of ENSO relative to the benchmark model built with the equatorial Pacific WWV and ZWS precursors. We further attempt to improve the statistical prediction skills of the Niño indices for the two types of ENSO by conditionally introducing the NTA precursor into the benchmark model as follows:

$${{{\rm Ni}\tilde {\rm n}{\rm o}}}\left( t \right)=\alpha {{{\rm Ni}\tilde {\rm n}{\rm o}}}(t)+\beta {\text{WWV}}(t)+\gamma {\text{ZWS}}(t)+\delta {\text{NTA}}(t).$$
(2)

Here, δ is a new parameter corresponding to the new term relative to Eq. (1). In the equation, the NTA precursor to be added to the benchmark model depends on the initial calendar months for the EP and CP ENSO indices. For the EP-type (CP-type) indices, including the CTI and Niño3 (WPI and Niño4) indices, the NTA precursor is added to the benchmark model for the initial months from July to December (from January to June), while for the Niño3.4 index the NTA precursor is added for all the initial months.

Figure 12 presents the cross-validated TCC skill scores of the Niño indices predicted when conditionally adding the NTA precursor to the benchmark model and their differences relative to the benchmark skills for all the months combined. The TCC skill scores are higher than 0.6 for all the Niño indices for a lead of 6 months (Fig. 12a), and significant improvements are observed for most leads in the range of 3–11 months when conditionally adding the NTA precursor to the benchmark model (Fig. 12b). The Niño3.4 and Niño4 indices have the highest TCC skill scores compared to the other Niño indices for leads in the range of 0–11 months, with values of 0.79 and 0.77 for a lead of 6 months, respectively. Conversely, the WPI and EMI have the lowest skill scores for leads shorter than 7 months, with values of 0.63 and 0.61 for a lead of 6 months, respectively. For the Niño3 index and the WPI (CTI), the TCC skill score is still higher than 0.6 for a lead of 8 (7) months. In short, these results suggest that the prediction skills of both the EP-type and CP-type indices of ENSO can be further improved by conditionally considering the NTA SST signal in a statistical model that includes the equatorial Pacific WWV and ZWS precursors.

Fig. 12
figure 12

The same as Fig. 3, but for a the skill scores when conditionally adding the NTA precursor to the benchmark model and b their improvements relative to the benchmark skills

We further demonstrate that the improvement in the prediction skills depends on the initial/target months, as shown in Fig. 13. It is clear that the skill scores at nearly all the target and lead months increased after conditionally introducing the NTA precursor to the benchmark model and that all the apparent skill score increases appear over the domains where the benchmark skill scores are relatively low. The patterns of the skill increments are different for the EP and CP ENSO types due to the conditionally involved NTA SST precursors. Further, it is found that introducing the NTA precursor to the benchmark model weakened the intensity of the predictability barriers of the Niño indices for either type of ENSO compared to the benchmark model, as seen by comparing Fig. 14 with Fig. 6, where the maxima of the decline rate of the skill scores is weakened along the barriers of the predictability.

Fig. 13
figure 13

The same as Fig. 4, but for the skill scores when conditionally adding the NTA precursor to the benchmark model (contours) and their differences relative to the benchmark skills (shading)

Fig. 14
figure 14

Difference in the decline rate of the cross-validated correlation skill scores between the scheme conditionally adding the NTA precursor and the benchmark model (former minus latter) for the a Niño3, b CT, c Niño3.4, d Niño4, e WP, and f EMI indices during the period of 1982–2016

6 Summary and discussions

The two observed types of ENSO, as a manifestation of the diversity of ENSO, have drawn much attention in recent decades. Due to their distinct impacts on global climate, research concerning the prediction and predictability of the two ENSO types has become increasingly important. Real-time operational prediction of the two types of ENSO has been routinely performed since the 2014–2016 El Niño event (Ren et al. 2016b). However, as mentioned in the introduction, the current generation of climate models has a limited ability to reproduce the typical observed features of the two ENSO types; therefore, it is difficult to distinguish between them in most operational dynamical forecasting systems. In this study, we examined the predictability of the two types of ENSO by building up statistical models that include both the equatorial Pacific WWV and ZWS precursors as a benchmark version and attempted to involve several external precursors from outside the tropical Pacific to further improve the statistical models.

Our results show that the preceding variations of the equatorial Pacific WWV and ZWS, which represent the ENSO-related zonally uniform and contrasted thermocline variability patterns, respectively, contribute to the main statistical predictability of the Niño indices that are used to represent ENSO in terms of the two types. Note that the ZWS precursor has been distinctly defined in the models for the EP and CP ENSO indices. The benchmark statistical model has a higher skill score for both the EP and CP ENSO indices than the persistence forecast, and the improvement in skill score is more pronounced for the EP type than that for the CP type. As a comparison, both the traditional Niño3 and Niño4 indices, which combine signals from both types, are relatively more statistically predictable than the Niño indices that individually represent the two ENSO types. Moreover, the prediction skill scores of the EP ENSO indices in the benchmark model are higher during boreal autumn and winter but much lower during spring and summer, whereas those of the CP ENSO indices appear to be higher during winter and spring but lower during late summer and autumn. The evident skill superiority provided by the WWV and ZWS precursors, compared to the persistence forecast, primarily occurs during the predictability barrier and afterwards.

We further examined the impacts of the precursors from the tropical Indian and Atlantic oceans, as well as the extratropics, including the IOD, NTA SST, SIOD, SAOD, NAD, and NPO, which could be considered to be ENSO predictor candidates as revealed in previous studies (e.g., Izumo et al. 2010; Boschat et al. 2013; Ham et al. 2013a; Ding et al. 2017), on the prediction skills of the Niño indices. The results show that all the external precursors that are focused on in this study, except the NTA SST, only make limited contributions to improving the prediction skills of the two ENSO types at specific initial months and leads, compared to a benchmark model built using both the WWV and ZWS precursors. It is found that the NTA SST anomalies can reliably provide significant increases in the prediction skill scores for the different types of Niño indices relative to the benchmark skill score. We further demonstrate that the signals of some ENSO precursors, such as the IOD, SIOD, SAOD, NAD, and NPO, which are known to predict ENSO at specific initial or target months, have been already transferred most of their signals into variations in the equatorial Pacific WWV and ZWS anomalies. Note that these external precursors are used at all initial/target months as predictors in the statistical models in this study, as opposed to being locked to certain seasons as mentioned in previous studies. If focusing on the ENSO prediction at some specific initial/target month, it is possible to use these particular external precursors to set up the statistical models. Indeed, for operational use, relatively general statistical models that involve stably effective, highly independent, and high-performance precursors are needed.

Finally, we configured a new statistical model for the Niño indices of the two ENSO types by conditionally adding the NTA precursor to the benchmark model. It is demonstrated that the Niño3.4 index has the highest prediction skill, compared to the other Niño indices. The statistical predictabilities of the Niño indices representing the two types of ENSO are very different; this is the Niño indices of EP ENSO (i.e., the Niño3 index and CTI) have an overall higher prediction skill than those of CP ENSO (i.e., WPI and EMI). However, such skills usually differ between different initial months and are subject to different predictability barrier properties, such as timing and intensity. The different Niño indices tend to have similar predictability barriers, and the EP ENSO indices show a stronger barrier than the CP ENSO indices but with similar timing. Introducing the NTA precursor tends to weaken the intensity of the barriers for both ENSO types, compared to that of the benchmark model.

Moreover, our results also showed that features in the statistical predictability and prediction skill of the Niño3.4 index are more similar with those of the Niño3 index and CTI, compared to the Niño4 index and WPI, indicating that the Niño3.4 index more reflects the variation in the EP ENSO type than the CP type. To further apply our results to real forecast, we suggest that the Niño3 index can be preferentially used for EP ENSO predictions and that WPI is still preferential for CP ENSO predictions because the Niño3 index is highly correlated with CTI and has a higher prediction skill.

A statistical model is still a good method to predict the two ENSO types by simply inputting a few easily calculated indices that are available in real time. This is an advantage of statistical model compared to climate models whose prediction performance usually degrades as they are operated in real time. In future, more studies need to be conducted to better predict the two types of ENSO and studies concerning their predictability and prediction methodology also need to be developed.