1 Introduction

Projections of fewer tropical cyclones (TC) globally in a warming world have been consistent for more than a decade now (e.g., Knutson et al. 2010, 2019; Walsh et al. 2015). However, most global coupled ocean–atmosphere climate models continue to struggle to reproduce realistic TC climatologies in some basins, including the very active North Atlantic basin, which casts doubt on any projections made for these specific basins. Confidence in projections can be improved if it can be established how realistic model TC formation is, and if we can understand why TC formation climatologies change between future and present-day scenarios (Tory et al. 2018, hereafter T18). In order to answer these questions T18 sought to identify a set of basic parameters that define where TCs can and cannot form (TC formation boundaries) and identify fundamental atmospheric and oceanic climate circulations that determine the global distribution of these parameters. In this paper the T18 TC formation boundary parameters are applied to a selection of Phase 5 of the Coupled Model Intercomparison Project (CMIP5, Taylor et al. 2012) models during the peak summer TC seasons (i.e., January to March, southern hemisphere, July to September, northern hemisphere), to provide insight into why the models produce TCs where they do. The results are used to determine subjectively which models, and basins within models, can be used to make projections of plausible future changes in peak-season TC formation regions. For brevity this paper only provides a general overview of global results.

In the next section the TC detection and TC formation boundary methodologies are described, and the choice of CMIP5 models investigated explained. In Sect. 3, model performance for each TC basin is presented and discussed. In Sect. 4, plausible future scenarios are proposed for most TC basins based on model basins that were reported to perform well in Sect. 3. A summary is provided in Sect. 5.

2 Methods

The tropical depression (TD) and tropical Storm (TS) detection algorithm, and the TC formation boundary analysis of T18, are applied to twelve CMIP5 models forced under the historical (years 1970—2000) and Representative Concentration Pathway 8.5 (RCP8.5, years 2070—2100Footnote 1) conditions. The development and application of the Okubo-Weiss Zeta Paramater (OWZP) TS detection algorithm has been documented in a series of papers (Tory et al. 2013a, b, c, d; Tory and Dare 2015) and summarised most recently in T18, where an adjustment to the methodology to detect TDs is also described. T18 also describes the development of the TC formation boundary diagnostic in detail. Brief summaries are provided in the following subsections.

T18 used 34 years of ERA-Interim data to identify TC formation boundaries during the most climatically stable months of January—March and July—September for the southern and northern hemispheres, respectively. The boundaries were "tuned" to a TC formation "truth" provided by TS and TD detections for the same period. Detections rather than observed TCs were chosen for the "truth" to ensure the formation boundaries were globally consistent (the inclusion of weaker TCs in observed databases are not consistent in time or between basins), and to maximise consistency between TCs and formation boundaries when applied to climate models.

2.1 TD and TS detections

The OWZP tropical cyclone detection algorithm identifies atmospheric circulations that are ripe for TC formation. If the specified conditions remain favourable for a continuous 24 (48) hours a TD (TS) is assumed to have formed. Note, by definition all TSs were once TDs, so only those TDs that failed to develop further to TS intensity are described as TDs in this paper. The conditions require a rotating, moist environment extending from the low- to mid-troposphere, with low to moderate shear. The conditions are determined by a set of parameter thresholds that must be met on at least two neighbouring grid-points after the model native grid has been interpolated to a standard 1° by 1° latitude, longitude grid. A variant of the Okubo–Weiss parameter (OWZ) on 850 and 500 hPa pressure levels is used to determine sufficient rotation, and 950 and 700 hPa relative humidity and 950 specific humidity thresholds determine sufficient moisture. A threshold of the difference in vector winds between 850 and 200 hPa eliminates environments with excessive shear.

2.2 Formation boundary thresholds

T18 tested many dynamic and thermodynamic parameters with the aim to define global regions of TC formation using a minimum number of threshold quantities. This is illustrated for the summer months in Fig. 1. The equatorial boundary was largely defined by a composite parameter representing the ratio of 850 hPa absolute vorticity (\({\eta }_{850}\)) to the meridional gradient of absolute vorticity at 700 hPa (\({\beta }_{700}^{*}=\frac{\partial \eta }{\partial y}\)),

Fig. 1
figure 1

a ERA-Interim global TC formation regions (shaded) for the summer months (July—September, northern hemisphere, and January—March, southern hemisphere) as defined by the TC formation boundaries: \(\xi =2\times {10}^{-5}{{\rm s}}^{-1}\) (orange), \({V}_{PI}=40 {\rm ms}^{-1}\) (red), \({RH}_{700}=40\%\) (blue) and \({V}_{sh}=20 {\rm ms}^{-1}\) (green). OWZP detected TCs exceeding TS intensity (blue dots) and TDs that did not become TSs (red dots) are overlaid. See text for acronym and symbol definitions. b Blue lines mark the intersection of the TC formation basins, labelled with basin name abbreviations listed in Table 1. The Australian region, discussed in the text, lies between the yellow dashed lines. The ERA-Interim TC formation regions are outlined by red dashed lines

$$\xi =\frac{{\left|\eta \right|}_{850}}{{\beta }_{700}^{*}\left(R/2\Omega \right)}=2\times {10}^{-5}{s}^{-1}$$
(1)

Here \(R\) and \(\Omega \), the earth radius and rotation frequency, normalise \({\beta }^{*}\) to give \(\xi \) the same units as \(\eta \). The \(\eta \) magnitude is used to ensure \(\xi \) is positive in both hemispheres. No physical explanation was provided in T18 for why \(\xi \) should provide such a good equatorward TC formation boundary. Equation 1 is similar to an index introduced in Brunet and Haynes (1995) that essentially determines if a disturbance in a shear flow will roll-up into a vortex or be sheared apart. We tested a modified version of \(\xi \), roughly equivalent to the Brunet and Haynes (1995) index,

$${\xi }_{mod}=\frac{{\left|\eta \right|}_{850}}{{\left[{\beta }_{700}^{*}\left(R/2\Omega \right)\right]}^{2}}=1.7\times {10}^{-5}{s}^{-1}$$
(2)

which was found to perform as well as Eq. 1, due to large gradients in both \({\beta }^{*}\) and \(\eta \). The Brunet and Haynes (1995) index is roughly equivalent to \(\frac{\sqrt{\left|\eta \right|}}{{\beta }^{*}}\), where the numerator represents disturbance amplitude and the denominator the destructive potential of the shear flow. Using this index, Asaadi et al. (2016, 2017) demonstrated that the coexistence of a nonlinear critical layer and a region of weak meridional Potential Vorticity (PV) gradient over several days, is likely to be necessary for easterly waves to develop into named storms, and seems to explain the well-known fact that only about one quarter of African easterly waves contribute to tropical cyclogenesis. In this paper we continue to use Eq. 1 but note that either equation could be used.

T18 imposed a limit to the denominator of Eq. 1 (\({\beta }_{700}^{*}\ge 5\times {10}^{-12}{\mathrm{m}}^{-1}{\mathrm{s}}^{-1}\)). In the current paper an additional limit is applied to the numerator (\({\left|\eta \right|}_{850}\ge 1.75\times {10}^{-5}{\mathrm{s}}^{-1}\)). This limit was inspired by the insight gained from Brunet and Haynes (1995), which suggested small values of \({\beta }^{*}\) might allow a weak disturbance to roll up into a vortex, but it could take an unrealistically long time, suggesting a limit on the disturbance amplitude (here, the absolute vorticity) would be required. The value chosen is less than the \({\left|\eta \right|}_{850}\) thresholds tested by T18.

The poleward boundaries are mostly defined by Emanuel's (e.g., Bister and Emanuel 2002) maximum potential intensity,

$${V}_{PI}=40\,{\mathrm{ms}}^{-1}$$
(3)

and 700 hPa relative humidity,

$${RH}_{700}=40 \%$$
(4)

The addition of a windshear threshold was required to eliminate mid-summer formation in the North-Indian ocean basin, and winter formation in large parts of the northern hemisphere ITCZ,

$${V}_{sh}=\sqrt{{\left({u}_{200}-{u}_{850}\right)}^{2}+{\left({v}_{200}-{v}_{850}\right)}^{2}}=20 \,{\mathrm{ms}}^{-1}$$
(5)

2.3 Formation boundaries in ERA-Interim

The TC formation boundaries applied to ERA-Interim data (Fig. 1) provide insight into how global atmospheric and oceanic circulations affect TC formation. The favourable formation regions are in general thinner in the east of each ocean basin compared with the west but are otherwise unique to each ocean and hemisphere. (Table 1 lists the ocean basin name acronyms depicted in Fig. 1b and used throughout the paper.) The patterns are unique since they are dominated by ocean currents, which are shaped by the distribution of continental land masses and prevailing winds (See the Comet Project: https://ftp.comet.ucar.edu/ootw/tropical/textbook_2nd_edition/navmenu.php_tab_4_page_0.0.0.htm). Oceanic anticyclones transport cooler upwelling water equatorward in the east and transport warmer water poleward in the west of each basin except the North Indian (NI) where the anticyclone is inhibited by the Asian continent. This pattern is evident in the \({V}_{PI}\) boundaries (red contours), given the strong influence of SST on \({V}_{PI}\). Lower troposphere atmospheric anticyclones (comprising easterly trade winds and midlatitude westerly winds) impact the humidity in a similar way, with dry air advected equatorward and westward in the eastern ocean basins, and moist air advected poleward in the western ocean basins, as indicated by the \({RH}_{700}\) boundaries (blue contours). Cross-equatorial flow in the summer months pushes the \(\xi \) boundary poleward (orange contours) ensuring low-latitude formation is a rare occurrence, and as mentioned above, wind shear (green contours) plays a secondary role in the summer months except in NI basin, where shear associated with the monsoon is excessive.

Table 1 Names and codes for the TC ocean basins depicted in Fig. 1b, featured in Tables 4, 5, 6 and 7, and discussed throughout the text

Applying the foregoing TC formation boundary thresholds to CMIP5 models, we use the pattern and position of each threshold quantity to diagnose how well the atmospheric/oceanic circulations relevant to TC formation are represented in each model, and the impact this has on model TC formation.

2.4 CMIP5 models

Our research group originally analysed 20 CMIP5 models for TC formation using the OWZP TC detector (e.g., Tory et al. 2014). Seven of these models were rejected for producing too few TCs (less than one third of observed TC numbers) and one (MRI-CGCM3)Footnote 2 for producing more than double the observed number (Table 2). In the current study the same eight models are rejected. Table 2 suggests that low resolution likely impacts a model's ability to generate a realistic global TC climatology (as defined by the OWZP TC detection scheme). However, the two BCC models are exceptions that illustrate resolution alone does not determine TC frequency. Projections are based on differences between the CMIP5 historical (1970—2000) and rcp85 (2070—2100) scenarios.

Table 2 Model details

The development of the TC formation boundary methodology using ERA-Interim data biases the results to the specific atmosphere–ocean model used in that analysis. The different resolutions and model physics in the CMIP5 models could mean the TC formation boundary thresholds set in T18 are not ideal in the climate models, which justifies limited retuning of thresholds. Table 3 lists the retuned thresholds, plus three-character model-name abbreviations used throughout the paper. Subjective threshold changes were applied to adjust the boundaries to better match the TC detections. A relaxation of the \(\xi \) threshold was applied to the medium- and low-resolution models. This was necessary because the double derivatives inherent in \(\xi \) smear out the parameter when applied to coarser resolution models, which leads to a poleward drift of the T18 threshold location. The higher resolution models required no adjustment to the \({V}_{PI}\) and \({V}_{sh}\) thresholds, although a slight relaxing of the MI5 \({V}_{sh}\) threshold was introduced to better include a region of frequent formation in the NI basin. The \({V}_{sh}\) threshold was relaxed in four other models (Table 3) for the same reason, plus it provided improvements to the WNP basin (see later sections). Changes to the \({RH}_{700}\) thresholds appear to be resolution independent, with both high- and medium-resolution models seeing thresholds reduced to 35% necessary to better represent the eastern SI basin and eastern NA basin. A more drastic reduction of the \({RH}_{700}\) threshold was required in CN5, which suggests the model may have a dry bias, and may explain relatively low annual TS numbers (despite relatively high model resolution, Table 2). Finally, small increases in \({V}_{PI}\) thresholds for GFG and GFM, and for CSI a decrease in the \({V}_{PI}\) and an increase in the \({RH}_{700}\) thresholds, provided modest improvements in most basins.

Table 3 The twelve CMIP5 models used in the study listed in grid resolution order (highest to lowest)

To minimize repetition in the upcoming sections, a subset of four models are chosen to focus the discussion, before drawing comparisons with the remaining models. To capture the influence of model resolution we choose to include in the subset the highest (BCM) and lowest (BC1) resolution models, and two medium resolution models that represent the diversity of model performance (AC0 and GF3). The figure subplot labels are in red for these four models in Figs. 2, 3, 4 for easy identification. In choosing the four models the interdependency between models (e.g., Knutti et al. 2013) was considered. To investigate resolution dependency, it is beneficial for the models to come from the same family, whereas, to illustrate diversity the remaining two models were chosen from separate model families. There are three model families in the models used in this study (indicated in the last column of Table 3) plus four unrelated models. The BCC family includes two models from the Beijing Climate Center. The GFDL family includes three models from the Geophysical Fluid Dynamics Laboratory, and the ACC family includes the two Australian Community Climate Earth-System Simulator (ACCESS) models plus version 2 of the Hadley Centre Global Environment Model (HadGEM2).

Fig. 2
figure 2figure 2figure 2

As in Fig. 1 but for the CMIP5 models, historical scenario, with ERA-Interim boundaries (black dotted lines) included for comparison. Where retuned thresholds are used, original thresholds are included (fine contours) for comparison. Individual figure panels are labelled with the respective model abbreviations (Table 2). The representative four models (Sect. 2.4) have the model abbreviations in red text

In the next two sections TC formation boundary plots are introduced for all twelve models, for the historical (Fig. 2) and rcp85 (Fig. 3) scenarios, with 31 seasons of TD and TS detection locations marked by red and blue dots, respectively. Total numbers of TD and TS detections for each model, scenario and basin are listed in Table 4 for the three summer months, January—March (southern hemisphere) and July—September (northern hemisphere). It is beyond the scope of this paper to analyse these detection distributions and changes; however the reader is referred to the series of papers that analyse the OWZP detected TS tracks, by Sam Bell and co-authors (Bell et al. 2018, 2019a, b, c, 2020a, b). To avoid unwieldy referencing within the text, model plots are referred to only by the model abbreviation (Table 3), and for easy reference, appear in alphabetical order in the figures.

Fig. 3
figure 3figure 3figure 3

As in Fig. 2 but for the rcp85 scenario

Table 4 Total number of Tropical cyclone detections for each basin for the summer months (July—September, northern hemisphere, and January—March, southern hemisphere)

3 CMIP5 model performance

All four of the selected models show the NI basin is made unfavourable by strong shear (Fig. 2), and in the remaining basins the basic pattern of broader favourable formation zones in the west than the east is evident (cf. Fig. 1). In general, the same threshold quantity patterns that determine the poleward and equatorward boundaries identified in the reanalyses are seen in these CMIP5 models. However, the favourable formation regions in each basin can vary substantially in geographic coverage between models and reality, with the most notable example the NA basin. Figure 2 shows that after the additional threshold fine-tuning (mentioned in Sect. 2.3) the threshold set (defining the favourable formation regions) do represent well the regions in which TCs are detected (red and blue dots) in each model, which provides some confidence that the relationship between large-scale global circulations and TC formation is reasonably well represented in the CMIP5 models by the four threshold quantities identified in ERA-Interim reanalysis data. A more detailed analysis follows for each ocean basin separately, culminating in a subjective performance assessment summarised in Table 5.

Table 5 Summary of model errors for each basin

3.1 North Indian Ocean basin

The mid-summer analysis covers the period when the NI basin is dominated by the monsoon circulation, which brings very strong shear and strong cross-equatorial flow that pushes the \(\upxi \) boundary far from the equator (as far as 15° north in the Arabian Sea, Fig. 1). Consequently, only the northern-most parts of the Arabian Sea and Bay of Bengal see TC formation during these months. The active TC formation periods in the NI basin are limited to the shoulder seasons, when the oppressive monsoon influence is weaker. However, these transition seasons are difficult to analyse, due to the higher climatic variability, consequently projections for this basin are not made. However, it is still useful to investigate how well the models reproduce this "null" season.

Assuming Fig. 1 represents reality, a subjective comparison with the four representative models in Fig. 2 provides a qualitative performance assessment. The boundaries and detections are perhaps best captured by AC0. In BCM and GF3 the shear is a little weak providing a broader region of favourable conditions, which in the former appears to be responsible for unrealistically high numbers of TC detections. Interestingly, BC1 shows all the detections occurred inside the high shear region. This may indicate more shear variability in this model (TCs forming during reduced shear periods), or simply that TC-like circulations are less sensitive to shear in coarse resolution models.

A similar diversity in results is present in the remaining models. Of note is a reduced eastward extension of the shear threshold in CC4, GF3 and MI5, which opens up more of the Bay of Bengal for TC formation.

3.2 Western North Pacific Ocean basin

The poleward extent of this basin is determined by the \({V}_{PI}\) threshold in ERA-Interim (Fig. 1) and in all twelve models investigated (Fig. 2). The \({V}_{PI}\) boundary in BC1 is similar in latitude to the 35° diagnosed in ERA-Interim, while in the other three representative models it is located near 30° north. Perhaps as a consequence, TC detections in BCM and GF3 are concentrated south of about 25°, although TC detections in AC0 are less confined by this boundary. An equatorward bias of the \({V}_{PI}\) threshold position is common to most models, with CSI the most extreme case. CN5 is an exception because it has excessive \({V}_{PI}\) in most basins.

The equatorward boundary, determined by the \(\xi \) threshold is reasonably well represented in all four representative models, with the latitudinal variation west of 150° longitude best represented by AC0. Here the boundary in BC1 and BCM is formed instead by an eastward extension of the NI monsoon shear threshold (a pattern in CSI and HG2 also). The \(\xi \) threshold also defines well the equatorward boundary of the TC detections. Consequently, a poleward bias of the \(\xi \) threshold in GF3, together with the above-mentioned equatorward bias of the \({V}_{PI}\) threshold, results in an overly narrow latitude band of TC formation in GF3.

The division between the West and East North Pacific basins is somewhat arbitrary. A logical boundary is where the latitudinally broad favourable formation region that characterizes the WNP meets the characteristic latitudinally narrow eastern region. However, since the models differ substantially in how they represent the two basins and some produce a nearly uniform latitudinal width across the Pacific of the favourable formation region, it is necessary to comment specifically on model performance in the central Pacific where this boundary should be defined. In reality, favourable TC formation in the central Pacific is confined to a narrow latitude band between about 10 and 15° N, due to dry air and shear (Fig. 1) associated with the Tropical Upper Tropospheric Trough (TUTT, e.g. T18, Wang and Wu 2018). The TUTT is dynamically linked to the lower troposphere Pacific anticyclone through upper tropospheric convergence of the associated secondary circulation. All four representative models reproduce the TUTT-associated eastern Pacific dry air and shear, but with varying success (Fig. 2). AC0 best reproduces the pattern, correctly excluding the Hawaiian Islands, but with slightly too strong shear. BC1, BCM and GF3 fail to extend the dry air sufficiently into the central Pacific, and consequently there are too many TC detections there. CC4 is also too moist in the central Pacific, although shear excludes TC formation near the Hawaiian Islands. In contrast dry air and shear extend too far into the central Pacific in CSI and MI5.

Interestingly, Wang and Wu's (2018) assessment of TUTT reproduction (using 200 hPa zonal wind, \({U}_{200}\), skill score) in 43 models also found MI5 performed relatively poorly, but another MIROC model (MIROC-ESM) had the worst skill score of all due to a westward bias of the TUTT location (see their Fig. 4). In contrast they found an eastward TUTT bias in GF3, which may explain the weak influence of dry air and shear in the GF3 central Pacific. They found AC0 and AC1, the models that best represent our central Pacific formation boundaries, had the best \({U}_{200}\) skill scores.

Fig. 4
figure 4figure 4figure 4

Historical (yellow) and RCP85 (blue) TC formation boundaries overlayed. Overlapping boundaries appear green. The ERA-Interim boundaries from Fig. 1 (red dashed lines) are included for comparison

Model performance in this basin is mostly judged on (1) the latitude of the northern boundary (too low [L], good [G], too high [H], compared with ERA-Interim), and (2) how well the TUTT and related circulations define the shape and position of the eastern boundary (too far west or too strong [Wstr], good [G], or too far east or too weak [Ewk]). The letter codes are listed in Table 5, along with an overall subjective performance assessment of Good, Pass or Fail. The letter codes are described in Table 6. Only models listed Good or Pass are used in the projection assessment in Sect. 4.

Table 6 Description of the two performance limitations for each basin, and their corresponding letter codes listed in Table 5

3.3 Eastern North Pacific Ocean basin

Compared with the WNP the favourable TC formation region of the ENP basin is latitudinally narrow (Fig. 1). The equatorward boundary follows an almost constant latitude near 10° N, varying by only a few degrees, with the maximum poleward extent of about 12° N near 110° W. This is near the broadest part of the basin due to the poleward extension of the \({V}_{PI}\) threshold towards the Baja Peninsula. Further west the \({V}_{PI}\) threshold contracts equatorward where it meets the \({RH}_{700}\) threshold, together confining the poleward extent of the formation region to 15° N as it extends into the central Pacific, where the TUTT influence dominates.

The \(\xi \) threshold is well represented in AC0 and GF3, but too close to the equator in BC1 and BCM (Fig. 2). The equatorward contraction of the \({V}_{PI}\) threshold with distance west of 110° W is a little too large in AC0 and underrepresented in GF3, resulting in narrower and broader bands of favourable formation and corresponding TC detections, respectively. In both BC1 and BCM the \({V}_{PI}\) threshold is too poleward, likely due to a weak Pacific anticyclone, with corresponding TC detections present at comparatively high latitudes (cf. Fig. 1). A concern for BC1 and BCM is too few detections east of 110° W, where TC formation density in the real world is very high. In these models, precursor disturbances propagating into the ENP from the North Atlantic are likely to be negatively impacted by very dry air and strong shear, especially in BCM (Fig. 2). Additionally, Sheffield et al. (2013) show the storm tracks in BC1 have a strong equatorward bias (near 8° N, see their Fig. 4), where in the real world TCs do not form. Their figure also shows a similar southward storm track bias for CC4, which also suffers from minimal TC detections west of 110° W and has the \(\xi \) threshold too close to the equator (Fig. 2).

The relationship between Sheffield et al.’s storm track bias and OWZP TC detections can be investigated in four additional models (CN5, GFM, HG2, MI5). HG2 and MI5 have small storm-track biases and healthy ENP TC detections. CN5 and GFM have slightly larger storm-track biases and fewer TC detections. Biases in storm track intensity identified by Sheffield et al. (2013) do not appear to be correlated with detection numbers except perhaps in the NA basin, where large-magnitude weak biases coincide with very few TC detections.

The performance in this basin is based on (1) the latitudinal width of the formation band (too narrow [N], good [G], too broad [B]), (2) the exclusion of the Hawaiian Islands (no [N], good [G], excessive [E]).

3.4 North Atlantic Ocean basin

In the eastern part of the NA basin favourable formation is confined south of 20° N by the \({V}_{PI}\) and \({RH}_{700}\) boundaries. Across the entire basin the \(\xi \) boundary is located at a near-constant 10° N (Fig. 1). The western half of the basin extends well north of 30° N, the most poleward of any ocean basin. The border between the narrow east and broad west is determined by the \({\mathrm{RH}}_{700}\) boundary, i.e., how far the dry air extends into the central NA basin. TC detections are relatively evenly dispersed apart from higher concentrations off the African coast and slightly higher concentrations towards the western regions.

This basin is poorly modelled by all 12 CMIP5 models. With respect to TC detections, four models (AC0, AC3, BCM, HG2) produce very few TCs at lower latitudes (equatorward of 15° N), and nearly all models produce very few TCs, or none at all, in the Gulf of Mexico and the north west of the basin (Fig. 2).

Dry air extends too far westward in all of the four representative models. While it is a minor flaw in GF3, it is excessive in BC1 and BCM, leading to gaping TC formation holes that split the basin into two parts: southeast and northwest (Fig. 2). A similar TC formation hole is present in AC0 due to a combination of large westward dry air extension and excessive shear. Similar basin splitting (TC formation holes)Footnote 3 are also evident in AC3 and HG2, and to a lesser extent in CC4, CSI and MI5 (Fig. 2).

Considering the two parts of the split basin separately, Fig. 2 shows GF3 produces an abundance of TCs in the southeast part, BC1 produces a small number, and AC0 and BCM very few. The ability of models to produce TCs in these regions may be related to how broad in latitude the favourable formation band is and its proximity to the equator. For example, the favourable TC formation band in GF3 extends between about 10 and 18° N, closely matching the observed band (Fig. 1), whereas AC0 spans 8–13° N, which is both narrower and more equatorward than observed. We speculate that if the favourable region is too narrow and positioned too far equatorward, the propagation of African Easterly Waves (AEWs, known to be the dominant TC precursor in this region) will not align with the favourable TC formation region.Footnote 4 AC3 and HG2 also have too narrow and too equatorward southeast formation regions with very few TC detections. Conversely, studies in regional climate models that suppressed AEWs found TCs formed in similar numbers, but within other disturbances (Patricola et al. 2018).

AEW limitations in CMIP5 models have been documented by Dixon et al. (2017, too few AEWs because the Saharan Heat Low is too far westward) and Martin and Thorncroft (2015), insufficient convective diabatic AEW enhancement over poorly resolved Guinea Highlands), but the results are generalised, and thus unable to be related to individual model performance.

To the north and west of the split basin few TC detections appear in AC0, BC1 and GF3, despite the diagnosed favourable formation environment (Fig. 2). We speculate that very few precursor storms with the potential for TC formation make it to this side of the basin, due to a combination of too few precursor disturbances entering the NA basin in the east, and an environment too hostile to support amplification or precursor development in the central NA basin. Other models that show a low detection bias in the northwest include: AC3, CC4, GFG and GFM. All twelve models fail to produce reasonable TC climatologies in at least one of the southeast, Caribbean or northwest regions.

The failure of the models to produce TCs in the northwest region where the formation boundaries do suggest the formation environment is favourable, meant that good model performance had to also consider detection distributions. The basin is divided into two regions, lower latitudes (L, < 25° N, Africa to the Caribbean) and higher latitudes (H, > 25° N). Each region is rated either good (G) or unsatisfactory (U).

3.5 South Indian Ocean basin

The best performing basin in all models is the SI basin. Of the representative four, GF3 best captures the shape and relative locations of the threshold boundaries and has the best distribution of TC detections (cf. Fig. 1, 2). Compared with ERA-Interim, the equatorward boundary in GF3 is slightly poleward, and slightly equatorward in the other three models. AC0 also performs well, although the meridional variation of the \({RH}_{700}\) threshold across the basin is exaggerated, producing a poleward bias in the threshold position in the west. However, the bias is compensated for by the \({V}_{PI}\) threshold taking over to define the formation boundary. Similar \({RH}_{700}\) biases are present in BC1 and BCM, but the compensating threshold that defines the TC formation boundary in those models is \({V}_{sh}\). In fact, the \({RH}_{700}\) bias is present in all 8 remaining models with \({V}_{PI}\) compensating in AC3, CN5, CSI, GFG, GFM, HG2 and MI5, and \({V}_{sh}\) compensating in CC4.

The relatively good performance of the models in this basin (only CN5 fails), allows a more critical performance assessment. The performance reported in Table 5 represents (1) the western and (2) eastern halves of the basin, based on the latitudinal width of the favourable formation region (narrow [N], good [G], broad [B]).

3.6 West South Pacific Ocean basin

West of 150° W, the TC formation boundaries are well-reproduced by all models except CN5. Of the four representative models, GF3 best reproduces the TC formation boundaries and distribution of TC detections. Like the SI basin AC0 is the next best performer, although strong shear and a slightly too poleward location of the \(\xi \) boundary combine to produce a latitudinally narrow TC formation region. The opposite is true for BC1, where all poleward thresholds are located too far south, broadening the TC formation region a few degrees too far poleward. BCM better reproduces the meridional width of the formation region, but it is too zonal, i.e., it fails to capture the slight poleward expansion of the South Pacific Convergence Zone towards the centre of the ocean basin. A too latitudinally-broad formation region is also present in BC1, CN5, GFG and MI5 due to the \({RH}_{700}\) boundary being almost absent, and either or both the \({V}_{PI}\) and \({V}_{sh}\) boundaries located too far poleward. HG2 has the \(\xi \) boundary more equatorward than any other model between 150 and 180° East, due to low values of \({\beta }^{*}\) there (Fig. 2). We assume it is an anomaly, since there are no TC detections and it is not present in the rcp85 climate (Fig. 3).

Like its northern counterpart the eastern boundary of the WNP can be defined by the eastward extent of the higher-latitude TC formation region. This boundary is also influenced by the TUTT and associated circulation, imposing shear and subsiding dry air on the region (c.f. Sect. 3.2). All models under-represent the TUTT and associated processes, leading to the eastern WSP boundary extending too far eastward. Consequently, model performance assessment in this basin is determined by (1) the latitudinal width (narrow [N], good [G], broad [B]) and (2) the location of the eastern-most edge poleward of 15° S (Good [G], moderately eastward [M], excessively eastward [E]).

3.7 Australian region

The Australian region makes up the east and west of the previous basins respectively, where the models perform best. Between the two basins the Australian continent defines the poleward boundary (110—155°E) and is thus fixed for all models. Equatorward the \(\xi \) boundary is reasonably well represented by each model, although in AC0, BC1 and BCM it extends slightly equatorward than that diagnosed in ERA-Interim (Fig. 1).

The performance summarised in Table 5 is determined by the latitudinal width of (1) the eastern SI and (2) the western WSP basins (narrow [N], good [G], broad [B]).

3.8 East South Pacific Ocean basin

This basin is the home of the infamous Pacific cold-tongue bias and double ITCZ, in which most climate models in the past, and many continue to, produce a southern hemisphere ITCZ (that mirrors the northern ITCZ) extending across the entire Pacific Ocean (e.g., Li and Xie 2014). The \({RH}_{700}\) boundary in Fig. 1 shows that in reality, very dry conditions extend deep into the tropics suppressing convection in the eastern half of the South Pacific Ocean basin, which takes an enormous "bite" out of this tropical region within which TCs do not form. Even without the dry air, much of the basin (east of about 110° W) would be hostile to TC formation, due to overlapping \({V}_{PI}\) (from the south) and \(\xi \) (from the north) boundaries and strong shear. The \({V}_{PI}\) and \(\xi \) boundary overlap is due in part to a more poleward location of the \(\xi \) boundary (12—13°S) compared with almost all favourable TC formation regions (e.g., 7—10°S in most of the WNP, ENP, NA, SI and WSP basins). This is due to the absence of low-latitude convection in the ESP.Footnote 5

In all 12 models investigated, all four threshold boundaries are less restrictive than reality (cf. Figs. 2 and 1). This is especially so for those models with a prominent southern ITCZ (including BC1, BCM, CN5, GF3, GFG and GFM, Fig. 2), which contributes to a more equatorward position of the \(\xi \) boundary (as mentioned above). Indeed, all models show the \(\xi \) boundary sloping equatorward from west to east across the Pacific (Fig. 2), rather than poleward (Fig. 1). The other three threshold boundaries contribute to a much-reduced eastern Pacific "bite" leaving behind in most models a narrow ITCZ-like band of favourable formation. CSI and MI5 do not produce an ITCZ-related favourable formation band, and AC0 and AC3 produce only small regions of favourability in the far east of the basin (Fig. 2).

The model performance in this basin is based on (1) the presence of a southern ITCZ-related favourable formation band (Not present [G], Narrow [N], broad [B]), and (2) whether detections are present (None present [G], few present [F], many present [M]).

3.9 South Atlantic Ocean basin

In the ERA-Interim reanalyses the SA basin is dominated by dry air that extends across the entire ocean from east to west, leaving only a small band of favourability off the Brazilian coast at relatively high latitudes. Very few TCs form here. Only two were detected by the OWZP scheme in ERA-Interim in a 34-year period. Many models fail to reproduce this extensive cross-ocean advection of dry air (defined by the \({RH}_{700}\) boundary position), resulting in larger areas diagnosed to be favourable for TC formation, that contain TC detections. These include BC1, BCM, CN5, GF3 and to a lesser extent HG2 (Fig. 2). GFG and GFM also have larger favourable formation areas, but they contain relatively few detections.

Insufficient extension of dry air across the SA basin produces in some models two somewhat distinct formation regions, consisting of an enlarged higher latitude formation region off Brazil, and a tropical band that may extend part way or entirely across the ocean basin (e.g., GF3, Fig. 2). How far the tropical band extends across the basin is also determined by the position of the \(\xi \) boundary, with more equatorward positions further opening the space between the \(\xi \) and \({RH}_{700}\) boundary. Models with a tropical band containing detections include BC1, BCM, CN5 and GF3, and to a lesser extent HG2 and MI5 (Fig. 2).

Performance in this basin is based on (1) the existence of the tropical formation band with detections (not present [G], small and/or contain few detections [S], large and/or contain many detections [L]) and (2) the presence of the higher-latitude band (not present [N], realistic size with few detections [G], large and/or contain many detections [L]).

3.10 Model performance summary

Table 5 summarises model performance and individual basin representation by the 12 models (last row). If two performance measures in a basin are rated 'G' then that basin is rated 'Good'. If only one performance measure is rated 'G', or none are rated 'G' the basin is usually rated 'Pass' and 'Fail' respectively (with eight and three exceptions respectively). These eleven exceptions result from a secondary subjective performance assessment that awards a Pass to two relatively minor biases, and a Fail when a single bias is severe. The worst performing basin is the NA, in which all 12 models failed the subjective performance assessment, hence it is excluded from the projection analysis. As noted previously the NI basin is also excluded. The ESP and SA basins performed poorly with 8 and 7 deemed failures respectively, suggesting projections made in these basins should be treated with a degree of caution. The remaining four basins plus the Australian region have more than half the models passing the performance assessment.

4 Future changes in TC formation regions

In this section limited projections are proposed for all TC basins except NI and NA. The projection assessment, summarised in Table 7, is subjective and based on changes in TC formation boundary locations between the historical (Fig. 2) and rcp85 (Fig. 3) scenarios. The historical and rcp85 boundaries are overlaid for easy comparison in Fig. 4.

Table 7 Projection summaries for the selected ocean basins

4.1 Western North Pacific Ocean basin

All eight models included in the WNP projection analysis show a small poleward expansion of the poleward boundary (Fig. 4), due to a poleward shift of the \({V}_{PI}\) boundary (Fig. 3). This change is accompanied by increased numbers of TC detections, which may signal a poleward expansion of TC formation in a future warmer climate. However, a general increase in TC detections that fall outside the boundary is noted,Footnote 6 perhaps indicating that the TC detection methodology, or the TC boundary analysis, is not ideal at these high latitudes in the RCP85 climate.

The central North Pacific errors in the TC formation boundaries attributed to TUTT-circulation biases (Sect. 3.2) persist in the RCP85 climate for all models, with only small changes. Five of the eight WNP models show minimal change in the WNP eastern boundary position (AC0, CC4, CN5, GFG, GFGM), whereas AC3, BC1 and HG2 show a small eastward expansion (Fig. 4), which may be related to future changes in El Nino events (Bell et al. 2020a). This change is not likely to be associated with shifts in TUTT location, since both AC0 and HG2 experienced a 2° longitude westward shift of the TUTT location, according to Fig. 6d of Wang and Wu (2018).Footnote 7

4.2 Eastern North Pacific Ocean basin

The ENP has seven models included in the projection analysis, six in common with the WNP (Table 5). AC0, shows a slight north–south expansion of the favourable formation zone (Fig. 4). A similar pattern is also present in AC3, GFG, HG2 and MI5, whereas GFM shows very minimal change. In each case the poleward expansion extends towards the Hawaiian Islands, and even envelops the islands in GFG. Only CC4 shows the opposite behaviour with a contraction of the favourable formation zone away from the Hawaiian Islands (Fig. 4). An expansion of the TC favourability zone towards the Hawaiian Islands from the west is also evident in AC3, GFG and HG2 (Fig. 4), due to increased \({RH}_{700}\) and reduced shear (cf. Figs. 2 and 3).

The consensus projection is for a slight latitudinal expansion of the TC favourability zone across most of the ENP, with TC formation becoming more favourable near the Hawaiian Islands, potentially from two directions, the south and west. This change may be associated with a common CMIP5 future of a more El Nino-like state in which the central Pacific warms more than the west (e.g., Chand et al. 2017, Bell et al. 2020a). Recently, Seager et al. (2019) questioned this warming scenario, suggesting it may be an artefact of the cold-tongue bias (Sect. 3.8) present in many CMIP5 models, in which case the western ENP might experience minimal TC formation boundary changes.

4.3 South Indian Ocean basin

The SI is the best performing basin, with only one model (CN5) failing the subjective performance assessment (Table 5). Changes in the position of the equatorward boundary are generally small, with most models showing no perceptible change, or a slight equatorward shift.

Six models (AC0, AC3, BCM, CSI, HG2 and MI5) show an equatorward shift of the poleward boundary in the east of the basin (near the Australian continent, generally due to decreasing \({RH}_{700}\), cf. Figs. 2 and 3) contributing to a meridional narrowing of the region of favourable formation. Exceptions include CC4, GF3, GFG and GFM, which show no shift, and BC1 shows a slight poleward shift of the boundary (Fig. 4).

The meridional narrowing in the east is consistent with changes in the Indian Ocean Dipole (IOD) suggested by Zheng et al. (2013). Their analysis showed an ensemble-mean strengthening of the SST gradient across the ocean basin (greater warming in the west than east), accompanied by a strengthening of the trade easterlies (weakening of the Walker Circulation) and a thinning of the thermocline in the east of the basin. Of the seven models in common with our study, the strongest changes in the IOD they identify match models with a well-defined contraction of the favourable formation zone (CSI, GFM and MI5). Additionally, Bell et al. (2020a) found a substantial reduction of TC detections in the eastern SI basin during future El Nino events.

These results suggest a plausible future scenario is for a weakening of the Walker Circulation, leading to a strengthening of the trade easterlies and SI basin anticyclone, which narrows the favourable formation zone in the east of the basin.

4.4 Western South Pacific Ocean basin

Only three models failed the WSP subjective performance test (BC1, BCM and CN5). Common to most models and basins the \({V}_{PI}\) threshold moved slightly poleward in rcp85 (Fig. 3) compared with the historical scenario (Fig. 2), but this only produced an expansion in the formation region in MI5. In most models it was countered by an increase in windshear producing a contraction of the formation region (AC0, AC3, CSI, GF3 and HG2, Fig. 4) that is generally larger in the central South Pacific than nearer the Australian continent. The opposite change, a slight poleward expansion of the formation boundary, is found in CC4 and to a lesser extent GFM, due to decreasing windshear.

Again, changes to the equatorward boundary are minimal. Exceptions include an equatorward expansion in CSI and MI5. (We ignore the apparent contraction in HG2 associated with the suspected anomalous low-latitude favourable region between 150 and 180° East in Fig. 2, mentioned in Sect. 3.6).

A plausible future scenario for the WSP basin is for increasing wind shear to override and reverse any poleward expansion tendency associated with increasing \({V}_{PI}\).

4.5 Australian region

The Australian region incorporates the best performing parts of the SI and WSP basins. A combination of minimal change in the equatorward boundary with an equatorward shift of the poleward boundaries to the east and west of the Australian continent, suggest a contraction of the TC formation band is plausible in the Australian region.

4.6 Eastern South Pacific Ocean basin

Only four models (AC0, AC3, CSI and MI5) were deemed suitable for a projection analysis in ESP (Table 5). All four models show the three threshold quantities \({V}_{PI}\), \({RH}_{700}\) and \({V}_{sh}\) become more restrictive in the future scenario (Fig. 4). The small favourable formation zones in AC0 and AC3 in the ESP far east, noted in Sect. 3.8, show little change in the rcp85 future (Fig. 3). The four-model consensus of a more hostile future environment for TC formation, in a region that currently never experiences TCs, suggests the region will continue to be devoid of TCs.

4.7 South Atlantic Ocean basin

Five models passed the subjective performance assessment (CC4, CSI, GFG, GFM and MI5) for the SA basin (Table 5). The \({RH}_{700}\) and \({V}_{sh}\) quantities tend to become more restrictive (c.f. Figs. 2 and 3), leading to a general contraction of the favourable TC formation region. Minor exceptions include, a slight eastward expansion in CSI due to a shift in \({RH}_{700}\), and a slight poleward expansion in CC4 due to a shift in \({V}_{PI}\).

All boundary changes in these models are minor, although most suggest a contraction of the favourable TC formation region. Thus, continued very infrequent TC formation is plausible.

5 Summary

Motivated by a desire to understand why TCs form where they do in climate models, T18 developed a set of threshold quantities that defined TC formation, using TC detections in ERA-Interim data. In this paper the thresholds and detections are applied to twelve CMIP5 models for the historical and rcp85 scenarios. Results from the application to the historical scenario were used to assess each model's ability to recreate realistic mid-summer-season TC formation climatologies. A subjective performance assessment for each TC basin in each model was used to determine which basin/model pair could be used in the projection analysis. This paper only provides a general assessment of model performance and projections, due to the large number of model/basin combinations.

A considerable variation in performance was found between models and between individual basins within models (Table 5). The NI basin was excluded from the projection analysis because few TCs form in the mid-summer study period, when monsoon shear suppresses formation. The NA basin was also excluded because all twelve models failed to reproduce realistic TC formation climatologies, mostly because the basin was too dry and the shear too strong. The Australian region was best represented with eleven of the twelve models passing the performance assessment, due largely to good performance in the neighbouring SI and WSP ocean basins. The ESP and SA basins were in general not well-reproduced with only four and five models passing respectively, whereas the northern Pacific basins were better represented with eight models included in WNP and seven in ENP. A common problem in the ESP was the presence of a southern hemisphere ITCZ enabling TC formation across the entire South Pacific. Confidence in projections was reflected in the number of models that passed the performance assessment for each basin.

Projections, expressed as plausible scenarios (summarised in Table 7), are based on consensus changes in formation boundary threshold locations and TC detection. Minimal change in the WNP basin boundaries is proposed, with perhaps a slight poleward expansion, but into a relatively hostile environment where a high proportion of TDs fail to become TCs. A slight meridional broadening is proposed for the ENP, with a possible extension towards the Hawaiian Islands. In the SI basin a contraction in the east due to an equatorward shift of the poleward boundary is proposed, likely due to a strengthening of the oceanic and atmospheric anticyclones. A similar contraction is proposed for the WSP, but due to increased wind shear instead, although the models that do not show increased shear suggest a slight poleward expansion is possible. The Australian region, bordered by these two basins, shares the same general projection of contracting favourable formation regions. While the ESP and SA basins are generally not well represented by the models, the consensus results suggest the already highly unfavourable TC formation environments will become even more unfavourable.