1 Introduction

Large-scale circulation patterns exert a direct influence on the regional climate. For instance, persistent high pressure (blocking) systems (Rex 1950; Jury et al. 2019) and the North Atlantic Oscillation (NAO) pattern (Hurrell et al. 2003; Folland et al. 2009) disturb the predominant cyclonic westerly flow (Sillmann and Croci-Maspoli 2009), which largely modulates the European climate. As a consequence, changes in seasonal extreme events occur, such as those related to high temperature (Buehler et al. 2011; Barriopedro et al. 2011; Favá et al. 2015; Schaller et al. 2018), wet spells and precipitation extremes (Busuioc et al. 2001; Casanueva et al. 2014; Sousa et al. 2017) or droughts (Bladé et al. 2011).

Similarly, in the South Hemisphere (SH), the Southern Annular Mode (SAM), also referred as the Antarctic Oscillation, influences the climate systems at high and middle latitudes (Gong and Wang 1998, 1999; Thompson and Wallace 2000; Thompson et al. 2000), displaying a ‘seesaw’ pattern for the atmospheric mass (sea-level pressure -SLP- or geopotential heights). In essence, the SAM consists of a belt of strong westerly winds or low pressures surrounding Antarctica which exhibits a northward/southward displacement as its main mode of variability. For example, it is associated with storms and cold fronts that move from west to east largely determining precipitation in southern Australia (Stammerjohn et al. 2008; Risbey et al. 2009). Additionally, various modes of large-scale climate variability such as the El Niño-Southern Oscillation (ENSO), the Interdecadal Pacific Oscillation (IPO), the Indian Ocean Dipole (IOD), the Subtropical ridge (STR) or the Madden Julian Oscillation (MJO) largely determine the regional climate of the SH (Dey et al. 2019).

Hence, an adequate representation of atmospheric circulation and high/ low pressure variability becomes essential for a proper description of the main regional climate features. In this context, synoptic weather types are a useful tool as they summarize the whole range of variability of the data into a few, construable patterns (Huth et al. 1993, 2008; Littmann 2000; Stryhal and Huth 2017). A well-known circulation classification method is the Lamb Weather Type (LWT) Classification. The LWT classification is a subjective clustering approach created by the climatologist Hubert H. Lamb (Lamb 1972) with the aim of studying the synoptic climatology over the British Isles. The system relies on a deterministic weather type classification based on a number of rules requiring meteorological expert knowledge for the interpretation of daily SLP charts, providing a straightforward and well defined physical interpretation of the SLP patterns. Later in the computer era, Jenkinson and Collison (1977) developed a more objective scheme following Lamb’s principles, known as the Jenkinson-Collison weather type classification (JC-WT hereafter). The JC-WT approach is an automated procedure using a set of equations based upon SLP able to reproduce circulation types with negligible differences from the original LWT catalogue (Jones et al. 1993). Furthermore, unlike the original LWT approach, the JC-WT scheme has the advantage of being automatically applicable to different geographical locations through the introduction of some adjustment parameters to account for the relative grid spacing as a function of latitude.

The advantages of this circulation typing method has since then been exploited in different studies in the Northern Hemisphere (NH). For instance, JC-WTs have been centered in the Iberian Peninsula \([10^\circ W, 40^\circ N]\) (Trigo and DaCamara 2000), \([5^\circ W, 40^\circ N]\) (Ramos et al. 2014), western Mediterranean basin \([5^\circ E, 40^\circ N]\) (Grimalt-Gelabert et al. 2013), southern Scandinavia \([15^\circ E, 60^\circ N]\) (Chen 2000), Central Europe \([10^\circ E, 50^\circ N]\) (Donat et al. 2010), south-west Russia \([55^\circ E, 55^\circ N]\) (Spellman 2017), Ireland \([10^\circ W, 55^\circ N]\) (Fealy and Mills 2018), Serbia \([20^\circ E, 42.5^\circ N]\) (Putnikovic et al. 2016) and southeastern China (Wang and Sun 2020; Wu et al. 2020). Otero et al. (2018) computed, for the first time, a spatially continuous application of the JC-WT classification over Europe \([13^\circ W-34^\circ E, 34^\circ N-71^\circ N]\) and, in an unprecedented study, Brands (2022) computed JC-WTs on every grid-box covering the majority of the NH extratropic region in order to evaluate GCM performance. The JC-WT classification has been seldom used in the southern hemisphere, excepting, to the best of our knowledge, by Sarricolea et al. (2018) in southern Chile \([72.5^\circ W, 42.5^\circ S]\), requiring in this case an adaptation of the model equations to the SH circulation, as illustrated in this study.

The idea of evaluating model performance by means of weather types started long time ago (Jones et al. 1993; Hulme et al. 1993), and the process-based evaluation paradigm has recently come into focus particularly within the downscaling community (Maraun et al. 2017), primarily aimed in this case at performing an optimal, objective selection of GCMs for downscaling experiments (Pickler and Mölg 2021). The JC-WT classification, built upon SLP fields only, is somewhat limited in its ability to infer other atmospheric features related to atmospheric circulation in a broad context. However, important information regarding low level circulation physics can also be inferred from the analysis of the JC intermediate parameters, such as air flows and shear vorticities. The relatively easy interpretation of these airflow indices and the simplicity of the JC method, which allows transferability to other regions with a simple implementation, make it the preferred classification scheme in previous studies (e.g. Otero et al. 2018). Moreover, as shown by Conway and Jones (1998), circulation patterns fundamentally control meteorological characteristics on the surface, whereby the use of SLP has a lot of advantages. Quoting Sterl (2004), ‘if the pressure fields of two models differ, most other fields will differ too’, which highlights the importance of SLP as the basic dynamical variable of a model and its relevance for model assessment. In this sense, model performance can be assessed by relying on SLP-based classification techniques such as the JC-WT classification, as an effective tool for model ranking and selection (see e.g., Fernandez-Granja et al. 2021))

Model evaluation involves the analytical comparison of model outputs against observations (or reanalysis, as pseudo-observations), taking into account that not only the model, but also the observations may be uncertain (Gettelman and Rood 2016; Kotlarski et al. 2019. Most often, reanalysis products are used as reference, requiring a careful consideration of the divergence in their representation of climate features, particularly in regions where their background observational network has been historically sparse (Sterl 2004; Lavin-Gullon et al. 2021) and those more sensitive to temporal non-stationarities in the assimilated data incorporated to the model by the observing systems (Bengtsson 2004). As a result, it is widely recognized the need to consider multiple observational products when evaluating climate models (Gibson et al. 2019) and considerable effort has been devoted to the intercomparison of reanalysis products in different climate research contexts (e.g. Chen et al. 2008; Ben Daoud et al. 2009; Brands et al. 2012) and the understanding of the underlying causes for their disagreement (Fujiwara et al. 2017).

This study describes the implementation of the JC-WT classification for its application worldwide, including its adaptation to SH locations, and presents a 6-hourly database of JC-WT catalogues calculated from five different reanalysis products with a global coverage (excluding a 10\(^\circ\)-width band along the equator). The objective of this work is two-fold. First, we undertake an assessment of the JC-WT applicability across the globe, taking as reference the latest IPCC-WGI Reference Regions (Iturbide et al. 2020) for summarizing the results. These allow for the identification of the global areas where the JC-WT classification provides an effective tool for characterizing the SLP patterns. Second, the different JC-WT reanalysis catalogues are compared with the aim of unveiling regions with high observational uncertainty. Caution is advocated in these areas, where the usage of more than one reanalysis is required in order to adequately evaluate the range of uncertainty in circulation-dependent modelling and/or downscaling exercises. The global JC-WT catalogues and the code implementing the worldwide JC-WT formulation used in this study are made available through dedicated Zenodo and GitHub open repositories.

2 Methodology and data

2.1 Jenkinson–Collinson classification

We follow the JC-WT formulation developed by Jenkinson and Collison (1977) that yields 27 different types. As input, we use 6-hourly, instantaneous sea-level pressure (SLP) data, which are sampled using a cross-shaped point pattern (Fig. 1) formed by 16 points with a separation of \(5^{\circ }\) in latitude and \(10^{\circ }\) in longitude. Due to its shape, in the following, we will refer to this scheme simply as “cross”. This cross can, in principle, be centered on any extra-tropical location.

The JC-WT classification is a function of 6 parameters related to wind-flow characteristics. Their corresponding equations are summarized in Table 1: southerly flow (S), westerly flow (W), total flow (F), southerly shear vorticity (\(Z_{S}\)), westerly shear vorticity (\(Z_{W}\)) and total shear vorticity (Z) computed upon the SLP records provided at a given time (6-hourly records in this study). The original parameter equations proposed by Jenkinson and Collison (1977) require further adjustments for their application in SH locations (Table 1). The result is a set of 27 weather types representing pure cyclonic (C) and anticyclonic (A) circulation over the center point, 8 pure directional types (N, NE, E, ..., NW), 16 hybrid types (mixing A or C with any of the directional types) and a 27th type accounting for unclassified records, that is, days with chaotic weak flow or days when incompatible hybrids are formed (Fig. 2). Equations for S, \(Z_{W}\) and \(Z_{S}\) use a number of adjustment coefficients (s, \(z_{w}^-\), \(z_{w}^+\), \(z_s\)) which take into account the relative grid spacing at different center latitudes (\(\psi\), in degrees; Jones et al. 2013).

Table 1 Equations of the different circulation parameters of the JC-WT classification, for Northern and Southern Hemispheres
Fig. 1
figure 1

Spatial distribution of the grid point pattern (“cross”) of 16 points (in black) used in the JC-WT equations from Table 1. The background grid (in blue) represents the common, \(2.5^{\circ }\) regular grid, where the SLP has been interpolated. The square indicates the central grid cell of the cross, located at latitude \(\psi\) (see equations in Table 1)

In order to produce the classification for the entire globe, the center of the cross is displaced from one grid-box to another through all points of a reference 2.5\(^\circ\) regular SLP grid within a band from \(80^{\circ }\)S to \(80^{\circ }\)N. Note that grid-boxes within \(90^{\circ }\)S–\(80^{\circ }\)S and \(80^{\circ }\)N–\(90^{\circ }\)N are beyond the range of the JC-WT method since the cross extends \(10^{\circ }\) north-south from its center (Fig. 1). The JC-WT method was conceived for extratropical application since significant pressure gradients within the cross, required for a meaningful classification, are expected to occur mainly in these latitudes. Consequently, the JC-WT domain of analysis is recommended to any mid-to-high latitude region (\(\sim 30^{\circ }\)\(70^{\circ }\)) by Jones et al. (2013). However, in this study we calculate the JC-WT classification beyond these latitudes to obtain a more complete picture of the geographical limits of applicability of the method, also considering for the first time a spatially continuous application over the southern hemisphere. Brands (2022) performs JC-WT classification over all cells in the northern hemisphere for the latitudes between \(30^{\circ }\) N and \(70^{\circ }\) N only. Note that the \(z_{w}^\mp\) coefficients are undefined at \(\psi = \pm 5^{\circ }\) cross center latitudes (Table 1), and take nonphysical, negative values in between. Therefore, this latitudinal band has been excluded from our calculations.

The worldwide JC-WT method implementation here presented is available through the R package transformeR (v2.0.2, Iturbide et al. 2019). Its application is also illustrated through a worked example in the companion paper notebook (see Sec. Availability of data and materials).

2.2 Reanalysis data

We computed the global JC-WT classification using the 6-hourly SLP fields from five reanalysis products. Table 2 summarizes their features and provides references for further details. Prior to JC-WT application, all reanalyses were conservatively interpolated to a common 2.5\(^{\circ }\) regular longitude-latitude grid (Fig. 1). In order to compare all reanalyses, we considered their common 27-year period 1979–2005, coincident with the AR5 CMIP5 historical baseline ( Taylor et al. 2012). ERA-20C is produced through the assimilation of just SLP and marine winds, being therefore not fully comparable to the others, which assimilate a wider range of surface, upper-air and satellite observations. However, it has been also included in the intercomparison experiment for the sake of diversity. The reanalysis uncertainty is here analyzed by following an “all-against-all” validation scheme, so every reanalysis have been validated against each other.

Table 2 Set of reanalyses used in this study, their nominal resolution at the equator (in \(^\circ\)) and modelling centers producing them
Fig. 2
figure 2

Example of 6-hourly record of the Jenkinson-Collison Weather Type catalogue obtained from ERA-5, corresponding to the SLP state at \(\lbrace\)1979-01-01\(\rbrace\) 00:00:00 UTC. The 26 JC-WT circulation types are indicated in the legend, ranging from purely anticyclonic (A) and its hybrids (ANE to AN), pure directional (NE to N) and purely cyclonic (C) and its hybrids (CNE to CN). Type 27, unclassified (U), is depicted in grey. For reference, dashed grey lines depict some latitudes such as the Equator, the Tropic of Cancer (23.5\(^{\circ }\) N) and Capricorn (23.5\(^{\circ }\) S) and the Arctic and Antarctic Circles (± 66.5\(^{\circ }\)), and they are also included in subsequent figures

2.3 Jenkinson–Collinson classification assessment

2.3.1 Weather type frequencies and transition probabilities

One salient feature of a weather type is its probability of occurrence, which can be estimated by the relative frequency of occurrence in a sample, i.e. the proportion of 6-h records classified in a particular category over the complete time series length. JC-WT persistence or, more generally, transition probabilities between two different types are also important. They determine key temporal features such as spell duration, serving as an effective tool for the assessment of the model ability to reproduce circulation pattern sequences (Gibson et al. 2016; Hochman et al. 2019; Fernandez-Granja et al. 2021). In order to measure the differences among reanalyses, we assess both the JC-WT probabilities of occurrence for each individual type, as well as the probability of transition of one type into another. The latter are analysed using a transition probability matrix (TPM), briefly described next. Let the discrete random variable \(X_t\) represent a particular JC-WT at time t, whose values \(x_t\in \left\{ 1,\ldots ,K \right\}\), where \(K=27\) is the total number of WTs. We consider this variable at two consecutive time steps, \(X_{t-1}\) and \(X_t\), to construct the \(K\times K\) transition probability matrix A, where \(A_{ij} = p(X_t\!=\!j \mid X_{t-1}\!=\!i)\), representing the probability of transitioning from WT i to WT j. Hence, each row of the matrix sums to one, \(\sum _{j} A_{ij}=1\). The TPM thus provides a visual “fingerprint” on how a given dataset reproduces the JC-WT classification when centered on a given grid cell. TPMs can be compared to a reference through specific evaluation measures (Sect. 2.3.2). Further details and examples of TPMs are presented in Fernandez-Granja et al. (2021).

2.3.2 Evaluation measures

Lamb weather types (and its automated interpretation from Jenkinson–Collison) method has proven to be useful for many regions in the Northern Hemisphere, ranging from maritime climates like the British Isles or the Iberian Peninsula (Jones et al. 1993; Trigo and DaCamara 2000) to extremely continental climates in central Asia (Wang et al. 2017). Furthermore, the JC-WTs can be linked to the variability in near-surface variables such as cloudiness, sunshine, wind-speed and even atmospheric chemistry (Brands et al. 2014) and the identification of wind storms in Central Europe (Donat et al. 2010). Sarricolea et al. (2018) investigated the JC-WTs associated with the teleconnections affecting central-southern Chile, namely El Niño Southern Oscillation (Trenberth 1997), Pacific Decadal Oscillation (Mantua and Hare 2002) and Antarctic Oscillation (Limpasuvan and Hartmann 1999) also known as Southern Annular Mode (Gong and Wang 1999). Similarly, in order to assess how meaningful the obtained JC-WTs all around the globe are, we identified the weather type yielding the strongest positive correlation with three main teleconnection indices, which largely affect climate variability in both hemispheres, namely the North Atlantic Oscillation, Pacific-North American Pattern (Barnston and Livezey 1987) and the Antarctic Oscillation. Our results corroborate the expected most frequent JC-WT linked to these teleconnections in large regions of the world (Fig. 11 in the Appendix) and are in line with previous studies based on mean sea-level pressure or geopotential height patterns (Barnston and Livezey 1987; Hurrell 1995; Gong and Wang 1999). Thus, we assume that the application the JC-WT classification is physically meaningful in large spatial domains of both hemispheres and in the following the limits of this applicability are examined in depth.

Two main aspects are addressed in our evaluation of the JC-WT classification. Firstly, we investigate the suitability of the method for each region of the world by using two criteria: (1) the number of different types, measuring the regional circulation’s diversity, and (2) the occurrence of the Unclassified (U) type, witnessing weak pressure gradient with no clear vorticity tendency, also known as barometric swamp (Grimalt-Gelabert et al. 2013). Low diversity and frequent barometric swamp are here assumed to be indicative of the method working at its theoretical limits, i.e. in climate regimes for which it has not been developed; most prominently the monsoon and Intertropical Convergence Zone. Applying the method under these circumstance makes little sense since synoptic variability is either missing at the considered scale or it is represented by other variables than SLP. We here intentionally push the method to its limits in order to explore whether it can be applied anywhere in between 30 and 70 degrees North or South, as was estimated by Jones et al. (2013), or even beyond. For the diversity criterion, we consider weather types attaining relative frequencies above \(0.1 \%\), i.e. 40 or more occurrences in the total record of nearly 40,000 time steps. This threshold has been chosen since some JC-WTs were found to occur with relative frequencies as small as \(0.47 \%\) in its original formulation for the British Isles (Perry and Mayes 1998). In light of both measures (weather type diversity and type U frequency) jointly considered, we aim to obtain empirical evidence about the global geographical boundaries for the applicability of the JC-WT classification.

Secondly, reanalysis uncertainty is addressed. In order to evaluate the discrepancies among reanalyses, we measure the differences in their resulting JC-WT classification considering the relative bias of their WT frequencies, as well as the statistical significance of these differences using a two-proportions Z-test. Annual and seasonal relative biases for each WT are computed among all pairs of reanalyses in Table 2. The relative bias is a non-dimensional measure, which is zero for a perfect agreement of frequencies. The null hypothesis for the two-proportions Z-test is that the relative frequencies of a given type for two different reanalyses are the same. We use this parametric test in order to identify significant differences in their resulting weather type frequencies. Here, we use the Z-test implementation in the prop.test function from the R package stats (v3.6.3, R Core Team 2020), which includes an exact test for small samples, better suited for infrequent weather types. The test was performed using a 95% confidence level.

Reanalysis differences in their representation of transition probabilities were evaluated by the TPM Score (TPMS, Eq. 1), envisaged to provide a quantitative measure of dissimilarity between two transition probability matrices (Fernandez-Granja et al. 2021). TPMS is defined as follows:

$$\begin{aligned} TPMS = \sum _{p\in A^{*}}\mid p-p_{ref}\mid \end{aligned}$$
(1)

where p and \(p_{ref}\) are the transition probabilities in the test and in the reference datasets, respectively. The (absolute) difference is calculated considering a subset of transition probabilities \(A^*\) from the full matrix (A), that are significantly different between the two reanalyses considered in each comparison, following the two-proportion Z-test. In order to include the “missing” transitions in the score (i.e. either transitions that exist in the reference but do not occur the test dataset, or transitions that occur in the test but do not in the reference dataset), these are assigned a zero probability (i.e. either \(p = 0\) or \(p_{ref}=0\)) and included in the \(A^*\) subset. As a result, the larger the TPMS departure from zero (perfect agreement), the larger the dissimilarity of the TPM fingerprints between the reanalyses for a given center grid cell.

2.4 Regional synthesis

Finally, in order to provide a synthetic overview of the main results, we provide a regional assessment of our results. To this aim, we use the latest set of climatic reference regions used in by the IPCC for the assessment of historical trends and future climate change projections (Iturbide et al. 2020). In order to avoid the inclusion of unsuitable areas for the application of the JC-WT methodology and provide a more meaningful summary valid for regional intercomparison purposes, we exclude from the original polygon dataset the whole intertropical range (\(23.5^\circ S-23.5^\circ N\)), thus producing a modification of some of the original IPCC regions to exclude this area. The IPCC regions affected are indicated with an asterisk throughout the text. The modified polygon layer is included as supplementary material of this study to ensure the reproducibility of the results (see Sect. Availability of data and materials).

3 Results and discussions

3.1 Suitability of the JC-WT classification worldwide

For brevity, the following results are based on ERA-5 unless otherwise indicated. The global distribution of the total number of distinct WTs (Fig. 3, left) shows a marked latitudinal gradient, with a decreasing diversity of types towards the tropics. Conversely, the frequency of the U type (Fig. 3, right) exhibits a sudden increase in the tropics, following a pattern similar to type diversity (left). In general, the lower the diversity of types, the higher the frequency of the U type, with a few regional exceptions, where reduced diversity coincides with small U numbers. Such exceptions are found year-round over Antarctica and the Tibetan Plateau; and to a lesser degree also over Greenland and the southern Indian Ocean at mid-latitudes.

Fig. 3
figure 3

Summary of the Jenkinson–Collison global classification calculated upon the SLP from ERA-5 (6-hourly, 1979–2005), considering the whole annual series (top row) and DJF and JJA seasons (rows 2–3 respectively). Left column: Number of weather types per grid-box with a relative frequency of occurrence above \(0.1 \%\). Right column: Relative frequency of the Unclassified type (U) per grid-box

Visual inspection of the maps in Fig. 3 reveals a remarkable empirical threshold of around 16 different types coinciding with at least half the temporal record falling into the U type. This threshold will be interpreted as applicability criterion in the forthcoming and is generally met polewards at the Tropics latitudes on both hemispheres (\(\pm\, 23.5^{\circ }\)), which clearly extends the range of application suggested by Jones et al. (2013), from 30\(^{\circ }\) to 70\(^{\circ }\).

The aforementioned thresholds exhibit seasonal excursions which affects the Mediterranean, Middle-East and southwestern United States during JJA season, as well as the mid-latitude eastern South Atlantic and eastern South Pacific for DJF. This seasonal fluctuations go hand in hand with the seasonal shifts of the Intertropical Convergence Zone (ITCZ, Barry and Carleton 2013), where local scale processes of deep convection are predominant and may alter to some extent the applicability of the JC-WT classification even in the extra-tropics.

In the Mediterranean Basin, for instance, suitability is optimal during DJF (boreal winter, maximum JC-WT diversity and a negligible type U frequency), which is due to a southward shift of the Atlantic storm tracks in combination with autochthonous cyclogenesis over the Mediterranean Sea (Fita et al. 2007). In JJA (boreal summer), however, the U type frequency increases in hand with a shrinking type diversity, hence compromising the usefulness of the classification during this season. Miró et al. (2020) used a modified version of the JC method, which combines the JC classification at the surface with an upper air classification based on 500 hPa geopotential height, obtaining a better differentiation within the U type, for a small region in the Pyrenees. Further research is needed for an overall and systematic application of this modified version, since expert local knowledge is needed to fine-tune the parameters. In summary, for both hemispheres, the favourable area widens towards the equator during their winter and retreats towards the respective pole during summer (Fig. 3).

3.2 Reanalysis uncertainty

After determining the regional applicability limits of the JC-WT classification, we next analyze its consistency among different reanalyses. In order to summarize the results, we will here refer to the modified IPCC-AR6 regions (Sect. 2.4, Fig. 4).

Fig. 4
figure 4

Annual Transition Probability Matrix Scores (TPMS, Sect. 2.3.2) of NCEP, ERA-Interim, ERA-20C and JRA-55 against ERA-5 (used as reference). The modified IPCC region polygon layer with the region identification codes are depicted in the top-left. Note that the original IPCC regions that have been modified to exclude the intertropical range (see Sect. 2.4) are marked with an asterisk

Overall, ERA-20C and ERA-Interim are more similar to ERA-5 than NCEP and JRA-55, as revealed by TPMS (Figs. 4 and 10 in the Appendix). For the latter two datasets, large TPMS values are obtained over Greenland (GIC), Antarctica (WAN, EAN), Northern Central-America (NCA*), West North-America (WNA), Central North-America (CNA), central Asia (WCA, ECA, EAS), Southern Asia (SAS*) and the Tibetan Plateau (TIB), where values in excess of 10 are found. In the Southern Hemisphere, large discrepancies between reanalyses are found over South America (SWS* and SES regions). The above mentioned regions present complex orography, with high-altitude mountain systems such as the Himalayas, the Andes and the Rocky Mountains or thick ice sheets like Antarctica and Greenland. The discrepancies found in these regions of complex terrain are likely related to the differences in the SLP fields from the reanalyses (Fig. 8 in the Appendix), estimated from pressure reduction algorithms that are very sensitive to the representation of orography and its spatial resolution (Sterl 2004; Lavin-Gullon et al. 2021; Brands 2022,e.g.). The magnitude of these differences is potentially large and can reach several hPa in certain areas (Fita et al. 2019). They can be particularly critical for Antarctica, which is mostly ice sheet several kilometers high. Therefore, the poor representation of the SLP by the different reanalyses prevents from the application of JC-WTs classification in these regions, as reflected by their high TPMS values (Fig. 4).

Fig. 5
figure 5

Seasonal transition probability matrix scores (TPMS, Sect. 2.3.2) of JRA-55 reanalysis against ERA-5 (reference)

The seasonal analysis of the TPMS between JRA-55 and ERA-5 (Fig. 5) generally reveals a seasonal march of the largest values towards the pole of the summer hemisphere. This is similar to the results found for type diversity and U frequency (Sect. 3.1), where the ITCZ may determine the JC-WT applicability to some point in that oscillating stripe.

In particular, the seasonal TPMS values for North Central-America (NCA*) and the Sahara region (SAH*) are larger than the annual ones (Fig. 5). In the Mediterranean (MED), Central North-America (CNA) and in West North-America (WNA), the TPMS values are largest in JJA and lowest in DJF. Reflecting the aforementioned seasonal march, the TPMS values for South South-America (SSA), West South-Africa (WSAF*), East South-Africa (ESAF*), Madagascar (MDG*) and Central Australia (CAU) reach their maximum in DJF, with a much larger magnitude than in any other season.

According to Fig. 5, moderately high TPMS values are found year-round in the mid-latitude Indian Ocean, which extend to the eastern South Atlantic and eastern South Pacific during DJF (austral summer), particularly affecting the Southern-Ocean (SOO) region. These differences might be caused by scarce observations in that area and, noteworthy, they again coincide with a relatively low type diversity, as reported in the previous section.

The large TPMS values found along the margins of the East Antarctic Ice Sheet (EAIS) are probably related to an abrupt shift of the easterly katabatic winds blowing down the ice sheet to quasi-persistent westerlies at mid-latitudes, driving the Antarctic Ocean divergence zone (Davis and McNider 1997). This singular regional modulation of the wind field might by resolved in a distinct manner by the two reanalyses and so is the pressure reduction to mean sea-level over the ice sheet itself. Notably, the TPMS values over the West Antarctic Ice Sheet are systematically lower than those over the EAIS (compare WAN to EAN regions respectively).

Fig. 6
figure 6

Left: Annual and seasonal relative biases of weather type frequencies for the different reanalyses (in columns) against ERA-5 (reference) for MED. Asterisks indicate statistically significant differences following the two proportion Z-test (Sect. 2.3.2). Right: Annual and seasonal regional weather type frequencies as represented by ERA-5 for MED, used as reference for the relative biases on the left. The boxes represent the spatial variability of type frequencies within MED, where the box upper/lower boundaries show the inter-quartile range and the lower/upper whiskers extend to the 10th and 90th percentiles respectively. Circles indicate the median frequency

Fig. 7
figure 7

TPMS (in red) of JRA-55 against ERA-5 and the number of WTs (in blue) of ERA-5 for each IPCC region. The three different panels correspond to annual, DJF and JJA. The limits of the boxes show the spatial interquartile range and whiskers depict the 10th and 90th percentile. The median values are represented with a circle inside the boxes. In the three panels, regions are sorted by decreasing order of the 75th percentile of the number of WTs at annual scale

In order to shed some light on regional TPMS, we investigate the relative biases of WTs frequencies for each IPCC region, which reveal misrepresentations of the synoptic conditions and their frequencies by the applied reanalyses. As an illustrative example, we show the results for the Mediterranean (MED) region in Fig. 6. The respective results for the remaining IPCC regions can be obtained by following the working example on analysis reproducibility indicated in Sect. Availability of data and materials.

Results show biases of different magnitude and sign, depending on the reanalysis. The majority of statistically significant differences (marked with asterisks) occur for the most frequent WTs (see WTs frequencies for ERA-5 in the right panel). The largest and most significant biases are found during JJA, which coincides with large TPMS values (Fig. 5). For the least frequent weather types, reanalyses do not exhibit significant differences with respect to ERA-5, especially in spring, autumn and for the annual results. The Unclassified type is the most frequent one in all seasons except DJF. It exhibits significant differences among reanalyses in all seasons, especially during JJA and SON.

All in all, we found a relationship between three factors, namely large values of TPMS (Figs. 4 and  5), small number of WTs and high frequency of the U type (Fig. 3). These three factors should be analyzed together due to the fact that large TPMS in some regions might come from the limitations in the suitability of the JC-WT classification. In other regions where the classification method is suitable, the TPMS purely reflects the uncertainly among reanalyses. However, the relationship seems to be clearer between large TPMS and low number of LWTs. Figure 7 summarizes Figs. 3, 4 and 5 for all the IPCC regions and analizes the connection between the number of distinct WTs and TPMS.

Considering the criteria of JC-WT diversity, the method might be less suitable for regions like SAS* and EAN, where few distinct WTs appear (median below 16 types). Seasonally, other regions also stand out by their low number of WTs, e.g. CAU or WSAF* in boreal winter and TIB, ARP*, SAH*, NCA* and WCA in boreal summer (Fig. 7). There is a large spatial variability in terms of WTs diversity in some regions where the applicability of the WT classification is questionable, such as EAN, TIB, ARP*, SAH* or WSAF*. Regardless of their median number of WTs, these regions show grid cells where all WTs are registered. Additionally, there are some regions that stand out for having large TPMS (i.e. reanalysis uncertainty) despite of their large number of WTs: EAN, NCA*, SWS*, ESAF*, SES, EAS, ECA and GIC; only in DJF we can add TIB, SSA and WAN; and only in JJA we can add MED, WNA and CNA.

4 Conclusions

In the present study, we assess the potential applicability of JC-WT classification for the entire world, except the inner tropics. Furthermore, we evaluate the consistency of the JC-WTs classifications obtained from five different state-of-the-art reanalysis products of varying characteristics and resolutions by means of the TPMS and relative bias in the WTs frequencies. To this aim, we first formulate a modification of JC-WT method to allow for its use in the Southern Hemisphere and then use two different criteria to evaluate its applicability worldwide: the number of JC-WTs reflecting “type diversity” and the relative frequency of the Unclassified type. A roll-out of up to 41 regions extracted from the latest set of IPCC Working Group I reference regions has been used to divide the globe in order to address these analyses on the regional scale.

We show that the JC-WT approach can be reliably applied over most of the global areas within \(23.5^{\circ }\) and \(80^{\circ }\) latitude on both hemispheres. For most of this area of applicability, we find a large diversity of WTs, while the type U (representing null pressure gradient situations) occurs with low frequency. As a warning of the loss of applicability of the method, we find a transition from which the diversity of WTs decreases sharply at the same time that the type U becomes the dominant type. This transition marks an empirical applicability threshold of 16 distinct WTs.

The TPMS maps, which provide a measure of WTs transition probability agreement, unveil a general consistency between reanalyses within this applicability range. On the one hand, there are regions where the JC-WT approach is in principle applicable but its practical use is hampered by a large reanalysis uncertainty. These are the Mediterranean (MED) in JJA, and Madagascar (MDG*) and East Southern-Africa (ESAF*) in DJF. On the other hand, there are regions where JC-WT is less suitable irrespective of the reanalysis uncertainty. These are Arabian-Peninsula (ARP*) and SAHara (SAH*) in JJA, and West Southern-Africa (WSAF*) and Central Australia (CAU) in DJF. Likewise, there are regions with complex orography where the JC-WT classification should be taken with caution as the SLP is estimated through pressure reduction algorithms, differently for each dataset. Our analyses confirm the expected large discrepancies among reanalyses there: Greenland (GIC), Antarctica (WAN, EAN), Northern Central-America (NCA*), West North-America (WNA), Central North-America (CNA), central Asia (WCA, ECA, EAS), Southern Asia (SAS*) and the Tibetan Plateau (TIB).

Both the 16-type diversity threshold and large TPMS areas exhibit a seasonal march towards the pole of the respective summer hemisphere. This excursion is particularly strong for the TPMS during JJA (boreal summer).

Generally, ERA-20C, ERA-Interim and ERA-5 represent more similar transition probabilities than NCEP and JRA-55, in accordance with their respective similarities in terms of SLP fields, specially in areas of complex orography.

This is the first study addressing the global application of the JC-WTs. We define its limits of applicability, providing an extension of this popular classification method that can serve as a potential tool for process-based climate model evaluation.