Skip to main content
Log in

Circulation typing with fuzzy rotated T-mode principal component analysis: methodological considerations

  • Research
  • Published:
Theoretical and Applied Climatology Aims and scope Submit manuscript

Abstract

Circulation typing using rotated T-mode decomposition (i.e., variable is time series and observation is grid points) principal component analysis (PCA) requires up to four important decisions to optimize the actual classification output (i.e., the time series and associated maps): (i) the determination of the most appropriate number of rotated PCs to retain, (ii) the selection of a simple structure rotation algorithm, (iii) the consideration of constructing a hyperplane width threshold used to separate noise from the signal by establishing a confidence interval for a zero PC loading, and (iv) deciding on the various classification types, e.g., hard/overlapping, hard/non-overlapping, fuzzy/overlapping. This study examines the sensitivity of the validity of the classification outputs to each of these decisions. Rather than adhere to the traditional eigenvalue methods that make this decision prior to rotation, we develop a method to select the optimal number of rotated PCs to retain, using the congruence coefficient to assess the validity of each PC after rotation, by documenting the distance between the PCs to the patterns portrayed in the similarity matrix from which those PCs were derived (e.g., correlation, covariance). After an optimal number of components is determined, the PC loadings are rotated using a number of algorithms to establish the sensitivity of the validity of the solution to various orthogonal (Varimax) and oblique rotations (Direct Oblimin, Promax, Simplimax). In the present study, the Direct Oblimin rotation method is found to perform most accurately in the congruence coefficient matching. Next, the accuracy of the solution to the choice of the hyperplane width threshold is examined and the threshold is found to be critical in affecting the shape of the maps and the time series. For the classification types tested, we find for the two overlapping classifications, the congruence matching to the similarity matrix patterns yields excellent matches for hyperplane width threshold magnitude values within 0.2 to 0.3; thus, that range is found optimal for separating noise from the signal in this study. As the hyperplane width was increased beyond 0.3, the congruence match with the similarity matrix patterns decreased. Overlapping classifications that allow for the assignment of observation to more than one circulation type are found to have higher congruence matches with the similarity matrix patterns compared to non-overlapping classifications. Further, fuzzy/overlapping classifications, that provide non-binary weights for every observation, are found to have the largest congruence matches with the correlation patterns. Thus, despite the trade-off of less simplicity, fuzzy/overlapping classifications can be considered as an alternative to hard classifications when the goal is to interpret patterns that are well supported by the similarity matrix.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

ERA5 data are available at https://cds.climate.copernicus.eu/cdsapp#!/dataset.

References

  • Ashok K, Shamal M, Sahai AK, Swapna P (2017) Nonlinearities in the evolutional distinctions between El Niño and La Niña types. J Geophys Res: Oceans 122:9649–9662

    Google Scholar 

  • Baggaley A, Cattell RC (1956) A comparison of exact and approximate linear function estimates of oblique factor scores. Brit J of Statist Psychol 9:83–86

    Google Scholar 

  • Bardossy A, Duckstein L, Bogardi I (1995) Fuzzy rule-based classification of atmospheric circulation patterns. Int J Climatol 15:1087–1097

    Google Scholar 

  • Barrett P (1986) Factor comparison: An examination of three methods. Pers Individ Differ 7:327–340

    Google Scholar 

  • Bartzokas A, Metaxas DA (1996) Northern Hemisphere gross circulation types. Climatic change and temperature distribution. Meteorol Z 5:99–109

    Google Scholar 

  • Beck C, Philipp A (2010) Evaluation and comparison of circulation type classifications of the European domain. Phys Chem Earth, Parts A/B/C, 374–387

  • Beck C, Philipp A, Streicher F (2013) The effect of domain size on the relationship between circulation type classifications and surface climate. Int J Climatol 36:2692–2709

    Google Scholar 

  • Brokken FB (1983) Orthogonal Procrustes rotation maximizing congruence. Psychometrika 48:343–352

    Google Scholar 

  • Campitelli E, Díaz LB, Vera C (2022) Assessment of zonally symmetric and asymmetric components of the Southern Annular Mode using a novel approach. Clim Dyn 58:161–178. https://doi.org/10.1007/s00382-021-05896-5

    Article  Google Scholar 

  • Cattell RB (1966) The scree test for the number of factors. Multivar Behav Res 1:245–276

    Google Scholar 

  • Chen M, Li T, Shen X, Wu B (2016) Relative Roles of Dynamic and Thermodynamic Processes in Causing Evolution Asymmetry between El Niño and La Niña. J Clim 29:2201–2220

    Google Scholar 

  • Compagnucci RH, Richman MB (2008) Can principal component analysis provide atmospheric circulation or teleconnection patterns? Int J Climatol 6:703–726

    Google Scholar 

  • Compagnucci RH, Araneo D, Canziani PO (2001) Principal sequence pattern analysis: a new approach to classifying the evolution of atmospheric systems. Int J Climatol 21:197–217

    Google Scholar 

  • Davis C, Archer E, Vincent K, Engelbrecht F, Landman W, Cull T, Tadross M (2011) Climate Risk and Vulnerability: A Handbook for Southern Africa.

  • Demuzere M, Kassomenos P, Philipp A (2010) The COST733 circulation type classification software: An example for surface ozone concentrations in Central Europe. Theor Appl Climatol 105:143–166

    Google Scholar 

  • Dholakia SG, Bhavsar CD (2017) Factor recovery by principal component analysis and Harris component analysis. Asian J Res Soc Sci Humanit 7:2249–7315

    Google Scholar 

  • Fernández Mills G (1995) Principal component analysis of precipitation and rainfall regionalization in Spain. Theor Appl Climatol 50:169–183

    Google Scholar 

  • Gillett NP, Kell TD, Jones PD (2006) Regional climate impacts of the Southern Annular Mode. Geophys Res Lett 33.

  • Gong X, Richman MB (1995) On the Application of Cluster Analysis to Growing Season Precipitation Data in North America East of the Rockies. J Clim 4:897–931

    Google Scholar 

  • Guadagnoli E, Velicer W (1991) A comparison of pattern matching indices. Multivar Behav Res 26:323–343

    Google Scholar 

  • Guirguis KJ, Avissar R (2008) An analysis of precipitation variability, persistence, and observational data uncertainty in the western United States. J Hydrometeor 9:843–865

    Google Scholar 

  • Hall A, Visbeck M (2002) Synchronous Variability in the Southern Hemisphere Atmosphere, Sea Ice, and Ocean Resulting from the Annular Mode. J Clim 15:3043–3057

    Google Scholar 

  • Haven S, ten Berge JM (1977) Tucker's coefficient of congruence as a measure of factorial invariance: An empirical study. Psychologische Instituten der Rijksuniversiteit Groningen

  • Hendrickson AE, White PO (1964) Promax: a quick method to oblique simple structure. Brit J Stat Psych 17:65

    Google Scholar 

  • Hersbach H et al (2020) The ERA5 global reanalysis. Q J R Meteorol Soc 730:1999–2049

    Google Scholar 

  • Hoerling MP, Kumar A, Zhong M (1997) El Niño La Niña, and the Nonlinearity of Their Teleconnections. J Clim 10:1769–1786

    Google Scholar 

  • Huth R (1996) An intercomparison of computer-assisted circulation classification methods. Int J Climatol 16:893–922

    Google Scholar 

  • Huth R (1997) Continental scale circulation in the UKHI GCM. J Climate 10:1545–1561

    Google Scholar 

  • Huth R, Beranová R (2021) How to recognize a true mode of atmospheric variability. Earth Space Sci 8:e2020EA001275

    Google Scholar 

  • Huth R, Canziani P (2003) Classification of hemispheric monthly mean stratospheric potential vorticity fields. Ann Geophys 21:808–817

    Google Scholar 

  • Huth R, Beck C, Philipp A, Demuzere M, Ustrnul Z, Cahynová M, Kysely J, Tveito OE (2008) Classifications of atmospheric circulation patterns: recent advances and applications. Ann N Y Acad Sci 1146:105–152

    Google Scholar 

  • Ibebuchi CC (2021a) On the relationship between circulation patterns, the southern annular mode, and rainfall variability in Western Cape. Atmosphere 12:753

    Google Scholar 

  • Ibebuchi CC (2021b) Revisiting the 1992 severe drought episode in South Africa: the role of El Niño in the anomalies of atmospheric circulation types in Africa south of the equator. Theor Appl Climatol 146:723–740

    Google Scholar 

  • Ibebuchi CC (2022) Patterns of atmospheric circulation in Western Europe linked to heavy rainfall in Germany: preliminary analysis into the 2021 heavy rainfall episode. Theor Appl Climatol 148:269–283

    Google Scholar 

  • Im S-H, An S-I, Kim ST, Jin F-F (2015) Feedback processes responsible for El Niño-La Niña amplitude asymmetry. Geophys Res Lett 42:5556–5563

    Google Scholar 

  • Jennrich RI, Sampson PF (1966) Rotation for simple loadings. Psychometrika 31:313

    Google Scholar 

  • Kaiser HF (1958) The Varimax criterion for analytic rotation in factor analysis. Psychometrika 9:187–200

    Google Scholar 

  • Kang I-S, Kug J-S (2002) El Niño and La Niña sea surface temperature anomalies: Asymmetry characteristics associated with their wind stress anomalies. J Geophys Res 107(D19):4372

    Google Scholar 

  • Karl TR, Koscielny AJ (1982) Drought in the United States: 1895–1981. J Climatol 2:313–329

    Google Scholar 

  • Kidson JW (1997) The utility of surface and upper air data in synoptic climatological specification of surface climatic variables. Int J Climatol 4:399–414

    Google Scholar 

  • Kiers HAL (1994) Simplimax: Oblique rotation to an optimal target with simple structure. Psychometrika 59:567–579

    Google Scholar 

  • Kim M, Yoo C, Sung M, Lee S (2021) Classification of Wintertime Atmospheric Teleconnection Patterns in the Northern Hemisphere. J Clim 34:1847–1861

    Google Scholar 

  • Li M, Zhang Q, Zhang F (2016) Hail day frequency trends and associated atmospheric circulation patterns over China during 1960–2012. J Climate 29:7027–7044

    Google Scholar 

  • Li W, Bi Xu, Sheng L, Luo Y, Sun J (2021) Modulations of Synoptic Weather Patterns on Warm-Sector Heavy Rainfall in South China: Insights From High-Density Observations With Principal Component Analysis. Front Earth Sci 9:2296–6463

    Google Scholar 

  • Lorenz EN (1956) Empirical orthogonal functions and statistical weather prediction. Scientific Report No. 1, Contract AF19 (604)-1566, Meteorology Department, Massachusetts Institute of Technology, Cambridge, MA.

  • Lorenz EN (1970) Climate change as a mathematical problem. J Appl Meteor 9:325–329

    Google Scholar 

  • Lorenzo-Seva U, Ten Berge JMF (2006) Tucker’s congruence coefficient as a meaningful index of factor similarity. Methodology: Eur J Res Methods Behav Soc Sci 2:57–64

    Google Scholar 

  • Lundberg U, Ekman G (1973) Subjective geographic distance: A multidimensional comparison. Psychometrika 38:113–122

    Google Scholar 

  • Mächel H, Kapala A (2013) Multivariate testing of spatio-temporal consistence of daily precipitation records. Adv Sci Res 10:85–90

    Google Scholar 

  • McBratney AB, Moore AW (1985) Application of fuzzy sets to climatic classification. Agric for Meteorol 35:165–185

    Google Scholar 

  • Montroy DL, Richman MB, Lamb PJ (1998) Observed non-linearities of monthly teleconnections between tropical Pacific sea-surface temperature anomalies and central and eastern North American precipitation. J Climate 11:812–835

    Google Scholar 

  • Netzel P, Stepinski T (2016) On Using a Clustering Approach for Global Climate Classification. J Clim 29:3387–3401

    Google Scholar 

  • North G, Bell T, Cahalan R, Moeng F (1982) Sampling Errors in the Estimation of Empirical Orthogonal Functions. Mon Wea Rev 110:699–706

    Google Scholar 

  • O’Lenic EA, Livezey RE (1988) Practical considerations in the use of rotated principal component analysis (RPCA) in diagnostic studies of upper-air height fields. Mon Wea Rev 116:1682–1689

    Google Scholar 

  • Philipp A (2009) Comparison of principal component and cluster analysis for classifying circulation pattern sequences for the European domain. Theor Appl Climatol 96:31–41

    Google Scholar 

  • Philipp AP, Della-Marta M, Jacobeit J, Fereday DR, Jones PD, Moberg A, Wanner H (2007) Long-term variability of daily North Atlantic-European pressure patterns since 1850 classified by simulated annealing clustering. J Climate 20:4065–4095

    Google Scholar 

  • Philipp A, Bartholy J, Beck C, Erpicum M et al (2010) COST733CAT – a database of weather and circulation type classifications. Phys Chem Earth, Parts a/b/c 35:360–373

    Google Scholar 

  • Philipp A, Beck C, Huth R, Jacobeit J (2016) Development and comparison of circulation type classifications using the COST733 dataset and software. Int J Climatol 36:2673–2691

    Google Scholar 

  • Pinneau SR, Newhouse A (1964) Measures of invariance and comparability in factor analysis for fixed variables. Psychometrika 29:271–281

    Google Scholar 

  • Pirhalla DE, Lee CC, Sheridan SC, Ransibrahmanakul V (2022) Atlantic coastal sea level variability and synoptic-scale meteorological forcing. JAMC 61:205–222

    Google Scholar 

  • Plaut G, Simonnet E (2001) Large-scale circulation classification, weather regimes, and local climate over France, the Alps and Western Europe. Clim Res 17:303–324

    Google Scholar 

  • Richman MB (1986) Rotation of Principal Components. J Climatol 3:293–335

    Google Scholar 

  • Richman MB, Easterling WE (1988) Procrustes target analysis: A multivariate tool for identification of climate fluctuations. J Geophys Res Atmos 93:10989–11003

    Google Scholar 

  • Richman MB, Gong X (1999) Relationships between the definition of the hyperplane width to the fidelity of principal component loadings patterns. J Clim 6:1557–1576

    Google Scholar 

  • Richman MB, Lamb PJ (1985) Climatic pattern analysis of three and seven-day summer rainfall in the Central United States: some methodological considerations and regionalization. J Climate Appl Meteor 12:1325–1343

    Google Scholar 

  • Roberts D, Hamann A (2012) Predicting potential climate change impacts with bioclimate envelope models: a palaeoecological perspective. Glob Ecol Biogeogr 21:121–133

    Google Scholar 

  • Serrano A, Garcia J, Mateos VL, Cancillo ML, Garrido J (1999) Monthly modes of variation of precipitation over the Iberian Peninsula. J Climate 12:2894–2919

    Google Scholar 

  • Stryhal J, Huth R (2017) Classifications of winter Euro-Atlantic circulation patterns: An intercomparison of five atmospheric reanalyses. J Climate 30:7847–7861

    Google Scholar 

  • Tataryn DJ, Gorsuch WJM, RL, (1999) Setting the value of k in Promax: A Monte Carlo study. Educ Psychol Measur 59:384–391

    Google Scholar 

  • Tveito O, Huth R, Philipp A, et al (2016) COST Action 733: Harmonization and Application of Weather Type Classifications for European Regions. Universität Augsburg

  • Walker G (1904) Walker: Identifying the Southern Oscillation. Retrieved on 14 April 2022 https://iridl.ldeo.columbia.edu/maproom/ENSO/New/walker.html.

  • Wang CK, Zhao A, Huang X, Chen X, Rao X (2021) The crucial role of synoptic pattern in determining the spatial distribution and diurnal cycle of heavy rainfall over the south China coast. J Climate 34:2441–2458

    Google Scholar 

  • Watterson IG (2001) Zonal wind vacillation and its interaction with the ocean: Implication for interannual variability and predictability. J Geophys Res 106:965–997

    Google Scholar 

  • Woodhouse CA, Russell JL, Cook R (2009) Two modes of North American drought data from instrumental and paleoclimatic data. J Climate 22:4336–4347

    Google Scholar 

  • Yi Z, Wang Y, Chen W, Guo B, Zhang B, Che H, Zhang X (2021) Classification of the Circulation Patterns Related to Strong Dust Weather in China Using a Combination of the Lamb-Jenkinson and k-Means Clustering Methods. Atmosphere 212:1545

    Google Scholar 

  • Yuchechen AE, Canziani PO, Bischoff SA (2017) Stratosphere/troposphere joint variability in southern South America as estimated from a principal components analysis. Meteorol Atmos Phys 129:247–271

    Google Scholar 

  • Zadeh LA (1965) Fuzzy Sets. Inf Control 8:338–353

    Google Scholar 

  • Zhang JP, Zhu T, Zhang QH, Li CC, Shu HL, Ying Y, Dai ZP, Wang X, Liu XY, Liang AM, Shen HX, Yi BQ (2012) The impact of circulation patterns on regional transport pathways and air quality over Beijing and its surroundings. Atmos Chem 12:5031–5053

    Google Scholar 

  • Zhao Z, Xi H, Russo A, Du H, Gong Y, Xiang J, Zhou Z, Zhang J, Li C, Zhou C (2019) The Influence of Multi-Scale Atmospheric Circulation on Severe Haze Events in Autumn and Winter in Shanghai. China Sustain 11:5979

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Contributions

All authors worked together on all aspects of the manuscript.

Corresponding author

Correspondence to Chibuike Chiedozie Ibebuchi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval

No human subject is involved in this study and Figures belong to the authors. The paper is also not under consideration in any Journal. There is also no conflict of interest in this paper.

Consent to participate

No human research is used. The authors consent this paper to be considered.

Consent for publication

The authors consent this paper to being published.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

Tables 10 and 11

Table 10 Congruence matches between the rotated PC loadings to the correlation patterns for oblique solutions (m = 2, 3, 4) under the Promax method
Table 11 Congruence coefficient between the anomaly SLP composite maps of the CTs from the same principal component. The CTs are fuzzy overlapping CTs at T = 0.2

Figures 9, 1011, 12, 13 and 14

Fig. 9
figure 9

Idealized locations of synoptic rain-bearing systems in the study region during their active periods. CW: cross-equatorial northeast trade wind; MCT: Mozambique Channel Trough; SIOHP: western branch of the South Indian Ocean high-pressure AGC: Agulhas warm current; CF: cold fronts; SAOHP: South Atlantic Ocean high-pressure; AC: Angola warm current; BC: Benguela cold current; ITCZ: Inter-tropical convergence Zone during its southward track in austral summer; KL: Kalahari low; AL: Angola low; SICZ: South Indian Ocean Convergence Zone

Fig. 10
figure 10

Standardized DJF annual mean loadings of PC3 and annual mean SAM index from 1979 to 2020

Fig. 11
figure 11

Standardized SLP anomaly map on 1984–09-04

Fig. 12
figure 12

Distribution of the PC loadings from all 4 retained PCs. The horizontal dashed line indicates the 0.92 PC loading magnitude

Fig. 13
figure 13

Histogram of the congruence coefficients (in absolute value) between the PC scores and the 15,341 daily SLP observations, for the non-overlapping (a), and overlapping (b) classification. The vertical line indicates the 0.92 congruence coefficient. For the overlapping classification, the weighted mean map of the 4 PC scores (i.e., from the four retained PCs and using the PC loadings as the weights) is matched to the SLP anomaly map of the day. For the non-overlapping, (only one) PC score that has the largest loading magnitude is matched to the SLP anomaly of the day

Fig. 14
figure 14

Effect of sampling variability on the PC loadings. The X-axis contains the original PC loadings and the Y-axis contains the count of PC loadings after resampling the data that changed to within the range of \(-\) 0.05 to + 0.05. The red, yellow, brown, blue, and green vertical lines demarcate PC loadings values of 0.1, 0.2, 0.3, 0.4, and 0.5 respectively

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ibebuchi, C.C., Richman, M.B. Circulation typing with fuzzy rotated T-mode principal component analysis: methodological considerations. Theor Appl Climatol 153, 495–523 (2023). https://doi.org/10.1007/s00704-023-04474-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00704-023-04474-5

Keywords

Navigation