Abstract
Circulation typing using rotated T-mode decomposition (i.e., variable is time series and observation is grid points) principal component analysis (PCA) requires up to four important decisions to optimize the actual classification output (i.e., the time series and associated maps): (i) the determination of the most appropriate number of rotated PCs to retain, (ii) the selection of a simple structure rotation algorithm, (iii) the consideration of constructing a hyperplane width threshold used to separate noise from the signal by establishing a confidence interval for a zero PC loading, and (iv) deciding on the various classification types, e.g., hard/overlapping, hard/non-overlapping, fuzzy/overlapping. This study examines the sensitivity of the validity of the classification outputs to each of these decisions. Rather than adhere to the traditional eigenvalue methods that make this decision prior to rotation, we develop a method to select the optimal number of rotated PCs to retain, using the congruence coefficient to assess the validity of each PC after rotation, by documenting the distance between the PCs to the patterns portrayed in the similarity matrix from which those PCs were derived (e.g., correlation, covariance). After an optimal number of components is determined, the PC loadings are rotated using a number of algorithms to establish the sensitivity of the validity of the solution to various orthogonal (Varimax) and oblique rotations (Direct Oblimin, Promax, Simplimax). In the present study, the Direct Oblimin rotation method is found to perform most accurately in the congruence coefficient matching. Next, the accuracy of the solution to the choice of the hyperplane width threshold is examined and the threshold is found to be critical in affecting the shape of the maps and the time series. For the classification types tested, we find for the two overlapping classifications, the congruence matching to the similarity matrix patterns yields excellent matches for hyperplane width threshold magnitude values within 0.2 to 0.3; thus, that range is found optimal for separating noise from the signal in this study. As the hyperplane width was increased beyond 0.3, the congruence match with the similarity matrix patterns decreased. Overlapping classifications that allow for the assignment of observation to more than one circulation type are found to have higher congruence matches with the similarity matrix patterns compared to non-overlapping classifications. Further, fuzzy/overlapping classifications, that provide non-binary weights for every observation, are found to have the largest congruence matches with the correlation patterns. Thus, despite the trade-off of less simplicity, fuzzy/overlapping classifications can be considered as an alternative to hard classifications when the goal is to interpret patterns that are well supported by the similarity matrix.
Similar content being viewed by others
Data availability
ERA5 data are available at https://cds.climate.copernicus.eu/cdsapp#!/dataset.
References
Ashok K, Shamal M, Sahai AK, Swapna P (2017) Nonlinearities in the evolutional distinctions between El Niño and La Niña types. J Geophys Res: Oceans 122:9649–9662
Baggaley A, Cattell RC (1956) A comparison of exact and approximate linear function estimates of oblique factor scores. Brit J of Statist Psychol 9:83–86
Bardossy A, Duckstein L, Bogardi I (1995) Fuzzy rule-based classification of atmospheric circulation patterns. Int J Climatol 15:1087–1097
Barrett P (1986) Factor comparison: An examination of three methods. Pers Individ Differ 7:327–340
Bartzokas A, Metaxas DA (1996) Northern Hemisphere gross circulation types. Climatic change and temperature distribution. Meteorol Z 5:99–109
Beck C, Philipp A (2010) Evaluation and comparison of circulation type classifications of the European domain. Phys Chem Earth, Parts A/B/C, 374–387
Beck C, Philipp A, Streicher F (2013) The effect of domain size on the relationship between circulation type classifications and surface climate. Int J Climatol 36:2692–2709
Brokken FB (1983) Orthogonal Procrustes rotation maximizing congruence. Psychometrika 48:343–352
Campitelli E, Díaz LB, Vera C (2022) Assessment of zonally symmetric and asymmetric components of the Southern Annular Mode using a novel approach. Clim Dyn 58:161–178. https://doi.org/10.1007/s00382-021-05896-5
Cattell RB (1966) The scree test for the number of factors. Multivar Behav Res 1:245–276
Chen M, Li T, Shen X, Wu B (2016) Relative Roles of Dynamic and Thermodynamic Processes in Causing Evolution Asymmetry between El Niño and La Niña. J Clim 29:2201–2220
Compagnucci RH, Richman MB (2008) Can principal component analysis provide atmospheric circulation or teleconnection patterns? Int J Climatol 6:703–726
Compagnucci RH, Araneo D, Canziani PO (2001) Principal sequence pattern analysis: a new approach to classifying the evolution of atmospheric systems. Int J Climatol 21:197–217
Davis C, Archer E, Vincent K, Engelbrecht F, Landman W, Cull T, Tadross M (2011) Climate Risk and Vulnerability: A Handbook for Southern Africa.
Demuzere M, Kassomenos P, Philipp A (2010) The COST733 circulation type classification software: An example for surface ozone concentrations in Central Europe. Theor Appl Climatol 105:143–166
Dholakia SG, Bhavsar CD (2017) Factor recovery by principal component analysis and Harris component analysis. Asian J Res Soc Sci Humanit 7:2249–7315
Fernández Mills G (1995) Principal component analysis of precipitation and rainfall regionalization in Spain. Theor Appl Climatol 50:169–183
Gillett NP, Kell TD, Jones PD (2006) Regional climate impacts of the Southern Annular Mode. Geophys Res Lett 33.
Gong X, Richman MB (1995) On the Application of Cluster Analysis to Growing Season Precipitation Data in North America East of the Rockies. J Clim 4:897–931
Guadagnoli E, Velicer W (1991) A comparison of pattern matching indices. Multivar Behav Res 26:323–343
Guirguis KJ, Avissar R (2008) An analysis of precipitation variability, persistence, and observational data uncertainty in the western United States. J Hydrometeor 9:843–865
Hall A, Visbeck M (2002) Synchronous Variability in the Southern Hemisphere Atmosphere, Sea Ice, and Ocean Resulting from the Annular Mode. J Clim 15:3043–3057
Haven S, ten Berge JM (1977) Tucker's coefficient of congruence as a measure of factorial invariance: An empirical study. Psychologische Instituten der Rijksuniversiteit Groningen
Hendrickson AE, White PO (1964) Promax: a quick method to oblique simple structure. Brit J Stat Psych 17:65
Hersbach H et al (2020) The ERA5 global reanalysis. Q J R Meteorol Soc 730:1999–2049
Hoerling MP, Kumar A, Zhong M (1997) El Niño La Niña, and the Nonlinearity of Their Teleconnections. J Clim 10:1769–1786
Huth R (1996) An intercomparison of computer-assisted circulation classification methods. Int J Climatol 16:893–922
Huth R (1997) Continental scale circulation in the UKHI GCM. J Climate 10:1545–1561
Huth R, Beranová R (2021) How to recognize a true mode of atmospheric variability. Earth Space Sci 8:e2020EA001275
Huth R, Canziani P (2003) Classification of hemispheric monthly mean stratospheric potential vorticity fields. Ann Geophys 21:808–817
Huth R, Beck C, Philipp A, Demuzere M, Ustrnul Z, Cahynová M, Kysely J, Tveito OE (2008) Classifications of atmospheric circulation patterns: recent advances and applications. Ann N Y Acad Sci 1146:105–152
Ibebuchi CC (2021a) On the relationship between circulation patterns, the southern annular mode, and rainfall variability in Western Cape. Atmosphere 12:753
Ibebuchi CC (2021b) Revisiting the 1992 severe drought episode in South Africa: the role of El Niño in the anomalies of atmospheric circulation types in Africa south of the equator. Theor Appl Climatol 146:723–740
Ibebuchi CC (2022) Patterns of atmospheric circulation in Western Europe linked to heavy rainfall in Germany: preliminary analysis into the 2021 heavy rainfall episode. Theor Appl Climatol 148:269–283
Im S-H, An S-I, Kim ST, Jin F-F (2015) Feedback processes responsible for El Niño-La Niña amplitude asymmetry. Geophys Res Lett 42:5556–5563
Jennrich RI, Sampson PF (1966) Rotation for simple loadings. Psychometrika 31:313
Kaiser HF (1958) The Varimax criterion for analytic rotation in factor analysis. Psychometrika 9:187–200
Kang I-S, Kug J-S (2002) El Niño and La Niña sea surface temperature anomalies: Asymmetry characteristics associated with their wind stress anomalies. J Geophys Res 107(D19):4372
Karl TR, Koscielny AJ (1982) Drought in the United States: 1895–1981. J Climatol 2:313–329
Kidson JW (1997) The utility of surface and upper air data in synoptic climatological specification of surface climatic variables. Int J Climatol 4:399–414
Kiers HAL (1994) Simplimax: Oblique rotation to an optimal target with simple structure. Psychometrika 59:567–579
Kim M, Yoo C, Sung M, Lee S (2021) Classification of Wintertime Atmospheric Teleconnection Patterns in the Northern Hemisphere. J Clim 34:1847–1861
Li M, Zhang Q, Zhang F (2016) Hail day frequency trends and associated atmospheric circulation patterns over China during 1960–2012. J Climate 29:7027–7044
Li W, Bi Xu, Sheng L, Luo Y, Sun J (2021) Modulations of Synoptic Weather Patterns on Warm-Sector Heavy Rainfall in South China: Insights From High-Density Observations With Principal Component Analysis. Front Earth Sci 9:2296–6463
Lorenz EN (1956) Empirical orthogonal functions and statistical weather prediction. Scientific Report No. 1, Contract AF19 (604)-1566, Meteorology Department, Massachusetts Institute of Technology, Cambridge, MA.
Lorenz EN (1970) Climate change as a mathematical problem. J Appl Meteor 9:325–329
Lorenzo-Seva U, Ten Berge JMF (2006) Tucker’s congruence coefficient as a meaningful index of factor similarity. Methodology: Eur J Res Methods Behav Soc Sci 2:57–64
Lundberg U, Ekman G (1973) Subjective geographic distance: A multidimensional comparison. Psychometrika 38:113–122
Mächel H, Kapala A (2013) Multivariate testing of spatio-temporal consistence of daily precipitation records. Adv Sci Res 10:85–90
McBratney AB, Moore AW (1985) Application of fuzzy sets to climatic classification. Agric for Meteorol 35:165–185
Montroy DL, Richman MB, Lamb PJ (1998) Observed non-linearities of monthly teleconnections between tropical Pacific sea-surface temperature anomalies and central and eastern North American precipitation. J Climate 11:812–835
Netzel P, Stepinski T (2016) On Using a Clustering Approach for Global Climate Classification. J Clim 29:3387–3401
North G, Bell T, Cahalan R, Moeng F (1982) Sampling Errors in the Estimation of Empirical Orthogonal Functions. Mon Wea Rev 110:699–706
O’Lenic EA, Livezey RE (1988) Practical considerations in the use of rotated principal component analysis (RPCA) in diagnostic studies of upper-air height fields. Mon Wea Rev 116:1682–1689
Philipp A (2009) Comparison of principal component and cluster analysis for classifying circulation pattern sequences for the European domain. Theor Appl Climatol 96:31–41
Philipp AP, Della-Marta M, Jacobeit J, Fereday DR, Jones PD, Moberg A, Wanner H (2007) Long-term variability of daily North Atlantic-European pressure patterns since 1850 classified by simulated annealing clustering. J Climate 20:4065–4095
Philipp A, Bartholy J, Beck C, Erpicum M et al (2010) COST733CAT – a database of weather and circulation type classifications. Phys Chem Earth, Parts a/b/c 35:360–373
Philipp A, Beck C, Huth R, Jacobeit J (2016) Development and comparison of circulation type classifications using the COST733 dataset and software. Int J Climatol 36:2673–2691
Pinneau SR, Newhouse A (1964) Measures of invariance and comparability in factor analysis for fixed variables. Psychometrika 29:271–281
Pirhalla DE, Lee CC, Sheridan SC, Ransibrahmanakul V (2022) Atlantic coastal sea level variability and synoptic-scale meteorological forcing. JAMC 61:205–222
Plaut G, Simonnet E (2001) Large-scale circulation classification, weather regimes, and local climate over France, the Alps and Western Europe. Clim Res 17:303–324
Richman MB (1986) Rotation of Principal Components. J Climatol 3:293–335
Richman MB, Easterling WE (1988) Procrustes target analysis: A multivariate tool for identification of climate fluctuations. J Geophys Res Atmos 93:10989–11003
Richman MB, Gong X (1999) Relationships between the definition of the hyperplane width to the fidelity of principal component loadings patterns. J Clim 6:1557–1576
Richman MB, Lamb PJ (1985) Climatic pattern analysis of three and seven-day summer rainfall in the Central United States: some methodological considerations and regionalization. J Climate Appl Meteor 12:1325–1343
Roberts D, Hamann A (2012) Predicting potential climate change impacts with bioclimate envelope models: a palaeoecological perspective. Glob Ecol Biogeogr 21:121–133
Serrano A, Garcia J, Mateos VL, Cancillo ML, Garrido J (1999) Monthly modes of variation of precipitation over the Iberian Peninsula. J Climate 12:2894–2919
Stryhal J, Huth R (2017) Classifications of winter Euro-Atlantic circulation patterns: An intercomparison of five atmospheric reanalyses. J Climate 30:7847–7861
Tataryn DJ, Gorsuch WJM, RL, (1999) Setting the value of k in Promax: A Monte Carlo study. Educ Psychol Measur 59:384–391
Tveito O, Huth R, Philipp A, et al (2016) COST Action 733: Harmonization and Application of Weather Type Classifications for European Regions. Universität Augsburg
Walker G (1904) Walker: Identifying the Southern Oscillation. Retrieved on 14 April 2022 https://iridl.ldeo.columbia.edu/maproom/ENSO/New/walker.html.
Wang CK, Zhao A, Huang X, Chen X, Rao X (2021) The crucial role of synoptic pattern in determining the spatial distribution and diurnal cycle of heavy rainfall over the south China coast. J Climate 34:2441–2458
Watterson IG (2001) Zonal wind vacillation and its interaction with the ocean: Implication for interannual variability and predictability. J Geophys Res 106:965–997
Woodhouse CA, Russell JL, Cook R (2009) Two modes of North American drought data from instrumental and paleoclimatic data. J Climate 22:4336–4347
Yi Z, Wang Y, Chen W, Guo B, Zhang B, Che H, Zhang X (2021) Classification of the Circulation Patterns Related to Strong Dust Weather in China Using a Combination of the Lamb-Jenkinson and k-Means Clustering Methods. Atmosphere 212:1545
Yuchechen AE, Canziani PO, Bischoff SA (2017) Stratosphere/troposphere joint variability in southern South America as estimated from a principal components analysis. Meteorol Atmos Phys 129:247–271
Zadeh LA (1965) Fuzzy Sets. Inf Control 8:338–353
Zhang JP, Zhu T, Zhang QH, Li CC, Shu HL, Ying Y, Dai ZP, Wang X, Liu XY, Liang AM, Shen HX, Yi BQ (2012) The impact of circulation patterns on regional transport pathways and air quality over Beijing and its surroundings. Atmos Chem 12:5031–5053
Zhao Z, Xi H, Russo A, Du H, Gong Y, Xiang J, Zhou Z, Zhang J, Li C, Zhou C (2019) The Influence of Multi-Scale Atmospheric Circulation on Severe Haze Events in Autumn and Winter in Shanghai. China Sustain 11:5979
Author information
Authors and Affiliations
Contributions
All authors worked together on all aspects of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethics approval
No human subject is involved in this study and Figures belong to the authors. The paper is also not under consideration in any Journal. There is also no conflict of interest in this paper.
Consent to participate
No human research is used. The authors consent this paper to be considered.
Consent for publication
The authors consent this paper to being published.
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ibebuchi, C.C., Richman, M.B. Circulation typing with fuzzy rotated T-mode principal component analysis: methodological considerations. Theor Appl Climatol 153, 495–523 (2023). https://doi.org/10.1007/s00704-023-04474-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-023-04474-5