Probabilistic clustering of extratropical cyclones using regression mixture models

Gaffney, Scott J.; Robertson, Andrew W.; Smyth, Padhraic; Camargo, Suzana J.; Ghil, Michael

doi:10.1007/s00382-007-0235-z

Probabilistic clustering of extratropical cyclones using regression mixture models

Published: 06 March 2007

Volume 29, pages 423–440, (2007)
Cite this article

Climate Dynamics Aims and scope Submit manuscript

Scott J. Gaffney¹,
Andrew W. Robertson^2,6,
Padhraic Smyth³,
Suzana J. Camargo² &
…
Michael Ghil^4,5

1787 Accesses
127 Citations
Explore all metrics

Abstract

A probabilistic clustering technique is developed for classification of wintertime extratropical cyclone (ETC) tracks over the North Atlantic. We use a regression mixture model to describe the longitude-time and latitude-time propagation of the ETCs. A simple tracking algorithm is applied to 6-hourly mean sea-level pressure fields to obtain the tracks from either a general circulation model (GCM) or a reanalysis data set. Quadratic curves are found to provide the best description of the data. We select a three-cluster classification for both data sets, based on a mix of objective and subjective criteria. The track orientations in each of the clusters are broadly similar for the GCM and reanalyzed data; they are characterized by predominantly south-to-north (S–N), west-to-east (W–E), and southwest-to-northeast (SW–NE) tracking cyclones, respectively. The reanalysis cyclone tracks, however, are found to be much more tightly clustered geographically than those of the GCM. For the reanalysis data, a link is found between the occurrence of cyclones belonging to different clusters of trajectory-shape, and the phase of the North Atlantic Oscillation (NAO). The positive phase of the NAO is associated with the SW–NE oriented cluster, whose tracks are relatively straight and smooth (with cyclones that are typically faster, more intense, and of longer duration). The negative NAO phase is associated with more-erratic W–E tracks, with typically weaker and slower-moving cyclones. The S–N cluster is accompanied by a more transient geopotential trough over the western North Atlantic. No clear associations are found in the case of the GCM composites. The GCM is able to capture cyclone tracks of quite realistic orientation, as well as subtle associated features of cyclone intensity, speed and lifetimes. The clustering clearly highlights, though, the presence of serious systematic errors in the GCM’s simulation of ETC behavior.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Eastern North Pacific tropical cyclone activity in historical and future CMIP5 experiments: assessment with a model-independent tracking scheme

Article 28 May 2019

A modified algorithm for identifying and tracking extratropical cyclones

Article 07 June 2017

Circular characteristics of the Philippine storm tracks

Article 13 October 2021

References

Anderson D, Hodges KI, Hoskins BJ (2003) Sensitivity of feature-based analysis methods of storm tracks to the form of background field. Mon Wea Rev 131(3):565–573
Article Google Scholar
Bernardo JM, Smith AFM (1994) Bayesian theory. Wiley, New York
Google Scholar
Blackmon ML, Wallace JM, Lau N-C, Mullen SL (1977) An observational study of the Northern Hemisphere wintertime circulation. J Atmos Sci 34:1040–1053
Article Google Scholar
Blender R, Fraedrich K, Lunkeit F (1997) Identification of cyclone-track regimes in the North Atlantic. Quart J Royal Meteor Soc 123:727–741
Article Google Scholar
Camargo SJ, Robertson AW, Gaffney SJ, Smyth P (2007a) Cluster analysis of typhoon tracks. Part I: General properties. J Climate (in press)
Camargo SJ, Robertson AW, Gaffney SJ, Smyth P (2007b) Cluster analysis of typhoon tracks. Part II: Large-scale circulation and ENSO. J Climate (in press)
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J Royal Stat Soc B 39:1–38
Google Scholar
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J Roy Stat Soc B 39:1–38
Google Scholar
DeSarbo WS, Cron WL (1988) A maximum likelihood methodology for clusterwise linear regression. J Classificat 5(1):249–282
Article Google Scholar
Draper NR, Smith H (1981) Applied regression analysis, 2nd edn. Wiley , New York
Google Scholar
Elsner JB (2003) Tracking hurricanes. Bull Amer Meteor Soc 84(3):353–356
Article Google Scholar
Elsner JB, Liu Kb, Kocher B (2000) Spatial variations in major US hurricane activity: statistics and a physical mechanism. J Climate 13:2293–2305
Article Google Scholar
Fraley C, Raftery AE (1998) How many clusters? Which clustering methods? Answers via model-based cluster analysis. Comput J 41(8):578–588
Article Google Scholar
Fraley C, Raftery AE (2002) Model-based clustering, discriminant analysis, and density estimation. J Amer Stat Assoc 97(458):611–631
Article Google Scholar
Fyfe JC (2003) Extratropical southern hemisphere cyclones: Harbingers of climate change. J Climate 16:2802–2805
Article Google Scholar
Gaffney SJ (2004) Probabilistic curve-aligned clustering and prediction with regression mixture models. Ph.D. Dissertation, Department of Computer Science, University of California, Irvine
Gaffney S, Smyth P (1999) Trajectory clustering with mixtures of regression models. In: Surajit Chaudhuri, David Madigan (eds) Proceedings of the fifth ACM SIGKDD international conference on knowledge discovery and data mining, August 15–18, 1999. ACM Press, New York, pp 63–72
Gaffney SJ, Smyth P (2003) Curve clustering with random effects regression mixtures. In: Bishop CM, Frey BJ (eds) Proceedings of the 9th international workshop on artificial intelligence and statistics, Key West, FL, January 3–6, 2003
Gneiting T, Raftery AE (2004) Strictly proper scoring rules, prediction, and estimation. Technical Report 463, Department of Statistics, University of Washington
Hack JJ, Kiehl JT, Hurrell JW (1998) The hydrologic and thermodynamic characteristics of the NCAR CCM3. J Climate 11:1179–1206
Article Google Scholar
Hannachi A, O’Neill A (2001) Atmospheric multiple equilibria and non-Gaussian behaviour in model simulations. Quart J Royal Meteor Soc 127(573):939–958
Article Google Scholar
Hartigan JA, Wong MA (1978) Algorithm AS 136: A K-means clustering algorithm. Appl Stat 28:100–108
Article Google Scholar
Hodges KI (1994) A general method for tracking analysis and its applications to meteorological data. Mon Wea Rev 122(11):2573–2586
Article Google Scholar
Hodges KI (1995) Feature tracking on the unit sphere. Mon Wea Rev 123(12):3458–3465
Article Google Scholar
Hodges KI (1998) Feature-point detection using distance transforms: Application to tracking tropical convective complexes. Mon Wea Rev 126(3):785–795
Article Google Scholar
Hoskins BJ, Hodges KI (2002) New perspectives on the Northern Hemisphere winter storm tracks. J Atmos Sci 59(6):1041–1061
Article Google Scholar
Hurrell JW, Kushnir Y, Ottersen G, Visbeck M (2003) An overview of the North Atlantic Oscillation. Geophys Monogr 134:2217–2231
Google Scholar
Kalnay E, Kanamitsu M, Kistler R, Collins W, Deaven D, Gandin L, Iredell M, Saha S, White G, Woollen J, Zhu Y, Chelliah M, Ebisuzaki W, Higgins W, Janowiak J, Mo KC, Ropelewski C, Wang J, Leetmaa A, Reynolds R, Jenne R, Joseph D (1996) The NCEP/NCAR 40-year reanalysis project. Bull Amer Meteor Soc 77:437–441
Article Google Scholar
Kimoto M, Ghil M (1993) Multiple flow regimes in the northern hemisphere winter. Part II: Sectorial regimes and preferred transitions. J Atmos Sci 16:2645–2673
Article Google Scholar
König W, Sausen R, Sielman F (1993) Objective identification of cyclones in GCM simulations. J Climate 6(12):2217–2231
Article Google Scholar
Lau N-C (1988) Variability of the observed midlatitude storm tracks in relation to low-frequency changes in the circulation pattern. J Atmos Sci 45:2718–2743
Article Google Scholar
Le Treut H, Kalnay E (1990) Comparison of observed and simulated cyclone frequency distribution as determined by an objective method. Atmosfera 3:57–71
Google Scholar
Lenk PJ, DeSarbo WS (2000) Bayesian inference for finite mixtures of generalized linear models with random effects. Psychometrika 65(1):93–119
Article Google Scholar
Lwin T, Martin PJ (1989) Probits of mixtures. Biometrics 45:721–732
Article Google Scholar
Mailier PJ, Stephenson DB, Ferro CAT, Hodges KJ (2006) Serial clustering of extratropical cyclones. Mon Wea Rev 134:2224–2240
Article Google Scholar
McLachlan GJ, Basford KE (1988) Mixture models: inference and applications to clustering. Marcel Dekker, New York
Google Scholar
McLachlan GJ, Krishnan T (1997) The EM algorithm and extensions. Wiley, New York
Google Scholar
McLachlan GJ, Peel D (2000) Finite mixture models. Wiley, New York
Google Scholar
Mesrobian E, Muntz R, Shek E, Mechoso CR, Farrara J, Spahr J, Stolorz P (1995) Real time data mining, management, and visualization of GCM output. Supercomputing ’94, IEEE Computer Society, pp 81–87
Mo K, Ghil M (1988) Cluster analysis of multiple planetary flow regimes. J Geophys Res 93D:10927–10952
Article Google Scholar
MunichRe (2002) Winter storms in Europe (II). Analysis of 1999 losses and loss potentials. Technical Report 302-03109, 72 pp [Available from Münchner Rückversicherungs-Gesellschaft, Königinstr. 107, 80802 München, Germany]
Preisendorfer RW (1988) Principal component analysis in meteorology and oceanography. Elsevier, Amsterdam
Google Scholar
Ramsay JO, Silverman BW (1997) Functional data analysis. Springer, New York
Google Scholar
Ramsay JO, Silverman BW (2002) Applied functional data analysis: methods and case studies. Springer, New York
Google Scholar
Robertson AW, Metz W (1990) Transient-eddy feedbacks derived from linear theory and observations. J Atmos Sci 47:2743–2764
Article Google Scholar
Ross J. Murray, Ian Simmonds (1991) A numerical scheme for tracking cyclone centres from digital data. Part I: Development and operation of the scheme. Aust Meteor Mag 39:155–166
Google Scholar
Saunders MA (1999) Earth’s future climate. Philos Trans Roy Soc Lond A 357:3459–3480
Article Google Scholar
Simmons AJ, Hoskins BJ (1978) The life cycles of some nonlinear baroclinic waves. J Atmos Sci 35(3):414–432
Article Google Scholar
Smyth P, Ide K, Ghil M (1999) Multiple regimes in northern hemisphere height fields via mixture model clustering. J Atmos Sci 56(21):3704–3723
Article Google Scholar
Smyth P (2000) Model selection for probabilistic clustering using cross-validated likelihood. Stat Comput 10(1):63–72
Article Google Scholar
von Storch H, Zwiers FW (1999) Statistical analysis in climate research. Cambridge University Press, Cambridge
Google Scholar
Terry J, Atlas R (1996) Objective cyclone tracking and its applications to ERS-1 scatterometer forecast impact studies. In: 15th conference on weather analysis and forecasting, Norfolk, VA. American Meteorological Society
Trenberth KE (1986) An assessment of the impact of transient eddies on the zonal flow during a blocking episode using localized Eliassen-Palm flux diagnostics. J Atmos Sci 43:2070–2087
Article Google Scholar
Vautard R (1990) Multiple weather regimes over the North Atlantic: analysis of precursors and successors. Mon Wea Rev 45:2845–2867
Google Scholar
Vrac M, Chedin A, Diday E (2005) Clustering a global field of atmospheric profiles by mixture decomposition of copulas. J Atmos Ocean Technol 22(10):1445–1459
Article Google Scholar
Wang K, Gasser T (1997) Alignment of curves by dynamic time warping. Annal Stat 25:1251–1276
Article Google Scholar
Yiou P, Nogaj M (2004) Extreme climatic events and weather regimes over the North Atlantic: when and where? Geophys Res Lett 31:L07202. doi:10.1029/2003GL019119
Article Google Scholar

Download references

Acknowledgments

We wish to thank Kevin Hodges for helpful discussions, and Jim Boyle and Peter Glecker for help in obtaining the NCAR CCM3 data. We are grateful to Kevin Hodges and two anonymous referees for their constructive reviews which substantially improved the paper. The NCEP–NCAR Reanalysis data were provided by the NOAA CIRES Climate Diagnostics Center, Boulder, Colorado, from their Web site available online at http://www.cdc.noaa.go. This work was supported in part by a Department of Energy grant DE-FG02-02ER63413 (MG and AWR), by NOAA through a block grant to the International Research Institute for Climate and Society (SJC and AWR), and by the National Science Foundation under grants No. SCI-0225642, IIS-0431085, and ATM-0530926 (SJG and PS).

Author information

Authors and Affiliations

Data Mining Research, Yahoo! Inc., Sunnyvale, CA, USA
Scott J. Gaffney
International Research Institute for Climate and Society (IRI), The Earth Institute at Columbia University, Palisades, NY, USA
Andrew W. Robertson & Suzana J. Camargo
Department of Computer Science, University of California, Irvine, CA, USA
Padhraic Smyth
Department of Atmospheric and Oceanic Sciences and IGPP, University of California, Los Angeles, CA, USA
Michael Ghil
Département Terre-Atmosphère-Océan and Laboratoire de Météorologie Dynamique (CNRS and IPSL), Ecole Normale Supérieure, Paris, France
Michael Ghil
IRI - Monell 230, 61 Route 9W, Palisades, NY, 10964, USA
Andrew W. Robertson

Authors

Scott J. Gaffney
View author publications
You can also search for this author in PubMed Google Scholar
Andrew W. Robertson
View author publications
You can also search for this author in PubMed Google Scholar
Padhraic Smyth
View author publications
You can also search for this author in PubMed Google Scholar
Suzana J. Camargo
View author publications
You can also search for this author in PubMed Google Scholar
Michael Ghil
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrew W. Robertson.

Appendix A: Expectation maximization algorithm

The EM algorithm is an iterative maximum likelihood (ML) procedure that provides a general and efficient framework for parameter estimation. At a base level, EM is an approximate root-finding procedure used to seek the root of the likelihood equation by iteratively searching for a set of parameters that maximize the probability of the observed data. EM is primarily used for finding ML parameter estimates in missing- or hidden-data problems. Parameter estimation in hidden-data problems is difficult because the likelihood equation takes on a complex form, often involving an integral or a sum over the hidden data itself.

For example, Eq. (5) in Sect. 3.3 gives the likelihood of ϕ given both Z and T (repeated here):

$$ L(\phi|{\bf Z},{\bf T}) = p({\bf Z}|{\bf T},\phi) = \prod_i^n \sum_k^K \alpha_k\, f_k({{\bf z}}_i|{{\bf T}}_i{\varvec{\beta}}_k,\Sigma_k). $$

(A1)

Notice that the hidden data in this case are the unknown cluster memberships which must be summed-out of the likelihood to arrive at L(ϕ|Z,T). It is understood in hidden-data problems that this operation cannot be easily carried out. The EM algorithm is an iterative two-step procedure used to circumvent this integration (or sum) by (1) indirectly estimating values for the unobserved data, and (2) finding the ML parameter estimates that correspond to the now completely observed data. The new ML estimates from step (2) are then used to re-estimate the hidden data in step (1), and these iterations are continued until some stopping criterion is reached (typically this involves stopping when the change in log-likelihood falls below a particular threshold, and thus the iterations have stabilized).

In the first step, the E-step, we estimate the hidden cluster memberships by forming the ratio of the likelihood of trajectory i under cluster k, to the sum-total likelihood of trajectory i under all clusters:

$$ w_{ik} = \frac{\alpha_k f_k({{\bf z}}_i|{{\bf T}}_i{\varvec{\beta}}_k,\Sigma_k)} {\sum_j^K \alpha_j f_j({{\bf z}}_i|{{\bf T}}_i{\varvec{\beta}}_j,\Sigma_j)}. $$

(A2)

These w _ik give the probabilities that the ith trajectory was generated from cluster k. They represent a posterior expectation for the value of the actual binary cluster memberships (i.e., the ith trajectory was either generated by the kth cluster or it was not).

In the second step, the M-step, the expected cluster memberships from the E-step are used to form the weighted log-likelihood function:

$$ {\mathcal{L}}(\phi|{\bf Z},{\bf T}) = \sum_i^n \sum_k^K w_{ik} \log \alpha_k\, f_k({{\bf z}}_i|{{\bf T}}_i{\varvec{\beta}}_k,\Sigma_k). $$

(A3)

The membership probabilities weight the contribution that the kth density component adds to the overall likelihood. In the case where the w _ik are binary, and thus cluster membership is perfectly known, this reduces to the usual fully-observed log-likelihood. This weighted log-likelihood is then maximized with respect to the parameter set ϕ.

For the sake of completeness, we give each of the re-estimation equations below. Let ${{{\bf w}}_{ik} = w_{ik}{{\bf I}}_{n_i},}$ where ${{{\bf I}}_{n_i}}$ is an n _i-vector of ones, and let W _k = diag(w′_1k, ..., w′_nk) be an N × N diagonal matrix. Then, in the M-step we use W _k to calculate the mixture parameters

$$ \hat{{\varvec{\beta}}}_k = ({{\bf T}}'{{\bf W}}_k{{\bf T}})^{-1}{{\bf T}}'{{\bf W}}_k{{\bf Z}}, $$

(A4)

$$ \hat{\Sigma_k} = \frac{({{\bf Z}}-{{\bf T}}\hat{\beta}_k)'{{\bf W}}_k ({{\bf Z}}-{{\bf T}}\hat{\beta}_k)}{\sum_i^n w_{ik}}, $$

(A5)

and the mixture weights

$$ \hat{\alpha_k} = {\frac{1}{n}} \sum_i^n w_{ik} $$

(A6)

for k = 1,...,K. These update equations are equivalent to the well-known weighted least-squares solution in regression (Draper and Smith 1981). The diagonal elements of W _k represent the weights to be applied to Z and T during the weighted regression.

Because most of the difficult work is carried out in estimating the cluster memberships, the maximization carried out in the M-step is straightforward. This is a common attribute of the EM algorithm. Dempster et al. (1977b) showed that under fairly general conditions, the likelihood will never decrease during the E- and M-step iterations. Due to the presence of local maxima on the likelihood surface, the solution is not guaranteed to correspond to a global maximum. However, we can increase the chances of finding the global maximum by running the EM algorithm multiple times from different starting points in parameter space and selecting the parameters that result in the highest overall likelihood.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gaffney, S.J., Robertson, A.W., Smyth, P. et al. Probabilistic clustering of extratropical cyclones using regression mixture models. Clim Dyn 29, 423–440 (2007). https://doi.org/10.1007/s00382-007-0235-z

Download citation

Received: 26 September 2006
Accepted: 26 January 2007
Published: 06 March 2007
Issue Date: September 2007
DOI: https://doi.org/10.1007/s00382-007-0235-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Probabilistic clustering of extratropical cyclones using regression mixture models

Abstract

Access this article

Similar content being viewed by others

Eastern North Pacific tropical cyclone activity in historical and future CMIP5 experiments: assessment with a model-independent tracking scheme

A modified algorithm for identifying and tracking extratropical cyclones

Circular characteristics of the Philippine storm tracks

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix A: Expectation maximization algorithm

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Probabilistic clustering of extratropical cyclones using regression mixture models

Abstract

Access this article

Similar content being viewed by others

Eastern North Pacific tropical cyclone activity in historical and future CMIP5 experiments: assessment with a model-independent tracking scheme

A modified algorithm for identifying and tracking extratropical cyclones

Circular characteristics of the Philippine storm tracks

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix A: Expectation maximization algorithm

Appendix A: Expectation maximization algorithm

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation