Abstract
Mixtures of skew component distributions are being applied widely to model and partition data into clusters that exhibit non-normal features such as asymmetry and tails heavier than the normal. The number of contributions on skew distributions are now so many that it is beyond the scope of this paper to include them all here. However, many of these developments can be considered as special cases of a (location-scale variant) of the fundamental skew normal (CFUSN) distribution or of the fundamental skew t (CFUST) distribution. We therefore focus on mixtures of CFUSN and CFUST distributions, along with a recently proposed extension that can be viewed as a scale-mixture of the CFUSN distribution, namely the canonical fundamental skew (symmetric generalized) hyperbolic (CFUSH) distribution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Allard, A., Soubeyrand, S.: Skew-normality for climatic data and dispersal models for plant epidemiology: when application fields drive spatial statistics. Spat. Stat. 1, 50–64 (2012)
Arellano-Valle, R.B., Azzalini, A.: On the unification of families of skew-normal distributions. Scand. J. Stat. 33, 561–574 (2006)
Arellano-Valle, R.B., Branco, M.D., Genton, M.G.: A unified view on skewed distributions arising from selections. Can. J. Stat. 34, 581–601 (2006)
Arellano-Valle, R.B., Genton, M.G.: On fundamental skew distributions. J. Multivar. Anal. 96, 93–116 (2005)
Asparouhov, T., Muthén, B.: Structural equation models and mixture models with continuous non-normal skewed distributions. Struct. Equ. Model.:Multidisc. J. 23, 1–19 (2016)
Azzalini, A.: A class of distributions which includes the normal ones. Scand. J. Stat. 12, 171–178 (1985)
Azzalini, A.: The skew-normal distribution and related multivariate families. Scand. J. Stat. 32, 159–188 (2005)
Azzalini, A.: The Skew-Normal and Related Families. Cambridge University Press, Cambridge (2014). Institute of Mathematical Statistics Monographs
Azzalini, A., Browne, R.P., Genton, M.G., McNicholas, P.D.: On nomenclature for, and the relative merits of, two formulations of skew distributions. Stat. Probab. Lett. 110, 201–206 (2016)
Azzalini, A., Capitanio, A.: Distributions generated by perturbation of symmetry with emphasis on a multivariate skew \(t\) distribution. J. Roy. Stat. Soc. B 65, 367–389 (2003)
Azzalini, A., Dalla Valle, A.: The multivariate skew-normal distribution. Biometrika 83, 715–726 (1996)
Contreras-Reyes, J.E., Arellano-Valle, R.B.: Growth estimates of cardinalfish (Epigonus Crassicaudus) based on scale mixtures of skew-normal distributions. Fish. Res. 147, 137–144 (2013)
Contreras-Reyes, J.E., López Quintero, F.O., Yáñez, A.A.: Towards age determination of Southern King crab (Lithodes Santolla) off Southern Chile using flexible mixture modeling. J. Marine Sci. Eng. 6, 157 (2018)
Forbes, F., Wraith, D.: A new family of multivariate heavy-tailed distributions with variable marginal amounts of tailweight: application to robust clustering. Stat. Comput. 24, 971–984 (2013)
Genton, M.G. (ed.): Skew-Elliptical Distributions and Their Applications: A Journey Beyond Normality. Chapman & Hall/CRC, Boca Raton/Florida (2004)
Hejblum, B.P., Alkhassim, C., Gottardo, R., Caron, F., Thiébaut, R.: Sequential Dirichlet process mixtures of multivariate skew \(t\)-distributions for model-based clustering of flow cytometry data. Ann. Appl. Stat. 13, 638–660 (2019)
Hohmann, L., Holtmann, J., Eid, M.: Skew \(t\) mixture latent state-trait analysis: a Monte Carlo simulation study on statistical performance. Front. Psychol. 9, 1323 (2018)
Karlis, D., Santourian, A.: Model-based clustering with non-elliptically contoured distributions. Stat. Comput. 19, 73–83 (2009)
Kollo, T.: Multivariate skewness and kurtosis measures with an application in ICA. J. Multivar. Anal. 99, 2328–2338 (2008)
Kollo, T., Srivastava, M.S.: Estimation and testing of parameters in multivariate Laplace distribution. Commun. Stat. - Theor. Methods 33, 2363–2387 (2007)
Lee, S.X., Lin, T.-I., McLachlan, G.J.: Mixtures of factor analyzers with fundamental skew symmetric distributions. arXiv:1802.02467 (2018)
Lee, S.X., McLachlan, G.J.: On mixtures of skew normal and skew \(t\)-distributions. Adv. Data Anal. Classif. 7, 241–266 (2013a)
Lee, S.X., McLachlan, G.J.: Model-based clustering and classification with non-normal mixture distributions (with discussion). Stat. Methods Appl. 22, 427–479 (2013b)
Lee, S.X., McLachlan, G.J.: Finite mixtures of multivariate skew \(t\)-distributions: some recent and new results. Stat. Comput. 24, 181–202 (2014a)
Lee, S.X., McLachlan, G.J.: Maximum likelihood estimation for finite mixtures of canonical fundamental skew \(t\)-distributions: the unification of the unrestricted and restricted skew \(t\)-mixture models. arXiv:1401.8182v1 [stat.ME] (2014b)
Lee, S.X., McLachlan, G.J.: EMMIXcskew: an R package for the fitting of a mixture of canonical fundamental skew \(t\)-distributions. arXiv:1509.02069.v1 [stat.CO] (2015)
Lee, S.X., McLachlan, G.J.: EMMIXcskew: an R Package for the fitting of a mixture of canonical fundamental skew \(t\)-distributions. J. Stat. Softw. 83(3) (2018)
Lee, S.X., McLachlan, G.J.: Finite mixtures of canonical fundamental skew \(t\)-distributions: the unification of the restricted and unrestricted skew \(t\)-mixture models. Stat. Comput. 26, 573–589 (2016)
Lee, S.X., McLachlan, G.J., Pyne, S.: Modelling of inter-sample variation in flow cytometric data with the joint clustering and matching (JCM) procedure. Cytometry Part A 89A, 30–43 (2016)
Lin, T.-I.: Maximum likelihood estimation for multivariate skew normal mixture models. J. Multivar. Anal. 101, 257–265 (2009a)
Lin, T.-I.: Robust mixture modeling using the multivariate skew \(t\)-distributions. Stat. Comput. 20, 343–356 (2009b)
Lin, T.-I., Lee, J.C., Hsieh, W.: Robust mixture modeling using the skew \(t\) distribution. Stat. Comput. 17, 81–92 (2007a)
Lin, T.-I., Lee, J.C., Yen, S.Y.: Finite mixture modelling using the skew normal distribution. Stat. Sinica 17, 909–927 (2007b)
Maleki, M., Wraith, D., Arellano-Valle, R.B.: Robust finite mixture modeling of multivariate unrestricted skew-normal generalized hyperbolic distributions. Stat. Comput. 29, 415–428 (2019)
McLachlan, G.J., Lee, S.X.: Comment “On the nomenclature for, and the relative merits of two formulations of skew distributions,” by A. Azzalini, R. Browne, M. Genton, and P. McNicholas. Stat. Probab. Lett. 116, 1–5 (2016)
McLachlan, G.J., Lee, S.X.: Comment on “Hidden truncation hyperbolic distributions, mixtures thereof, and their application for clustering” by Murray, Browne, and McNicholas. arXiv:1904.12057 (2019)
McLachlan, G.J., Lee, S.X., Rathnayake, S.I.: Finite mixture models. Ann. Rev. Stat. Appl. 6, 355–378 (2019)
Mousavi, S.A., Amirzadeh, V., Rezapour, M., Sheikhy, A.: Multivariate tail conditional expectation for scale mixtures of skew-normal distribution. J. Stat. Comput. Simul. 89, 3167–3181 (2019)
Murray, P.M., Browne, R.B., McNicholas, P.D.: Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering. J. Multivar. Anal. 161, 141–156 (2017)
Murray, P.M., Browne, R.B., McNicholas, P.D.: Note of Clarification on “Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering”, by Murray, Browne, and McNicholas, J. Multivariate Anal. 161 (2017) 141–156. J. Multivar. Anal. 171, 475–476 (2019)
Pyne, S., et al.: Automated high-dimensional flow cytometric data analysis. Proc. Natl. Acad. Sci. USA 106, 8519–8524 (2009)
Pyne, S., et al.: Joint modeling and registration of cell populations in cohorts of high-dimensional flow cytometric data. PLoS One 9(7), e100334 (2014)
Riggi, S., Ingrassia, S.: Modeling high energy cosmic rays mass composition data via mixtures of multivariate skew-\(t\) distributions. arXiv:13011178 [astro-phHE] (2013)
Sahu, S.K., Dey, D.K., Branco, M.D.: A new class of multivariate skew distributions with applications to Bayesian regression models. Can. J. Stat. 31, 129–150 (2003)
Seshadri, V.: Halphen’s laws. In: Encyclopedia of Statistical Sciences, pp. 302–306. Wiley, New York (1997)
Tagle, F., Castruccio, S., Crippa, P., Genton, M.G.: A non-Gaussian spatio-temporal model for daily wind speeds based on a multi-variate skew-\(t\) distribution. J. Time Ser. Anal. 40, 312–326 (2019)
Voigt, T., Fried, R.: Distance based feature construction in a setting of astronomy. In: Lausen, B., Krolak-Schwerdt, S., Böhmer, M. (eds.) Data Science, Learning by Latent Structures, and Knowledge Discovery. SCDAKO, pp. 475–485. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-44983-7_42
Wraith, D., Forbes, F.: Clustering using skewed multivariate heavy tailed distributions with flexible tail behaviour. Comput. Stat. Data Anal. 90, 61–72 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
McLachlan, G.J., Lee, S.X. (2019). Flexible Modelling via Multivariate Skew Distributions. In: Nguyen, H. (eds) Statistics and Data Science. RSSDS 2019. Communications in Computer and Information Science, vol 1150. Springer, Singapore. https://doi.org/10.1007/978-981-15-1960-4_4
Download citation
DOI: https://doi.org/10.1007/978-981-15-1960-4_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1959-8
Online ISBN: 978-981-15-1960-4
eBook Packages: Computer ScienceComputer Science (R0)