Skip to main content

Flexible Modelling via Multivariate Skew Distributions

  • Conference paper
  • First Online:
Statistics and Data Science (RSSDS 2019)

Abstract

Mixtures of skew component distributions are being applied widely to model and partition data into clusters that exhibit non-normal features such as asymmetry and tails heavier than the normal. The number of contributions on skew distributions are now so many that it is beyond the scope of this paper to include them all here. However, many of these developments can be considered as special cases of a (location-scale variant) of the fundamental skew normal (CFUSN) distribution or of the fundamental skew t (CFUST) distribution. We therefore focus on mixtures of CFUSN and CFUST distributions, along with a recently proposed extension that can be viewed as a scale-mixture of the CFUSN distribution, namely the canonical fundamental skew (symmetric generalized) hyperbolic (CFUSH) distribution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Allard, A., Soubeyrand, S.: Skew-normality for climatic data and dispersal models for plant epidemiology: when application fields drive spatial statistics. Spat. Stat. 1, 50–64 (2012)

    Article  Google Scholar 

  • Arellano-Valle, R.B., Azzalini, A.: On the unification of families of skew-normal distributions. Scand. J. Stat. 33, 561–574 (2006)

    Article  MathSciNet  Google Scholar 

  • Arellano-Valle, R.B., Branco, M.D., Genton, M.G.: A unified view on skewed distributions arising from selections. Can. J. Stat. 34, 581–601 (2006)

    Article  MathSciNet  Google Scholar 

  • Arellano-Valle, R.B., Genton, M.G.: On fundamental skew distributions. J. Multivar. Anal. 96, 93–116 (2005)

    Article  MathSciNet  Google Scholar 

  • Asparouhov, T., Muthén, B.: Structural equation models and mixture models with continuous non-normal skewed distributions. Struct. Equ. Model.:Multidisc. J. 23, 1–19 (2016)

    Article  Google Scholar 

  • Azzalini, A.: A class of distributions which includes the normal ones. Scand. J. Stat. 12, 171–178 (1985)

    MathSciNet  MATH  Google Scholar 

  • Azzalini, A.: The skew-normal distribution and related multivariate families. Scand. J. Stat. 32, 159–188 (2005)

    Article  MathSciNet  Google Scholar 

  • Azzalini, A.: The Skew-Normal and Related Families. Cambridge University Press, Cambridge (2014). Institute of Mathematical Statistics Monographs

    MATH  Google Scholar 

  • Azzalini, A., Browne, R.P., Genton, M.G., McNicholas, P.D.: On nomenclature for, and the relative merits of, two formulations of skew distributions. Stat. Probab. Lett. 110, 201–206 (2016)

    Article  MathSciNet  Google Scholar 

  • Azzalini, A., Capitanio, A.: Distributions generated by perturbation of symmetry with emphasis on a multivariate skew \(t\) distribution. J. Roy. Stat. Soc. B 65, 367–389 (2003)

    Article  MathSciNet  Google Scholar 

  • Azzalini, A., Dalla Valle, A.: The multivariate skew-normal distribution. Biometrika 83, 715–726 (1996)

    Article  MathSciNet  Google Scholar 

  • Contreras-Reyes, J.E., Arellano-Valle, R.B.: Growth estimates of cardinalfish (Epigonus Crassicaudus) based on scale mixtures of skew-normal distributions. Fish. Res. 147, 137–144 (2013)

    Google Scholar 

  • Contreras-Reyes, J.E., López Quintero, F.O., Yáñez, A.A.: Towards age determination of Southern King crab (Lithodes Santolla) off Southern Chile using flexible mixture modeling. J. Marine Sci. Eng. 6, 157 (2018)

    Article  Google Scholar 

  • Forbes, F., Wraith, D.: A new family of multivariate heavy-tailed distributions with variable marginal amounts of tailweight: application to robust clustering. Stat. Comput. 24, 971–984 (2013)

    Article  MathSciNet  Google Scholar 

  • Genton, M.G. (ed.): Skew-Elliptical Distributions and Their Applications: A Journey Beyond Normality. Chapman & Hall/CRC, Boca Raton/Florida (2004)

    MATH  Google Scholar 

  • Hejblum, B.P., Alkhassim, C., Gottardo, R., Caron, F., Thiébaut, R.: Sequential Dirichlet process mixtures of multivariate skew \(t\)-distributions for model-based clustering of flow cytometry data. Ann. Appl. Stat. 13, 638–660 (2019)

    Article  MathSciNet  Google Scholar 

  • Hohmann, L., Holtmann, J., Eid, M.: Skew \(t\) mixture latent state-trait analysis: a Monte Carlo simulation study on statistical performance. Front. Psychol. 9, 1323 (2018)

    Article  Google Scholar 

  • Karlis, D., Santourian, A.: Model-based clustering with non-elliptically contoured distributions. Stat. Comput. 19, 73–83 (2009)

    Article  MathSciNet  Google Scholar 

  • Kollo, T.: Multivariate skewness and kurtosis measures with an application in ICA. J. Multivar. Anal. 99, 2328–2338 (2008)

    Article  MathSciNet  Google Scholar 

  • Kollo, T., Srivastava, M.S.: Estimation and testing of parameters in multivariate Laplace distribution. Commun. Stat. - Theor. Methods 33, 2363–2387 (2007)

    Article  MathSciNet  Google Scholar 

  • Lee, S.X., Lin, T.-I., McLachlan, G.J.: Mixtures of factor analyzers with fundamental skew symmetric distributions. arXiv:1802.02467 (2018)

  • Lee, S.X., McLachlan, G.J.: On mixtures of skew normal and skew \(t\)-distributions. Adv. Data Anal. Classif. 7, 241–266 (2013a)

    Article  MathSciNet  Google Scholar 

  • Lee, S.X., McLachlan, G.J.: Model-based clustering and classification with non-normal mixture distributions (with discussion). Stat. Methods Appl. 22, 427–479 (2013b)

    Article  MathSciNet  Google Scholar 

  • Lee, S.X., McLachlan, G.J.: Finite mixtures of multivariate skew \(t\)-distributions: some recent and new results. Stat. Comput. 24, 181–202 (2014a)

    Google Scholar 

  • Lee, S.X., McLachlan, G.J.: Maximum likelihood estimation for finite mixtures of canonical fundamental skew \(t\)-distributions: the unification of the unrestricted and restricted skew \(t\)-mixture models. arXiv:1401.8182v1 [stat.ME] (2014b)

  • Lee, S.X., McLachlan, G.J.: EMMIXcskew: an R package for the fitting of a mixture of canonical fundamental skew \(t\)-distributions. arXiv:1509.02069.v1 [stat.CO] (2015)

  • Lee, S.X., McLachlan, G.J.: EMMIXcskew: an R Package for the fitting of a mixture of canonical fundamental skew \(t\)-distributions. J. Stat. Softw. 83(3) (2018)

    Google Scholar 

  • Lee, S.X., McLachlan, G.J.: Finite mixtures of canonical fundamental skew \(t\)-distributions: the unification of the restricted and unrestricted skew \(t\)-mixture models. Stat. Comput. 26, 573–589 (2016)

    Article  MathSciNet  Google Scholar 

  • Lee, S.X., McLachlan, G.J., Pyne, S.: Modelling of inter-sample variation in flow cytometric data with the joint clustering and matching (JCM) procedure. Cytometry Part A 89A, 30–43 (2016)

    Article  Google Scholar 

  • Lin, T.-I.: Maximum likelihood estimation for multivariate skew normal mixture models. J. Multivar. Anal. 101, 257–265 (2009a)

    Article  MathSciNet  Google Scholar 

  • Lin, T.-I.: Robust mixture modeling using the multivariate skew \(t\)-distributions. Stat. Comput. 20, 343–356 (2009b)

    Article  MathSciNet  Google Scholar 

  • Lin, T.-I., Lee, J.C., Hsieh, W.: Robust mixture modeling using the skew \(t\) distribution. Stat. Comput. 17, 81–92 (2007a)

    Article  MathSciNet  Google Scholar 

  • Lin, T.-I., Lee, J.C., Yen, S.Y.: Finite mixture modelling using the skew normal distribution. Stat. Sinica 17, 909–927 (2007b)

    MathSciNet  MATH  Google Scholar 

  • Maleki, M., Wraith, D., Arellano-Valle, R.B.: Robust finite mixture modeling of multivariate unrestricted skew-normal generalized hyperbolic distributions. Stat. Comput. 29, 415–428 (2019)

    Article  MathSciNet  Google Scholar 

  • McLachlan, G.J., Lee, S.X.: Comment “On the nomenclature for, and the relative merits of two formulations of skew distributions,” by A. Azzalini, R. Browne, M. Genton, and P. McNicholas. Stat. Probab. Lett. 116, 1–5 (2016)

    Article  MathSciNet  Google Scholar 

  • McLachlan, G.J., Lee, S.X.: Comment on “Hidden truncation hyperbolic distributions, mixtures thereof, and their application for clustering” by Murray, Browne, and McNicholas. arXiv:1904.12057 (2019)

  • McLachlan, G.J., Lee, S.X., Rathnayake, S.I.: Finite mixture models. Ann. Rev. Stat. Appl. 6, 355–378 (2019)

    Article  MathSciNet  Google Scholar 

  • Mousavi, S.A., Amirzadeh, V., Rezapour, M., Sheikhy, A.: Multivariate tail conditional expectation for scale mixtures of skew-normal distribution. J. Stat. Comput. Simul. 89, 3167–3181 (2019)

    Article  MathSciNet  Google Scholar 

  • Murray, P.M., Browne, R.B., McNicholas, P.D.: Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering. J. Multivar. Anal. 161, 141–156 (2017)

    Article  MathSciNet  Google Scholar 

  • Murray, P.M., Browne, R.B., McNicholas, P.D.: Note of Clarification on “Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering”, by Murray, Browne, and McNicholas, J. Multivariate Anal. 161 (2017) 141–156. J. Multivar. Anal. 171, 475–476 (2019)

    Article  Google Scholar 

  • Pyne, S., et al.: Automated high-dimensional flow cytometric data analysis. Proc. Natl. Acad. Sci. USA 106, 8519–8524 (2009)

    Google Scholar 

  • Pyne, S., et al.: Joint modeling and registration of cell populations in cohorts of high-dimensional flow cytometric data. PLoS One 9(7), e100334 (2014)

    Article  Google Scholar 

  • Riggi, S., Ingrassia, S.: Modeling high energy cosmic rays mass composition data via mixtures of multivariate skew-\(t\) distributions. arXiv:13011178 [astro-phHE] (2013)

  • Sahu, S.K., Dey, D.K., Branco, M.D.: A new class of multivariate skew distributions with applications to Bayesian regression models. Can. J. Stat. 31, 129–150 (2003)

    Article  MathSciNet  Google Scholar 

  • Seshadri, V.: Halphen’s laws. In: Encyclopedia of Statistical Sciences, pp. 302–306. Wiley, New York (1997)

    Google Scholar 

  • Tagle, F., Castruccio, S., Crippa, P., Genton, M.G.: A non-Gaussian spatio-temporal model for daily wind speeds based on a multi-variate skew-\(t\) distribution. J. Time Ser. Anal. 40, 312–326 (2019)

    Article  MathSciNet  Google Scholar 

  • Voigt, T., Fried, R.: Distance based feature construction in a setting of astronomy. In: Lausen, B., Krolak-Schwerdt, S., Böhmer, M. (eds.) Data Science, Learning by Latent Structures, and Knowledge Discovery. SCDAKO, pp. 475–485. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-44983-7_42

    Chapter  Google Scholar 

  • Wraith, D., Forbes, F.: Clustering using skewed multivariate heavy tailed distributions with flexible tail behaviour. Comput. Stat. Data Anal. 90, 61–72 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Geoffrey J. McLachlan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

McLachlan, G.J., Lee, S.X. (2019). Flexible Modelling via Multivariate Skew Distributions. In: Nguyen, H. (eds) Statistics and Data Science. RSSDS 2019. Communications in Computer and Information Science, vol 1150. Springer, Singapore. https://doi.org/10.1007/978-981-15-1960-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-1960-4_4

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-1959-8

  • Online ISBN: 978-981-15-1960-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics