Model-Based Clustering for Cylindrical Data

SenGupta, Ashis; Roy, Moumita; Chattopadhyay, Asis Kumar

doi:10.1007/978-3-030-62900-7_17

Ashis SenGupta⁹,
Moumita Roy¹⁰ &
Asis Kumar Chattopadhyay¹¹

Part of the book series: Emerging Topics in Statistics and Biostatistics ((ETSB))

798 Accesses
3 Citations

Abstract

The objective of this paper is to perform clustering based on data consisting of both linear and circular variables, that is the data that lie on the surface of a cylinder. There are many circular–linear distributions available in the literature. We use the pragmatic approach of specifying the conditional rather than the marginal, which is often easier. Adopting Arnold et al. (Lecture Notes in Statistics: Conditionally Specified Distributions, Springer Verlag Publisher, Berlin Heidelberg, 1992), we provide the conditional distribution of θ given x and that of x given θ. Here, a mixture model approach based on the joint distribution of the linear and the circular variable is proposed. In particular, two types of such mixture models are used. One is based on the joint distribution of the marginal distribution of the linear variable and the conditional distribution of the circular variable given the linear variable and the other vice versa. Convergence property of Expectation Maximization (EM) algorithm for the members of the curved exponential family used for our models is studied. A real-life application on meteorological data is made of the proposed approaches. Comparison of the two models is done based on this example. The distinctive and important feature of preserving the geometry of the cylindrical manifold by our clustering method and its marked deviation from that for data on \(\Re ^p\) is also revealed by this example.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A cylindrical distribution with heavy-tailed linear part

Article 07 February 2019

Clustering Circular Data via Finite Mixtures of von Mises Distributions and an Application to Data on Wind Directions

Article 03 January 2024

Clustering on the Torus

Article 29 April 2021

References

Abe, T., Ley, C. (2017). A tractable, parsimonious and highly flexible model for cylindrical data, with applications. Econometrics and Statistics, 4, 91–104.
Article MathSciNet Google Scholar
Arnold, B. C., Castillo, E., Sarabia, J. M. (1992). Conditionally specified distributions. Lecture Notes in Statistics. Berlin Heidelberg: Springer.
Google Scholar
Dingxi, Q., Tamhane, A. (2007). A comparative study of k-means algorithm and the normal mixture model for clustering: Univariate case. Journal of Statistical Planning and Inference, 137, 3722–3740.
Article MathSciNet Google Scholar
Fisher, N. I., Lee, A. J. (1992). Regression models for an angular response. Biometrics, 48(3), 665–677.
Article MathSciNet Google Scholar
Green, P. J. (1984). Iteratively reweighted least squares for maximum likelihood estimation, and some robust and resistant alternatives. Journal of the Royal Statistical Society, Series B, 46, 149–192.
MathSciNet MATH Google Scholar
Jammalamadaka, S. R., SenGupta A. (2001). Topics in circular statistics. New Jersey: World Scientific Publishers.
Book Google Scholar
Johnson, R. A., Wehrly, T. E. (1978). Some angular-linear distributions and related regression models. Journal of the American Statistical Association, 73, 602–606.
Article MathSciNet Google Scholar
Johnson, R. A., Wichern, D. W. (2007). Applied multivariate statistical analysis. New Jersey: Pearson Prentice Hall
MATH Google Scholar
Kato, S., Shimizu, K. (2008). Dependent models for observations which include angular ones. Journal of Statistical Planning and Inference, 138, 3538–3549.
Article MathSciNet Google Scholar
Lagona, F., Picone, M., Maruotti, A. (2015). A Hidden Markov model for the analysis of cylindrical time series. Environmetrics, 26, 534–544.
Article MathSciNet Google Scholar
Mardia, K. V., Sutton, T. W. (1978). A model for cylindrical variables with applications. Journal of the Royal Statistical Society. Series B (Methodological), 40(2), 229–233.
Article Google Scholar
McLachlan, G. J., Krishnan, T. (1997). The EM algorithm and extensions. New York: Wiley.
MATH Google Scholar
McLachlan, G. J., Peel, D. (2001). Finite mixture models. Wiley Series in Probability and Statistics, United States of America. Hoboken: Wiley.
Google Scholar
Modlin, D., Fuentes, M., Reich, B. (2012). Circular conditional autoregressive modeling of vector fields. Environmetrics, 23, 46–53.
Article MathSciNet Google Scholar
Seal, B., SenGupta, A. (2012). On the foundations of dependency models and parameters for random variables on cylinder. Calcutta Statistical Association Bulletin, 64, 151–165.
Article MathSciNet Google Scholar
SenGupta, A. (2004). On the construction of probability distributions for directional data. Bulletin of the Calcutta Mathematical Society, 96(2), 139–154.
MathSciNet MATH Google Scholar
SenGupta, A., Ugwuowo, F. (2006). Asymmetric circular-linear multivariate regression models with applications to environmental data. Environmental and Ecological Statistics, 13(3), 299–309.
Article MathSciNet Google Scholar
Vermunt, J. K., Magidson, J. (2005). Hierarchical mixture models for nested data structures. In C. Weihs, W. Gaul (Eds.), Classification – the ubiquitous challenge. Studies in Classification, Data Analysis, and Knowledge Organization (pp. 240–247). Berlin: Springer.
Google Scholar
Wu, J. C. F. (1983). On the convergence properties of the EM algorithm. Annals of Statistics, 11(1), 95–103.
Article MathSciNet Google Scholar

Download references

Acknowledgements

The authors greatly appreciate the thorough and scholarly report on the paper by the Referee and express their thanks hereby. The research of the second author of the paper was funded by the Senior Research Fellowship from the University Grants Commission, Government of India. She is also thankful to the Indian Statistical Institute and the University of Calcutta for providing the necessary facilities.

Author information

Authors and Affiliations

Applied Statistics Unit, Indian Statistical Institute, Kolkata, West Bengal, India
Ashis SenGupta
Department of Statistics, Midnapore College (Autonomous), Midnapore, West Bengal, India
Moumita Roy
Department of Statistics, University of Calcutta, Kolkata, West Bengal, India
Asis Kumar Chattopadhyay

Authors

Ashis SenGupta
View author publications
You can also search for this author in PubMed Google Scholar
Moumita Roy
View author publications
You can also search for this author in PubMed Google Scholar
Asis Kumar Chattopadhyay
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics and Statistics, University of North Carolina Wilmington, Wilmington, NC, USA
Indranil Ghosh
Department of Mathematics and Statistics, McMaster University, Hamilton, ON, Canada
N. Balakrishnan
Department of Statistical Science, Southern Methodist University, Dallas, TX, USA
Hon Keung Tony Ng

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

SenGupta, A., Roy, M., Chattopadhyay, A.K. (2021). Model-Based Clustering for Cylindrical Data. In: Ghosh, I., Balakrishnan, N., Ng, H.K.T. (eds) Advances in Statistics - Theory and Applications. Emerging Topics in Statistics and Biostatistics . Springer, Cham. https://doi.org/10.1007/978-3-030-62900-7_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-62900-7_17
Published: 18 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-62899-4
Online ISBN: 978-3-030-62900-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Model-Based Clustering for Cylindrical Data

Abstract

Access this chapter

Similar content being viewed by others

A cylindrical distribution with heavy-tailed linear part

Clustering Circular Data via Finite Mixtures of von Mises Distributions and an Application to Data on Wind Directions

Clustering on the Torus

References

Acknowledgements

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Model-Based Clustering for Cylindrical Data

Abstract

Access this chapter

Similar content being viewed by others

A cylindrical distribution with heavy-tailed linear part

Clustering Circular Data via Finite Mixtures of von Mises Distributions and an Application to Data on Wind Directions

Clustering on the Torus

References

Acknowledgements

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation