Advertisement

Pattern Analysis and Applications

, Volume 19, Issue 3, pp 631–645 | Cite as

SWGMM: a semi-wrapped Gaussian mixture model for clustering of circular–linear data

  • Anandarup RoyEmail author
  • Swapan K. Parui
  • Utpal Roy
Theoretical Advances

Abstract

Finite mixture models are widely used to perform model-based clustering of multivariate data sets. Most of the existing mixture models work with linear data; whereas, real-life applications may involve multivariate data having both circular and linear characteristics. No existing mixture models can accommodate such correlated circular–linear data. In this paper, we consider designing a mixture model for multivariate data having one circular variable. In order to construct a circular–linear joint distribution with proper inclusion of correlation terms, we use the semi-wrapped Gaussian distribution. Further, we construct a mixture model (termed SWGMM) of such joint distributions. This mixture model is capable of approximating the distribution of multi-modal circular–linear data. An unsupervised learning of the mixture parameters is proposed based on expectation maximization method. Clustering is performed using maximum a posteriori criterion. To evaluate the performance of SWGMM, we choose the task of color image segmentation in LCH space. We present comprehensive results and compare SWGMM with existing methods. Our study reveals that the proposed mixture model outperforms the other methods in most cases.

Keywords

Circular–linear joint distribution Semi-wrapped Gaussian distribution Statistical mixture model Clustering 

References

  1. 1.
    Agiomyrgiannakis Y, Stylianou Y (2009) Wrapped gaussian mixture models for modeling and high-rate quantization of phase data of speech. IEEE Trans Audio Speech Lang Process 17(4):775–786CrossRefGoogle Scholar
  2. 2.
    Amayri O, Bouguila N (2013) Beyond hybrid generative discriminative learning: spherical data classification. Pattern Anal Appl pp 1–21Google Scholar
  3. 3.
    Bahlmann C (2006) Directional features in online handwriting recognition. Pattern Recognit 39:115–125CrossRefGoogle Scholar
  4. 4.
    Banerjee A, Dhillon IS, Ghosh J, Sra S (2005) Clustering on the unit hypersphere using von Mises–Fisher distributions. J Mach Learn Res 6:1345–1382MathSciNetzbMATHGoogle Scholar
  5. 5.
    Boutemedjet S, Bouguila N, Ziou D (2009) A hybrid feature extraction selection approach for high-dimensional non-gaussian data clustering. IEEE Trans Pattern Anal Mach Intell 31(8):1429–1443CrossRefGoogle Scholar
  6. 6.
    Fernández-Durán JJ (2007) Models for circular–linear and circular–circular data constructed from circular distributions based on nonnegative trigonometric sums. Biometrics 63:579–585MathSciNetCrossRefzbMATHGoogle Scholar
  7. 7.
    Figueiredo MAT, Jain AK (2002) Unsupervised learning of finite mixture models. IEEE Trans Pattern Anal Mach Intell 24(3):381–396CrossRefGoogle Scholar
  8. 8.
    Freixenet J, Muñoz X, Raba D, Martí J, Cufí X (2002) Yet another survey on image segmentation: region and boundary information integration. In: Proceedings of European conference on computer vision-Part III. Springer, Berlin, pp 408–422Google Scholar
  9. 9.
    Gong H, Shi J (2011) Conditional entropies as over-segmentation and under-segmentation metrics for multi-part image segmentation. Technical Report MS-CIS-11-17, University of Pennsylvania Department of Computer and Information ScienceGoogle Scholar
  10. 10.
    Jammalamadaka SR, Sengupta A (2001) Topics in Circular Statistics. World Scientific Publication Co Inc, New YorkGoogle Scholar
  11. 11.
    Johnson RA, Wehrly TE (1978) Some angular-linear distributions and related regression models. J Am Stat Assoc 73(363):602–606MathSciNetCrossRefzbMATHGoogle Scholar
  12. 12.
    Law MHC, Figueiredo MAT, Jain AK (2004) Simultaneous feature selection and clustering using mixture models. IEEE Trans Pattern Anal Mach Intell 26(9):1154–1166CrossRefGoogle Scholar
  13. 13.
    Mardia KV, Jupp P (2000) Directional Statistics. Wiley, New YorkGoogle Scholar
  14. 14.
    Mardia KV, Sutton TW (1978) A model for cylindrical variables with applications. J Royal Stat Soc Ser B (Methodological) 40(2):229–233zbMATHGoogle Scholar
  15. 15.
    Martin D, Fowlkes C, Tal D, Malik J (2001) A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: Proceedings of international conference computer vision, pp 416–423Google Scholar
  16. 16.
    Martin DR (2002) An empirical approach to grouping and segmentation. Ph.D. thesis, EECS Department, University of California, BerkeleyGoogle Scholar
  17. 17.
    Mclachlan GJ, Peel D (2000) Finite Mixture Models. Wiley, New YorkGoogle Scholar
  18. 18.
    Meilǎ M (2002) Comparing clusterings: an axiomatic view. In: Proceedings of international conference on machine learning. ACM, New York, pp 577–584Google Scholar
  19. 19.
    Peel D, Mclachlan GJ (2000) Robust mixture modelling using the t distribution. Stat Comput 10:339–348CrossRefGoogle Scholar
  20. 20.
    Roy A, Parui SK, Roy U (2007) A beta mixture model based approach to text extraction from color images. In: Proceedings of international conference on advances in pattern recognition. World Scientific, New York, pp 321–326Google Scholar
  21. 21.
    Roy A, Parui SK, Roy U (2011) Color image segmentation using a semi-wrapped gaussian mixture model. In: proceedings of international conference on pattern recognition and machine intelligence. Springer, Berlin, pp 148–153Google Scholar
  22. 22.
    Roy A, Parui SK, Roy U (2012) A mixture model of circular–linear distributions for color image segmentation. Int J Comput Appl 58(9):6–11Google Scholar
  23. 23.
    Shieh GS, Zheng SR, Shimizu K (2006) A bivariate generalized von Mises distribution with applications to circular genomes. Technical report, Institute of Statistical Science, Academia Sinica, TaiwanGoogle Scholar
  24. 24.
    Unnikrishnan R, Pantofaru C, Hebert M (2007) Toward objective evaluation of image segmentation algorithms. IEEE Trans Pattern Anal Mach Intell 29(6):929–944CrossRefGoogle Scholar
  25. 25.
    Wang Q, Kulkarni SR, Verd\(\acute{u}\) S (2009) Divergence estimation for multidimensional densities via \(k\)-nearest-neighbor distances. IEEE Trans Inf Theory 55(5):2392–2405Google Scholar
  26. 26.
    Yang AY, Wright J, Ma Y, Sastry SS (2008) Unsupervised segmentation of natural images via lossy data compression. Comput Vis Image Underst 110:212–225CrossRefGoogle Scholar

Copyright information

© Springer-Verlag London 2014

Authors and Affiliations

  1. 1.Computer Vision and Pattern Recognition UnitIndian Statistical InstituteKolkataIndia
  2. 2.Department of Computer and System SciencesVisva-Bharati UniversitySantiniketanIndia

Personalised recommendations