Abstract
Model-based clustering is a method that clusters data with an assumption of a statistical model structure. In this paper, we propose a novel model-based hierarchical clustering method for a finite statistical mixture model based on the Fisher distribution. The main foci of the proposed method are: (a) provide efficient solution to estimate the parameters of a Fisher mixture model (FMM); (b) generate a hierarchy of FMMs and (c) select the optimal model. To this aim, we develop a Bregman soft clustering method for FMM. Our model estimation strategy exploits Bregman divergence and hierarchical agglomerative clustering. Whereas, our model selection strategy comprises a parsimony-based approach and an evaluation graph-based approach. We empirically validate our proposed method by applying it on simulated data. Next, we apply the method on real data to perform depth image analysis. We demonstrate that the proposed clustering method can be used as a potential tool for unsupervised depth image analysis.
Similar content being viewed by others
Notes
The bijection is expressed as: \(f(x|\theta )=\text {exp}(-D_{G}(t(x),\eta ))J_{G}(x)\) where \(J_{G}\) is a uniquely determined function. For more details, please see Theorem 3 of Banerjee et al. (2005b).
In order to compare different methods, we used MATLAB implementation provided either by the authors (SPKM and soft-MoVMF) or by standard toolbox (GMM). For the KMDR, we used the available R package skmeans (Buchta et al. 2012).
available at: http://cs.nyu.edu/~silberman/datasets/nyu_depth_v2.html
References
Alata, O., Quintard, L.: Is there a best color space for color image characterization or representation based on multivariate Gaussian mixture model? Computer Vis. Image Underst. 113(8), 867–877 (2009)
Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. In: ACM-SIAM Symposium on Discrete algorithms, Society for Industrial and Applied Mathematics, pp. 1027–1035 (2007)
Banerjee, A., Dhillon, I.S., Ghosh, J., Sra, S.: Clustering on the unit hypersphere using von Mises–Fisher distributions. J. Mach. Learn. Res. 6, 1345–1382 (2005a)
Banerjee, A., Merugu, S., Dhillon, I.S., Ghosh, J.: Clustering with bregman divergences. J. Mach. Learn. Res. 6, 1705–1749 (2005b)
Bangert, M., Hennig, P., Oelfke, U.: Using an infinite von Mises–Fisher mixture model to cluster treatment beam directions in external radiation therapy. In: International Conference on Machine Learning and Applications, pp. 746–751 (2010)
Barron, J.T., Malik, J.: Intrinsic scene properties from a single rgb-d image. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 17–24 (2013)
Baudry, J.P., Raftery, A.E., Celeux, G., Lo, K., Gottardo, R.: Combining mixture components for clustering. J. Comput. Graph. Stat. 19(2), 332–353 (2010)
Biernacki, C., Celeux, G., Govaert, G.: Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Trans. Pattern Anal. Mach. Intell. 22(7), 719–725 (2000)
Biernacki, C., Celeux, G., Govaert, G.: Choosing starting values for the em algorithm for getting the highest likelihood in multivariate Gaussian mixture models. Comput. Stat. Data Anal. 41(3), 561–575 (2003)
Buchta, C., Kober, M., Feinerer, I., Hornik, K.: Spherical k-means clustering. J. Stat. Softw. 50(10), 1–22 (2012)
Burnham, K.P., Anderson, D.R.: Model Selection and Multi-model Inference: A Practical Information-Theoretic Approach. Springer, New York (2002)
Dal Mutto, C., Zanuttigh, P., Cortelazzo, G.M.: Fusion of geometry and color information for scene segmentation. IEEE J. Sel. Top. Signal Process. 6(5), 505–521 (2012a)
Dal Mutto, C., Zanuttigh, P., Cortelazzo, G.M.: Time-of-Flight Cameras and Microsoft Kinect. Springer, New York (2012b)
Dhillon, I.S., Sra, S.: Modeling data using directional distributions. Technical report, TR-03-06, Department of Computer Sciences, The University of Texas at Austin (2003)
Figueiredo, M.A.T., Jain, A.K.: Unsupervised learning of finite mixture models. IEEE Trans. Pattern Anal. Mach. Intell. 24(3), 381–396 (2002)
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Fraley, C., Raftery, A.E.: Model-based clustering, discriminant analysis, and density estimation. J. Am. Stat. Assoc. 97(458), 611–631 (2002)
Fraley, C., Raftery, A.E.: Model-based methods of classification: using the mclust software in chemometrics. J. Stat. Softw. 18(6), 1–13 (2007)
Garcia, V., Nielsen, F.: Simplification and hierarchical representations of mixtures of exponential families. Signal Process. 90(12), 3197–3212 (2010)
Ghosh, S., Sudderth, E.: Nonparametric learning for layered segmentation of natural images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2272–2279 (2012)
Goldberger, J., Roweis, S.T.: Hierarchical clustering of a mixture model. In: Advances in Neural Information Processing Systems, pp. 505–512 (2004)
Gopal, S., Yang, Y.: Von Mises–Fisher clustering models. In: International Conference on Machine Learning, pp. 154–162 (2014)
Gupta, S., Arbelaez, P., Malik, J.: Perceptual organization and recognition of indoor scenes from RGB-D images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 564–571 (2013)
Han, J., Shao, L., Xu, D., Shotton, J.: Enhanced computer vision with microsoft kinect sensor: a review. IEEE Trans. Cybern. 43, 1318–1334 (2013)
Herrera, C., Kannala, J., Heikkilä, J., et al.: Joint depth and color camera calibration with distortion correction. IEEE Trans. Pattern Anal. Mach. Intell. 34(10), 2058–2064 (2012)
Hershey, J.R., Olsen, P.A.: Approximating the kullback leibler divergence between Gaussian mixture models. In: IEEE Conference on Acoustics, Speech and Signal Processing, vol. 4, pp. IV-317 (2007)
Khoshelham, K., Elberink, S.O.: Accuracy and resolution of kinect depth data for indoor mapping applications. Sensors 12(2), 1437–1454 (2012)
Lee, J., Kwak, S., Han, B., Choi, S.: Online video segmentation by bayesian split-merge clustering. In: European Conference on Computer Vision, Springer, pp. 856–869 (2012)
Liu, M., Vemuri, B.C., Amari, S.I., Nielsen, F.: Shape retrieval using hierarchical total Bregman soft clustering. IEEE Trans. Pattern Anal. Mach. Intell. 34(12), 2407–2419 (2012)
Maitra, R.: Initializing partition-optimization algorithms. IEEE/ACM Trans. Comput. Biol. Bioinform. 6(1), 144–157 (2009)
Maitra, R., Ramler, I.P.: A k-mean-directions algorithm for fast clustering of data on the sphere. J. Comput. Graph. Stat. 19(2), 377–396 (2010)
Mardia, K.V., Jupp, P.E.: Directional Statistics. Wiley, New York (2000)
Martinez, W.L., Martinez, A., Solka, J.: Exploratory Data Analysis with MATLAB, 2nd edn. CRC Press, Boca Raton (2010)
McGraw, T., Vemuri, B., Yezierski, R., Mareci, T.: Segmentation of high angular resolution diffusion MRI modeled as a field of von mises-fisher mixtures. In: European Conference on Computer Vision, Springer, pp. 463–475 (2006)
McLachlan, G., Peel, D.: Finite Mixture Models. Wiley, New York (2004). com
McLachlan, G.J., Krishnan, T.: The EM algorithm and extensions. Wiley Series in Probability and Statistics, 2nd edn. Wiley, Hoboken (2008)
Melnykov, V., Maitra, R.: Finite mixture models and model-based clustering. Stat. Surv. 4, 80–116 (2010)
Moons, T., Vergauwen, M., Van Gool, L.: 3d Reconstruction from multiple images: Part 1: Principles. Now Publishers Inc, Hanover (2009)
Murphy, K.P.: Machine Learning: A Probabilistic Perspective. The MIT Press, Cambridge (2012)
Nielsen, F., Garcia, V.: Statistical exponential families: a digest with flash cards. CoRR abs/0911.4863: http://arxiv.org/abs/0911.4863 (2009)
Ren, X., Bo, L., Fox, D.: RGB-D scene labeling: features and algorithms. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2759–2766 (2012)
Rusu, R.B.: Semantic 3D Object Maps for Everyday Robot Manipulation. Springer, Berlin (2013)
Salvador, S., Chan, P.: Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms. In: IEEE Conference on Tools with Artificial Intelligence, pp. 576–584 (2004)
Sfikas, G., Nikou, C., Galatsanos, N., Heinrich, C.: Majorization-minimization mixture model determination in image segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2169–2176 (2011)
Silberman, N., Hoiem, D., Kohli, P., Fergus, R.: Indoor segmentation and support inference from rgbd images. In: European Conference on Computer Vision, Springer, pp. 746–760 (2012)
Szeliski, R.: Computer Vision: Algorithms and Applications. Springer, London (2011)
Tibshirani, R., Walther, G., Hastie, T.: Estimating the number of clusters in a data set via the gap statistic. J. R. Stat. Soc. Ser. B 63(2), 411–423 (2001)
Vu, D.H.T., Haeb-Umbach, R.: Blind speech separation employing directional statistics in an expectation maximization framework. In: IEEE Conference on Acoustics, Speech, and Signal Processing (2010)
Watson, G.S.: Statistics on Spheres, vol. 6. Wiley, New York (1983)
Watson, G.S.: The theory of concentrated langevin distributions. J. Multivar. Anal. 14(1), 74–82 (1984)
Zhang, Z.: Microsoft kinect sensor and its effect. IEEE MultiMed. 19(2), 4–10 (2012)
Zhao, Q., Hautamaki, V., Fränti, P.: Knee point detection in BIC for detecting the number of clusters. In: Advanced Concepts for Intelligent Vision Systems, Springer, pp. 664–673 (2008)
Zhong, S., Ghosh, J.: A unified framework for model-based clustering. J. Mach. Learn. Res. 4, 1001–1037 (2003)
Acknowledgments
This work has been supported by the research Grants from the ARC6 of région Rhône-Alpes, France.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hasnat, M.A., Alata, O. & Trémeau, A. Model-based hierarchical clustering with Bregman divergences and Fishers mixture model: application to depth image analysis. Stat Comput 26, 861–880 (2016). https://doi.org/10.1007/s11222-015-9576-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11222-015-9576-3