Data Analysis and Classification pp 67-75 | Cite as
Visualization of Model-Based Clustering Structures
Conference paper
First Online:
Abstract
Model-based clustering based on a finite mixture of Gaussian components is an effective method for looking for groups of observations in a dataset. In this paper we propose a dimension reduction method, called MCLUSTSIR, which is able to show clustering structures depending on the selected Gaussian mixture model. The method aims at finding those directions which are able to display both variation in cluster means and variations in cluster covariances. The resulting MCLUSTSIR variables are defined as a linear mapping method which projects the data onto a suitable subspace.
Keywords
Gaussian Mixture Model Kernel Matrix Finite Mixture Uncertainty Boundary Slice Inverse Regression
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
- Banfield, J., & Raftery, A. E. (1993). Model-based Gaussian and non-Gaussian clustering. Biometrics, 49, 803–821.MATHCrossRefMathSciNetGoogle Scholar
- Celeux, G., & Govaert, G. (1995). Gaussian parsimonious clustering models. Pattern Recognition, 28, 781–793.CrossRefGoogle Scholar
- Chang, W. (1983). On using principal components before separating a mixture of two multivariate normal distributions. Applied Statistics, 32(3), 267–275.MATHCrossRefMathSciNetGoogle Scholar
- Cook, R. D. (1998). Regression graphics: Ideas for studying regressions through graphics.New York: Wiley.Google Scholar
- Fraley, C., & Raftery, A. E. (1998). How many clusters? which clustering method? answers via model-based cluster analysis. The Computer Journal, 41, 578–588.MATHCrossRefGoogle Scholar
- Fraley, C., & Raftery, A. E. (2002). Model-based clustering, discriminant analysis, and density estimation. Journal of the American Statistical Association, 97(458), 611–631.MATHCrossRefMathSciNetGoogle Scholar
- Fraley, C., & Raftery, A. E. (2006). MCLUST version 3 for R: Normal mixture modeling and model-based clustering (Technical Report 504). Department of Statistics, University of Washington.Google Scholar
- Li, K. C. (1991). Sliced inverse regression for dimension reduction (with discussion). Journal of the American Statistical Association, 86, 316–342.MATHCrossRefMathSciNetGoogle Scholar
- Li, K. C. (2000). High dimensional data analysis via the SIR/PHD approach. Unpublished manuscript. Retrieved from http://www.stat.ucla.edu/∼kcli/sir-PHD.pdf.
- McLachlan, G., & Peel, D. (2000). Finite mixture models. New York: Wiley.MATHCrossRefGoogle Scholar
- Raftery, A. E., & Dean, N. (2006). Variable selection for model-based clustering. Journal of the American Statistical Association, 101(473), 168–178.MATHCrossRefMathSciNetGoogle Scholar
Copyright information
© Springer-Verlag Berlin Heidelberg 2010