An online classification EM algorithm based on the mixture model
- 406 Downloads
Mixture model-based clustering is widely used in many applications. In certain real-time applications the rapid increase of data size with time makes classical clustering algorithms too slow. An online clustering algorithm based on mixture models is presented in the context of a real-time flaw-diagnosis application for pressurized containers which uses data from acoustic emission signals. The proposed algorithm is a stochastic gradient algorithm derived from the classification version of the EM algorithm (CEM). It provides a model-based generalization of the well-known online k-means algorithm, able to handle non-spherical clusters. Using synthetic and real data sets, the proposed algorithm is compared with the batch CEM algorithm and the online EM algorithm. The three approaches generate comparable solutions in terms of the resulting partition when clusters are relatively well separated, but online algorithms become faster as the size of the available observations increases.
KeywordsClustering Mixture model EM CEM Stochastic gradient Exponential family
Unable to display preview. Download preview PDF.
- Biernacki, C., Celeux, G., Govaert, G.: Assessing a mixture model for clustering with the integrated completed likelihood. IEEE PAMI 22, 719–725 (2000) Google Scholar
- Bottou, L.: Une approche théorique de l’apprentissage connexioniste; applications à la reconnaissance de la parole. PhD thesis, Université d’Orsay (1991) Google Scholar
- Bottou, L., Bengio, Y.: Convergence properties of the K-means algorithm. In: Tesauro, G., Touretzky, D., Leen, T. (eds.) Advances in Neural Information Processing Systems, vol. 7, pp. 585–592. MIT, Cambridge (1995) Google Scholar
- MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley Symposium on Mathematics, Statistics and Probability, vol. 1, pp. 281–298 (1967) Google Scholar
- Wang, S., Zhao, Y.: Almost sure convergence of Titterington’s recursive estimator for mixture models. In: IEEE International Symposium on Information Theory, ISIT (2002) Google Scholar