Abstract
A multi-clustering fusion method is presented based on combining several runs of a clustering algorithm resulting in a common partition. More specifically, the results of several independent runs of the same clustering algorithm are appropriately combined to obtain a partition of the data which is not affected by initialization and overcomes the instabilities of clustering methods. Finally, the fusion procedure starts with the clusters produced by the combining part and finds the optimal number of clusters in the data set according to some predefined criteria. The unsupervised multi-clustering method implemented in this work is quite general. There is ample room for the implementation and testing with any existing clustering algorithm that has unstable results. Experiments using both simulated and real data sets indicate that the multi-clustering fusion algorithm is able to partition a set of data points to the optimal number of clusters not constrained to be hyper-spherically shaped.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
A.K. Jain and R.C. Dubes. Algorithms for Clustering Data. Englewood Cliffs, N. J.: Prentice Hall, 1988.
A.K. Jain, R.P.W. Duin, and J. Mao. Statistical pattern recognition: A review. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(1), 2000.
M. Halkidi, Y. Batistakis, and M. Vazirgiannis. Clustering algorithms and validity measures. In Proceedings of the SSDBM conference, Virginia, USA, July 2001.
J.C. Bezdek and S.K. Pal. Fuzzy Models for Pattern Recognition: Methods that Search for Structures in Data. IEEE CS Press, 1992.
J.C. Bezdek. Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York, 1981.
A.P. Dempster, N.M. Laird, and D.B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Roy. Statist. Soc. B, 39:1–38, 1977.
Vlassis N. and Likas A. A greedy-EM algorithm for Gaussian mixture learning. Technical report, Computer Science Institute, University of Amsterdam, The Netherlands, May 2000.
E. Boundaillier and G. Hebrail. Interactive interpretation of hierarchical clustering. Intell. Data Anal., 2(3), 1998.
A. Fred. Finding Consistent Clusters in Data Partitions. In Proceedings of the Second International Workshop on Multiple Classifier Systems (MCS 2001), LNCS 2096, pages 309–318, Cam bridge, UK, July 2–4 2001. Springer.
E. Dimitriadou, A. Weingessel, and K. Hornik. A voting-merging clustering algorithm. Working Paper 31, SFB ‘Adaptive Information Systems and Modeling in Economics and Management Science’, April 1999.
P. Smyth. Clustering Using Monte Carlo Cross-Validation. In Proceedings Knowledge Discovery and Data Mining, pages 126–133, 1996.
P. Cheeseman and J. Stutz. Bayesian Classification (AutoClass): Theory and Results. In Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, and Ramasamy Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining. AAAI Press/MIT Press, 1996.
D.H. Fisher. Knowledge acquisition via incremental conceptual clustering. Machine Learning, 2:139–172, 1987.
D.S. Frossyniotis and A. Stafylopatis. A Multi-SVM Classification System. In Proceedings of the Second International Workshop on Multiple Classifier Systems (MCS 2001), LNCS 2096, pages 198–207, Cam bridge, UK, July 2–4 2001. Springer.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Frossyniotis, D., Pertselakis, M., Stafylopatis, A. (2002). A Multi-clustering Fusion Algorithm. In: Vlahavas, I.P., Spyropoulos, C.D. (eds) Methods and Applications of Artificial Intelligence. SETN 2002. Lecture Notes in Computer Science(), vol 2308. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46014-4_21
Download citation
DOI: https://doi.org/10.1007/3-540-46014-4_21
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43472-6
Online ISBN: 978-3-540-46014-5
eBook Packages: Springer Book Archive