Abstract
Clustering is an unsupervised learning and clustering results are often inconsistent and unreliable when different clustering algorithms are used. In this paper we have proposed a clustering ensemble framework, named Object-Neighbourhood Clustering Ensemble (ONCE), to improve the consistency, reliability and quality of the clustering result. The core of the ONCE is a new consensus function that addresses the uncertain agreements between members by taking the neighbourhood relationship between object pairs into account in the similarity matrix. The experiments are carried out on 11 benchmark datasets. The results show that our ensemble method outperforms the co-association method, when the Average linkage is used. Furthermore, the results show that our ensemble method is more accurate than the baseline algorithm, and this indicates that the clustering ensemble method is more consistent and reliable than a single clustering algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. The Journal of Machine Learning Research 7, 1–30 (2006)
Fred, A.L., Jain, A.K.: Combining multiple clusterings using evidence accumulation. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(6), 835–850 (2005)
Gionis, A., Mannila, H., Tsaparas, P.: Clustering aggregation. ACM Transactions on Knowledge Discovery from Data (TKDD) 1(1), 4 (2007)
Iman, R.L., Davenport, J.M.: Approximations of the critical region of the fbietkan statistic. Communications in Statistics - Theory and Methods 9(6), 571–595 (1980)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys (CSUR) 31(3), 264–323 (1999)
Jarvis, R.A., Patrick, E.A.: Clustering using a similarity measure based on shared near neighbors. IEEE Transactions on Computers 100(11), 1025–1034 (1973)
Lin, Z., Chen, M., Ma, Y.: The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. arXiv preprint arXiv:1009.5055 (2010)
Strehl, A., Ghosh, J.: Cluster ensembles—a knowledge reuse framework for multiple partitions. The Journal of Machine Learning Research 3, 583–617 (2003)
Veenman, C.J., Reinders, M.J.T., Backer, E.: A maximum variance cluster algorithm. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(9), 1273–1280 (2002)
Vega-Pons, S., Ruiz-Shulcloper, J., Guerra-Gandón, A.: Weighted association based methods for the combination of heterogeneous partitions. Pattern Recognition Letters 32(16), 2163–2170 (2011)
Wang, X., Yang, C., Zhou, J.: Clustering aggregation by probability accumulation. Pattern Recognition 42(5), 668–675 (2009)
Yi, J., Yang, T., Jin, R., Jain, A.K., Mahdavi, M.: Robust ensemble clustering by matrix completion. In: 2012 IEEE 12th International Conference on Data Mining (ICDM), pp. 1176–1181. IEEE (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Alqurashi, T., Wang, W. (2014). Object-Neighbourhood Clustering Ensemble Method. In: Corchado, E., Lozano, J.A., Quintián, H., Yin, H. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2014. IDEAL 2014. Lecture Notes in Computer Science, vol 8669. Springer, Cham. https://doi.org/10.1007/978-3-319-10840-7_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-10840-7_18
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10839-1
Online ISBN: 978-3-319-10840-7
eBook Packages: Computer ScienceComputer Science (R0)