Abstract
Multi-view data clustering plays a crucial role in various real-world applications. This kind of data from various domains can exhibit a range of distributions, making it challenging for algorithms to uncover robust patterns. This paper extends the fuzzy k-means clustering algorithm to cluster multi-view data. The objective function includes two additional matrixes to measure the compactness of each view and the importance of individual features. The objective function also includes entropy weights. Experiments on real-life data indicate that the proposed algorithm outperforms current state-of-the-art algorithms. These set of algorithms comprises of clustering techniques that incorporate variable weighting, such as W-k-means [11], LAC [9], and EWKM [13], along with a multiview clustering algorithm called TW-k-means [6]. The evaluation of the algorithms involves measuring their accuracy, as well as comparing their respective running times. A comprehensive discussion on the proposed algorithm’s properties was conducted, where all its parameters were fine-tuned and analyzed in detail.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bickel, S., Scheffer, T.: Multi-view clustering. In: ICDM, vol. 4, pp. 19–26 (2004)
Blaschko, M.B., Lampert, C.H.: Correlational spectral clustering. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2008)
Chan, E.Y., Ching, W.K., Ng, M.K., Huang, J.Z.: An optimization algorithm for clustering using weighted dissimilarity measures. Pattern Recogn. 37(5), 943–952 (2004)
Chao, G., Sun, S., Bi, J.: A survey on multi-view clustering. IEEE Trans. Artif. Intell. 2(2), 146–168 (2021)
Chaudhuri, K., Kakade, S. M., Livescu, K., Sridharan, K.: Multi-view clustering via canonical correlation analysis. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 129–136. ACM (2009)
Chen, X., Xu, X., Huang, J.Z., Ye, Y.: Tw-k-means: automated two-level variable weighting clustering algorithm for multi-view data. IEEE Trans Knowl. Data Eng. 25(4), 932–944 (2013)
De Sa, V. R.: Spectral clustering with two views. In: ICML Workshop on Learning with Multiple Views, pp. 20–27 (2005)
DeSarbo, W.S., Carroll, J.D., Clark, L.A., Green, P.E.: Synthesized clustering: a method for amalgamating alternative clustering bases with differential weighting of variables. Psychometrika 49(1), 57–78 (1984). https://doi.org/10.1007/BF02294206
Domeniconi, C., Gunopulos, D., Ma, S., Yan, B., Al-Razgan, M., Papadopoulos, D.: Locally adaptive metrics for clustering high dimensional data. Data Min. Knowl. Disc. 14(1), 63–97 (2007). https://doi.org/10.1007/s10618-006-0060-8
Fu, L., Lin, P., Vasilakos, A.V., Wang, S.: An overview of recent multi-view clustering. Neurocomputing 402, 148–161 (2020)
Huang, J.Z., Ng, M.K., Rong, H., Li, Z.: Automated variable weighting in k-means type clustering. IEEE Trans. Pattern Anal. Mach. Intell. 27(5), 657–668 (2005)
Huang, Z.: Extensions to the k-means algorithm for clustering large data sets with categorical values. Data Min. Knowl. Disc. 2(3), 283–304 (1998). https://doi.org/10.1023/A:1009769707641
Jing, L., Ng, M.K., Huang, J.Z.: An entropy weighting k-means algorithm for subspace clustering of high-dimensional sparse data. IEEE Trans. Knowl. Data Eng. 19(8), 1026–1041 (2007)
Kailing, K., Kriegel, H.-P., Pryakhin, A., Schubert, M.: Clustering multi-represented objects with noise. In: Dai, H., Srikant, R., Zhang, C. (eds.) PAKDD 2004. LNCS (LNAI), vol. 3056, pp. 394–403. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-24775-3_48
Khan, I., Huang, J.Z., Luo, Z., Masud, M.A.: CPLP: an algorithm for tracking the changes of power consumption patterns in load profile data over time. Inf. Sci. 429, 332–348 (2018)
Khan, I., Huang, J.Z., Ivanov, K.: Incremental density-based ensemble clustering over evolving data streams. Neurocomputing 191, 34–43 (2016)
Khan, I., Huang, J.Z., Masud, M.A., Jiang, Q.: Segmentation of factories on electricity consumption behaviors using load profile data. IEEE Access 4, 8394–8406 (2016)
Khan, I., Huang, J.Z., Tung, N.T., Williams, G.: Ensemble clustering of high dimensional data with FastMap projection. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 483–493 (2014)
Khan, I., Ivanov, K., Jiang, Q.: FastMap projection for high-dimensional data: a cluster ensemble approach. Int. J. Database Theory Appl. 9(12), 311–330 (2016)
Khan, I., Luo, Z., Huang, J.Z., Shahzad, W.: Variable weighting in fuzzy k-means clustering to determine the number of clusters. IEEE Trans. Knowl. Data Eng. 32(9), 1838–1853 (2019)
Khan, I., Luo, Z., Shaikh, A.K., Hedjam, R.: Ensemble clustering using extended fuzzy k-means for cancer data analysis. Expert Syst. Appl. 172, 114622 (2021)
Liu, X., Ji, S., Glänzel, W., De Moor, B.: Multiview partitioning via tensor methods. IEEE Trans. Knowl. Data Eng. 25(5), 1056–1069 (2013)
Liu, X., Yu, S., Moreau, Y., De Moor, B., Glänzel, W., Janssens, F.: Hybrid clustering of text mining and bibliometrics applied to journal sets. In: Proceedings of the 2009 SIAM International Conference on Data Mining, pp. 49–60. SIAM (2009)
Long, B., Yu, P.S., Zhang, Z.: A general model for multiple view unsupervised learning. In: Proceedings of the 2008 SIAM International Conference on Data Mining, pp. 822–833. SIAM (2008)
Mucha, P.J., Richardson, T., Macon, K., Porter, M.A., Onnela, J.P.: Community structure in time-dependent, multiscale, and multiplex networks. Science 328(5980), 876–878 (2010)
Tzortzis, G.F., Likas, A.C.: Multiple view clustering using a weighted combination of exemplar-based mixture models. IEEE Trans. Neural Netw. 21(12), 1925–1938 (2010)
Wang, Q., Chen, M., Nie, F., Li, X.: Detecting coherent groups in crowd scenes by multi-view clustering. IEEE Trans. Pattern Anal. Mach. Intell. 42, 46–58 (2018)
Zhou, D., Burges, C.J.: Spectral clustering and transductive learning with multiple views. In: Proceedings of the 24th International Conference on Machine Learning, pp. 1159–1166. ACM (2007)
Acknowledgement
This work was supported by the Internal Grant of Sultan Qaboos University (Grant No. IG/SCI/COMP/21/01).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Khan, I., ALghafri, M., Abdessalem, A. (2023). Entropy in Fuzzy k-Means Algorithm for Multi-view Data. In: Daimi, K., Al Sadoon, A. (eds) Proceedings of the 2023 International Conference on Advances in Computing Research (ACR’23). ACR 2023. Lecture Notes in Networks and Systems, vol 700. Springer, Cham. https://doi.org/10.1007/978-3-031-33743-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-33743-7_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33742-0
Online ISBN: 978-3-031-33743-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)