Abstract
Partitioning multi-view data is a recent challenge in clustering methods, which traditionally consider single-view data. In clustering techniques, finding the similarity or distance between objects, handled by metrics in \(\mathbb {R}^{n}\), plays a central role in community detection. Under this framework, different algorithms have been developed where the output relies on an exact distance calculated based on the objects’ features. As feature information might be qualitative data defined in an ambiguous environment, this study offers a new class of metrics, so-called S-distance, as a dual of a fuzzy T-similarity, which successfully produces a collective distance based on all views/observers and provides a more flexible framework to define distance under uncertainty. Besides, most existing approaches handle multi-view clustering by aggregating each view’s clusters or using an iterative optimization method; both are time-consuming. Here, by transforming the multi-view clustering problem into node clustering, we suggest a new approach without iteration for multi-view and multi-observer data. Our proposed method, GMSkNN, uses an attribute-structural similarity relation between nodes to get more coherent clusters. To this end, we first build a k-nearest neighbor (kNN) directed graph using the proposed S-distance, then transform it into an undirected graph based on the neighborhood information of the nodes so that the resultant graph is characterized based on nodes interactions and initial features information of the nodes. Next, a new maximal-clique-based clustering is designed to complete the node partitioning. The proposed clustering algorithm is programmed and tested on synthetic and four real-world datasets using the R software. The clustering results are analyzed based on several indexes. This analysis shows the efficiency of the proposed algorithm compared to the traditional clustering methods.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Figa_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Figb_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Figc_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig9_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig10_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig11_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig12_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig13_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig14_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig15_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig16_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00521-024-09560-x/MediaObjects/521_2024_9560_Fig17_HTML.png)
Similar content being viewed by others
Data availability
The synthetic dataset generated and analyzed during the current study is available online via the following link https://drive.google.com/drive/folders/1kHy7QJxe7rGjkYs5P1iqzh1AtiEAvZJc. Other datasets analyzed during the current study are available on “igraphdata” of R-packages and Kaggle repository via the following link https://www.kaggle.com/datasets/azadehzahedikhameneh/world-happiness-report-pre-covid-vs-covid. The used data are also available from the corresponding author on reasonable request.
References
Mota VC, Damasceno FA, Leite DF (2018) Fuzzy clustering and fuzzy validity measures for knowledge discovery and decision making in agricultural engineering. Comput Electron Agric 150:118–124
Xue Y, Deng Y (2021) Decision making under measure-based granular uncertainty with intuitionistic fuzzy sets. Appl Intell 51(8):6224–6233
Cheung G, Magli E, Tanaka Y, Ng MK (2018) Graph spectral image processing. Proc IEEE 106(5):907–930
Jiao L, Zhao J (2019) A survey on the new generation of deep learning in image processing. IEEE Access 7:172231–172263
Qi C, Zhang J, Jia H, Mao Q, Wang L, Song H (2021) Deep face clustering using residual graph convolutional network. Knowl-Based Syst 211:106561–106568
Wu Y, Duan H, Du S (2015) Multiple fuzzy c-means clustering algorithm in medical diagnosis. Technol Health Care 23(s2):519–527
Maier A, Syben C, Lasser T, Riess C (2019) A gentle introduction to deep learning in medical image processing. Z Med Phys 29(2):86–101
Nie F, Li J, Li X, et al (2017) Self-weighted multiview clustering with multiple graphs. In: Proceedings of the twenty-sixth international joint conference on artificial intelligence (IJCAI-17) self-weighted, pp. 2564–2570
Lin Z, Kang Z, Zhang L, Tian L (2021) Multi-view attributed graph clustering. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2021.3101227
Wang H, Yang Y, Liu B, Fujita H (2019) A study of graph-based system for multi-view clustering. Knowl-Based Syst 163:1009–1019
Wang H, Yang Y, Liu B (2019) GMC: graph-based multi-view clustering. IEEE Trans Knowl Data Eng 32(6):1116–1129
Kang Z, Shi G, Huang S, Chen W, Pu X, Zhou JT, Xu Z (2020) Multi-graph fusion for multi-view spectral clustering. Knowl-Based Syst 189:105102–105111
Zhang X, Liu X (2022) Multiview clustering of adaptive sparse representation based on coupled p systems. Entropy 24(4):568–591
Zhong X, Xu X, Chen X (2022) A clustering and fusion method for large group decision making with double information and heterogeneous experts. Soft Comput 26(5):2451–2463
Lv J, Kang Z, Wang B, Ji L, Xu Z (2021) Multi-view subspace clustering via partition fusion. Inf Sci 560:410–423
Bodenhofer U (2003) Representations and constructions of similarity-based fuzzy orderings. Fuzzy Sets Syst 137(1):113–136
Zahedi Khameneh A, Kilicman A, Md Ali F (2021) Revision of pseudo-ultrametric spaces based on m-polar t-equivalences and its application in decision making. Mathematics 9(11):1232–1250
Khameneh AZ, Kilicman A, Mahad Z (2023) A multi-view clustering algorithm for attributed weighted multi-edge directed networks. Neural Comput Appl 35(10):7779–7800
Palla G, Derenyi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043):814–818
Ertoz L, Steinbach M, Kumar V (2003) Finding clusters of different sizes, shapes, and densities in noisy, high dimensional data. In: Proceedings of the 2003 SIAM international conference on data mining, SIAM, pp. 47–58
Jarvis RA, Patrick EA (1973) Clustering using a similarity measure based on shared near neighbors. IEEE Trans Comput 100(11):1025–1034
Brito MR, Chavez EL, Quiroz AJ, Yukich JE (1997) Connectivity of the mutual k-nearest-neighbor graph in clustering and outlier detection. Statist Probab Lett 35(1):33–42
Franti P, Virmajoki O, Hautamaki V (2006) Fast agglomerative clustering using a k-nearest neighbor graph. IEEE Trans Pattern Anal Mach Intell 28(11):1875–1881
Dong W, Moses C, Li K (2011) Efficient k-nearest neighbor graph construction for generic similarity measures. In: Proceedings of the 20th international conference on World Wide Web, pp. 577–586
Qin Y, Yu ZL, Wang C-D, Gu Z, Li Y (2018) A novel clustering method based on hybrid k-nearest-neighbor graph. Pattern Recogn 74:1–14
Liu R, Wang H, Yu X (2018) Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf Sci 450:200–226
Kundu PP, Mitra S (2015) Multi-objective optimization of shared nearest neighbor similarity for feature selection. Appl Soft Comput 37:751–762
Zhou S, Xu Z (2018) A novel internal validity index based on the cluster centre and the nearest neighbour cluster. Appl Soft Comput 71:78–88
Zahedi Khameneh A, Kilicman A (2020) m-polar generalization of fuzzy t-ordering relations: an approach to group decision making. Symmetry 13(1):51–71
Jo T, Lee M (2007) The evaluation measure of text clustering for the variable number of clusters. In: International symposium on neural networks, Springer, pp. 871–879
Choudrie J, Patil S, Kotecha K, Matta N, Pappas I (2021) Applying and understanding an advanced, novel deep learning approach: a covid 19, text based, emotions analysis study. Inf Syst Front 23(6):1431–1465
Garcia K, Berton L (2021) Topic detection and sentiment analysis in twitter content related to COVID-19 from Brazil and the USA. Appl Soft Comput 101:107057
Chen T, Wang Y-C (2021) A calibrated piecewise-linear FGM approach for travel destination recommendation during the COVID-19 pandemic. Appl Soft Comput 109:107535
Nimmi K, Janet B, Selvan AK, Sivakumaran N (2022) Pre-trained ensemble model for identification of emotion during COVID-19 based on emergency response support system dataset. Appl Soft Comput 122:108842
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Khameneh, A.Z., Ghaznavi, M., Kilicman, A. et al. A maximal-clique-based clustering approach for multi-observer multi-view data by using k-nearest neighbor with S-pseudo-ultrametric induced by a fuzzy similarity. Neural Comput & Applic 36, 9525–9550 (2024). https://doi.org/10.1007/s00521-024-09560-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-024-09560-x