Abstract
This paper presents an improved cluster validation scheme called two phase cluster validation (TPCV) and aims to estimate the inter closeness and inter separation among the clusters in the cluster set of unsupervised clustering schemes based on probability measure for validating the cluster quality without prior identification. First phase, the TPCV computes the representative cluster centroid of each individual cluster in the cluster set based on standard mean operation and then it estimates the probability of inter closeness of each cluster with other clusters in the cluster set based on cluster centroid. Next phase, it calculates the probability of separation among the clusters in the cluster set based on cluster centroid by distance measure. Experimental results show that the TPCV scheme is simple and effective to estimate the cluster quality by measuring the probability of closeness and separation between the clusters in the result of unsupervised clustering scheme.
Similar content being viewed by others
Change history
11 July 2022
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1007/s12652-022-04305-x
References
Ahmed ST, Sandhya M, Sankar S (2019) A dynamic MooM dataset processing under TelMED protocol design for QoS improvisation of telemedicine environment. J Med Syst 43(8):257
Ahmed ST, Sankar S, Sandhya M (2020) Multi-objective optimal medical data informatics standardization and processing technique for telemedicine via machine learning approach. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-020-02016-9
Bezdek JC, Pal NR (1998) Some new indexes of cluster validity. IEEE Trans Syst Man Cybern Part B Cybern 28(3):301–315
Calinski T, Harabasz A (1974) A dendrite method for cluster analysis. Commun Stat Theory Method 3(1):1–27
Davies DL, Bouldin DW (1979) Cluster separation measure. IEEE Trans Pattern Anal Mach Intell 1(2):95–104
Dunn JC (1974) Well Separated Clusters and Optimal Fuzzy Partitions. J Cybern 4:95–104
Hartigan JA (1975) Clustering algorithm. Wiley, New York
Yang JH, Lee I (2004) Cluster validity through graph-based boundary analysis. In: International Conference on Information and Knowledge Engineering, pp 204–210
Jain A, Dubes R (1988) Algorithm for clustering data. Prentice Hall, Englewood Cliffs
Wang JS, Chiang JC (2008) A cluster validity measure with outlier detection for support vector clustering. IEEE Trans Syst Man Cybern Part B Cybern 38(1):78–89
Dinakaran K, Suresh RM (2011) Validation techniques to find optimal cluster in gene expression data. Eur J Sci Res 54(3):411–417
Krishnamoorthy R, Sreedhar Kumar S (2013) New inter cluster validation method for unsupervised clustering techniques. In: IEEE International Conference on Communication and Computer Vision 2013 (ICCCV’13), pp 1–5. https://doi.org/10.1109/ICCCV.2013.690674
Madheswaran M, Sreedhar Kumar S (2017) An improved frequency based agglomerative clustering algorithm for detecting distinct clusters on two dimensional dataset. J Eng Technol Res (Acad J) 9(4):30–41. https://doi.org/10.5897/JETR2017.0628
Dash M, Liu H, Scheuerman P, Tam KL (2003) Fast hierarchical clustering and it’s validation. Data Knowl Eng 44:109–138
Manoj RJ, Praveena MA, Vijayakumar K (2019) An ACO–ANN based feature selection algorithm for big data. Cluster Comput 22(2):3953–3960
Kantandzic M (2011) Data mining: concepts, models, methods and algorithms, 2nd edn. IEEE, pp 249–279
Nithya M, Vijayakumar K (2020) Secured segmentation for ICD datasets. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-020-02009-8
Patil KK, Ahmed ST (2014) Digital telemammography services for rural India, software components and design protocol. In: 2014 International Conference on Advances in Electronics Computers and Communications. IEEE, pp 1–5
Pradeep Mohan Kumar K, Saravanan M, Thenmozhi M, Vijayakumar K (2019) Intrusion detection system based on GA‐fuzzy classifier for detecting malicious attacks. Concurr Comput Pract Exper:e5242
Amorim RCD, Hennig C (2015) Recovering the number of clusters in data sets with noise features using feature rescaling factors. Inf Sci 324:126–145
Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65
Rui XU, Wunsch DC (2009) Clustering. IEEE Press, New York
Sreedhar Kumar S, Madheswaran M, Ravi R (2018) Inherent approach of medical image pixels classification using an improved agglomerative clustering technique. Res J Biotechnol 12(2):115–124
Sreedhar Kumar S, Madheswaran M, Vinutha BA, Manjunath Singh H, Charan KV (2019) A brief survey of unsupervised agglomerative hierarchical clustering schemes. Int J Eng Technol 8(1):29–37. https://doi.org/10.14419/ijet.v8i1.13971
Theodoridis S, Koutroubas K (1999) Pattern recognition. Academic Press, New York
Thouheed Ahmed S, Sandhya M (2019) Real-time biomedical recursive images detection algorithm for Indian telemedicine environment. In: Mallick P, Balas V, Bhoi A, Zobaa A (eds) Cognitive informatics and soft computing. Advances in intelligent systems and computing, vol 768. Springer, Singapore. https://doi.org/10.1007/978-981-13-0617-4_68
Vijayakumar K, Suchitra S, Shri PS (2019) A secured cloud storage auditing with empirical outsourcing of key updates. Int J Reason Based Intell Syst 11(2):109–114
Vijayakumar K, Arun C (2017) Automated risk identification using NLP in cloud based development environments. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-017-0503-7
Xie XL, Beni G (1991) A validity measures for fuzzy clustering. IEEE Trans Pattern Recogn Mach Intell 13(8):1020–1025
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s12652-022-04305-x
About this article
Cite this article
Kumar, S.S., Ahmed, S.T., Vigneshwaran, P. et al. RETRACTED ARTICLE: Two phase cluster validation approach towards measuring cluster quality in unstructured and structured numerical datasets. J Ambient Intell Human Comput 12, 7581–7594 (2021). https://doi.org/10.1007/s12652-020-02487-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-020-02487-w