Skip to main content
Log in

RETRACTED ARTICLE: Two phase cluster validation approach towards measuring cluster quality in unstructured and structured numerical datasets

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

This article was retracted on 11 July 2022

This article has been updated

Abstract

This paper presents an improved cluster validation scheme called two phase cluster validation (TPCV) and aims to estimate the inter closeness and inter separation among the clusters in the cluster set of unsupervised clustering schemes based on probability measure for validating the cluster quality without prior identification. First phase, the TPCV computes the representative cluster centroid of each individual cluster in the cluster set based on standard mean operation and then it estimates the probability of inter closeness of each cluster with other clusters in the cluster set based on cluster centroid. Next phase, it calculates the probability of separation among the clusters in the cluster set based on cluster centroid by distance measure. Experimental results show that the TPCV scheme is simple and effective to estimate the cluster quality by measuring the probability of closeness and separation between the clusters in the result of unsupervised clustering scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Change history

References

  • Ahmed ST, Sandhya M, Sankar S (2019) A dynamic MooM dataset processing under TelMED protocol design for QoS improvisation of telemedicine environment. J Med Syst 43(8):257

    Article  Google Scholar 

  • Ahmed ST, Sankar S, Sandhya M (2020) Multi-objective optimal medical data informatics standardization and processing technique for telemedicine via machine learning approach. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-020-02016-9

    Article  Google Scholar 

  • Bezdek JC, Pal NR (1998) Some new indexes of cluster validity. IEEE Trans Syst Man Cybern Part B Cybern 28(3):301–315

    Article  Google Scholar 

  • Calinski T, Harabasz A (1974) A dendrite method for cluster analysis. Commun Stat Theory Method 3(1):1–27

    Article  MathSciNet  Google Scholar 

  • Davies DL, Bouldin DW (1979) Cluster separation measure. IEEE Trans Pattern Anal Mach Intell 1(2):95–104

    Google Scholar 

  • Dunn JC (1974) Well Separated Clusters and Optimal Fuzzy Partitions. J Cybern 4:95–104

    Article  MathSciNet  Google Scholar 

  • Hartigan JA (1975) Clustering algorithm. Wiley, New York

    MATH  Google Scholar 

  • Yang JH, Lee I (2004) Cluster validity through graph-based boundary analysis. In: International Conference on Information and Knowledge Engineering, pp 204–210

  • Jain A, Dubes R (1988) Algorithm for clustering data. Prentice Hall, Englewood Cliffs

    MATH  Google Scholar 

  • Wang JS, Chiang JC (2008) A cluster validity measure with outlier detection for support vector clustering. IEEE Trans Syst Man Cybern Part B Cybern 38(1):78–89

    Article  Google Scholar 

  • Dinakaran K, Suresh RM (2011) Validation techniques to find optimal cluster in gene expression data. Eur J Sci Res 54(3):411–417

    Google Scholar 

  • Krishnamoorthy R, Sreedhar Kumar S (2013) New inter cluster validation method for unsupervised clustering techniques. In: IEEE International Conference on Communication and Computer Vision 2013 (ICCCV’13), pp 1–5. https://doi.org/10.1109/ICCCV.2013.690674

  • Madheswaran M, Sreedhar Kumar S (2017) An improved frequency based agglomerative clustering algorithm for detecting distinct clusters on two dimensional dataset. J Eng Technol Res (Acad J) 9(4):30–41. https://doi.org/10.5897/JETR2017.0628

    Article  Google Scholar 

  • Dash M, Liu H, Scheuerman P, Tam KL (2003) Fast hierarchical clustering and it’s validation. Data Knowl Eng 44:109–138

    Article  Google Scholar 

  • Manoj RJ, Praveena MA, Vijayakumar K (2019) An ACO–ANN based feature selection algorithm for big data. Cluster Comput 22(2):3953–3960

    Article  Google Scholar 

  • Kantandzic M (2011) Data mining: concepts, models, methods and algorithms, 2nd edn. IEEE, pp 249–279

  • Nithya M, Vijayakumar K (2020) Secured segmentation for ICD datasets. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-020-02009-8

    Article  Google Scholar 

  • Patil KK, Ahmed ST (2014) Digital telemammography services for rural India, software components and design protocol. In: 2014 International Conference on Advances in Electronics Computers and Communications. IEEE, pp 1–5

  • Pradeep Mohan Kumar K, Saravanan M, Thenmozhi M, Vijayakumar K (2019) Intrusion detection system based on GA‐fuzzy classifier for detecting malicious attacks. Concurr Comput Pract Exper:e5242

  • Amorim RCD, Hennig C (2015) Recovering the number of clusters in data sets with noise features using feature rescaling factors. Inf Sci 324:126–145

    Article  MathSciNet  Google Scholar 

  • Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65

    Article  Google Scholar 

  • Rui XU, Wunsch DC (2009) Clustering. IEEE Press, New York

    Google Scholar 

  • Sreedhar Kumar S, Madheswaran M, Ravi R (2018) Inherent approach of medical image pixels classification using an improved agglomerative clustering technique. Res J Biotechnol 12(2):115–124

    Google Scholar 

  • Sreedhar Kumar S, Madheswaran M, Vinutha BA, Manjunath Singh H, Charan KV (2019) A brief survey of unsupervised agglomerative hierarchical clustering schemes. Int J Eng Technol 8(1):29–37. https://doi.org/10.14419/ijet.v8i1.13971

    Article  Google Scholar 

  • Theodoridis S, Koutroubas K (1999) Pattern recognition. Academic Press, New York

    Google Scholar 

  • Thouheed Ahmed S, Sandhya M (2019) Real-time biomedical recursive images detection algorithm for Indian telemedicine environment. In: Mallick P, Balas V, Bhoi A, Zobaa A (eds) Cognitive informatics and soft computing. Advances in intelligent systems and computing, vol 768. Springer, Singapore. https://doi.org/10.1007/978-981-13-0617-4_68

    Chapter  Google Scholar 

  • Vijayakumar K, Suchitra S, Shri PS (2019) A secured cloud storage auditing with empirical outsourcing of key updates. Int J Reason Based Intell Syst 11(2):109–114

    Google Scholar 

  • Vijayakumar K, Arun C (2017) Automated risk identification using NLP in cloud based development environments. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-017-0503-7

    Article  Google Scholar 

  • Xie XL, Beni G (1991) A validity measures for fuzzy clustering. IEEE Trans Pattern Recogn Mach Intell 13(8):1020–1025

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Sreedhar Kumar.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s12652-022-04305-x

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kumar, S.S., Ahmed, S.T., Vigneshwaran, P. et al. RETRACTED ARTICLE: Two phase cluster validation approach towards measuring cluster quality in unstructured and structured numerical datasets. J Ambient Intell Human Comput 12, 7581–7594 (2021). https://doi.org/10.1007/s12652-020-02487-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-020-02487-w

Keywords

Navigation