Skip to main content
Log in

Optimal centroids model approach for many-feature data structure prediction

  • Research Paper
  • Published:
Evolutionary Intelligence Aims and scope Submit manuscript

Abstract

Data clustering is a popular machine learning technique widely applied for data structure analysis. Among the major techniques, fuzzy co-clustering (FCoC) is known for the capability of complex data processing such as many features, large size, and uncertainty. In some cases, FCoC has demonstrated superior performance over some traditional clustering methods. However, FCoC has some limitations such as being sensitive to the initial cluster centers and being stuck at locally optimal values. Particle swarm optimization (PSO) is a multidisciplinary optimization algorithm. PSO was used to find a suitable initial cluster centers solution for the clustering algorithms. However, PSO is limited and unclear, especially for algorithms with complex structures and many parameters. In this paper, we propose a new fuzzy co-clustering algorithm by using the optimal solution of the PSO algorithm to find the initial centroids for the FCoC algorithm. We call the new algorithm FCOCM. In our experience, we use symbols and mathematical language to describe the process of finding initial centroids called the optimal centroids model (OCM). Thus, our work proposes two new contributions including the OCM model and the FCOCM algorithm. Experiments were conducted on benchmark data sets from the UCI Machine Learning Repository and the School of the Computing University of Eastern Finland to demonstrate the superior performance of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. https://archive.ics.uci.edu/ml/index.php.

References

  1. Kennedy J, Eberhart R (1995) Particle swarm optimization. IEEE Int Conf Neural Netw 4:1942–1948

    Google Scholar 

  2. Zeng N, Wang Z, Liu W, Zhang H, Hone K, Liu (2020) X A dynamic neighborhood-based switching particle swarm optimization algorithm, IEEE Transactions on Cybernetics, pp. 1-12

  3. Zeng N, Song D, Li H, You Y, Liu Y, Alsaadic FE (2021) A competitive mechanism integrated multi-objective whale optimization algorithm with differential evolution. Neurocomputing 432:170–182

    Article  Google Scholar 

  4. Luo X, Yuan Y, Chen S, Zeng N, Wang Z (2020) Position-transitional particle swarm optimization-incorporated latent factor analysis, IEEE Transactions on Knowledge and Data Engineering, pp. 1-13,

  5. Rayala SS, Kumar NA (2020) Particle swarm optimization for robot target tracking application, Mater Today: Proc,

  6. Song Y, Zhang F, Liu C (2020) The risk of block chain financial market based on particle swarm optimization. J Computat Appl Math 37015:112667

    Article  MathSciNet  MATH  Google Scholar 

  7. Gao W, Su C (2020) Analysis of earnings forecast of blockchain financial products based on particle swarm optimization. J Comput Appl Math 372:112724

    Article  MathSciNet  MATH  Google Scholar 

  8. Li X, Wu X, Xu S, Qing S, Chang P (2019) A novel complex network community detection approach using discrete particle swarm optimization with particle diversity and mutation. Appl Soft Comput 81:105476

    Article  Google Scholar 

  9. Xiong H, Qiu B, Liu J (2020) An improved multi-swarm particle swarm optimizer for optimizing the electric field distribution of multichannel transcranial magnetic stimulation. Artif Intell Med 104:101790

    Article  Google Scholar 

  10. Junior FEF, Yen GG (2019) Particle swarm optimization of deep neural networks architectures for image classification. Swarm Evolut Comput 49:62–74

    Article  Google Scholar 

  11. Farshi TR, Drake JH, Ozcan E (2020) A multimodal particle swarm optimization-based approach for image segmentation. Expert Syst Appl 1491:113233

    Article  Google Scholar 

  12. Janani R, Vijayarani S (2019) Text document clustering using Spectral Clustering algorithm with Particle Swarm Optimization. Expert Syst Appl 13415:192–200

    Article  Google Scholar 

  13. Cui Z, Zhang J, Di W, Cai X, Chen J (2020) Hybrid many-objective particle swarm optimization algorithm for green coal production problem. Inform Sci 518:256–271

    Article  MathSciNet  Google Scholar 

  14. Hussain MM, Fujimoto N (2020) GPU-based parallel multi-objective particle swarm optimization for large swarms and high dimensional problems. Parallel Comput 92:102589

    Article  MathSciNet  Google Scholar 

  15. Zhou Y, Zhao Z, Cheng D (2020) Cluster structure prediction via revised particle-swarm optimization algorithm. Comput Phys Commun 247:106945

    Article  Google Scholar 

  16. Li Y, Chu X, Tian D, Feng J, Mu W (2021) Customer segmentation using K-means clustering and the adaptive particle swarm optimization algorithm. Appl Soft Comput 113:107924

    Article  Google Scholar 

  17. Feng Z, Niu W, Zhang R, Wang S, Chenge C (2019) Operation rule derivation of hydropower reservoir by k-means clustering method and extreme learning machine based on particle swarm optimization. J Hydrol 576:229–238

    Article  Google Scholar 

  18. Silva TM, Pimentel BA, Souza RMCR, Oliveira AL (2015) Hybrid methods for fuzzy clustering based on fuzzy c-means and improved particle swarm optimization. Expert Syst Appl 42(17–18):6315–6328

    Article  Google Scholar 

  19. Wu ZH, Wu ZC, Zhang J (2017) An improved FCM algorithm with adaptive weights based on SA-PSO. Neural Comput Appl 28(10):3113–3118

    Article  Google Scholar 

  20. Sharma M, Chhabra JK (2019) Sustainable automatic data clustering using hybrid PSO algorithm with mutation. Sustain Comput: Inf Syst 23:144–157

    Google Scholar 

  21. Li C, Zhou J, Kou P, Xiao J (2012) A novel chaotic particle swarm optimization based fuzzy clustering algorithm. Neurocomputing 83:98–109

    Article  Google Scholar 

  22. Gupta R, Muttoo SK, Pal SK (2017) Fuzzy C-Means clustering and particle swarm optimization based scheme for common service center location allocation. Appl Intell 47(3):624–643

    Article  Google Scholar 

  23. Song Y, Pan S, Liu S, Wei F, Zhou MX, W Qian (2010) Association for the advancement of artificial intelligence, Constrained co-clustering for textual documents, pp 581–586

  24. Selosse M, Jacques J, Biernacki C (2020) Model-based co-clustering for mixed type data. Comput Stat Data Anal 144:1–18

    Article  MathSciNet  MATH  Google Scholar 

  25. Kummamuru K, Dhawale A, Krishnapuram R (2003) Fuzzy co-clustering of documents and keywords. IEEE Int Conf on Fuzzy Syst 2:772–777

    Google Scholar 

  26. Tjhi WC, Chen L (2007) Possibilistic fuzzy co-clustering of large document collections. Pattern Recognit 40(12):3452–3466

    Article  MATH  Google Scholar 

  27. Yan Y, Chen L, Tjhi WC (2013) Fuzzy semi-supervised co-clustering for text documents. Fuzzy Sets Syst 215:74–89

    Article  MathSciNet  Google Scholar 

  28. Hanmandlua M, Verma OP, Susan S, Madasu V (2013) Color segmentation by fuzzy co-clustering of chrominance color features. Neurocomputing 120:235–249

    Article  Google Scholar 

  29. Tjhi WC, Chen L (2008) A heuristic-based fuzzy co-clustering algorithm for categorization of high-dimensional data. Fuzzy Sets Syst 159:371–389

    Article  MathSciNet  MATH  Google Scholar 

  30. Pham VN, Ngo LT, Pedrycz W (2016) Interval-valued fuzzy set approach to fuzzy co-clustering for data classification. Knowl-Based Syst 107:1–13

    Article  Google Scholar 

  31. Nha PV, Long PT, Pedryczc W, Long NT (2021) Feature-reduction fuzzy co-clustering approach for hyper-spectral image analysis. Knowl-Based Syst 216:106549

    Article  Google Scholar 

  32. Mazur JE (2006) Mathematical models and the experimental analysis of behavior. J Exp Anal Behav 85(2):275–291

    Article  Google Scholar 

  33. Hwang C, Rhee FCH (2007) Uncertain fuzzy clustering: interval type-2 fuzzy approach to C-means. IEEE Trans Fuzzy Syst 15(1):107–120

    Article  Google Scholar 

  34. Xing H, He H, Hu D, Jiang T, Yu X (2019) An interval Type-2 fuzzy sets generation method for remote sensing imagery classification. Comput Geosci 133:104287

    Article  Google Scholar 

  35. Olson DL, Delen D (2008) Advanced data mining techniques, Springer ISBN 3-540-76916-1, 1st edition, page 138,

  36. David MWP (2011) Evaluation: from Precision. Recall, F-Measure ROC, Inf, Mark Correl, Mach Learn Technol 2(1):37–63

    Google Scholar 

  37. Jiang M, Luo YP, Yang SY (2007) Stochastic convergence analysis and parameter selection of the standard particle swarm optimization algorithm. Inf Process Lett 102(115):8–16

    Article  MathSciNet  MATH  Google Scholar 

  38. Piotrowski AP, Napiorkowski JJ, Piotrowska AE (2020) Population size in particle swarm optimization. Swarm Evolut Comput 58:100718

    Article  Google Scholar 

  39. FrÁ nti P, Virmajoki O, Hautamáki V (2006) Fast agglomerative clustering using a k-nearest neighbor graph. IEEE Trans Pattern Analy Mach Intell 28(11):1875–1881

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pham Van Nha.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cam Binh, L.T., Nha, P.V. Optimal centroids model approach for many-feature data structure prediction. Evol. Intel. 16, 1353–1367 (2023). https://doi.org/10.1007/s12065-022-00747-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12065-022-00747-6

Keywords

Navigation