Skip to main content
Log in

K-Means algorithm based on multi-feature-induced order

  • ORIGINAL PAPER
  • Published:
Granular Computing Aims and scope Submit manuscript

Abstract

The K-Means algorithm is a powerful tool for data analysis, but it faces several challenges when dealing with large multi-feature data. Centroid initialization and centroid determination are two significant hurdles that can reduce the performance of the K-Means algorithm. To address these challenges, based on partial-order relations, an enhanced K-Means algorithm, the multi-feature induced order K-Means algorithm (OWAK-Means) is developed which combines with a novel centroid initialization based on partial-order relations and a multi-feature induced ordered weighted average (MFIOWA) operator. By using a weighted iteration method based on partial-order relations, the OWAK-Means algorithm initializes centroids with greater precision. The MFIOWA operator is designed based on database indexing theory and the Sigmoid weight function that improves its information filtering ability. These techniques, combined with an ordered weighted distance metric and the MFIOWA operator, make the OWAK-Means algorithm an effective tool for multi-feature data analysis. In comparative analysis with the variants of the K-Means algorithm, the OWAK-Means algorithm has significant improvement in the adjusted rand score, normalized mutual information, and purity. Statistical tests, comprehensive evaluation methods, and sensitivity analysis prove that the OWAK-Means algorithm is effective and reliable.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

Data will be made available on request.

References

Download references

Acknowledgements

This work is supported in part by the Humanities and Social Science Planning Project of the Ministry of Education under Grant 22YJA880051, the Science and Technology Project of Jiangxi Provincial Education Department under Grant GJJ2200535, and the 18th Student Research Project of Jiangxi University of Finance and Economics under Grant 20231016140352946.

Author information

Authors and Affiliations

Authors

Contributions

Benting Wan: Conception and design of study, Acquisition of data, Analysis and/or interpretation of data, Writing—original draft, Writing—review & editing. Weikang Huang: Analysis and/or interpretation of data, Writing—original draft, Writing— review & editing. Bilivogui Pierre: Analysis and/or interpretation of data, Writing—review & editing. Youyu Cheng: Writing—review & editing. Shufen Zhou: Analysis and/or interpretation of data. Writing—review & editing.

Corresponding author

Correspondence to Benting Wan.

Ethics declarations

Conflict of interest

All authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wan, B., Huang, W., Pierre, B. et al. K-Means algorithm based on multi-feature-induced order. Granul. Comput. 9, 45 (2024). https://doi.org/10.1007/s41066-024-00470-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s41066-024-00470-w

Keywords

Navigation