Improving K-Modes Algorithm Considering Frequencies of Attribute Values in Mode

He, Zengyou; Deng, Shengchun; Xu, Xiaofei

doi:10.1007/11596448_23

Zengyou He²⁶,
Shengchun Deng²⁶ &
Xiaofei Xu²⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3801))

Included in the following conference series:

International Conference on Computational and Information Science

1760 Accesses
29 Citations

Abstract

In this paper, we present an experimental study on applying a new dissimilarity measure to the k-modes clustering algorithm to improve its clustering accuracy. The measure is based on the idea that the similarity between a data object and cluster mode, is directly proportional to the sum of relative frequencies of the common values in mode. Experimental results on real life datasets show that, the modified algorithm is superior to the original k-modes algorithm with respect to clustering accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Modified K-Modes Clustering Algorithm

Attribute weights-based clustering centres algorithm for initialising K-modes clustering

Article 16 February 2018

An Improved K-means Clustering Algorithm Based on Distance Evaluation Function and Entropy

References

Huang, Z.: Extensions To The K-means Algorithm for Clustering Large Data Sets with Categorical Values. Data Mining and Knowledge Discovery 2, 283–304 (1998)
Article Google Scholar
Huang, Z., Ng, M.K.: A Fuzzy K-modes Algorithm for Clustering Categorical Data. IEEE Transactions on Fuzzy Systems 7(4), 446–452 (1999)
Article Google Scholar
He, Z., Xu, X., Deng, S.: Squeezer: An Efficient Algorithm for Clustering Categorical Data. Journal of Computer Science & Technology 17(5), 611–624 (2002)
Article MATH MathSciNet Google Scholar
He, Z., Xu, X., Deng, S.: A Cluster Ensemble Method for Clustering Categorical Data. Information Fusion 6(2), 143–151 (2005)
Article Google Scholar
Merz, C.J., Merphy, P.: UCI Repository of Machine Learning Databases (1996), http://www.ics.uci.edu/~mlearn/MLRRepository.html
He, Z., Xu, X., Deng, S.: Discovering Cluster-based Local Outliers. Pattern Recognition Letters 24, 1641–1650 (2003)
Article MATH Google Scholar
He, Z., Xu, X., Huang, J.Z., Deng, S.: Mining Class Outliers: Concepts, Algorithms and Applications in CRM. Expert Systems with Applications 27(4), 681–697 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Harbin Institute of Technology, China
Zengyou He, Shengchun Deng & Xiaofei Xu

Authors

Zengyou He
View author publications
You can also search for this author in PubMed Google Scholar
Shengchun Deng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofei Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microelectronic Instiute, Xidian University, 710071, Xi’an, China
Yue Hao
Department of Computer Science, Hong Kong Baptist University, Kowloon Tong, Hong Kong
Jiming Liu
School of Computer Science and Technology, Xidian University, Xi’an, China
Yuping Wang
Department of Computer Science, Hong Kong Baptist University, Hong Kong,
Yiu-ming Cheung
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin
Life Science Research Center, School of Electronic Engineering, Xidian University, 710071, Xi’an, Shaanxi, China
Licheng Jiao
Key Laboratory of Computer Networks and Information Security (Ministry of Education), Xidian University, 710071, Xi’an, China
Jianfeng Ma
National Laboratory of Antennas and Microwave Technology, Xidian University, 710071, Xi’an, Shanxi, P.R. China
Yong-Chang Jiao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, Z., Deng, S., Xu, X. (2005). Improving K-Modes Algorithm Considering Frequencies of Attribute Values in Mode. In: Hao, Y., et al. Computational Intelligence and Security. CIS 2005. Lecture Notes in Computer Science(), vol 3801. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11596448_23

Download citation

DOI: https://doi.org/10.1007/11596448_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30818-8
Online ISBN: 978-3-540-31599-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improving K-Modes Algorithm Considering Frequencies of Attribute Values in Mode

Abstract

Access this chapter

Preview

Similar content being viewed by others

A Modified K-Modes Clustering Algorithm

Attribute weights-based clustering centres algorithm for initialising K-modes clustering

An Improved K-means Clustering Algorithm Based on Distance Evaluation Function and Entropy

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improving K-Modes Algorithm Considering Frequencies of Attribute Values in Mode

Abstract

Access this chapter

Preview

Similar content being viewed by others

A Modified K-Modes Clustering Algorithm

Attribute weights-based clustering centres algorithm for initialising K-modes clustering

An Improved K-means Clustering Algorithm Based on Distance Evaluation Function and Entropy

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation