Skip to main content

Data Clustering Based on an Efficient Hybrid of K-Harmonic Means, PSO and GA

  • Chapter

Part of the book series: Lecture Notes in Computer Science ((TCCI,volume 6660))

Abstract

Clustering is one of the most commonly techniques in Data Mining. Kmeans is one of the most popular clustering techniques due to its simplicity and efficiency. However, it is sensitive to initialization and easily trapped in local optima. K-harmonic means clustering solves the problem of initialization using a built-in boosting function, but it is suffering from running into local optima. Particle Swarm Optimization is a stochastic global optimization technique that is the proper solution to solve this problem. In this paper, PSOKHM not only helps KHM clustering escape from local optima but also overcomes the shortcoming of slow convergence speed of PSO. In this paper, a hybrid data clustering algorithm based on PSO and Genetic algorithm, GSOKHM, is proposed. We investigate local optima method in addition to the global optima in PSO, called LSOKHM. The experimental results on five real datasets indicate that LSOKHM is superior to the GSOKHM algorithm.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hu, G., Zhou, S., Guan, J., Hu, X.: Towards effective document clustering: A constrained K-means based approach. Information Processing & Management 44, 1397–1409 (2008)

    Article  Google Scholar 

  2. Tan, P.N., Steinbach, M., Kumar, V.: Introduction to data mining, pp. 487–559. Addison-Wesley, Boston (2005)

    Google Scholar 

  3. Tjhi, W.C., Chen, L.H.: A heuristic-based fuzzy co-clustering algorithm for categorization of high-dimensional data. Fuzzy Sets and Systems 159, 371–389 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  4. Zhou, H., Liu, Y.H.: Accurate integration of multi-view range images using k-means clustering. Pattern Recognition 41, 152–175 (2008)

    Article  MATH  Google Scholar 

  5. Jain, A.K., Murty, M.N., Flynn, P.j.: Data clustering: A review. ACM Computational Survey 31, 264–323 (1999)

    Article  Google Scholar 

  6. Cui, X., Potok, T.E., Palathingal, P.: Document clustering using Particle Swarm Optimization. In: Proceedings 2005 IEEE Swarm Intelligence Symposium, pp. 185–191 (2005)

    Google Scholar 

  7. Zhang, B., Hsu, M., Dayal, U.: K-harmonic means – a data clustering algorithm. Technical Report HPL-1999-124, Hewlett-Packard Laboratories (1999)

    Google Scholar 

  8. Hammerly, G., Elkan, C.: Alternatives to the k-means algorithm that find better clusterings. In: Proceedings of the 11th International Conference on Information and Knowledge Management, Virginia, USA, pp. 600–607 (2002)

    Google Scholar 

  9. Güngör, Z., Ünler, A.: K-harmonic means data clustering with simulated annealing heuristic. Applied Mathematics and Computation, 199–209 (2007)

    Google Scholar 

  10. Güngör, Z., Ünler, A.: K-harmonic means data clustering with tabu-search method. Applied Mathematical Modelling 32, 1115–1125 (2008)

    Article  MATH  Google Scholar 

  11. Yang, F., Sun, T., Zhang, C.: An efficient hybrid data clustering method based on K-harmonic means and Particle Swarm Optimization. Expert Systems with Applications: An International Journal 36, 9847–9852 (2009)

    Article  Google Scholar 

  12. Jiang, H., Yi, S., Li, J., Yang, F., Hu, X.: Ant clustering algorithm with k-harmonic means clustering. Expert Systems with Applications 37, 8679–8684 (2010)

    Article  Google Scholar 

  13. Chu, S., Roddick, J.: A clustering algorithm using Tabu search approach with simulated annealing for vector quantization. Chinese Journal of Electronics 12, 349–353 (2003)

    Google Scholar 

  14. Huang, C.H., Pan, J.S., Lu, Z.H., Sun, S.H., Hang, H.M.: Vector quantization based on genetic simulated annealing. Signal Processing 81, 1513–1523 (2001)

    Article  MATH  Google Scholar 

  15. Xu, R.: Survey of clustering algorithms. IEEE Transactions on Neural Networks 16, 645–678 (2005)

    Article  Google Scholar 

  16. Kennedy, J., Eberhart, R.C.: Particle swarm optimization. In: Proceedings of the 1995 IEEE International Conference on Neural Networks, pp. 1942–1948. IEEE Press, New Jersey (1985)

    Google Scholar 

  17. Dalli, A.: Adaptation of the F-measure to cluster-based Lexicon quality evaluation. In: EACL (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Danesh, M., Naghibzadeh, M., Totonchi, M.R.A., Danesh, M., Minaei, B., Shirgahi, H. (2011). Data Clustering Based on an Efficient Hybrid of K-Harmonic Means, PSO and GA. In: Nguyen, N.T. (eds) Transactions on Computational Collective Intelligence IV. Lecture Notes in Computer Science(), vol 6660. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21884-2_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-21884-2_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-21883-5

  • Online ISBN: 978-3-642-21884-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics