Abstract
Hierarchical K-means has got rapid development and wide application because of combining the advantage of high accuracy of hierarchical algorithm and fast convergence of K-means in recent years. Traditional HK clustering algorithm first determines to the initial cluster centers and the number of clusters by agglomerative algorithm, but agglomerative algorithm merges two data objects of minimum distance in dataset every time. Hence, its time complexity can not be acceptable for analyzing huge dataset. In view of the above problem of the traditional HK, this paper proposes a new clustering algorithm iHK. Its basic idea is that the each layer of the N data objects constructs \(\lceil{{N}\over{2}} \rceil \) clusters by running K-means algorithm, and the mean vector of each cluster is used as the input of the next layer. iHK algorithm is tested on many different types of dataset and excellent experimental results are got.
Chapter PDF
Similar content being viewed by others
References
Wang, Y.C.F., Casasent, D.: Hierarchical k-means clustering using new support vector machines for multi-class classification. In: International Joint Conference on IEEE Neural Networks, IJCNN 2006, pp. 3457–3464 (2006)
Arai, K., Barakbah, A.R.: Hierarchical K-means: an algorithm for centroids initialization for K-means. Reports of the Faculty of Science and Engineering 36(1), 25–31 (2007)
Lu, J.F., Tang, J.B., Tang, Z.M., et al.: Hierarchical initialization approach for K-Means clustering. Pattern Recognition Letters 29(6), 787–795 (2008)
Celebi, M.E., Kingravi, H.A.: Deterministic initialization of the k-means algorithm using hierarchical clustering. International Journal of Pattern Recognition and Artificial Intelligence 26(07) (2012)
Archetti, F., Campanelli, P., Fersini, E., et al.: A Hierarchical document clustering environment bases on the induced bisecting k-means. Flexible Query Answering System, pp. 257–269. Springer, Heidelberg (2006)
Liu, Y., Liu, Z.: An improved hierarchical K-means algorithm for web document clustering. In: International Conference on Computer Science and Information Technology, ICCSIT 2008, pp. 606–610. IEEE (2008)
Murthy, V., Vamsidhar, E., Rao, P.S., et al.: Application of hierarchical and K-means techniques in Content based image retrieval. International Journal of Engineering Science and Technology 2(5), 749–755 (2010)
Chen, T.W., Chien, S.Y.: Flexible hardware architecture of hierarchical K-means clustering for large cluster number. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 19(8), 1336–1345 (2011)
Hu, X., Qi, P., Zhang, B.: Hierarchical K-means algorithm for modeling visual area V2 neurons. Neural Information Processing, pp. 373–381. Springer, Heidelberg (2012)
Mantena, G., Anguera, X.: Speed improvements to information retrieval-based dynamic time warping using hierarchical k-means clustering. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8515–8519. IEEE (2013)
Chen, B., He, J., Pellicer, S., et al.: Using Hybrid Hierarchical K-means (HHK) clustering algorithm for protein sequence motif Super-Rule-Tree (SRT) structure construction. International Journal of Data Mining and Bioin-formatics 4(3), 316–330 (2010)
Ghwanmeh, S.H.: Applying Clustering of Hierarchical K-means-like Algorithm on Arabic Language. International Journal of Information Technology 3(3) (2007)
Chehata, N., David, N., Bretar, F.: LIDAR data classification using hierarchical K-means clustering. In: ISPRS Congress, Beijing, vol. 37, pp. 325–330 (2008)
Chen, B., Tai, P.C., Harrison, R., et al.: Novel hybrid hierarchical-K-means clustering method (HK-means) for microarray analysis. In: Computational Systems Bioinformatics Conferences, Workshops and Poster Abstracts, pp. 105–108. IEEE (2005)
Ying, H., Qin, L.X.: Study on PCA based Hierarchical K-means Clustering Algorithm. Control and Automation 6, 68 (2012)
Lamrous, S., Taileb, M.: Divisive hierarchical k-means. In: 2006 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on Computational Intelligence for Modelling, Control and Automation, p. 18. IEEE (2006)
Zhang, L., Cui, W.D., et al.: Hybird clustering algorithm based on partitioning and hierarchical method. Computer Engineering and Applications 46(16), 127–129 (2010)
Chen, B., He, J., Pellicer, S., et al.: Protein Sequence Motif Super-Rule-Tree (SRT) Structure Constructed by Hybrid Hierarchical K-means Clustering Algorithm. In: IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008, pp. 98–103. IEEE (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 IFIP International Federation for Information Processing
About this paper
Cite this paper
Liu, W., Liang, Y., Fan, J., Feng, Z., Cai, Y. (2014). Improved Hierarchical K-means Clustering Algorithm without Iteration Based on Distance Measurement. In: Shi, Z., Wu, Z., Leake, D., Sattler, U. (eds) Intelligent Information Processing VII. IIP 2014. IFIP Advances in Information and Communication Technology, vol 432. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44980-6_5
Download citation
DOI: https://doi.org/10.1007/978-3-662-44980-6_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44979-0
Online ISBN: 978-3-662-44980-6
eBook Packages: Computer ScienceComputer Science (R0)