Cluster Center Initialization Using Hierarchical Two-Division of a Data Set along Each Dimension

Chen, Guang Hui

doi:10.1007/978-3-642-30126-1_38

Guang Hui Chen³

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 168))

1223 Accesses
1 Citations

Abstract

This paper proposes a hierarchical two-division method that divides each mother subset of a data set at the same layer into two subsets along a dimension, and hierarchically divides the data set into a series of leaf subsets when the two-division process passes through each dimension of the data set. Then the initial cluster centers are picked out from the series of leaf subsets according to the rule that optimizes the dissimilarities among the initial cluster centers. Thus a new cluster center initialization method is developed. Experiments on real data sets show that the proposed cluster center initialization method is desirable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

MacQueen, J.B.: Some methods for classification and analysis of multivariate observation. In: Le Cam, L.M. (ed.) Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297. University of California Press (1967)
Google Scholar
Khan, S.S., Ahmad, A.: Cluster center initialization algorithm for k-means clustering. Pattern Recognition Lett. 25(11), 1293–1302 (2004)
Article Google Scholar
Redmond, S.J., Heneghan, C.: A method for initialising the K-means clustering algorithm using kd-trees. Patt. Recog. Letters 28(8), 965–973 (2007)
Article Google Scholar
Erisoglu, M., Calis, N., Sakallioglu, S.: A new algorithm for initial cluster centers in k-means algorithm. Patt. Recog. Letters 32, 1701–1705 (2011)
Article Google Scholar
Pena, J.M., Lozano, J.A.: An empirical comparison of four initialization methods for the k-means algorithm. Patt. Recog. Lett. 20(10), 1027–1040 (1999)
Article Google Scholar
Steinley, D., Brusco, M.J.: Initializing K-means Batch Clustering: A Critical Evaluation of Several Techniques. J. of Classification 24(1), 99–121 (2007)
Article MathSciNet MATH Google Scholar
http://www.ics.uci.edu/~mlearn/MLRepository.html
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm

Download references

Author information

Authors and Affiliations

Department of Mathematics and Computational Science, Guang Dong University of Business Studies, Guangzhou, China, 510320
Guang Hui Chen

Authors

Guang Hui Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guang Hui Chen .

Editor information

Editors and Affiliations

Wuhan Section of ISER Association, Guangshan Road 76, Wuhan, 430072, China, People's Republic
David Jin
Researcher Association, Guangzhou Section, International Science & Education, Jinheng Road, Jinbi Garden 85-1102 144, Guang Zhou, China, People's Republic
Sally Lin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, G.H. (2012). Cluster Center Initialization Using Hierarchical Two-Division of a Data Set along Each Dimension. In: Jin, D., Lin, S. (eds) Advances in Computer Science and Information Engineering. Advances in Intelligent and Soft Computing, vol 168. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30126-1_38

Download citation

DOI: https://doi.org/10.1007/978-3-642-30126-1_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30125-4
Online ISBN: 978-3-642-30126-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics