Abstract
Since the amount of trajectory data is large and the structure of trajectory data is complex, an improved density-based K-means algorithm was proposed. Firstly, high-density trajectory data points were selected as the initial clustering centers based on the density and increasing the density weight of important points, to perform K-means clustering. Secondly the clustering results were evaluated by the Between-Within Proportion index. Finally, the optimal clustering number and the best clustering were determined according to the clustering results evaluation. Theoretical researches and experimental results showed that the improved algorithm could be better at extracting the trajectory key points. The accuracy of clustering results was 24% points higher than that of the traditional K-means algorithm and 16% points higher than that of the Density-Based Spatial Clustering of Applications with Noise algorithm. The proposed algorithm has a better stability and a higher accuracy in trajectory data clustering.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wang, Z.C., Yuan, X.R.: Visual analysis of trajectory data. J. Comput.-Aided Des. Comput. Graph. (1), 9–25 (2015)
Khan, S.S., Ahmad, A.: Cluster center initialization algorithm for K-means clustering. Expert Syst. Appl. 25(11), 1293–1302 (2004)
He, Y.B., Liu, X.J., Wang, Z.Q., et al.: Improved K-means algorithm based on global center and nonuniqueness high-density points. J. Comput. Eng. Appl. 52(1), 48–54 (2016)
Zhu, M., Wang, W., Huang, J.: Improved initial cluster center selection in K-means clustering. Eng. Comput. 31(8), 1661–1667 (2014)
Zhang, T., Ma, F.: Improved rough K-means clustering algorithm based on weighted distance measure with Gaussian function. Int. J. Comput. Math. 1–17 (2015)
Zhang, S.Q., Huang, Z.K., Feng, M.: An optimized K-means algorithm. Microelectron. Comput. 32(12), 36–39 (2015)
Capó, M., Pérez, A., Lozano, J.A.: An efficient approximation to the K-means clustering for massive data. Knowl.-Based Syst. 117, 56–69 (2017)
Zhang, S.J., Zhao, H.C.: Algorithm research of optimal cluster number and initial cluster center. J. Appl. Res. Comput. 34(6), 1–5 (2017)
Rodriguez, A., Laio, A.: Machine learning. Clustering by fast search and find of density peaks. Science 344(6191), 1492 (2014)
Rezaee, M.R., Lelieveldt, B.P.F., Reiber, J.H.C.: A new cluster validity index for the fuzzy c-mean. Pattern Recogn. Lett. 19(3–4), 237–246 (1998)
Zhou, S.B., Xu, Z.Y., Tang, X.Q.: Method for determining optimal number of clusters in K-means clustering algorithm. J. Comput. Appl. 46(16), 27–31 (2010)
Acknowledgments
This research was supported by the Fundamental Research Funds for the Universities in Tianjin, Tianjin Chengjian Universities (2016CJ11)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Hao, MW., Dai, HL., Hao, K., Li, C., Zhang, YJ., Song, HN. (2018). Optimization of Density-Based K-means Algorithm in Trajectory Data Clustering. In: Li, C., Mao, S. (eds) Wireless Internet. WiCON 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 230. Springer, Cham. https://doi.org/10.1007/978-3-319-90802-1_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-90802-1_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-90801-4
Online ISBN: 978-3-319-90802-1
eBook Packages: Computer ScienceComputer Science (R0)