Optimization of Density-Based K-means Algorithm in Trajectory Data Clustering

Hao, Mei-Wei; Dai, Hua-Lin; Hao, Kun; Li, Cheng; Zhang, Yun-Jie; Song, Hao-Nan

doi:10.1007/978-3-319-90802-1_39

Mei-Wei Hao¹⁷,
Hua-Lin Dai¹⁸,
Kun Hao¹⁷,
Cheng Li¹⁷,
Yun-Jie Zhang¹⁷ &
…
Hao-Nan Song¹⁹

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 230))

Included in the following conference series:

International Wireless Internet Conference

861 Accesses
2 Citations

Abstract

Since the amount of trajectory data is large and the structure of trajectory data is complex, an improved density-based K-means algorithm was proposed. Firstly, high-density trajectory data points were selected as the initial clustering centers based on the density and increasing the density weight of important points, to perform K-means clustering. Secondly the clustering results were evaluated by the Between-Within Proportion index. Finally, the optimal clustering number and the best clustering were determined according to the clustering results evaluation. Theoretical researches and experimental results showed that the improved algorithm could be better at extracting the trajectory key points. The accuracy of clustering results was 24% points higher than that of the traditional K-means algorithm and 16% points higher than that of the Density-Based Spatial Clustering of Applications with Noise algorithm. The proposed algorithm has a better stability and a higher accuracy in trajectory data clustering.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Wang, Z.C., Yuan, X.R.: Visual analysis of trajectory data. J. Comput.-Aided Des. Comput. Graph. (1), 9–25 (2015)
Google Scholar
Khan, S.S., Ahmad, A.: Cluster center initialization algorithm for K-means clustering. Expert Syst. Appl. 25(11), 1293–1302 (2004)
Google Scholar
He, Y.B., Liu, X.J., Wang, Z.Q., et al.: Improved K-means algorithm based on global center and nonuniqueness high-density points. J. Comput. Eng. Appl. 52(1), 48–54 (2016)
Google Scholar
Zhu, M., Wang, W., Huang, J.: Improved initial cluster center selection in K-means clustering. Eng. Comput. 31(8), 1661–1667 (2014)
Article Google Scholar
Zhang, T., Ma, F.: Improved rough K-means clustering algorithm based on weighted distance measure with Gaussian function. Int. J. Comput. Math. 1–17 (2015)
Google Scholar
Zhang, S.Q., Huang, Z.K., Feng, M.: An optimized K-means algorithm. Microelectron. Comput. 32(12), 36–39 (2015)
Google Scholar
Capó, M., Pérez, A., Lozano, J.A.: An efficient approximation to the K-means clustering for massive data. Knowl.-Based Syst. 117, 56–69 (2017)
Article Google Scholar
Zhang, S.J., Zhao, H.C.: Algorithm research of optimal cluster number and initial cluster center. J. Appl. Res. Comput. 34(6), 1–5 (2017)
Google Scholar
Rodriguez, A., Laio, A.: Machine learning. Clustering by fast search and find of density peaks. Science 344(6191), 1492 (2014)
Article Google Scholar
Rezaee, M.R., Lelieveldt, B.P.F., Reiber, J.H.C.: A new cluster validity index for the fuzzy c-mean. Pattern Recogn. Lett. 19(3–4), 237–246 (1998)
Article Google Scholar
Zhou, S.B., Xu, Z.Y., Tang, X.Q.: Method for determining optimal number of clusters in K-means clustering algorithm. J. Comput. Appl. 46(16), 27–31 (2010)
Google Scholar

Download references

Acknowledgments

This research was supported by the Fundamental Research Funds for the Universities in Tianjin, Tianjin Chengjian Universities (2016CJ11)

Author information

Authors and Affiliations

College of Computer and Information Engineering, Tianjin Chengjian University, Tianjin, 300384, China
Mei-Wei Hao, Kun Hao, Cheng Li & Yun-Jie Zhang
Computing Center, Tianjin Chengjian University, Tianjin, 300010, China
Hua-Lin Dai
Department of Electrical Engineering, Tsinghua University, Beijing, 10000, China
Hao-Nan Song

Authors

Mei-Wei Hao
View author publications
You can also search for this author in PubMed Google Scholar
Hua-Lin Dai
View author publications
You can also search for this author in PubMed Google Scholar
Kun Hao
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Li
View author publications
You can also search for this author in PubMed Google Scholar
Yun-Jie Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hao-Nan Song
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mei-Wei Hao .

Editor information

Editors and Affiliations

Electrical and Computer Engineering, Memorial University, St. John’s, Newfoundland and Labrador, Canada
Cheng Li
Electrical and Computer Engineering, Auburn University, Auburn, Alabama, USA
Shiwen Mao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hao, MW., Dai, HL., Hao, K., Li, C., Zhang, YJ., Song, HN. (2018). Optimization of Density-Based K-means Algorithm in Trajectory Data Clustering. In: Li, C., Mao, S. (eds) Wireless Internet. WiCON 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 230. Springer, Cham. https://doi.org/10.1007/978-3-319-90802-1_39

Download citation

DOI: https://doi.org/10.1007/978-3-319-90802-1_39
Published: 13 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-90801-4
Online ISBN: 978-3-319-90802-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics