Abstract
Recent developments in the 5th generation wireless communication system have heightened the need for the propagation characteristics and modeling of wireless channels. As the propagation characteristics and variation rules of radio waves in different scenes, frequency points and bandwidth are all hidden in the channel test massive data that have the big data features, it is necessary to carry out effective data cleaning methods to make better use of test data. This paper analyzes and compares a variety of data cleaning methods first, then designs a data cleaning strategy according to the characteristics of wireless channel test data. Finally, the effectiveness of the data cleaning strategy is verified through simulation. This paper provided significant theoretical and technical support for the wireless environment reconstruction and model construction in the big data era.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Molisch, A.F.: Wireless Communications. Wiley, Hoboken (2011)
Manyika, J., Chui, M., Brown, B., et al.: Big data: the next frontier for innovation, competition, and productivity (2011)
Wu, X., Zhu, X., Wu, G., Ding, W.: Data mining with big data. IEEE Trans. Knowl. Data Eng. 26(1), 97–107 (2014)
Zhang, J., Zhang, P., Tian, L., et al.: A wireless channel modeling method based on big data mining. CN 106126807A, 16 November 2016
Ganti, V., Sarma, A.D.: Data cleaning: a practical perspective. In: Data Cleaning: A Practical Perspective. Morgan & Claypool (2013)
Galhardas, H., Florescu, D.: An extensible framework for data cleaning. Technical report, Institute National de Recherche en Informatique et en Automatique (1999)
Maletic, J.I., Marcus, A.: Data cleansing: beyond integrity analysis. In: IQ 2000. Division of Computer, Science, 23 June 2000
Hernandez, M., Stolfo, S.: The merge/purge problem for large databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 127–138, May 1995
Lee, M.L., Ling, T.W., Low, W.L., et al.: IntelliClean: a knowledge—based intelligent data cleaner. In: Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 290–294 (2000)
Virmani, D., Arora, P., Sethi, E., Sharma, N.: Variegated data swabbing: an improved purge approach for data cleaning. In: 2017 7th International Conference on Cloud Computing, Data Science & Engineering - Confluence, Noida, pp. 226–230 (2017)
McCallum, A., Nigam, K., Ungar, L.: Efficient clustering of high-dimensional data sets with application to reference matching. In: Proceedings of the Sixth International Conference on Knowledge Discovery and Data Mining, pp. 169–178 (2000)
Fayyad, U., Piatetsky-shapiro, G., Smyth, P., et al.: A statistical perspective on knowledge discovery in databases. In: Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press, Cambridge (1996)
He, L., Wu, L., Cai, Y.: Summary of clustering algorithm in data mining. Comput. Appl. Res. (01), 10–13 (2007)
Tang, Y., Zhong, D., Yan, X.: Data cleaning technology based on clustering model. Comput. Appl. (05), 118–121 (2004)
Duda, R.O., Hart, P.E., Stork, D.H.: Pattern Classification, 2nd edn. Wiley, Hoboken (2000)
Hernandez, M.A., Stolfo, J.S.: Real—world data is dirty: data c1eaning and the merge/purge problem. J. Data Min. Knowl. Discov. 2(1), 9–37 (1998)
Marcus, A., Maletic, J.I.: Utilizing association rules for the identification of errors in data. Technical report C& 00-04
Kokaram, A.C., Morris, R.D., Fitzgerald, W.J., Rayner, P.J.W.: Interpolation of missing data in image sequences. IEEE Trans. Image Process. 4(11), 1509–1519 (1995)
Acknowledgment
The research was supported by the Beijing Municipal Natural Science Foundation-Haidian Original Innovation Foundation (No. L172030), Fundamental Research Funds for the Central Universities under grant 2018JBZ102 and Beijing Nova Program Interdisciplinary Cooperation Project (Z191100001119016).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhuang, L., Liu, L., Dong, S., Fan, Y., Zhang, J. (2020). Research on High Reliable Wireless Channel Data Cleaning Method. In: Chen, X., Yan, H., Yan, Q., Zhang, X. (eds) Machine Learning for Cyber Security. ML4CS 2020. Lecture Notes in Computer Science(), vol 12487. Springer, Cham. https://doi.org/10.1007/978-3-030-62460-6_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-62460-6_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-62459-0
Online ISBN: 978-3-030-62460-6
eBook Packages: Computer ScienceComputer Science (R0)