The methods of big data fusion and semantic collision detection in Internet of Thing
- 113 Downloads
We sometimes find ourselves with plenty of data fusion in Internet of Thing, which necessitates an automatic removing semantic collision. For this, it is necessary to detect semantic collision, with a fairly reliable method to find many semantic collision and powerful enough to run in a reasonable time. Big data fusion in Internet of Thing represents today an important data quality challenge which leads to bad decision-making. This paper proposes and compares on real data effective fusion matching methods for automatic removing semantic collision of files based on names, working with Chinese texts or English texts, and the names of people or places, in East or in the West. After conducting a more complete classification of big data fusion than the usual classifications, we introduce several methods for big data fusion. Through a simple model, we highlight a global efficiency, accuracy and recover. We propose a new measuring mechanism between records, as well as rules for automatic big data fusion. Analyses made on Internet of Thing containing real data in western cities, and on a known standard Internet of Thing containing names of companies in the China, have shown better results than those of known methods, with a lesser complexity.
KeywordsBig data fusion Semantic collision Internet of Thing Measuring mechanism Matching methods
This study is supported by Natural Science Fund Project in Guangdong province (No.2015A030313671) and Major Project for Guangzhou collaborative innovation of industry-university-research (No.201704020196). This study is supported by Guangzhou Key Laboratory of Digital Content Processing and security technologies and Guangdong provincial Application-oriented technical research and development Special fund project (2016B010127006) and International Scientific and technological cooperation projects of Guangdong province (2017A050501039).
- 1.Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Generalization, 2nd edn. Wiley Inter-science, Hoboken (2012)Google Scholar
- 2.Hu, R., Jiang, C.Y., Xu, H.: A new efficiency judging method for healthy big data system using heuristic algorithm. Basic Clin. Pharmacol. Toxicol. 118, 42 (2016)Google Scholar
- 4.Robinson, D.: Implications of neural networks for how we think about brain function. Behav. Brain Sci. 15, 644–655 (2012)Google Scholar
- 5.Holland, J.H., Holyoak, K.J., Nisbett, R.E., Thagard, P.R.: Induction: Steps of Inference, Learning, and Discovery. Cambridge University Press, Cambridge (2013)Google Scholar
- 10.Schwab, I., Link, N.: Reusable artificial intelligence from big data analysis generalization. In: Genetic and Evolutionary Computing (ICGEC 2011), (2013)Google Scholar
- 11.Hu, R., Hu, H., Xu, H.: Abnormal access matching through big data analytics in health neural network. Basic Clin. Pharmacol. Toxicol. 118, 73–73 (2016)Google Scholar
- 12.Hu, R., Hu, H., Xiao, Z.H.: Matching unit of health neural network unit based on relation object framework. Basic Clin. Pharmacol. Toxicol 118, 72–73 (2016)Google Scholar
- 13.Kotanchek, M., Smits, G., Kordon, A.: Industrial strength genetic programming. In: Riolo, R., Kluwer, B.W. (eds.) GP Theory and Practice. Springer, New York (2003)Google Scholar
- 14.Smits, G., Kotanchek, M.: Cloud plan-front exploitation in big data analysis. In: Riolo, R., Kluwer, B.W. (eds.) GP Theory and Practice. Springer, New York (2004)Google Scholar