Abstract
It is desired in the Internet of Things (IoT) networks to apply natural language processing (NLP) technology to complete the information exchange tasks such as text summary or text classification between IoT devices. To achieve higher precision for the NLP of Chinese sentences, in this paper, we propose to utilize the deep neural network (DNN) to compute the semantic similarity of Chinese sentences. The proposed DNN consists of the input layer, the semantic generation layer, the concat layer, the dropout layer, the hidden layer, and the output layer. We propose to train the intelligent semantic similarity calculator sequentially to extract the semantic feature and the context information feature. After the offline training, the resultant configured intelligent semantic similarity calculator could evaluate the semantic similarity of Chinese sentences. Furthermore, we provide numerical analysis to demonstrate the improved similarity calculation precision and the consistency of the calculation accuracy in different fields.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arora, S., Liang, Y., Ma, T.: A simple but tough-to-beat baseline for sentence embeddings. In: 5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings, Toulon, France (2019)
Bshoter, J.: Text-matching. https://github.com/BshoterJ/Text-Matching
Gunes, E., Dragomir, R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22(1), 457–479 (2011)
Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1, Minneapolis, Minnesota. Association for Computational Linguistics, Germany, January 2019
Ko, Y., Park, J., Seo, J.: Automatic text categorization using the importance of sentences. In: Proceedings of the 19th International Conference on Computational Linguistics, COLING’02, USA, vol. 1, pp. 1–7. Association for Computational Linguistics (2002)
Li, X., Meng, Y., Sun, X., Han, Q., Yuan, A., Li, J.: Is word segmentation necessary for deep learning of Chinese representations? In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, pp. 3242–3252. Association for Computational Linguistics, July 2019
Liu, X., et al.: LCQMC: a large-scale Chinese question matching corpus. In: Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico, USA. Association for Computational Linguistics, Germany, July 2018
Sepp, H., Jürgen, S.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)
Sun, Y., et al.: Ernie: Enhanced representation through knowledge integration (2019). https://arxiv.org/abs/1904.09223v1
Varelas, G., Voutsakis, E., Raftopoulou, P., Petrakis, E.G., Milios, E.E.: Semantic similarity methods in wordnet and their application to information retrieval on the web. In: Proceedings of the 7th Annual ACM International Workshop on Web Information and Data Management, WIDM’05, Bremen, Germany, pp. 10–16. Association for Computing Machinery, New York (2005)
Wei, J., et al.: Nezha: Neural contextualized representation for Chinese language understanding (2019). https://arxiv.org/abs/1909.00204
Zhao, Q., Qi, J.: A method for calculating the similarity of short texts based on semantic and syntactic structure. Comput. Eng. Sci. 40(283), 145–152 (2018)
Acknowledgements
This work was supported in part by the State Key Program of National Social Science of China (No. 18AZD035), the Key Research & Development and Transformation Plan of Science and Technology Program for Tibet Autonomous Region (No. XZ201901-GB-16), the Special Fund from the Central Finance to Support the Development of Local Universities (No. ZFYJY201902001) and the National Natural Science Foundation of China (No. 71964030).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Ye, J., Zhang, L., Lan, P., He, H., Yang, D., Wu, Z. (2021). Improved Intelligent Semantics Based Chinese Sentence Similarity Computing for Natural Language Processing in IoT. In: Li, B., Li, C., Yang, M., Yan, Z., Zheng, J. (eds) IoT as a Service. IoTaaS 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 346. Springer, Cham. https://doi.org/10.1007/978-3-030-67514-1_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-67514-1_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67513-4
Online ISBN: 978-3-030-67514-1
eBook Packages: Computer ScienceComputer Science (R0)