Skip to main content

Improved Intelligent Semantics Based Chinese Sentence Similarity Computing for Natural Language Processing in IoT

  • Conference paper
  • First Online:
IoT as a Service (IoTaaS 2020)

Abstract

It is desired in the Internet of Things (IoT) networks to apply natural language processing (NLP) technology to complete the information exchange tasks such as text summary or text classification between IoT devices. To achieve higher precision for the NLP of Chinese sentences, in this paper, we propose to utilize the deep neural network (DNN) to compute the semantic similarity of Chinese sentences. The proposed DNN consists of the input layer, the semantic generation layer, the concat layer, the dropout layer, the hidden layer, and the output layer. We propose to train the intelligent semantic similarity calculator sequentially to extract the semantic feature and the context information feature. After the offline training, the resultant configured intelligent semantic similarity calculator could evaluate the semantic similarity of Chinese sentences. Furthermore, we provide numerical analysis to demonstrate the improved similarity calculation precision and the consistency of the calculation accuracy in different fields.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arora, S., Liang, Y., Ma, T.: A simple but tough-to-beat baseline for sentence embeddings. In: 5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings, Toulon, France (2019)

    Google Scholar 

  2. Bshoter, J.: Text-matching. https://github.com/BshoterJ/Text-Matching

  3. Gunes, E., Dragomir, R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22(1), 457–479 (2011)

    Google Scholar 

  4. Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1, Minneapolis, Minnesota. Association for Computational Linguistics, Germany, January 2019

    Google Scholar 

  5. Ko, Y., Park, J., Seo, J.: Automatic text categorization using the importance of sentences. In: Proceedings of the 19th International Conference on Computational Linguistics, COLING’02, USA, vol. 1, pp. 1–7. Association for Computational Linguistics (2002)

    Google Scholar 

  6. Li, X., Meng, Y., Sun, X., Han, Q., Yuan, A., Li, J.: Is word segmentation necessary for deep learning of Chinese representations? In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, pp. 3242–3252. Association for Computational Linguistics, July 2019

    Google Scholar 

  7. Liu, X., et al.: LCQMC: a large-scale Chinese question matching corpus. In: Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico, USA. Association for Computational Linguistics, Germany, July 2018

    Google Scholar 

  8. Sepp, H., Jürgen, S.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)

    Article  Google Scholar 

  9. Sun, Y., et al.: Ernie: Enhanced representation through knowledge integration (2019). https://arxiv.org/abs/1904.09223v1

  10. Varelas, G., Voutsakis, E., Raftopoulou, P., Petrakis, E.G., Milios, E.E.: Semantic similarity methods in wordnet and their application to information retrieval on the web. In: Proceedings of the 7th Annual ACM International Workshop on Web Information and Data Management, WIDM’05, Bremen, Germany, pp. 10–16. Association for Computing Machinery, New York (2005)

    Google Scholar 

  11. Wei, J., et al.: Nezha: Neural contextualized representation for Chinese language understanding (2019). https://arxiv.org/abs/1909.00204

  12. Zhao, Q., Qi, J.: A method for calculating the similarity of short texts based on semantic and syntactic structure. Comput. Eng. Sci. 40(283), 145–152 (2018)

    Google Scholar 

Download references

Acknowledgements

This work was supported in part by the State Key Program of National Social Science of China (No. 18AZD035), the Key Research & Development and Transformation Plan of Science and Technology Program for Tibet Autonomous Region (No. XZ201901-GB-16), the Special Fund from the Central Finance to Support the Development of Local Universities (No. ZFYJY201902001) and the National Natural Science Foundation of China (No. 71964030).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhiqiang Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ye, J., Zhang, L., Lan, P., He, H., Yang, D., Wu, Z. (2021). Improved Intelligent Semantics Based Chinese Sentence Similarity Computing for Natural Language Processing in IoT. In: Li, B., Li, C., Yang, M., Yan, Z., Zheng, J. (eds) IoT as a Service. IoTaaS 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 346. Springer, Cham. https://doi.org/10.1007/978-3-030-67514-1_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-67514-1_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-67513-4

  • Online ISBN: 978-3-030-67514-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics