Skip to main content

Information Extraction and Similarity Computation for Semi-/Un-Structured Sentences from the Cyberdata

  • Conference paper
  • First Online:
Cyberspace Data and Intelligence, and Cyber-Living, Syndrome, and Health (CyberDI 2019, CyberLife 2019)

Abstract

With the popularization of network and the improvement of network speed, social network application plays an increasingly important role in people’s social life. People express their opinions and ask their own questions on social software, and these huge amounts of data drive researchers to propose various algorithms to extract the information in sentences and classify them. In this paper, we proposed a novel method of sentence similarity computation, which purpose is to extract the syntactic and semantic information of semi-structured and structured sentences and calculate their similarity. We mainly consider the subject predicate and object of sentence pairs, and use Stanford parser to classify Dependency Relation Triples to calculate the syntactic and semantic similarity between two sentences. Extensive simulations demonstrated that our method outperforms the other state-of-the-art methods in terms of correlation coefficient and mean deviation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Rani, R., Tandon, S.: In: 2018 4th International Conference on Computing Sciences (ICCS) (2018)

    Google Scholar 

  2. Spaeth, A., Desmarais, M.C.: Combining collaborative filtering and text similarity for expert profile recommendations in social websites. In: Carberry, S., Weibelzahl, S., Micarelli, A., Semeraro, G. (eds.) UMAP 2013. LNCS, vol. 7899, pp. 178–189. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38844-6_15

    Chapter  Google Scholar 

  3. Aliguliyev, R.M.: Expert Syst. Appl. 36(4), 7764 (2009)

    Article  Google Scholar 

  4. Ozates, S.B., Ozgur, A., Radev, D.R.: In: Language Resources and Evaluation Conference (2016)

    Google Scholar 

  5. Aminul, I., Diana, I.: ACM Trans. Knowl. Discov. 2(2, article 10), 1 (2008)

    Google Scholar 

  6. Li, Y., Mclean, D., Bandar, Z.A., O’Shea, J.D.: IEEE Trans. Knowl. Data Eng. 18(8), 1138 (2006)

    Article  Google Scholar 

  7. Xiong, J., Liu, Y.T., Yuan, D.: Inf. Technol. J. 12(20), 5685 (2013)

    Article  Google Scholar 

  8. Li, Y., Bandar, Z., Mclean, D., O’Shea, J.: In: Seventeenth International Florida Artificial Intelligence Research Society Conference, Miami Beach, Florida, USA (2004)

    Google Scholar 

  9. Nguyen, H.T., Duong, P.H., Le, T.Q.: Springer (2015)

    Google Scholar 

  10. Kadupitiya, G.D.J., Ranathunga, S.: In: Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, pp. 44–53 (2016)

    Google Scholar 

  11. Oliva, J., Serrano, J.I., Del Castillo, M.D., Iglesias, Á.: Data Knowl. Eng. 70(4), 390 (2011)

    Article  Google Scholar 

  12. Mihalcea, R., Corley, C., Strapparava, C.: In: National Conference on Artificial Intelligence and the Eighteenth Innovative Applications of Artificial Intelligence Conference, 16–20 July 2006, Boston, Massachusetts, USA, pp. 775–780 (2006)

    Google Scholar 

  13. Furlan, B., Sivavcki, V., Jovanovi, C.B., Nikoli, C.: JITA - J. Inf. Technol. Appl. (Banja Luka) - APEIRON 1(1), 65 (2011)

    Google Scholar 

  14. Peiying, Z., Qiuming, L., Huayu, L.: Int. J. Database Theory Appl. 9(10), 379 (2016)

    Article  Google Scholar 

  15. He, H., Gimpel, K., Lin, J.: In: Conference on Empirical Methods in Natural Language Processing, pp. 1576–1586 (2015)

    Google Scholar 

  16. Hu, B., Lu, Z., Li, H., Chen, Q.: Adv. Neural Inf. Process. Syst. 3, 2042 (2015)

    Google Scholar 

  17. Huang, J.P., Ji, D.H.: Huanan Ligong Daxue Xuebao/J. S. China Univ. Technol. 45(3), 68 (2017)

    Google Scholar 

  18. Panchenko, A., Morozova, O., Naets, H.: pp. 174–178 (2012)

    Google Scholar 

  19. Liu, X., Zhou, Y., Zheng, R.: In: International Conference on Semantic Computing, pp. 250–256 (2007)

    Google Scholar 

  20. Amir, S., Tanasescu, A., Zighed, D.A.: J. Intell. Inf. Syst. 1–15 (2016)

    Google Scholar 

  21. Ming, C.L.: Expert Syst. Appl. 38(5), 6392 (2011)

    Article  Google Scholar 

  22. Wu, Z., Palmer, M.: In: Meeting on Association for Computational Linguistics, pp. 133–138 (1994)

    Google Scholar 

  23. Rubenstein, H., Goodenough, J.B.: Commun. ACM 8(10), 627 (1965)

    Article  Google Scholar 

  24. Dolan, B., Quirk, C., Brockett, C.: In: International Conference on Computational Linguistics, p. 350 (2004)

    Google Scholar 

  25. Agirre, E., Diab, M., Cer, D., Gonzalez-Agirre, A.: In: Joint Conference on Lexical and Computational Semantics, pp. 385–393 (2012)

    Google Scholar 

  26. O’Shea, J., Bandar, Z., Crockett, K., McLean, D.: A comparative study of two short text semantic similarity measures. In: Nguyen, N.T., Jo, G.S., Howlett, R.J., Jain, L.C. (eds.) KES-AMSTA 2008. LNCS (LNAI), vol. 4953, pp. 172–181. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-78582-8_18

    Chapter  Google Scholar 

  27. Croft, D., Coupland, S., Shell, J., Brown, S.: In: UK Workshop on Computational Intelligence, pp. 221–227 (2013)

    Google Scholar 

  28. Landauer, T.K., Foltz, P.W., Laham, D.: Discourse Process. 25(2–3), 259 (1998)

    Article  Google Scholar 

  29. Tsatsaronis, G., Varlamis, I., Vazirgiannis, M.: J. Artif. Intell. Res. 37(4), 1 (2014)

    Google Scholar 

  30. Le, Y., Wang, Z.J., Quan, Z., He, J., Yao, B.: In: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18 (International Joint Conferences on Artificial Intelligence Organization), pp. 4137–4143 (2018). https://doi.org/10.24963/ijcai.2018/575

  31. Yao, H., Liu, H., Zhang, P.: Concurr. Comput. Pract. Exp. 30(1), e4415 (2018)

    Article  Google Scholar 

Download references

Acknowledgments

This work is supported by “the Fundamental Research Funds for the Central Universities” of China University of Petroleum (East China) (Grant No. 18CX02139A), the Shandong Provincial Natural Science Foundation, China (Grant No. ZR2014FQ018). The authors also gratefully acknowledge the helpful comments and suggestions of the reviewers, which have improved the presentation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weishan Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, P., Huang, X., Zhang, L., Zhang, W. (2019). Information Extraction and Similarity Computation for Semi-/Un-Structured Sentences from the Cyberdata. In: Ning, H. (eds) Cyberspace Data and Intelligence, and Cyber-Living, Syndrome, and Health. CyberDI CyberLife 2019 2019. Communications in Computer and Information Science, vol 1137. Springer, Singapore. https://doi.org/10.1007/978-981-15-1922-2_3

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-1922-2_3

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-1921-5

  • Online ISBN: 978-981-15-1922-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics