Mining Scholarly Publications for Scientific Knowledge Graph Construction

  • Davide Buscaldi
  • Danilo DessìEmail author
  • Enrico Motta
  • Francesco Osborne
  • Diego Reforgiato Recupero
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11762)


In this paper, we present a preliminary approach that uses a set of NLP and Deep Learning methods for extracting entities and relationships from research publications and then integrates them in a Knowledge Graph. More specifically, we (i) tackle the challenge of knowledge extraction by employing several state-of-the-art Natural Language Processing and Text Mining tools, (ii) describe an approach for integrating entities and relationships generated by these tools, and (iii) analyse an automatically generated Knowledge Graph including 10, 425 entities and 25, 655 relationships in the field of Semantic Web.


  1. 1.
    Angeli, G., Premkumar, M.J.J., Manning, C.D.: Leveraging linguistic structure for open domain information extraction. In: Proceedings of the 53rd Annual Meeting of the ACL and the 7th IJCNLP, vol. 1, pp. 344–354 (2015)Google Scholar
  2. 2.
    Auer, S., Kovtun, V., Prinz, M., Kasprzik, A., Stocker, M., Vidal, M.E.: Towards a knowledge graph for science. In: Proceedings of the 8th International Conference on Web Intelligence, Mining and Semantics, p. 1. ACM (2018)Google Scholar
  3. 3.
    Dessì, D., Reforgiato Recupero, D., Fenu, G., Consoli, S.: A recommender system of medical reports leveraging cognitive computing and frame semantics. In: Tsihrintzis, G.A., Sotiropoulos, D.N., Jain, L.C. (eds.) Machine Learning Paradigms. ISRL, vol. 149, pp. 7–30. Springer, Cham (2019). Scholar
  4. 4.
    Ehrlinger, L., Wöß, W.: Towards a definition of knowledge graphs. In: SEMANTiCS (Posters, Demos, SuCCESS), vol. 48 (2016)Google Scholar
  5. 5.
    Luan, Y., He, L., Ostendorf, M., Hajishirzi, H.: Multi-task identification of entities, relations, and coreference for scientific knowledge graph construction. In: Proceedings of the EMNLP 2018 Conference, pp. 3219–3232 (2018)Google Scholar
  6. 6.
    Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)CrossRefGoogle Scholar
  7. 7.
    Nuzzolese, A.G., Gentile, A.L., Presutti, V., Gangemi, A.: Conference linked data: the ScholarlyData project. In: Groth, P., et al. (eds.) ISWC 2016. LNCS, vol. 9982, pp. 150–158. Springer, Cham (2016). Scholar
  8. 8.
    Peroni, S., Shotton, D., Vitali, F.: One year of the OpenCitations corpus. In: d’Amato, C., et al. (eds.) ISWC 2017. LNCS, vol. 10588, pp. 184–192. Springer, Cham (2017). Scholar
  9. 9.
    Salatino, A.A., Thanapalasingam, T., Mannocci, A., Osborne, F., Motta, E.: Classifying research papers with the computer science ontology. In: ISWC (P&D/Industry/BlueSky). CEUR Workshop Proceedings, vol. 2180 (2018)Google Scholar
  10. 10.
    Salatino, A.A., Thanapalasingam, T., Mannocci, A., Osborne, F., Motta, E.: The computer science ontology: a large-scale taxonomy of research areas. In: Vrandečić, D., et al. (eds.) ISWC 2018. LNCS, vol. 11137, pp. 187–205. Springer, Cham (2018). Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Davide Buscaldi
    • 1
  • Danilo Dessì
    • 2
    Email author
  • Enrico Motta
    • 3
  • Francesco Osborne
    • 3
  • Diego Reforgiato Recupero
    • 2
  1. 1.ParisFrance
  2. 2.CagliariItaly
  3. 3.Milton KeynesUK

Personalised recommendations