Skip to main content

Knowledge Engineering in the Era of Artificial Intelligence

  • Conference paper
  • First Online:
Advances in Databases and Information Systems (ADBIS 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13985))

Included in the following conference series:

  • 308 Accesses

Abstract

Knowledge engineering with respect to knowledge graphs and graph data in general is becoming a more and more essential component of intelligent systems. Such systems benefit from the wealth of structured knowledge, which does not only include native graph data, such as social networks or Linked Data on the Web, but also general knowledge describing particular topics of interest. Furthermore, the flexibility of the graph model and its ability to store data relationships explicitly enables the integration and exploitation of data from very diverse sources. Hence, to truly exploit their potential, it becomes crucial to provide intelligent systems with verifiable knowledge, reliable facts, patterns, and a deeper understanding of the underlying domains. This paper will therefore chart a number of current challenges in knowledge engineering and discuss opportunities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://openai.com/chatgpt

  2. 2.

    https://www.w3.org/RDF/

  3. 3.

    In RDF edges are represented as triples (subject, predicate, object), where subject and object represent a pair of nodes and the predicate describes the relationship (edge label) between them.

  4. 4.

    https://www.w3.org/TR/rdf-sparql-query/

  5. 5.

    http://cas.lod-cloud.net/

  6. 6.

    http://www.w3.org/TR/owl2-overview/

  7. 7.

    http://www.w3.org/TR/rdf-schema/

  8. 8.

    https://www.w3.org/TR/shacl/

  9. 9.

    https://w3c.github.io/rdf-star/

  10. 10.

    https://www.w3.org/TR/prov-o/

References

  1. Abedjan, Z., Golab, L., Naumann, F., Papenbrock, T.: Data Profiling. Morgan & Claypool Publishers, Synthesis Lectures on Data Management (2018)

    Google Scholar 

  2. Abuoda, G., Dell’Aglio, D., Keen, A., Hose, K.: Transforming RDF-star to property graphs: a preliminary analysis of transformation approaches. In: QuWeDa@ISWC. CEUR Workshop Proceedings, vol. 3279, pp. 17–32. CEUR-WS.org (2022)

    Google Scholar 

  3. Aebeloe, C., Keles, I., Montoya, G., Hose, K.: Star Pattern Fragments: Accessing Knowledge Graphs through Star Patterns. CoRR abs/ arXiv: 2002.09172 (2020)

  4. Aebeloe, C., Montoya, G., Hose, K.: A decentralized architecture for sharing and querying semantic data. In: Hitzler, P., et al. (eds.) ESWC 2019. LNCS, vol. 11503, pp. 3–18. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-21348-0_1

    Chapter  Google Scholar 

  5. Aebeloe, C., Montoya, G., Hose, K.: Decentralized indexing over a network of RDF peers. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11778, pp. 3–20. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30793-6_1

    Chapter  Google Scholar 

  6. Aebeloe, C., Montoya, G., Hose, K.: ColChain: collaborative linked data networks. In: WWW, pp. 1385–1396. ACM / IW3C2 (2021)

    Google Scholar 

  7. Aebeloe, C., Montoya, G., Hose, K.: The Lothbrok approach for SPARQL Query Optimization over Decentralized Knowledge Graphs. Semantic Web J. (2023)

    Google Scholar 

  8. Angles, R., et al.: PG-schema: schemas for property graphs. Proc. ACM Manag. Data 1(2), 198:1–198:25 (2023)

    Google Scholar 

  9. Angles, R., Thakkar, H., Tomaszuk, D.: Mapping RDF databases to property graph databases. IEEE Access 8, 86091–86110 (2020)

    Article  Google Scholar 

  10. Azzam, A., Aebeloe, C., Montoya, G., Keles, I., Polleres, A., Hose, K.: WiseKG: balanced access to web knowledge graphs. In: WWW, pp. 1422–1434 (2021)

    Google Scholar 

  11. Bang, Y., et al.: A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity. CoRR abs/ arXiv: 2302.04023 (2023)

  12. Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Sci. Am. 284(5), 28–37 (2001)

    Article  Google Scholar 

  13. Carroll, J.J., Bizer, C., Hayes, P.J., Stickler, P.: Named graphs. J. Web Semant. 3(4), 247–267 (2005)

    Article  Google Scholar 

  14. Chu, X., Ilyas, I.F., Krishnan, S., Wang, J.: Data cleaning: overview and emerging challenges. In: SIGMOD Conference, pp. 2201–2206. ACM (2016)

    Google Scholar 

  15. Deutsch, A., et al.: Graph pattern matching in GQL and SQL/PGQ. In: SIGMOD Conference, pp. 2246–2258. ACM (2022)

    Google Scholar 

  16. Dimou, A., Sande, M.V., Colpaert, P., Verborgh, R., Mannens, E., de Walle, R.V.: RML: a generic language for integrated rdf mappings of heterogeneous data. In: Proceedings of the Workshop on Linked Data on the Web co-located with the 23rd International World Wide Web Conference (WWW 2014), Seoul, Korea, April 8, 2014. CEUR Workshop Proceedings, vol. 1184. CEUR-WS.org (2014). https://ceur-ws.org/Vol-1184/ldow2014_paper_01.pdf

  17. Galárraga, L., Hose, K., Schenkel, R.: Partout: a distributed engine for efficient RDF processing. In: WWW (Companion Volume), pp. 267–268. ACM (2014)

    Google Scholar 

  18. Galárraga, L., Ahlstrøm, K., Hose, K., Pedersen, T.B.: Answering provenance-aware queries on RDF data cubes under memory budgets. In: Vrandečić, D., et al. (eds.) ISWC 2018. LNCS, vol. 11136, pp. 547–565. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_32

    Chapter  Google Scholar 

  19. Galárraga, L., Mathiassen, K.A.M., Hose, K.: QBOAirbase: the european air quality database as an RDF cube. In: ISWC (Posters, Demos & Industry Tracks). CEUR Workshop Proceedings, vol. 1963. CEUR-WS.org (2017)

    Google Scholar 

  20. Galárraga, L., Teflioudi, C., Hose, K., Suchanek, F.M.: Fast rule mining in ontological knowledge bases with AMIE+. VLDB J. 24(6), 707–730 (2015)

    Article  Google Scholar 

  21. Geerts, F., Unger, T., Karvounarakis, G., Fundulaki, I., Christophides, V.: Algebraic structures for capturing the provenance of SPARQL queries. J. ACM 63(1), 7:1–7:63 (2016)

    Google Scholar 

  22. Group RW: R2RML: RDB to RDF Mapping Language. http://www.w3.org/TR/r2rml/, http://www.w3.org/2001/sw/rdb2rdf/ (2014)

  23. Gür, N., Pedersen, T.B., Zimányi, E., Hose, K.: A foundation for spatial data warehouses on the semantic web. Semantic Web 9(5), 557–587 (2018)

    Article  Google Scholar 

  24. Hansen, E.R., Lissandrini, M., Ghose, A., Løkke, S., Thomsen, C., Hose, K.: Transparent integration and sharing of life cycle sustainability data with provenance. In: ISWC, pp. 378–394 (2020)

    Google Scholar 

  25. Harth, A., Hose, K., Schenkel, R. (eds.): Linked Data Management. Chapman and Hall/CRC (2014)

    Google Scholar 

  26. Helali, M., Vashisth, S., Carrier, P., Hose, K., Mansour, E.: Linked Data Science Powered by Knowledge Graphs. CoRR abs/ arXiv: 2303.02204 (2023)

  27. Heling, L., Acosta, M.: Federated SPARQL query processing over heterogeneous linked data fragments. In: WWW, pp. 1047–1057. ACM (2022)

    Google Scholar 

  28. Hernández, D., Galárraga, L., Hose, K.: Computing how-provenance for SPARQL queries via query rewriting. Proc. VLDB Endow. 14(13), 3389–3401 (2021)

    Article  Google Scholar 

  29. Hoffart, J., Suchanek, F.M., Berberich, K., Lewis-Kelham, E., de Melo, G., Weikum, G.: YAGO2: exploring and querying world knowledge in time, space, context, and many languages. In: WWW (Companion Volume), pp. 229–232. ACM (2011)

    Google Scholar 

  30. Hose, K., Schenkel, R.: Towards benefit-based RDF source selection for SPARQL queries. In: SWIM, p. 2. ACM (2012)

    Google Scholar 

  31. Hose, K., Schenkel, R.: WARP: workload-aware replication and partitioning for RDF. In: ICDE Workshops, pp. 1–6. IEEE Computer Society (2013)

    Google Scholar 

  32. Ibragimov, D., Hose, K., Pedersen, T.B., Zimányi, E.: Towards exploratory OLAP over linked open data – a case study. In: Castellanos, M., Dayal, U., Pedersen, T.B., Tatbul, N. (eds.) BIRTE 2013-2014. LNBIP, vol. 206, pp. 114–132. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-46839-5_8

    Chapter  Google Scholar 

  33. Ibragimov, D., Hose, K., Pedersen, T.B., Zimányi, E.: processing aggregate queries in a federation of SPARQL endpoints. In: Gandon, F., Sabou, M., Sack, H., d’Amato, C., Cudré-Mauroux, P., Zimmermann, A. (eds.) ESWC 2015. LNCS, vol. 9088, pp. 269–285. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18818-8_17

    Chapter  Google Scholar 

  34. Ibragimov, D., Hose, K., Pedersen, T.B., Zimányi, E.: Optimizing aggregate SPARQL queries using materialized RDF views. In: Groth, P., et al. (eds.) ISWC 2016. LNCS, vol. 9981, pp. 341–359. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46523-4_21

    Chapter  Google Scholar 

  35. Idreos, S., et al.: Design continuums and the path toward self-designing key-value stores that know and learn. In: CIDR. www.cidrdb.org (2019)

  36. Jakobsen, A.L., Montoya, G., Hose, K.: How diverse are federated query execution plans really? In: Hitzler, P., et al. (eds.) ESWC 2019. LNCS, vol. 11762, pp. 105–110. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32327-1_21

    Chapter  Google Scholar 

  37. Jakobsen, K.A., Andersen, A.B., Hose, K., Pedersen, T.B.: Optimizing RDF data cubes for efficient processing of analytical queries. In: COLD. CEUR Workshop Proceedings, vol. 1426. CEUR-WS.org (2015)

    Google Scholar 

  38. Kaoudi, Z., et al.: Atlas: Storing, updating and querying RDF(S) data on top of DHTs. J. Web Semant. 8(4), 271–277 (2010)

    Article  Google Scholar 

  39. Keles, I., Hose, K.: Skyline queries over knowledge graphs. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11778, pp. 293–310. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30793-6_17

    Chapter  Google Scholar 

  40. Khayatbashi, S., Ferrada, S., Hartig, O.: Converting property graphs to RDF: a preliminary study of the practical impact of different mappings. In: GRADES-NDA@SIGMOD, pp. 10:1–10:9. ACM (2022)

    Google Scholar 

  41. Lassila, O., et al.: The OneGraph vision: challenges of breaking the graph model lock-in. Semantic Web 14(1), 125–134 (2023)

    Article  Google Scholar 

  42. Lehmann, J., et al.: Dbpedia - a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web 6(2), 167–195 (2015)

    Article  Google Scholar 

  43. Lissandrini, M., Hose, K., Pedersen, T.B.: Example-driven exploratory analytics over knowledge graphs. In: EDBT, pp. 105–117. OpenProceedings.org (2023)

    Google Scholar 

  44. Lissandrini, M., Mottin, D., Hose, K., Pedersen, T.B.: Knowledge graph exploration systems: are we lost? In: CIDR (2022). www.cidrdb.org

  45. Lissandrini, M., Mottin, D., Palpanas, T., Velegrakis, Y.: Data Exploration Using Example-Based Methods. Morgan & Claypool Publishers, Synthesis Lectures on Data Management (2018)

    Google Scholar 

  46. Mansour, E., Srinivas, K., Hose, K.: Federated data science to break down silos. SIGMOD Rec. 50(4), 16–22 (2021)

    Article  Google Scholar 

  47. Montoya, G., Aebeloe, C., Hose, K.: Towards efficient query processing over heterogeneous RDF interfaces. In: ISWC (Best Workshop Papers). Studies on the Semantic Web, vol. 36, pp. 39–53. IOS Press (2018)

    Google Scholar 

  48. Montoya, G., Keles, I., Hose, K.: Analysis of the effect of query shapes on performance over LDF interfaces. In: QuWeDa@ISWC. CEUR Workshop Proceedings, vol. 2496, pp. 51–66. CEUR-WS.org (2019)

    Google Scholar 

  49. Montoya, G., Skaf-Molli, H., Hose, K.: The Odyssey approach for optimizing federated SPARQL queries. In: d’Amato, C., et al. (eds.) ISWC 2017. LNCS, vol. 10587, pp. 471–489. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68288-4_28

    Chapter  Google Scholar 

  50. Nargesian, F., Pu, K.Q., Bashardoost, B.G., Zhu, E., Miller, R.J.: Data lake organization. IEEE Trans. Knowl. Data Eng. 35(1), 237–250 (2023)

    Google Scholar 

  51. Nath, R.P.D., Hose, K., Pedersen, T.B., Romero, O.: SETL: a programmable semantic extract-transform-load framework for semantic data warehouses. Inf. Syst. 68, 17–43 (2017)

    Article  Google Scholar 

  52. Nguyen, V., Bodenreider, O., Sheth, A.P.: Don’t like RDF reification?: making statements about statements using singleton property. In: WWW (2014)

    Google Scholar 

  53. Noy, N.F., Gao, Y., Jain, A., Narayanan, A., Patterson, A., Taylor, J.: Industry-scale knowledge graphs: lessons and challenges. Commun. ACM 62(8), 36–43 (2019)

    Article  Google Scholar 

  54. Özsu, M.T., Valduriez, P.: Principles of Distributed Database Systems, 4th edn. Springer (2020). https://doi.org/10.1007/978-3-030-26253-2

  55. Pelgrin, O., Galárraga, L., Hose, K.: Towards fully-fledged archiving for RDF datasets. Semantic Web 12(6), 903–925 (2021)

    Article  Google Scholar 

  56. Rabbani, K., Lissandrini, M., Hose, K.: SHACL and ShEx in the wild: a community survey on validating shapes generation and adoption. In: WWW (Companion Volume), pp. 260–263. ACM (2022)

    Google Scholar 

  57. Rabbani, K., Lissandrini, M., Hose, K.: Extraction of validating shapes from very large knowledge graphs. Proc. VLDB Endow. 16(5), 1023–1032 (2023)

    Article  Google Scholar 

  58. Rabbani, K., Lissandrini, M., Hose, K.: SHACTOR: improving the quality of large-scale knowledge graphs with validating shapes. In: SIGMOD Conference Companion, pp. 151–154. ACM (2023)

    Google Scholar 

  59. Sagi, T., Lissandrini, M., Pedersen, T.B., Hose, K.: A design space for RDF data representations. VLDB J. 31(2), 347–373 (2022)

    Article  Google Scholar 

  60. Sakr, S., et al.: The future is big graphs: a community view on graph processing systems. Commun. ACM 64(9), 62–71 (2021)

    Article  Google Scholar 

  61. Schwarte, A., Haase, P., Hose, K., Schenkel, R., Schmidt, M.: FedX: a federation layer for distributed query processing on linked open data. In: Antoniou, G., et al. (eds.) ESWC 2011. LNCS, vol. 6644, pp. 481–486. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-21064-8_39

    Chapter  Google Scholar 

  62. Schwarte, A., Haase, P., Hose, K., Schenkel, R., Schmidt, M.: FedX: optimization techniques for federated query processing on linked data. In: Aroyo, L., et al. (eds.) ISWC 2011. LNCS, vol. 7031, pp. 601–616. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-25073-6_38

    Chapter  Google Scholar 

  63. Taelman, R., Mahieu, T., Vanbrabant, M., Verborgh, R.: Optimizing storage of RDF archives using bidirectional delta chains. Semantic Web (2021)

    Google Scholar 

  64. Varadarajan, R., Bharathan, V., Cary, A., Dave, J., Bodagala, S.: DBDesigner: a customizable physical design tool for vertica analytic database. In: ICDE, pp. 1084–1095. IEEE Computer Society (2014)

    Google Scholar 

  65. Vrandecic, D., Krötzsch, M.: Wikidata: a free collaborative knowledgebase. Commun. ACM 57(10), 78–85 (2014)

    Article  Google Scholar 

  66. Zervakis, L., Setty, V., Tryfonopoulos, C., Hose, K.: Efficient continuous multi-query processing over graph streams. In: EDBT, pp. 13–24. OpenProceedings.org (2020)

    Google Scholar 

  67. Zhang, C., Bonifati, A., Özsu, M.T.: An overview of reachability indexes on graphs. In: SIGMOD Conference Companion, pp. 61–68. ACM (2023)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Katja Hose .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hose, K. (2023). Knowledge Engineering in the Era of Artificial Intelligence. In: Abelló, A., Vassiliadis, P., Romero, O., Wrembel, R. (eds) Advances in Databases and Information Systems. ADBIS 2023. Lecture Notes in Computer Science, vol 13985. Springer, Cham. https://doi.org/10.1007/978-3-031-42914-9_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-42914-9_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-42913-2

  • Online ISBN: 978-3-031-42914-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics