Abstract
Knowledge engineering with respect to knowledge graphs and graph data in general is becoming a more and more essential component of intelligent systems. Such systems benefit from the wealth of structured knowledge, which does not only include native graph data, such as social networks or Linked Data on the Web, but also general knowledge describing particular topics of interest. Furthermore, the flexibility of the graph model and its ability to store data relationships explicitly enables the integration and exploitation of data from very diverse sources. Hence, to truly exploit their potential, it becomes crucial to provide intelligent systems with verifiable knowledge, reliable facts, patterns, and a deeper understanding of the underlying domains. This paper will therefore chart a number of current challenges in knowledge engineering and discuss opportunities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
In RDF edges are represented as triples (subject, predicate, object), where subject and object represent a pair of nodes and the predicate describes the relationship (edge label) between them.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
References
Abedjan, Z., Golab, L., Naumann, F., Papenbrock, T.: Data Profiling. Morgan & Claypool Publishers, Synthesis Lectures on Data Management (2018)
Abuoda, G., Dell’Aglio, D., Keen, A., Hose, K.: Transforming RDF-star to property graphs: a preliminary analysis of transformation approaches. In: QuWeDa@ISWC. CEUR Workshop Proceedings, vol. 3279, pp. 17–32. CEUR-WS.org (2022)
Aebeloe, C., Keles, I., Montoya, G., Hose, K.: Star Pattern Fragments: Accessing Knowledge Graphs through Star Patterns. CoRR abs/ arXiv: 2002.09172 (2020)
Aebeloe, C., Montoya, G., Hose, K.: A decentralized architecture for sharing and querying semantic data. In: Hitzler, P., et al. (eds.) ESWC 2019. LNCS, vol. 11503, pp. 3–18. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-21348-0_1
Aebeloe, C., Montoya, G., Hose, K.: Decentralized indexing over a network of RDF peers. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11778, pp. 3–20. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30793-6_1
Aebeloe, C., Montoya, G., Hose, K.: ColChain: collaborative linked data networks. In: WWW, pp. 1385–1396. ACM / IW3C2 (2021)
Aebeloe, C., Montoya, G., Hose, K.: The Lothbrok approach for SPARQL Query Optimization over Decentralized Knowledge Graphs. Semantic Web J. (2023)
Angles, R., et al.: PG-schema: schemas for property graphs. Proc. ACM Manag. Data 1(2), 198:1–198:25 (2023)
Angles, R., Thakkar, H., Tomaszuk, D.: Mapping RDF databases to property graph databases. IEEE Access 8, 86091–86110 (2020)
Azzam, A., Aebeloe, C., Montoya, G., Keles, I., Polleres, A., Hose, K.: WiseKG: balanced access to web knowledge graphs. In: WWW, pp. 1422–1434 (2021)
Bang, Y., et al.: A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity. CoRR abs/ arXiv: 2302.04023 (2023)
Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Sci. Am. 284(5), 28–37 (2001)
Carroll, J.J., Bizer, C., Hayes, P.J., Stickler, P.: Named graphs. J. Web Semant. 3(4), 247–267 (2005)
Chu, X., Ilyas, I.F., Krishnan, S., Wang, J.: Data cleaning: overview and emerging challenges. In: SIGMOD Conference, pp. 2201–2206. ACM (2016)
Deutsch, A., et al.: Graph pattern matching in GQL and SQL/PGQ. In: SIGMOD Conference, pp. 2246–2258. ACM (2022)
Dimou, A., Sande, M.V., Colpaert, P., Verborgh, R., Mannens, E., de Walle, R.V.: RML: a generic language for integrated rdf mappings of heterogeneous data. In: Proceedings of the Workshop on Linked Data on the Web co-located with the 23rd International World Wide Web Conference (WWW 2014), Seoul, Korea, April 8, 2014. CEUR Workshop Proceedings, vol. 1184. CEUR-WS.org (2014). https://ceur-ws.org/Vol-1184/ldow2014_paper_01.pdf
Galárraga, L., Hose, K., Schenkel, R.: Partout: a distributed engine for efficient RDF processing. In: WWW (Companion Volume), pp. 267–268. ACM (2014)
Galárraga, L., Ahlstrøm, K., Hose, K., Pedersen, T.B.: Answering provenance-aware queries on RDF data cubes under memory budgets. In: Vrandečić, D., et al. (eds.) ISWC 2018. LNCS, vol. 11136, pp. 547–565. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_32
Galárraga, L., Mathiassen, K.A.M., Hose, K.: QBOAirbase: the european air quality database as an RDF cube. In: ISWC (Posters, Demos & Industry Tracks). CEUR Workshop Proceedings, vol. 1963. CEUR-WS.org (2017)
Galárraga, L., Teflioudi, C., Hose, K., Suchanek, F.M.: Fast rule mining in ontological knowledge bases with AMIE+. VLDB J. 24(6), 707–730 (2015)
Geerts, F., Unger, T., Karvounarakis, G., Fundulaki, I., Christophides, V.: Algebraic structures for capturing the provenance of SPARQL queries. J. ACM 63(1), 7:1–7:63 (2016)
Group RW: R2RML: RDB to RDF Mapping Language. http://www.w3.org/TR/r2rml/, http://www.w3.org/2001/sw/rdb2rdf/ (2014)
Gür, N., Pedersen, T.B., Zimányi, E., Hose, K.: A foundation for spatial data warehouses on the semantic web. Semantic Web 9(5), 557–587 (2018)
Hansen, E.R., Lissandrini, M., Ghose, A., Løkke, S., Thomsen, C., Hose, K.: Transparent integration and sharing of life cycle sustainability data with provenance. In: ISWC, pp. 378–394 (2020)
Harth, A., Hose, K., Schenkel, R. (eds.): Linked Data Management. Chapman and Hall/CRC (2014)
Helali, M., Vashisth, S., Carrier, P., Hose, K., Mansour, E.: Linked Data Science Powered by Knowledge Graphs. CoRR abs/ arXiv: 2303.02204 (2023)
Heling, L., Acosta, M.: Federated SPARQL query processing over heterogeneous linked data fragments. In: WWW, pp. 1047–1057. ACM (2022)
Hernández, D., Galárraga, L., Hose, K.: Computing how-provenance for SPARQL queries via query rewriting. Proc. VLDB Endow. 14(13), 3389–3401 (2021)
Hoffart, J., Suchanek, F.M., Berberich, K., Lewis-Kelham, E., de Melo, G., Weikum, G.: YAGO2: exploring and querying world knowledge in time, space, context, and many languages. In: WWW (Companion Volume), pp. 229–232. ACM (2011)
Hose, K., Schenkel, R.: Towards benefit-based RDF source selection for SPARQL queries. In: SWIM, p. 2. ACM (2012)
Hose, K., Schenkel, R.: WARP: workload-aware replication and partitioning for RDF. In: ICDE Workshops, pp. 1–6. IEEE Computer Society (2013)
Ibragimov, D., Hose, K., Pedersen, T.B., Zimányi, E.: Towards exploratory OLAP over linked open data – a case study. In: Castellanos, M., Dayal, U., Pedersen, T.B., Tatbul, N. (eds.) BIRTE 2013-2014. LNBIP, vol. 206, pp. 114–132. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-46839-5_8
Ibragimov, D., Hose, K., Pedersen, T.B., Zimányi, E.: processing aggregate queries in a federation of SPARQL endpoints. In: Gandon, F., Sabou, M., Sack, H., d’Amato, C., Cudré-Mauroux, P., Zimmermann, A. (eds.) ESWC 2015. LNCS, vol. 9088, pp. 269–285. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18818-8_17
Ibragimov, D., Hose, K., Pedersen, T.B., Zimányi, E.: Optimizing aggregate SPARQL queries using materialized RDF views. In: Groth, P., et al. (eds.) ISWC 2016. LNCS, vol. 9981, pp. 341–359. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46523-4_21
Idreos, S., et al.: Design continuums and the path toward self-designing key-value stores that know and learn. In: CIDR. www.cidrdb.org (2019)
Jakobsen, A.L., Montoya, G., Hose, K.: How diverse are federated query execution plans really? In: Hitzler, P., et al. (eds.) ESWC 2019. LNCS, vol. 11762, pp. 105–110. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32327-1_21
Jakobsen, K.A., Andersen, A.B., Hose, K., Pedersen, T.B.: Optimizing RDF data cubes for efficient processing of analytical queries. In: COLD. CEUR Workshop Proceedings, vol. 1426. CEUR-WS.org (2015)
Kaoudi, Z., et al.: Atlas: Storing, updating and querying RDF(S) data on top of DHTs. J. Web Semant. 8(4), 271–277 (2010)
Keles, I., Hose, K.: Skyline queries over knowledge graphs. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11778, pp. 293–310. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30793-6_17
Khayatbashi, S., Ferrada, S., Hartig, O.: Converting property graphs to RDF: a preliminary study of the practical impact of different mappings. In: GRADES-NDA@SIGMOD, pp. 10:1–10:9. ACM (2022)
Lassila, O., et al.: The OneGraph vision: challenges of breaking the graph model lock-in. Semantic Web 14(1), 125–134 (2023)
Lehmann, J., et al.: Dbpedia - a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web 6(2), 167–195 (2015)
Lissandrini, M., Hose, K., Pedersen, T.B.: Example-driven exploratory analytics over knowledge graphs. In: EDBT, pp. 105–117. OpenProceedings.org (2023)
Lissandrini, M., Mottin, D., Hose, K., Pedersen, T.B.: Knowledge graph exploration systems: are we lost? In: CIDR (2022). www.cidrdb.org
Lissandrini, M., Mottin, D., Palpanas, T., Velegrakis, Y.: Data Exploration Using Example-Based Methods. Morgan & Claypool Publishers, Synthesis Lectures on Data Management (2018)
Mansour, E., Srinivas, K., Hose, K.: Federated data science to break down silos. SIGMOD Rec. 50(4), 16–22 (2021)
Montoya, G., Aebeloe, C., Hose, K.: Towards efficient query processing over heterogeneous RDF interfaces. In: ISWC (Best Workshop Papers). Studies on the Semantic Web, vol. 36, pp. 39–53. IOS Press (2018)
Montoya, G., Keles, I., Hose, K.: Analysis of the effect of query shapes on performance over LDF interfaces. In: QuWeDa@ISWC. CEUR Workshop Proceedings, vol. 2496, pp. 51–66. CEUR-WS.org (2019)
Montoya, G., Skaf-Molli, H., Hose, K.: The Odyssey approach for optimizing federated SPARQL queries. In: d’Amato, C., et al. (eds.) ISWC 2017. LNCS, vol. 10587, pp. 471–489. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68288-4_28
Nargesian, F., Pu, K.Q., Bashardoost, B.G., Zhu, E., Miller, R.J.: Data lake organization. IEEE Trans. Knowl. Data Eng. 35(1), 237–250 (2023)
Nath, R.P.D., Hose, K., Pedersen, T.B., Romero, O.: SETL: a programmable semantic extract-transform-load framework for semantic data warehouses. Inf. Syst. 68, 17–43 (2017)
Nguyen, V., Bodenreider, O., Sheth, A.P.: Don’t like RDF reification?: making statements about statements using singleton property. In: WWW (2014)
Noy, N.F., Gao, Y., Jain, A., Narayanan, A., Patterson, A., Taylor, J.: Industry-scale knowledge graphs: lessons and challenges. Commun. ACM 62(8), 36–43 (2019)
Özsu, M.T., Valduriez, P.: Principles of Distributed Database Systems, 4th edn. Springer (2020). https://doi.org/10.1007/978-3-030-26253-2
Pelgrin, O., Galárraga, L., Hose, K.: Towards fully-fledged archiving for RDF datasets. Semantic Web 12(6), 903–925 (2021)
Rabbani, K., Lissandrini, M., Hose, K.: SHACL and ShEx in the wild: a community survey on validating shapes generation and adoption. In: WWW (Companion Volume), pp. 260–263. ACM (2022)
Rabbani, K., Lissandrini, M., Hose, K.: Extraction of validating shapes from very large knowledge graphs. Proc. VLDB Endow. 16(5), 1023–1032 (2023)
Rabbani, K., Lissandrini, M., Hose, K.: SHACTOR: improving the quality of large-scale knowledge graphs with validating shapes. In: SIGMOD Conference Companion, pp. 151–154. ACM (2023)
Sagi, T., Lissandrini, M., Pedersen, T.B., Hose, K.: A design space for RDF data representations. VLDB J. 31(2), 347–373 (2022)
Sakr, S., et al.: The future is big graphs: a community view on graph processing systems. Commun. ACM 64(9), 62–71 (2021)
Schwarte, A., Haase, P., Hose, K., Schenkel, R., Schmidt, M.: FedX: a federation layer for distributed query processing on linked open data. In: Antoniou, G., et al. (eds.) ESWC 2011. LNCS, vol. 6644, pp. 481–486. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-21064-8_39
Schwarte, A., Haase, P., Hose, K., Schenkel, R., Schmidt, M.: FedX: optimization techniques for federated query processing on linked data. In: Aroyo, L., et al. (eds.) ISWC 2011. LNCS, vol. 7031, pp. 601–616. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-25073-6_38
Taelman, R., Mahieu, T., Vanbrabant, M., Verborgh, R.: Optimizing storage of RDF archives using bidirectional delta chains. Semantic Web (2021)
Varadarajan, R., Bharathan, V., Cary, A., Dave, J., Bodagala, S.: DBDesigner: a customizable physical design tool for vertica analytic database. In: ICDE, pp. 1084–1095. IEEE Computer Society (2014)
Vrandecic, D., Krötzsch, M.: Wikidata: a free collaborative knowledgebase. Commun. ACM 57(10), 78–85 (2014)
Zervakis, L., Setty, V., Tryfonopoulos, C., Hose, K.: Efficient continuous multi-query processing over graph streams. In: EDBT, pp. 13–24. OpenProceedings.org (2020)
Zhang, C., Bonifati, A., Özsu, M.T.: An overview of reachability indexes on graphs. In: SIGMOD Conference Companion, pp. 61–68. ACM (2023)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Hose, K. (2023). Knowledge Engineering in the Era of Artificial Intelligence. In: Abelló, A., Vassiliadis, P., Romero, O., Wrembel, R. (eds) Advances in Databases and Information Systems. ADBIS 2023. Lecture Notes in Computer Science, vol 13985. Springer, Cham. https://doi.org/10.1007/978-3-031-42914-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-031-42914-9_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-42913-2
Online ISBN: 978-3-031-42914-9
eBook Packages: Computer ScienceComputer Science (R0)