RML and FnO: Shaping DBpedia Declaratively

  • Ben De Meester
  • Wouter Maroy
  • Anastasia Dimou
  • Ruben Verborgh
  • Erik Mannens
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10577)

Abstract

dbpedia data is largely generated from extracting and parsing the wikitext from the infoboxes of Wikipedia. This generation process is handled by the dbpedia Extraction Framework (dbpedia ef). This framework currently consists of data transformations, a series of custom hard-coded steps which parse the wikitext, and schema transformations, which model the resulting rdf data. Therefore, applying changes to the resulting rdf data needs both Semantic Web expertise and development within the dbpedia ef. As such, the current dbpedia data is being shaped by a small amount of core developers. However, by describing both schema and data transformations declaratively, we shape and generate dbpedia data using solely declarations, splitting the concerns between implementation and modeling. The parsing functions development is decoupled from the dbpedia ef, and other data transformation functions can easily be integrated during dbpedia data generation. This demo showcases an interactive Web application that allows non-technical users to (re-)shape the dbpedia data and use external data transformation functions, solely by editing a mapping document via html controls.

Keywords

DBpedia Data transformations FnO Linked Data generation RML 

References

  1. 1.
    Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007).  https://doi.org/10.1007/978-3-540-76298-0_52 CrossRefGoogle Scholar
  2. 2.
    Auer, S., Lehmann, J.: What Have Innsbruck and Leipzig in Common? Extracting semantics from wiki content. In: Franconi, E., Kifer, M., May, W. (eds.) ESWC 2007. LNCS, vol. 4519, pp. 503–517. Springer, Heidelberg (2007).  https://doi.org/10.1007/978-3-540-72667-8_36 CrossRefGoogle Scholar
  3. 3.
    De Meester, B., Dimou, A., Verborgh, R., Mannens, E.: An ontology to semantically declare and describe functions. In: Sack, H., Rizzo, G., Steinmetz, N., Mladenić, D., Auer, S., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9989, pp. 46–49. Springer, Cham (2016).  https://doi.org/10.1007/978-3-319-47602-5_10 CrossRefGoogle Scholar
  4. 4.
    De Meester, B., Maroy, W., Dimou, A., Verborgh, R., Mannens, E.: Declarative data transformations for Linked Data generation: the case of DBpedia. In: Blomqvist, E., Maynard, D., Gangemi, A., Hoekstra, R., Hitzler, P., Hartig, O. (eds.) ESWC 2017. LNCS, vol. 10250, pp. 33–48. Springer, Cham (2017).  https://doi.org/10.1007/978-3-319-58451-5_3 CrossRefGoogle Scholar
  5. 5.
    Dimou, A., Vander Sande, M., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: a generic language for integrated RDF mappings of heterogeneous data. In: Bizer, C., Heath, T., Auer, S., Berners-Lee, T. (eds.) Proceedings of the 7th Workshop on Linked Data on the Web, CEUR Workshop Proceedings, vol. 1184, April 2014Google Scholar
  6. 6.
    Heyvaert, P., Dimou, A., Herregodts, A.-L., Verborgh, R., Schuurman, D., Mannens, E., Walle, R.: RMLEditor: a graph-based mapping editor for linked data mappings. In: Sack, H., Blomqvist, E., d’Aquin, M., Ghidini, C., Ponzetto, S.P., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9678, pp. 709–723. Springer, Cham (2016).  https://doi.org/10.1007/978-3-319-34129-3_43 CrossRefGoogle Scholar
  7. 7.
    Rahm, E., Do, H.H.: Data cleaning: problems and current approaches. IEEE Data Eng. Bull. 23(4), 3–13 (2000)Google Scholar
  8. 8.
    Regalia, B., Janowicz, K., Gao, S.: VOLT: a provenance-producing, transparent SPARQL proxy for the on-demand computation of linked data and its application to spatiotemporally dependent data. In: The Semantic Web. Latest Advances and New Domains (2016)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Ben De Meester
    • 1
  • Wouter Maroy
    • 1
  • Anastasia Dimou
    • 1
  • Ruben Verborgh
    • 1
  • Erik Mannens
    • 1
  1. 1.IDLab, Department of Electronics and Information SystemsGhent University – imecGhentBelgium

Personalised recommendations