The Metadata Ecosystem of DataID

  • Markus FreudenbergEmail author
  • Martin Brümmer
  • Jessika Rücknagel
  • Robert Ulrich
  • Thomas Eckart
  • Dimitris Kontokostas
  • Sebastian Hellmann
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 672)


The rapid increase of data produced in a data-centric economy emphasises the need for rich metadata descriptions of datasets, covering many domains and scenarios. While there are multiple metadata formats, describing datasets for specific purposes, exchanging metadata between them is often a difficult endeavour. More general approaches for domain-independent descriptions, often lack the precision needed in many domain-specific use cases. This paper introduces the multilayer ontology of DataID, providing semantically rich metadata for complex datasets. In particular, we focus on the extensibility of its core model and the interoperability with foreign ontologies and other metadata formats. As a proof of concept, we will present a way to describe Data Management Plans (DMP) of research projects alongside the metadata of its datasets, repositories and involved agents.


Link Open Data Provenance Information Metadata Schema Core Ontology Application Profile 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.



This research has received funding by grants from the H2020 EU projects ALIGNED (GA 644055) and FREME (GA-644771) as well as the Smart Data Web (GA-01MD15010B).


  1. 1.
    Brüummer, M., et al.: DataID: towards semantically rich metadata for complex datasets. In: Proceedings of the 10th International Conference on Semantic Systems, SEM 2014, pp. 84–91. ACM, Leipzig (2014)Google Scholar
  2. 2.
    Deri, F.M., Galway, N.: Data Catalog Vocabulary (DCAT). W3C Recommendation.
  3. 3.
    Maali, F., Cyganiak, R., Peristeras, V.: Enabling interoperability of government data catalogues. In: Wimmer, M.A., Chappelet, J.-L., Janssen, M., Scholl, H.J. (eds.) EGOV 2010. LNCS, vol. 6228, pp. 339–350. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  4. 4.
    McCrae, J.P., Labropoulou, P., Gracia, J., Villegas, M., Rodríguez-Doncel, V., Cimiano, P.: One ontology to bind them all: the meta-share owl ontology for the interoperability of linguistic datasets on the web. In: Gandon, F., Guéret, C., Villata, S., Breslin, J., Faron-Zucker, C., Zimmermann, A. (eds.) ESWC 2015. LNCS, vol. 9341, pp. 271–282. Springer, Heidelberg (2015)CrossRefGoogle Scholar
  5. 5.
    Nielsen, H.F.: Interoperability and evolvability.
  6. 6.
    Alexander, K., et al.: Describing linked datasets with the VoID vocabulary. W3C Interest Group Note.
  7. 7.
    McGuinness, D., Lebo, T., Sahoo, S.: The PROV ontology. W3C Recommendation.
  8. 8.
    Cyganiak, R., et al.: The RDF data cube vocabulary. W3C Recommendation.
  9. 9.
    Pampel, H., et al.: Making research data repositories visible: the registry. PLoS ONE 8(11), e78080 (2013)CrossRefGoogle Scholar
  10. 10.
    Rücknagel, J., et al.: Metadata schema for the description of research data repositories. In: GFZ Germans Research Center for GeosciencesGoogle Scholar
  11. 11.
    Broeder, D., et al.: A data category registry- and component-based metadata framework. In: Proceedings of LREC. European Language Resources Association (2010). ISBN: 2-9517408-6-7Google Scholar
  12. 12.
    Hinrichs, E., Krauwer, S.: The CLARIN research infrastructure: resources and tools for e-Humanities scholars. In: Proceedings of LREC 2014. European Language Resources Association (ELRA) (2014)Google Scholar
  13. 13.
    Durco, M., Windhouwer, M.: From CLARIN component metadata to linked open data. In: LDL 2014, LREC Workshop (2014)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Markus Freudenberg
    • 1
    Email author
  • Martin Brümmer
    • 2
  • Jessika Rücknagel
    • 3
  • Robert Ulrich
    • 4
  • Thomas Eckart
    • 5
  • Dimitris Kontokostas
    • 1
  • Sebastian Hellmann
    • 1
  1. 1.Institut für Angewandte Informatik (InfAI), AKSW/KILTUniversität LeipzigLeipzigGermany
  2. 2.eccenca GmbHLeipzigGermany
  3. 3.re3dataGöttingenGermany
  4. 4.re3dataKarlsruheGermany
  5. 5.Abteilung Automatische SprachverarbeitungUniversität LeipzigLeipzigGermany

Personalised recommendations