Skip to main content
Log in

Modeling and querying provenance by extending CIDOC CRM

  • Published:
Distributed and Parallel Databases Aims and scope Submit manuscript

Abstract

This paper elaborates on the problem of modeling provenance for both physical and digital objects. In particular it discusses provenance according to OAIS (ISO 14721:2003) and how it relates with the conceptualization of CIDOC CRM ontology (ISO 21127:2006). Subsequently it introduces an extension of the CIDOC CRM ontology, able to capture the modeling and the query requirements regarding the provenance of digital objects. Over this extension the paper provides a number of indicative examples of modeling provenance in various domains. Subsequently, it introduces a number of indicative provenance query templates, and finally it describes an implementation using Semantic Web technologies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. CASPAR (Cultural, Artistic and Scientific knowledge for Preservation, Access and Retrieval), FP6-2005-IST-033572. http://www.casparpreserves.eu/

  2. SPARQL Query Language for RDF. W3C Candidate Recommendation, 6 April 2006. http://www.w3.org/TR/rdf-sparql-query/

  3. Allard, P., Ferre’, S.: Dynamic taxonomies for the semantic web. In: Proccedings of FIND’2008 (at DEXA’08), Turin, Italy, September 2008

  4. Brickley, D., Guha, R.V.: Resource description framework (RDF) schema specification: proposed recommendation. W3C, March 1999. http://www.w3.org/TR/1999/PR-rdf-schema-19990303

  5. Brown, J.L., Ferner, C.S., Hudson, T.C., Stapleton, A.E., Vetter, R.J., Carland, T., Martin, A., Martin, J., Rawls, A., Shipman, W.J., et al.: Gridnexus: a grid services scientific workflow system. Int. J. Comput. Inf. Sci. 6(2), 77–82 (2005)

    Google Scholar 

  6. Buneman, P., Khanna, S., Tajima, K., Tan, W.C.: Archiving scientific data. ACM Trans. Database Syst. 29(1), 2–42 (2004)

    Article  Google Scholar 

  7. Buneman, P., Tan, W.C.: Provenance in databases. In: Proccedings of the 2007 ACM SIGMOD International Conference on Management of Data, pp. 1171–1173. ACM, New York (2007)

    Chapter  Google Scholar 

  8. Carroll, J.J., Bizer, C., Hayes, P., Stickler, P.: Named graphs, provenance and trust. In: Proccedings of the 14th International Conference on World Wide Web, pp. 613–622. ACM, New York (2005)

    Chapter  Google Scholar 

  9. XFDU development site. http://sindbad.gsfc.nasa.gov/xfdu

  10. Doerr, M., Crofts, N.: Electronic communication on diverse data—the role of an object-oriented CIDOC reference model. In: Proccedings of CIDOC’98, Melbourne, October 1998. http://www.ics.forth.gr/proj/isst/Publications/Conference_Proc.html

  11. Eltabakh, M.Y., Aref, W.G., Elmagarmid, A.K., Ouzzani, M., Laura-Silva, Y.: Supporting annotations on relations. In: 12th International Conference on Extending Database Technology (EDBT 2009), Saint-Petersburg, Russia, March 2009

  12. Factor, M., Henis, E., Naor, D., Rabinovici-Cohen, S., Reshef, P., Ronen, S., Michetti, G., Guercio, M.: Authenticity and provenance in long term digital preservation: modeling and implementation in preservation aware storage. In: Proccedings of the USENIX First Workshop on the Theory and Practice of Provenance (TaPP), San Francisco, USA, February 2009

  13. Flouris, G., Fundulaki, I., Pediaditis, P., Theoharis, Y., Christophides, V.: Coloring rdf triples to capture provenance. In: Proccedings of the 8th International Semantic Web Conference (ISWC’09), October 2009

  14. SAFE (Standard Archive Format for Europe). http://earth.esa.int/safe/

  15. FORTH-ICS/ISL. The CIDOC conceptual reference model for digital objects (2008). http://cidoc.ics.forth.gr/rdfs/caspar/cidoc_digital2.3.rdfs

  16. Hildebrand, M., van Ossenbruggen, J., Hardman, L.: /facet: a browser for heterogeneous semantic web repositories. In: Lecture Notes in Computer Science, vol. 4273, p. 272. Springer, Berlin (2006)

    Google Scholar 

  17. Hunter, J., Cheung, K.: Provenance explorer-a graphical interface for constructing scientific publication packages from provenance trails. Int. J. Digit. Libr. 7(1), 99–107 (2007)

    Article  Google Scholar 

  18. International organization for standardization: OAIS: open archival information system—reference model (2003). Ref. No. ISO 14721:2003

  19. International organization for standardization: The CIDOC conceptual reference model (2006). Ref. No. ISO 21127:2006. http://cidoc.ics.forth.gr/

  20. Karvounarakis, G., Christophides, V., Plexousakis, D.: RQL: a declarative query language for RDF. In: Eleventh International World Wide Web Conference (WWW), Hawaii, USA, May 2002

  21. Kondylakis, H., Analyti, A., Plexousakis, D.: Quete: ontology-based query system for distributed sources. In: Advances in Databases and Information Systems (ADBIS 2007), pp. 359–375. Springer 2007

  22. DEDSL Language (Data Entity Dictionary Specification Language). http://east.cnes.fr/english/page_dedsl.html

  23. SWRL (Semantic Web Rule Language). http://www.w3.org/submission/swrl/ (2004)

  24. Lorie, R.A.: Long term preservation of digital information. In: Proccedings of the 1st ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 346–352 (2001)

  25. Lucas, A.: XFDU packaging contribution to an implementation of the OAIS reference model. In: Proccedings of the International Conference PV’2007 (Ensuring the Long-Term Preservation and Value Adding to Scientific and Technical Data), Edinburgh, November 2005

  26. Magiridou, M., Sahtouris, S., Christophides, V., Koubarakis, M.: RUL: a declarative update language for RDF. In: Proccedings of the 4th International Conference on the Semantic Web (ISWC-2005), Galway, Ireland, November 2005

  27. Majithia, S., Shields, M.S., Taylor, I.J., Wang, I.: Triana: a graphical web service composition and execution toolkit. In: Proccedings of the IEEE International Conference on Web Services (ICWS’04), San Diego, California, USA, July 2004

  28. Marketakis, Y., Tzanakis, M., Tzitzikas, Y.: PreScan: towards automating the preservation of digital objects. In: Proccedings of the International Conference on Management of Emergent Digital Ecosystems (MEDES’09), Lyon, France, October 2009

  29. Mikroyannidis, A., Bee, O., Ng, K., Giaretta, D.: Ontology-based temporal modelling of provenance information. In: Proccedings of Electrotechnical Conference, MELECON 2008. The 14th IEEE Mediterranean, Tenerife, Spain, May 2008, pp. 176–181

  30. Mäkelä, E., Hyvönen, E., Saarela, S.: Ontogator—a semantic view-based search engine service for web applications. In: International Semantic Web Conference. Lecture Notes in Computer Science, vol. 4273. Springer, Berlin (2006)

    Google Scholar 

  31. Moreau, L., Freire, J., Myers, J., Futrelle, J., Paulson, P.: The open provenance model. University of Southampton (2007)

  32. Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M.R., Wipat, A., et al.: Taverna: a tool for the composition and enactment of bioinformatics workflows (2004)

  33. Oren, E., Delbru, R., Decker, S.: Extending faceted navigation for RDF data. In: Lecture Notes in Computer Science, vol. 4273, p. 559. Springer, Berlin (2006)

    Google Scholar 

  34. Sacco, G.M., Tzitzikas, Y. (eds.): Dynamic Taxonomies and Faceted Search: Theory, Practise and Experience. Springer, Berlin (2009)

    Google Scholar 

  35. Srivastava, D., Velegrakis, Y.: Using queries to associate metadata with data. In ICDE, pp. 1451–1453 (2007)

  36. EAST Language (Enhanced Ada Subse T). http://east.cnes.fr/english/page_east.html

  37. Theoharis, Y., Christophides, V., Karvounarakis, G.G.: Benchmarking database representations of RDF/S stores. In: Proccedings of the 4th International Semantic Web Conference (ISWC’05). Springer, Berlin (2005)

    Google Scholar 

  38. Tzitzikas, Y., Kotzinos, D., Theoharis, Y.: On ranking RDF schema elements (and its application in visualization). J. Univers. Comput. Sci. 13(12), 1854–1880 (2007)

    Google Scholar 

  39. Tzitzikas, Y., Theoharis, Y., Andreou, D.: On storage policies for semantic web repositories that support versioning. In: Proccedings of the 5th European Semantic Web Conference (ESWC’08), Tenerife, Spain, June 2008, pp. 705–719. Springer, Berlin (2008)

    Google Scholar 

  40. van der Hoeven, J.R., van Diessen, R.J., van der Meer, K.: Development of a universal virtual computer (UVC) for long-term preservation of digital objects. J. Inf. Sci. 31(3), 196 (2005)

    Article  Google Scholar 

  41. Watkins, E.R., Nicole, D.A.: Named graphs as a mechanism for reasoning about provenance. In: Proccedings of the 8th Asia-Pacific Web Conf. (APWeb’2006), Harbin, China, pp. 943–948 (2006)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yannis Tzitzikas.

Additional information

Communicated by Walid G. Aref and Ouzzani Mourad.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Theodoridou, M., Tzitzikas, Y., Doerr, M. et al. Modeling and querying provenance by extending CIDOC CRM. Distrib Parallel Databases 27, 169–210 (2010). https://doi.org/10.1007/s10619-009-7059-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10619-009-7059-2

Keywords

Navigation