Abstract
This paper elaborates on the problem of modeling provenance for both physical and digital objects. In particular it discusses provenance according to OAIS (ISO 14721:2003) and how it relates with the conceptualization of CIDOC CRM ontology (ISO 21127:2006). Subsequently it introduces an extension of the CIDOC CRM ontology, able to capture the modeling and the query requirements regarding the provenance of digital objects. Over this extension the paper provides a number of indicative examples of modeling provenance in various domains. Subsequently, it introduces a number of indicative provenance query templates, and finally it describes an implementation using Semantic Web technologies.
Similar content being viewed by others
References
CASPAR (Cultural, Artistic and Scientific knowledge for Preservation, Access and Retrieval), FP6-2005-IST-033572. http://www.casparpreserves.eu/
SPARQL Query Language for RDF. W3C Candidate Recommendation, 6 April 2006. http://www.w3.org/TR/rdf-sparql-query/
Allard, P., Ferre’, S.: Dynamic taxonomies for the semantic web. In: Proccedings of FIND’2008 (at DEXA’08), Turin, Italy, September 2008
Brickley, D., Guha, R.V.: Resource description framework (RDF) schema specification: proposed recommendation. W3C, March 1999. http://www.w3.org/TR/1999/PR-rdf-schema-19990303
Brown, J.L., Ferner, C.S., Hudson, T.C., Stapleton, A.E., Vetter, R.J., Carland, T., Martin, A., Martin, J., Rawls, A., Shipman, W.J., et al.: Gridnexus: a grid services scientific workflow system. Int. J. Comput. Inf. Sci. 6(2), 77–82 (2005)
Buneman, P., Khanna, S., Tajima, K., Tan, W.C.: Archiving scientific data. ACM Trans. Database Syst. 29(1), 2–42 (2004)
Buneman, P., Tan, W.C.: Provenance in databases. In: Proccedings of the 2007 ACM SIGMOD International Conference on Management of Data, pp. 1171–1173. ACM, New York (2007)
Carroll, J.J., Bizer, C., Hayes, P., Stickler, P.: Named graphs, provenance and trust. In: Proccedings of the 14th International Conference on World Wide Web, pp. 613–622. ACM, New York (2005)
XFDU development site. http://sindbad.gsfc.nasa.gov/xfdu
Doerr, M., Crofts, N.: Electronic communication on diverse data—the role of an object-oriented CIDOC reference model. In: Proccedings of CIDOC’98, Melbourne, October 1998. http://www.ics.forth.gr/proj/isst/Publications/Conference_Proc.html
Eltabakh, M.Y., Aref, W.G., Elmagarmid, A.K., Ouzzani, M., Laura-Silva, Y.: Supporting annotations on relations. In: 12th International Conference on Extending Database Technology (EDBT 2009), Saint-Petersburg, Russia, March 2009
Factor, M., Henis, E., Naor, D., Rabinovici-Cohen, S., Reshef, P., Ronen, S., Michetti, G., Guercio, M.: Authenticity and provenance in long term digital preservation: modeling and implementation in preservation aware storage. In: Proccedings of the USENIX First Workshop on the Theory and Practice of Provenance (TaPP), San Francisco, USA, February 2009
Flouris, G., Fundulaki, I., Pediaditis, P., Theoharis, Y., Christophides, V.: Coloring rdf triples to capture provenance. In: Proccedings of the 8th International Semantic Web Conference (ISWC’09), October 2009
SAFE (Standard Archive Format for Europe). http://earth.esa.int/safe/
FORTH-ICS/ISL. The CIDOC conceptual reference model for digital objects (2008). http://cidoc.ics.forth.gr/rdfs/caspar/cidoc_digital2.3.rdfs
Hildebrand, M., van Ossenbruggen, J., Hardman, L.: /facet: a browser for heterogeneous semantic web repositories. In: Lecture Notes in Computer Science, vol. 4273, p. 272. Springer, Berlin (2006)
Hunter, J., Cheung, K.: Provenance explorer-a graphical interface for constructing scientific publication packages from provenance trails. Int. J. Digit. Libr. 7(1), 99–107 (2007)
International organization for standardization: OAIS: open archival information system—reference model (2003). Ref. No. ISO 14721:2003
International organization for standardization: The CIDOC conceptual reference model (2006). Ref. No. ISO 21127:2006. http://cidoc.ics.forth.gr/
Karvounarakis, G., Christophides, V., Plexousakis, D.: RQL: a declarative query language for RDF. In: Eleventh International World Wide Web Conference (WWW), Hawaii, USA, May 2002
Kondylakis, H., Analyti, A., Plexousakis, D.: Quete: ontology-based query system for distributed sources. In: Advances in Databases and Information Systems (ADBIS 2007), pp. 359–375. Springer 2007
DEDSL Language (Data Entity Dictionary Specification Language). http://east.cnes.fr/english/page_dedsl.html
SWRL (Semantic Web Rule Language). http://www.w3.org/submission/swrl/ (2004)
Lorie, R.A.: Long term preservation of digital information. In: Proccedings of the 1st ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 346–352 (2001)
Lucas, A.: XFDU packaging contribution to an implementation of the OAIS reference model. In: Proccedings of the International Conference PV’2007 (Ensuring the Long-Term Preservation and Value Adding to Scientific and Technical Data), Edinburgh, November 2005
Magiridou, M., Sahtouris, S., Christophides, V., Koubarakis, M.: RUL: a declarative update language for RDF. In: Proccedings of the 4th International Conference on the Semantic Web (ISWC-2005), Galway, Ireland, November 2005
Majithia, S., Shields, M.S., Taylor, I.J., Wang, I.: Triana: a graphical web service composition and execution toolkit. In: Proccedings of the IEEE International Conference on Web Services (ICWS’04), San Diego, California, USA, July 2004
Marketakis, Y., Tzanakis, M., Tzitzikas, Y.: PreScan: towards automating the preservation of digital objects. In: Proccedings of the International Conference on Management of Emergent Digital Ecosystems (MEDES’09), Lyon, France, October 2009
Mikroyannidis, A., Bee, O., Ng, K., Giaretta, D.: Ontology-based temporal modelling of provenance information. In: Proccedings of Electrotechnical Conference, MELECON 2008. The 14th IEEE Mediterranean, Tenerife, Spain, May 2008, pp. 176–181
Mäkelä, E., Hyvönen, E., Saarela, S.: Ontogator—a semantic view-based search engine service for web applications. In: International Semantic Web Conference. Lecture Notes in Computer Science, vol. 4273. Springer, Berlin (2006)
Moreau, L., Freire, J., Myers, J., Futrelle, J., Paulson, P.: The open provenance model. University of Southampton (2007)
Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M.R., Wipat, A., et al.: Taverna: a tool for the composition and enactment of bioinformatics workflows (2004)
Oren, E., Delbru, R., Decker, S.: Extending faceted navigation for RDF data. In: Lecture Notes in Computer Science, vol. 4273, p. 559. Springer, Berlin (2006)
Sacco, G.M., Tzitzikas, Y. (eds.): Dynamic Taxonomies and Faceted Search: Theory, Practise and Experience. Springer, Berlin (2009)
Srivastava, D., Velegrakis, Y.: Using queries to associate metadata with data. In ICDE, pp. 1451–1453 (2007)
EAST Language (Enhanced Ada Subse T). http://east.cnes.fr/english/page_east.html
Theoharis, Y., Christophides, V., Karvounarakis, G.G.: Benchmarking database representations of RDF/S stores. In: Proccedings of the 4th International Semantic Web Conference (ISWC’05). Springer, Berlin (2005)
Tzitzikas, Y., Kotzinos, D., Theoharis, Y.: On ranking RDF schema elements (and its application in visualization). J. Univers. Comput. Sci. 13(12), 1854–1880 (2007)
Tzitzikas, Y., Theoharis, Y., Andreou, D.: On storage policies for semantic web repositories that support versioning. In: Proccedings of the 5th European Semantic Web Conference (ESWC’08), Tenerife, Spain, June 2008, pp. 705–719. Springer, Berlin (2008)
van der Hoeven, J.R., van Diessen, R.J., van der Meer, K.: Development of a universal virtual computer (UVC) for long-term preservation of digital objects. J. Inf. Sci. 31(3), 196 (2005)
Watkins, E.R., Nicole, D.A.: Named graphs as a mechanism for reasoning about provenance. In: Proccedings of the 8th Asia-Pacific Web Conf. (APWeb’2006), Harbin, China, pp. 943–948 (2006)
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Walid G. Aref and Ouzzani Mourad.
Rights and permissions
About this article
Cite this article
Theodoridou, M., Tzitzikas, Y., Doerr, M. et al. Modeling and querying provenance by extending CIDOC CRM. Distrib Parallel Databases 27, 169–210 (2010). https://doi.org/10.1007/s10619-009-7059-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10619-009-7059-2