Abstract
The amounts of digital information are growing in size and complexity. With the emergence of distributed services over internet and the booming of electronic exchanges, the need to identify information origins and its lifecycle history becomes essential. Essential because it’s the only factor ensuring information integrity and probative value. That’s why in different areas like government, commerce, medicine and science, tracking data origins is essential and can serve for informational, quality, forensics, regulatory compliance, rights protection and intellectual property purposes. Managing information provenance is a complex task and it has been extensively treated in databases, file system and scientific workflows. However, provenance in the cloud is a more challenging task due to specific problems related to the cloud added to the traditional ones.
Chapter PDF
Similar content being viewed by others
Keywords
- Cloud Computing
- Electronic Document
- Data Provenance
- Information Provenance
- Open Archival Information System
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Agrawal, P., Benjelloun, O., Sarma, A.D., Hayworth, C., Shubha, U., Nabar, C.U., Sugihara, T., Widom, J.: ULDBs: Databases with Uncertainty and Lineage. In: Trio: A System for Data, Uncertainty, and Lineage. VLDB 2006, pp. 1151–1154 (2006)
Bhagwat, D., Chiticariu, L., Tan, W.C., Vijayvargiya, G.: An Annotation Management System for Relational Databases. In: VLDB, pp. 900–911 (2004)
Braun, U., Shinnar, A., Seltzer, M.: Securing provenance. In: Third USENIX Workshop on Hot Topics in Security (HotSec) (July 2008)
Buneman, P., Khanna, S., Tan, W.C.: Why and Where: A Characterization of Data Provenance. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, pp. 316–330. Springer, Heidelberg (2000)
Cameron, G.: Provenance and Pragmatics. In: Workshop on Data Provenance and Annotation (2003)
Cheney, J., Chong, S., Foster, N., Seltzer, M., Vansummeren, S.: Provenance: A Future History. In: International Conference on Object Oriented Programming, Systems, Languages and Applications, pp. 957–964 (2009)
Da Silva, P.P., McGuinness, D.L., McCool, R.: Knowledge Provenance Infrastructure. IEEE Data Engineering Bulletin 26, 26–32 (2003)
Davidson, S., Cohen-Boulakia, S., Eyal, A., Ludascher, B., McPhillips, T., Bowers, S., Freire, J.: Provenance in Scientific Workflow Systems. IEEE Data Engineering Bulletin 32, 44–50 (2007)
Goble, C.: Position Statement: Musings on Provenance, Workow and Semantic Web Annotations for Bioinformatics. In: Workshop on Data Derivation and Provenance (2002)
Grand challenges in computing research conference (2008), UK Computing Society: http://www.ukcrc.org.uk/press/news/challenge08/gccr08final.cfm
Hasan, R., Yurcik, W., Myagmar, S.: The Evolution of Storage Service Providers: Techniques and Challenges to Outsourcing Storage. In: Proceedings of the 2005 ACM workshop on Storage Security and Survivability (2005)
Hassan, R., Sion, R., Winslett, M.: Preventing History Forgery with Secure Provenance. ACM Transactions on Storage (2009)
Hassan, R., Sion, R., Winslett, M.: Remembrance: The Unbearable Sentience of Being Digital. In: Fourth Biennial Conference on Innovative Data Systems Research (2009)
INFOSEC Research Council (IRC) Hard problem list. Technical report (November 2005), http://www.cyber.st.dhs.gov/docs/IRC_Hard_Problem_List.pdf
ISO 14721:2003. Space data and information transfer systems - Open Archival Information System Reference model (OAIS), http://www.iso.org
Miles, S., Groth, P.T., Munroe, S., Jiang, S., Assandri, T., Moreau, L.: Extracting causal graphs from an open provenance data model. Concurrency and Computation: Practice and Experience 20(5), 577–586 (2008)
MoReq2 specifications. Model Requirements for the management of electronic records Update and Extension (2008), http://www.moreq2.eu
Muniswamy-Reddy, K.K., Holland, D.A., Braun, U., Seltzer, M.: Provenance-aware storage systems. In: USENIX Annual Technical Conference, General Track, pp. 43–56 (2006)
NF Z42-013. Electronic archival storage-Specifications relative to the design and operation of information processing systems in view of ensuring the storage and integrity of the recording stored in these systems, http://www.boutique.afnor.org
Sar, C., Cao, P.: Lineage file system. Technical Report (January 2005), http://crypto.stanford.edu/~cao/lineage
Simmhan, Y., Plale, B., Gannon, D.: A survey of data provenance in e-science. SIGMOD Record (Special section on scientific workflows) 34(3), 31–36 (2005)
ISO Standard for using PDF format for the long-term archiving of electronic documents ISO-19005-1 - Document management - Electronic document file format for long-term preservation - Part 1: Use of PDF 1.4 (PDF/A-1), http://www.pdfa.org
Moreau, L., Plale, B., Miles, S., Goble, C., Missier, P., Barga, R., Simmhan, Y., Futrelle, J., McGrath, R.E., Myers, J., Paulson, P., Bowers, S., Ludaescher, B., Kwasnikowska, N., Van den Bussche, J., Ellkvist, T., Freire, J., Groth, P.: The Open Provenance Model (v1.01) specifications. Future Generation Computer Systems (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sakka, M.A., Defude, B., Tellez, J. (2010). Document Provenance in the Cloud: Constraints and Challenges. In: Aagesen, F.A., Knapskog, S.J. (eds) Networked Services and Applications - Engineering, Control and Management. EUNICE 2010. Lecture Notes in Computer Science, vol 6164. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13971-0_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-13971-0_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13970-3
Online ISBN: 978-3-642-13971-0
eBook Packages: Computer ScienceComputer Science (R0)