Abstract
Consciously collecting (research) data and respecting privacy aspects are two contradictions, which seem to be mutually exclusive at first moment. However, this does not have to be the case. But before we can address this conflict and its resolution, we want to understand what the terms privacy, provenance, and research data management actually mean. We are not interested in the formal definitions but in the community’s understanding of these terms. We have the intention to explore how far the theoretical definitions are known in science and economy. Hence, we interviewed 20 people – scientists and non-scientists – and evaluated their answers for discussing the relevance of combining provenance and privacy in the field of research data management. We discovered that provenance is generally understood as the origin of data or physical objects, and privacy often refers to the protection of personal data. We found that all participants have a very good understanding of their own research data, which in most cases is based on a well-developed research data management. Nevertheless, there is still some uncertainty, especially in the area of provenance and privacy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Auge, T., Heuer, A.: ProSA—using the CHASE for provenance management. In: Welzer, T., Eder, J., Podgorelec, V., Kamišalić Latifić, A. (eds.) ADBIS 2019. LNCS, vol. 11695, pp. 357–372. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-28730-6_22
Auge, T., Scharlau, N., Heuer, A.: Privacy aspects of provenance queries. In: Glavic, B., Braganholo, V., Koop, D. (eds.) IPAW 2020-2021. LNCS, vol. 12839, pp. 218–221. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-80960-7_15
Auge, T.: Extended provenance management for data science applications. In: PhD@VLDB, CEUR Workshop Proceedings, vol. 2652 (2020). CEUR-WS.org
Dwork, C.: Differential privacy. In: Bugliesi, M., Preneel, B., Sassone, V., Wegener, I. (eds.) ICALP 2006. LNCS, vol. 4052, pp. 1–12. Springer, Heidelberg (2006). https://doi.org/10.1007/11787006_1
Herschel, M., Diestelkämper, R., Ben Lahmar, H.: A survey on provenance: What for? What form? What from? VLDB J. 26(6), 881–906 (2017). https://doi.org/10.1007/s00778-017-0486-1
Samarati, P.: Protecting respondents’ identities in microdata release. IEEE Trans. Knowl. Data Eng. 13(6), 1010–1027 (2001)
Sweeney, L.: Simple Demographics Often Identify People Uniquely. Carnegie Mellon University, School of Computer Science, Data Privacy Lab White Paper Series LIDAP-WP4. Pittsburgh, PA (2000)
Acknowledgements
Thanks to all interview partners for their time as well as their exhaustive answers. Thanks also to Tom Ettrich for proofreading our article.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Auge, T., Scharlau, N., Heuer, A. (2021). Provenance and Privacy in ProSA. In: Kotsis, G., et al. Database and Expert Systems Applications - DEXA 2021 Workshops. DEXA 2021. Communications in Computer and Information Science, vol 1479. Springer, Cham. https://doi.org/10.1007/978-3-030-87101-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-87101-7_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87100-0
Online ISBN: 978-3-030-87101-7
eBook Packages: Computer ScienceComputer Science (R0)