Priority-Based k-Anonymity Accomplished by Weighted Generalisation Structures

Stark, Konrad; Eder, Johann; Zatloukal, Kurt

doi:10.1007/11823728_38

Priority-Based k-Anonymity Accomplished by Weighted Generalisation Structures

Konrad Stark¹⁸,
Johann Eder¹⁹ &
Kurt Zatloukal¹⁸

Conference paper

776 Accesses
12 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4081))

Abstract

Biobanks are gaining in importance by storing large collections of patient’s clinical data (e.g. disease history, laboratory parameters, diagnosis, life style) together with biological materials such as tissue samples, blood or other body fluids. When releasing these patient-specific data for medical studies privacy protection has to be guaranteed for ethical and legal reasons. k-anonymity may be used to ensure privacy by generalising and suppressing attributes in order to release sufficient data twins that mask patients’ identities. However, data transformation techniques like generalisation may produce anonymised data unusable for medical studies because some attributes become too coarse-grained. We propose a priority-driven anonymisation technique that allows to specify the degree of acceptable information loss for each attribute separately. We use generalisation and suppression of attributes together with a weighting-scheme for quantifying generalisation steps. Our approach handles both numerical and categorical attributes and provides a data anonymisation based on priorities and weights. The anonymisation algorithm described in this paper has been implemented and tested on a carcinoma data set. We discuss some general privacy protecting methods for medical data and show some medical-relevant use cases that benefit from our anonymisation technique.

Download to read the full chapter text

Chapter PDF

References

A biobank for the advancement of medicine, http://www.bioresource-med.com
Fung, B.C.M., Wang, K., Yu, P.S.: Top-down specialization for information and privacy preservation. In: ICDE, pp. 205–216 (2005)
Google Scholar
Genomeresearch in Austria, http://www.gen-au.at/english/content.jsp
Sweeney, L.: Computational disclosure control for medical microdata (1997)
Google Scholar
LeFevre, K., DeWitt, D.J., Ramakrishnan, R.: Incognito: Efficient full-domain k-anonymity. In: SIGMOD Conference, pp. 49–60 (2005)
Google Scholar
LeFevre, K., DeWitt, D.J., Ramakrishnan, R.: Multidimensional k-anonymity. In Technical Report 1521, University of Wisconsin, 2005 (2005)
Google Scholar
Sweeney, L., Samarati, P.: Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression. In: Proceedings of the IEEE Symposium on Research in Security and Privacy (1998)
Google Scholar
Samarati, P.: Protecting respondents’ identities in microdata release. IEEE Transactions on Knowledge and Data Engineering 13(6), 1010–1027 (2001)
Article Google Scholar
Sweeney, L.: Achieving k-anonymity privacy protection using generalization and suppression. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 10(5), 571–588 (2002)
Article MATH MathSciNet Google Scholar
Wang, K., Yu, P.S., Chakraborty, S.: Bottom-up generalization: A data mining solution to privacy protection. In: ICDM, pp. 249–256 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Pathology, Medical University Graz, Auenbruggerplatz 25, A-8036, Graz
Konrad Stark & Kurt Zatloukal
Department of Knowledge and Business Engineering, University of Vienna, Rathausstrae 19/9, A-1010, Wien
Johann Eder

Authors

Konrad Stark
View author publications
You can also search for this author in PubMed Google Scholar
Johann Eder
View author publications
You can also search for this author in PubMed Google Scholar
Kurt Zatloukal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Software Technology and Interactive Systems, Vienna University of Technology, Favoritenstr. 9-11/188, A-1040, Wien, Austria
A Min Tjoa
Department of Software and Computing Systems, University of Alicante, Spain
Juan Trujillo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stark, K., Eder, J., Zatloukal, K. (2006). Priority-Based k-Anonymity Accomplished by Weighted Generalisation Structures. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2006. Lecture Notes in Computer Science, vol 4081. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11823728_38

Download citation

DOI: https://doi.org/10.1007/11823728_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37736-8
Online ISBN: 978-3-540-37737-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics