Abstract
In this paper, disclosure risk assessment in Statistical Databases is performed by means of a probabilistic approach; in particular, we consider the problem of auditing databases that support statistical sum/count/mean/max/min queries to protect the privacy of sensitive boolean data. We provide both a theoretical framework for evaluating the disclosure risk and a tool for its control and management.
Similar content being viewed by others
References
Adam, N.R., Worthmann, J.C.: Security-control methods for statistical databases: a comparative study. ACM Comput. Surv. (CSUR) 21, 515–556 (1989)
Arcos, A., Rueda, Md., Singh, S.: A generalized approach to randomised response for quantitative variables. Qual. Quant. pp 1–18 (2014)
Canfora, G., Cavallo, B.: A bayesian approach for on-line max and min auditing. In: Proocedings of International workshop on Privacy and Anonymity in Information Society (PAIS), ACM DL, pp 12–20 (2008a)
Canfora, G., Cavallo, B.: A bayesian approach for on-line max auditing. In: Proocedings of The Third International Conference on Availability, Reliability and Security (ARES), IEEE Computer Society Press, pp 1020–1027 (2008b)
Canfora, G., Cavallo, B.: Reasoning under uncertainty in on-line auditing. Privacy in Statistical Databases, Lecture Notes in Computer Science, pp. 257–269. Springer, Berlin Heidelberg (2008c)
Canfora, G., Cavallo, B.: A bayesian model for disclosure control in statistical databases. Data Knowl. Eng. 68(11), 1187–1205 (2009)
Canfora, G., Cavallo, B.: A probabilistic approach for on-line sum-auditing. In: Proocedings of 2010 International Conference on Availability, Reliability and Security, IEEE Computer Society Press, pp 303–308 (2010)
Cavallo, B., Canfora, G.: A bayesian approach for on-line sum/count/max/min auditing on boolean data. In: Privacy in Statistical Databases, Lecture Notes in Computer Science, Springer-Verlag, Berlin Heidelberg, pp 295–307 (2012)
Cavallo, B., Canfora, G., DApuzzo, L., Squillante, M.: Reasoning under uncertainty and multi-criteria decision making in data privacy. Qual. Quant. 48(4), 1957–1972 (2014)
Chang, H.J., Wang, C.L., Huang, K.C.: On estimating the proportion of a qualitative sensitive character using randomized response sampling. Qual. Quant. 38(5), 675–680 (2005)
Chin, F.Y.: Security problems on inference control for sum, max, and min queries. J. ACM 33(3), 451–464 (1986)
Chin, F.Y., Ozsoyoglu, G.: Auditing and inference control in statistical databases. IEEE Trans. Softw. Eng. SE 8(6), 574–582 (1982)
Domingo-Ferrer, J., Torra, V.: Disclosure risk assessment in statistical microdata protection via advanced record linkage. Stat. Comput. 13(4), 343–354 (2003)
Domingo-Ferrer, J., Snchez, D., Rufian-Torrell, G.: Anonymization of nominal data based on semantic marginality. Inf. Sci. 242, 35–48 (2013)
Heckerman, D.: Causal independence for knowledge acquisition and inference. In Proceedings of Ninth Conference on Uncertainty in Artificial Intelligence pp 122–127 (1993)
Inan, A., Kantarcioglu, M., Ghinita, G., Bertino, E.: A hybrid approach to private record matching. Dependable Secure Comput. IEEE Trans. 9(5), 684–698 (2012)
Kenthapadi, K., Mishra, N., Nissim, K.: Simulatable auditing. In PODS pp 118–127 (2005)
Kleinberg, J., Papadimitriou, C., Raghavan, P.: Auditing boolean attributes. J. Comput. Syst. Sci. 66(1), 244–253 (2003)
Malvestuto, F.: Auditing categorical sum, max and min queries. In: Domingo-Ferrer, J., Saygn, Y. (eds.) Privacy in Statistical Databases, Lecture Notes in Computer Science, pp. 247–256. Springer, Berlin (2008)
Malvestuto, F.M., Mezzini, M., Moscarini, M.: Auditing sum-queries to make a statistical database secure. ACM Trans. Inf. Syst. Secur. (TISSEC) 9(1), 31–60 (2006)
Olesen, K.G., Kjaerulff, U., Jensen, F., Jensen, F.V., Falck, B., Andreassen, S., Andersen, S.K.: A munin network for the median nerve—a case study in loops. Appl. Artif. Intell. 3(2–3), 385–403 (1989)
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: networks of Plausible Inference. Morgan Kaufmann, San Francisco (1998)
Polettini, S.: Maximum entropy simulation for microdata protection. Stat. Comput. 13(4), 307–320 (2003)
Reiss, S.P.: Security in databases: a combinatorial study. J. ACM 26(1), 45–57 (1979)
Sweeney, L.: k-Anonymity: a model for protecting privacy. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 10(05), 557–570 (2002)
Zhimin, H., Zaizai, Y.: Measure of privacy in randomized response model. Qual. Quant. 46(4), 1167–1180 (2012)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Cavallo, B., Canfora, G. A probabilistic approach for disclosure risk assessment in statistical databases. Qual Quant 50, 729–749 (2016). https://doi.org/10.1007/s11135-015-0173-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11135-015-0173-5