Abstract
In this paper we describe a new procedure that is capable of ensuring that the marginal distributions of attributes in microdata masked with a masking mechanism end up being the same as the marginal distributions of attributes in the original data. We illustrate the application of the new procedure using several commonly used masking mechanisms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cario, M.C., Nelson, B.L.: Modeling and Generating Random Vectors with arbitrary Marginal Distributions and Correlation Matrix. Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, IL (1997)
Chen, H.: Initialization for NORTA: Generation of Random Vectors with Specified Marginals and Correlations. INFORMS Journal on Computing 13, 312–331 (2001)
Domingo-Ferrer, J.: Non-Perturbative Masking Methods. In: Liu, L., Ozsu, M.T. (eds.) Encyclopedia of Database Systems, pp. 1912–1913. Springer, US (2009)
Domingo-Ferrer, J., González-Nicolás, U.: Hybrid Microdata Using Microaggregation. Information Science 180, 2834–2844 (2010)
Domingo-Ferrer, J., Torra, V.: A Quantitative Comparison of Disclosure Control Methods for Microdata. In: Doyle, P., Lane, J.I., Theeuwes, J.J.M., Zayatz, L.V. (eds.) Confidentiality, Disclosure, and Data Access: Theory and Practical Applications for Statistical Agencies, pp. 111–133. Elsevier, Amsterdam (2001)
Domingo-Ferrer, J., Sanchez, D., Rufian-Torrell, G.: Anonymization of Nominal Data Based on Semantic Marginality. Information Science 242, 35–48 (2013)
Drechsler, J., Bender, S., Rassler, S.: Comparing Fully and Partially Synthetic Datasets for Statistical Disclosure Control in the German IAB Establishment Panel. Transactions on Data Privacy 1, 105–130 (2008)
Hundepool, A., Domingo-Ferrer, J., Franconi, L., Giessing, S., Nordholt, E.S., Spicer, K., de Wolf, P.-P.: Statistical Disclosure Control. John Wiley & Sons, West Sussex (2012)
Moore, R.A.: Controlled Data Swapping for Masking Public Use Microdata Sets. Washington DC: Research report series (RR96/04), Statistical Research Division, US Census Bureau (1996)
Muralidhar, K., Sarathy, R.: Data Shuffling: A New Masking Approach for Numerical Data. Management Scienc 52, 658–670 (2006)
Nin, J., Herranz, J., Torra, V.: Rethinking Rank Swapping to Decrease Disclosure Risk. Data & Knowledge Engineering 64, 346–364 (2008)
Raghunathan, T.E., Reiter, J.P., Rubin, D.B.: Multiple Imputation for Statistical Disclosure Limitation. Journal of Official Statistics 19, 1–16 (2003)
Sebé, F., Domingo-Ferrer, J., Mateo-Sanz, J.M., Torra, V.: Post-Masking Optimization of the Tradeoff between Information Loss and Disclosure Risk in Masked Microdata. In: Domingo-Ferrer, J. (ed.) Inference Control in Statistical Databases, pp. 163–171. Springer, Heidelberg (2002)
Simpson, G.R.: The 2000 Count: Bureau Blurs Data to Keep Names Confidential. Wall Street Journal 14, B1–B2 (2001)
Sklar, A.: Fonctions de répartition à n dimensions et leurs marges. Publications de l’Institut de Statistique de L’Université de Paris 8, 229–231 (1959)
Soria-Comas, J., Domingo-Ferrer, J.: Probabilistic k-anonymity through microaggregation and data swapping. In: IEEE International Conference on Fuzzy Systems, pp. 1–8. IEEE, Brisbane (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Muralidhar, K., Sarathy, R., Domingo-Ferrer, J. (2014). Reverse Mapping to Preserve the Marginal Distributions of Attributes in Masked Microdata. In: Domingo-Ferrer, J. (eds) Privacy in Statistical Databases. PSD 2014. Lecture Notes in Computer Science, vol 8744. Springer, Cham. https://doi.org/10.1007/978-3-319-11257-2_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-11257-2_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11256-5
Online ISBN: 978-3-319-11257-2
eBook Packages: Computer ScienceComputer Science (R0)