Abstract
The aim of a project initiated by the International Household Survey Network (IHSN, www.ihsn.org ) is to integrate the C++ code they developed to the R package sdcMicro. The methods for microdata perturbation in the R-package sdcMicro are now all based on computational fast C++ code. The paper describes how this integration was done and describes the methods ready to be used. Finally, we give an outline of on-going and further developments which are funded by the IHSN and Google.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Todorov, V., Templ, M.: R in the statistical office: Part 2. Working paper, 2, United Nations Industrial Development (in press, 2012)
R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2012) ISBN 3-900051-07-0
Templ, M.: sdcMicro: Statistical Disclosure Control methods for the generation of public- and scientific-use files. Manual and Package (2012), R package version 3.1.1
Templ, M.: Statistical disclosure control for microdata using the R-package sdcMicro. Transactions on Data Privacy 1(2), 67–85 (2008)
Templ, M., Petelin, T.: A graphical user interface for microdata protection which provides reproducibility and interactions: the sdcMicro GUI. Transactions on Data Privacy 2, 207–223 (2009)
Templ, M., Meindl, B.: Practical applications in statistical disclosure control using R. In: Nin, J., Herranz, J. (eds.) Privacy and Anonymity in Information Management Systems. Advanced Information and Knowledge Processing, pp. 31–62. Springer, London (2010), doi:10.1007/978-1-84996-238-4_3
Meindl, B., Templ, M., Kowarik, A.: Drafting of guidelines for the anonymization of microdata using R-package sdcMicro. Project: Relative to the Testing of SDC Algorithms and Provision of Practical SDC Deliverable 3, Reference 0500000897, MEHLB (2011), data analysis OG (2012)
Manning, A., Haglin, D., Keane, J.: A recursive search algorithm for statistical disclosure assessment. Data Mining and Knowledge Discovery 16, 165–196 (2008), doi:10.1007/s10618-007-0078-6
Eddelbuettel, D., François, R.: Rcpp: Seamless R and C++ integration. Journal of Statistical Software 40(8), 1–18 (2011)
Eddelbuettel, D., Francois, R., with contributions by Bates, D., Chambers, J.: Rcpp: Seamless R and C++ Integration (2011), R package version 0.9.2
Implementing the Post Randomisation method to the Individual Sample of Anonymised Records (SAR) from the 2001 Census. In: Meeting on the Samples of Anonymised Records from the 2001 Census (2004)
Alfons, A., Kraft, S., Templ, M., Filzmoser, P.: Simulation of close-to-reality population data for household surveys with application to EU-SILC. Statistical Methods & Applications 20(3), 383–407 (2011)
Templ, M.: Estimators and model predictions from the structural earnings survey for benchmarking statistical disclosure methods. Technical report, E107 - Institut für Statistik und Wahrscheinlichkeitstheorie; Technische Universität Wien (2011)
Templ, M., Alfons, A.: Standard methods for point estimation of social inclusion indicators using the R package laeken. Research Report CS-2011-1, Department of Statistics and Probability Theory, Vienna University of Technology (2011)
Templ, M., Meindl, B.: Robustification of Microdata Masking Methods and the Comparison with Existing Methods. In: Domingo-Ferrer, J., Saygın, Y. (eds.) PSD 2008. LNCS, vol. 5262, pp. 113–126. Springer, Heidelberg (2008), doi:10.1007/978-3-540-87471-3_15.
Manning, A., Haglin, D.: A new algorithm for finding minimal sample uniques for use in statistical disclosure assessment. In: ICDM, pp. 290–297. IEEE Computer Society (2005)
Kooiman, P., Willenbourg, L., Gouweleeuw, J.: A method for disclosure limitation of microdata. Technical report, Research paper 9705, Statistics Netherlands, Voorburg (2002)
Gouweleeuw, J., Kooiman, P., Willenborg, L., De Wolf, P.P.: Post randomisation for statistical disclosure control: Theory and implementation. Journal of Official Statistics 14(4), 463–478 (1998)
Domingo-Ferrer, J., Mateo-Sanz, J.M.: Practical data-oriented microaggregation for statistical disclosure control. IEEE Trans. on Knowl. and Data Eng. 14(1), 189–201 (2002)
Dalenius, T., Reiss, S.: Data-swapping: A technique for disclosure control. In: Proceedings of the Section on Survey Research Methods, vol. 6, pp. 73–85. American Statistical Association (1982)
Moore, J.: Controlled data-swapping techniques for masking public use microdata. Statistical Research Division Report Series RR 96-04, U.S. Bureau of the Census (1996)
Benedetti, R., Franconi, L.: Statistical and technological solutions for controlled data dissemination. In: Pre-Proceedings of New Techniques and Technologies for Statistics, pp. 225–232 (1998)
Capobianchi, A., Polettini, S., Lucarelli, M.: Strategy for the implementation of individual risk methodology into μ-ARGUS. Technical report, Report for the CASC project. No: 1.2-D1 (2001)
Fox, J.: The R Commander: A basic statistics graphical user interface to R. Journal of Statistical Software 14(9), 1–42 (2005)
Templ, M., Alfons, A.: Disclosure Risk of Synthetic Population Data with Application in the Case of EU-SILC. In: Domingo-Ferrer, J., Magkos, E. (eds.) PSD 2010. LNCS, vol. 6344, pp. 174–186. Springer, Heidelberg (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kowarik, A., Templ, M., Meindl, B., Fonteneau, F., Prantner, B. (2012). Testing of IHSN C++ Code and Inclusion of New Methods into sdcMicro. In: Domingo-Ferrer, J., Tinnirello, I. (eds) Privacy in Statistical Databases. PSD 2012. Lecture Notes in Computer Science, vol 7556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33627-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-33627-0_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33626-3
Online ISBN: 978-3-642-33627-0
eBook Packages: Computer ScienceComputer Science (R0)