Skip to main content

Testing of IHSN C++ Code and Inclusion of New Methods into sdcMicro

  • Conference paper
Privacy in Statistical Databases (PSD 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7556))

Included in the following conference series:

  • 927 Accesses

Abstract

The aim of a project initiated by the International Household Survey Network (IHSN, www.ihsn.org ) is to integrate the C++ code they developed to the R package sdcMicro. The methods for microdata perturbation in the R-package sdcMicro are now all based on computational fast C++ code. The paper describes how this integration was done and describes the methods ready to be used. Finally, we give an outline of on-going and further developments which are funded by the IHSN and Google.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Todorov, V., Templ, M.: R in the statistical office: Part 2. Working paper, 2, United Nations Industrial Development (in press, 2012)

    Google Scholar 

  2. R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2012) ISBN 3-900051-07-0

    Google Scholar 

  3. Templ, M.: sdcMicro: Statistical Disclosure Control methods for the generation of public- and scientific-use files. Manual and Package (2012), R package version 3.1.1

    Google Scholar 

  4. Templ, M.: Statistical disclosure control for microdata using the R-package sdcMicro. Transactions on Data Privacy 1(2), 67–85 (2008)

    MathSciNet  Google Scholar 

  5. Templ, M., Petelin, T.: A graphical user interface for microdata protection which provides reproducibility and interactions: the sdcMicro GUI. Transactions on Data Privacy 2, 207–223 (2009)

    MathSciNet  Google Scholar 

  6. Templ, M., Meindl, B.: Practical applications in statistical disclosure control using R. In: Nin, J., Herranz, J. (eds.) Privacy and Anonymity in Information Management Systems. Advanced Information and Knowledge Processing, pp. 31–62. Springer, London (2010), doi:10.1007/978-1-84996-238-4_3

    Chapter  Google Scholar 

  7. Meindl, B., Templ, M., Kowarik, A.: Drafting of guidelines for the anonymization of microdata using R-package sdcMicro. Project: Relative to the Testing of SDC Algorithms and Provision of Practical SDC Deliverable 3, Reference 0500000897, MEHLB (2011), data analysis OG (2012)

    Google Scholar 

  8. Manning, A., Haglin, D., Keane, J.: A recursive search algorithm for statistical disclosure assessment. Data Mining and Knowledge Discovery 16, 165–196 (2008), doi:10.1007/s10618-007-0078-6

    Article  MathSciNet  Google Scholar 

  9. Eddelbuettel, D., François, R.: Rcpp: Seamless R and C++ integration. Journal of Statistical Software 40(8), 1–18 (2011)

    Google Scholar 

  10. Eddelbuettel, D., Francois, R., with contributions by Bates, D., Chambers, J.: Rcpp: Seamless R and C++ Integration (2011), R package version 0.9.2

    Google Scholar 

  11. Implementing the Post Randomisation method to the Individual Sample of Anonymised Records (SAR) from the 2001 Census. In: Meeting on the Samples of Anonymised Records from the 2001 Census (2004)

    Google Scholar 

  12. Alfons, A., Kraft, S., Templ, M., Filzmoser, P.: Simulation of close-to-reality population data for household surveys with application to EU-SILC. Statistical Methods & Applications 20(3), 383–407 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  13. Templ, M.: Estimators and model predictions from the structural earnings survey for benchmarking statistical disclosure methods. Technical report, E107 - Institut für Statistik und Wahrscheinlichkeitstheorie; Technische Universität Wien (2011)

    Google Scholar 

  14. Templ, M., Alfons, A.: Standard methods for point estimation of social inclusion indicators using the R package laeken. Research Report CS-2011-1, Department of Statistics and Probability Theory, Vienna University of Technology (2011)

    Google Scholar 

  15. Templ, M., Meindl, B.: Robustification of Microdata Masking Methods and the Comparison with Existing Methods. In: Domingo-Ferrer, J., Saygın, Y. (eds.) PSD 2008. LNCS, vol. 5262, pp. 113–126. Springer, Heidelberg (2008), doi:10.1007/978-3-540-87471-3_15.

    Chapter  Google Scholar 

  16. Manning, A., Haglin, D.: A new algorithm for finding minimal sample uniques for use in statistical disclosure assessment. In: ICDM, pp. 290–297. IEEE Computer Society (2005)

    Google Scholar 

  17. Kooiman, P., Willenbourg, L., Gouweleeuw, J.: A method for disclosure limitation of microdata. Technical report, Research paper 9705, Statistics Netherlands, Voorburg (2002)

    Google Scholar 

  18. Gouweleeuw, J., Kooiman, P., Willenborg, L., De Wolf, P.P.: Post randomisation for statistical disclosure control: Theory and implementation. Journal of Official Statistics 14(4), 463–478 (1998)

    Google Scholar 

  19. Domingo-Ferrer, J., Mateo-Sanz, J.M.: Practical data-oriented microaggregation for statistical disclosure control. IEEE Trans. on Knowl. and Data Eng. 14(1), 189–201 (2002)

    Article  Google Scholar 

  20. Dalenius, T., Reiss, S.: Data-swapping: A technique for disclosure control. In: Proceedings of the Section on Survey Research Methods, vol. 6, pp. 73–85. American Statistical Association (1982)

    Google Scholar 

  21. Moore, J.: Controlled data-swapping techniques for masking public use microdata. Statistical Research Division Report Series RR 96-04, U.S. Bureau of the Census (1996)

    Google Scholar 

  22. Benedetti, R., Franconi, L.: Statistical and technological solutions for controlled data dissemination. In: Pre-Proceedings of New Techniques and Technologies for Statistics, pp. 225–232 (1998)

    Google Scholar 

  23. Capobianchi, A., Polettini, S., Lucarelli, M.: Strategy for the implementation of individual risk methodology into μ-ARGUS. Technical report, Report for the CASC project. No: 1.2-D1 (2001)

    Google Scholar 

  24. Fox, J.: The R Commander: A basic statistics graphical user interface to R. Journal of Statistical Software 14(9), 1–42 (2005)

    Google Scholar 

  25. Templ, M., Alfons, A.: Disclosure Risk of Synthetic Population Data with Application in the Case of EU-SILC. In: Domingo-Ferrer, J., Magkos, E. (eds.) PSD 2010. LNCS, vol. 6344, pp. 174–186. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kowarik, A., Templ, M., Meindl, B., Fonteneau, F., Prantner, B. (2012). Testing of IHSN C++ Code and Inclusion of New Methods into sdcMicro. In: Domingo-Ferrer, J., Tinnirello, I. (eds) Privacy in Statistical Databases. PSD 2012. Lecture Notes in Computer Science, vol 7556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33627-0_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33627-0_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33626-3

  • Online ISBN: 978-3-642-33627-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics