Skip to main content

Statistical Disclosure Control in Tabular Data

  • Chapter
  • First Online:

Part of the book series: Advanced Information and Knowledge Processing ((AI&KP))

Abstract

Data disseminated by National Statistical Agencies (NSAs) can be classified as either microdata or tabular data. Tabular data are obtained from microdata by crossing one or more categorical variables. Although cell tables provide aggregated information, they also need to be protected. This chapter is a short introduction to tabular data protection. It contains three main sections. The first one shows the different types of tables that can be obtained and how they are modeled. The second describes the practical rules for detection of sensitive cells that are used by NSAs. Finally, an overview of protection methods is provided, with a particular focus on two of them: “cell suppression problem” and “controlled tabular adjustment.”

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Ahuja R.K., Magnanti T.L., and Orlin J.B. Network Flows. Theory, Algorithms and Applications, Prentice Hall, Upper Saddle River, NJ, 1993.

    MATH  Google Scholar 

  2. Bacharach M. Matrix rounding problems, Management Science, 9:732–742, 1966.

    Article  MathSciNet  Google Scholar 

  3. Benders J.F. Partitioning procedures for solving mixed-variables programming problems, Computational Management Science, 2:3–19, 2005. English translation of the original paper appeared in Numerische Mathematik, 4:238–252 (1962).

    Google Scholar 

  4. Carvalho F.D., Dellaert N.P., and Osório M.D. Statistical disclosure in two-dimensional tables: general tables. Journal of the American Statistical Association, 89:1547–1557, 1994.

    Article  MathSciNet  MATH  Google Scholar 

  5. Castro J. Network flows heuristics for complementary cell suppression: an empirical evaluation and extensions, Lecture Notes in Computer Science, 2316:59–73, 2002.

    Article  Google Scholar 

  6. Castro J. A fast network flows heuristic for cell suppression in positive tables. Lecture Notes in Computer Science, 3050:136–148, 2004.

    Article  Google Scholar 

  7. Castro J. Quadratic interior-point methods in statistical disclosure control. Computational Management Science, 2(2):107–121, 2005.

    Article  MathSciNet  MATH  Google Scholar 

  8. Castro J. Minimum-distance controlled perturbation methods for large-scale tabular data protection. European Journal of Operational Research, 171:39–52, 2006.

    Article  MathSciNet  MATH  Google Scholar 

  9. Castro J. A shortest-paths heuristic for statistical data protection in positive tables. INFORMS Journal on Computing, 19(4):520–533, 2007.

    Article  Google Scholar 

  10. Castro J. An interior-point approach for primal block-angular problems. Computational Optimization and Applications, 36:195–219, 2007.

    Article  MathSciNet  MATH  Google Scholar 

  11. Castro J. and Baena D. Using a mathematical programming modeling language for optimal CTA. Lecture Notes in Computer Science, 5262:1–12, 2008.

    Article  Google Scholar 

  12. Castro J. and Giessing S. Testing variants of minimum distance controlled tabular adjustment. In: Monographs of Official Statistics. Work session on Statistical Data Confidentiality, Eurostat-Office for Official Publications of the European Communities, Luxembourg, pp. 333–343, 2006. ISBN 92-79-01108-1.

    Google Scholar 

  13. Castro J., González A., and Baena D. User’s and programmer’s manual of the RCTA package, Technical Report DR 2009/01, Dept. of Statistics and Operations Research, Universitat Politécnica de Catalunya, 2009.

    Google Scholar 

  14. Cox L.H. Network models for complementary cell suppression. Journal of the American Statistical Association, 90:1453–1462, 1995.

    Article  MATH  Google Scholar 

  15. Cox L.H. and George J.A. Controlled rounding for tables with subtotals. Annals of Operations Research, 20:141–157, 1989.

    Article  MathSciNet  MATH  Google Scholar 

  16. Cox L.H., Kelly J.P., and Patil R. Computational aspects of controlled tabular adjustment: algorithm and analysis. In: The Next Wave in Computer, Optimization and Decision Technologies, (eds. B. Golden, S. Raghavan, and E. Wassil), Kluwer, Boston, MA, pp. 45–59, 2002.

    Google Scholar 

  17. Dandekar R.A. and Cox L.H. Synthetic tabular data: An alternative to complementary cell suppression, manuscript, Energy Information Administration, US Department of Energy, Washington, DC, 2002.

    Google Scholar 

  18. Dellaert N.P. and Luijten W.A. Statistical disclosure in general three-dimensional tables. Statistica Neerlandica, 53:197–221, 1999.

    Article  MathSciNet  MATH  Google Scholar 

  19. de Wolf P.P. HiTaS: A heuristic approach to cell suppression in hierarchical tables. Lecture Notes in Computer Science, 2316:74–82, 2002.

    Article  Google Scholar 

  20. Domingo-Ferrer J. and Franconi L. (eds.). Privacy in Statistical Databases Lecture Notes in Computer Science, vol. 4302, Springer, Berlin, 2006.

    Google Scholar 

  21. Domingo-Ferrer J. and Saigin Y. (eds.). Privacy in Statistical Databases Lecture Notes in Computer Science, vol. 5262, Springer, Berlin, 2008.

    MATH  Google Scholar 

  22. Domingo-Ferrer J. and Torra V. (eds.). Privacy in Statistical Databases Lecture Notes in Computer Science, vol. 3050, Springer, Berlin, 2004.

    Google Scholar 

  23. Fischetti M. and Salazar-González J.J. Models and algorithms for the 2-dimensional cell suppression problem in statistical disclosure control. Mathematical Programming, 84:283–312, 1999.

    MathSciNet  MATH  Google Scholar 

  24. Fischetti M. and Salazar-González J.J. Solving the cell suppression problem on tabular data with linear constraints. Management Science, 47:1008–1026, 2001.

    Article  MATH  Google Scholar 

  25. Giessing S. and Repsilber D. Tools and strategies to protect multiple tables with the GHQUAR cell suppression engine. Lecture Notes in Computer Science, 2316:181–192, 2002.

    Article  MathSciNet  Google Scholar 

  26. Hundepool A., van de Wetering A., Ramaswamy R., de Wolf P.P., Giessing S., Fischetti M., Salazar-González J.J., Castro J., and Lowthian P. τ-Argus User’s Manual, Statistics Netherlands, The Netherlands 2007.

    Google Scholar 

  27. Hundepool A., Domingo-Ferrer J., Franconi L., Giessing S., Lenz R., Longhurst J., Schulte-Nordholt E., Giovanni Seri, and de Wolf P.P. Handbook on Statistical Disclosure Control, CENEX SDC. Available on-line at http://neon.vb.cbs.nl/casc/.\SDC_Handbook.pdf, 2007. Accessed date: 06/04/2010.

  28. Kelly J.P., Golden B.L., and Assad A.A. Cell suppression: Disclosure protection for sensitive tabular data. Networks, 22:28–55, 1992.

    Article  Google Scholar 

  29. Robertson D.A. and Ethier R. Cell suppression: Experience and theory. Lecture Notes in Computer Science, 2316:8–20, 2002.

    Article  MathSciNet  Google Scholar 

  30. Salazar-González J.J. Controlled rounding and cell perturbation: Statistical disclosure limitation methods for tabular data. Mathematical Programming, 105:583–603, 2006.

    Article  MathSciNet  MATH  Google Scholar 

  31. Salazar-González J.J. Statistical confidentiality: Optimization techniques to protect tables. Computers and Operations Research, 35:1638–1651, 2008.

    Article  Google Scholar 

  32. Willenborg L. and de Waal T. (eds.). Elements of Statistical Disclosure Control Lecture Notes in Statistics, vol 155, Springer, New York, 2000.

    Google Scholar 

Download references

Acknowledgments

This work has been supported by grant MTM2006-05550 of the Spanish Ministry of Science and Education.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jordi Castro .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer London

About this chapter

Cite this chapter

Castro, J. (2010). Statistical Disclosure Control in Tabular Data. In: Nin, J., Herranz, J. (eds) Privacy and Anonymity in Information Management Systems. Advanced Information and Knowledge Processing. Springer, London. https://doi.org/10.1007/978-1-84996-238-4_6

Download citation

  • DOI: https://doi.org/10.1007/978-1-84996-238-4_6

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-84996-237-7

  • Online ISBN: 978-1-84996-238-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics