Abstract
Data disseminated by National Statistical Agencies (NSAs) can be classified as either microdata or tabular data. Tabular data are obtained from microdata by crossing one or more categorical variables. Although cell tables provide aggregated information, they also need to be protected. This chapter is a short introduction to tabular data protection. It contains three main sections. The first one shows the different types of tables that can be obtained and how they are modeled. The second describes the practical rules for detection of sensitive cells that are used by NSAs. Finally, an overview of protection methods is provided, with a particular focus on two of them: “cell suppression problem” and “controlled tabular adjustment.”
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Ahuja R.K., Magnanti T.L., and Orlin J.B. Network Flows. Theory, Algorithms and Applications, Prentice Hall, Upper Saddle River, NJ, 1993.
Bacharach M. Matrix rounding problems, Management Science, 9:732–742, 1966.
Benders J.F. Partitioning procedures for solving mixed-variables programming problems, Computational Management Science, 2:3–19, 2005. English translation of the original paper appeared in Numerische Mathematik, 4:238–252 (1962).
Carvalho F.D., Dellaert N.P., and Osório M.D. Statistical disclosure in two-dimensional tables: general tables. Journal of the American Statistical Association, 89:1547–1557, 1994.
Castro J. Network flows heuristics for complementary cell suppression: an empirical evaluation and extensions, Lecture Notes in Computer Science, 2316:59–73, 2002.
Castro J. A fast network flows heuristic for cell suppression in positive tables. Lecture Notes in Computer Science, 3050:136–148, 2004.
Castro J. Quadratic interior-point methods in statistical disclosure control. Computational Management Science, 2(2):107–121, 2005.
Castro J. Minimum-distance controlled perturbation methods for large-scale tabular data protection. European Journal of Operational Research, 171:39–52, 2006.
Castro J. A shortest-paths heuristic for statistical data protection in positive tables. INFORMS Journal on Computing, 19(4):520–533, 2007.
Castro J. An interior-point approach for primal block-angular problems. Computational Optimization and Applications, 36:195–219, 2007.
Castro J. and Baena D. Using a mathematical programming modeling language for optimal CTA. Lecture Notes in Computer Science, 5262:1–12, 2008.
Castro J. and Giessing S. Testing variants of minimum distance controlled tabular adjustment. In: Monographs of Official Statistics. Work session on Statistical Data Confidentiality, Eurostat-Office for Official Publications of the European Communities, Luxembourg, pp. 333–343, 2006. ISBN 92-79-01108-1.
Castro J., González A., and Baena D. User’s and programmer’s manual of the RCTA package, Technical Report DR 2009/01, Dept. of Statistics and Operations Research, Universitat Politécnica de Catalunya, 2009.
Cox L.H. Network models for complementary cell suppression. Journal of the American Statistical Association, 90:1453–1462, 1995.
Cox L.H. and George J.A. Controlled rounding for tables with subtotals. Annals of Operations Research, 20:141–157, 1989.
Cox L.H., Kelly J.P., and Patil R. Computational aspects of controlled tabular adjustment: algorithm and analysis. In: The Next Wave in Computer, Optimization and Decision Technologies, (eds. B. Golden, S. Raghavan, and E. Wassil), Kluwer, Boston, MA, pp. 45–59, 2002.
Dandekar R.A. and Cox L.H. Synthetic tabular data: An alternative to complementary cell suppression, manuscript, Energy Information Administration, US Department of Energy, Washington, DC, 2002.
Dellaert N.P. and Luijten W.A. Statistical disclosure in general three-dimensional tables. Statistica Neerlandica, 53:197–221, 1999.
de Wolf P.P. HiTaS: A heuristic approach to cell suppression in hierarchical tables. Lecture Notes in Computer Science, 2316:74–82, 2002.
Domingo-Ferrer J. and Franconi L. (eds.). Privacy in Statistical Databases Lecture Notes in Computer Science, vol. 4302, Springer, Berlin, 2006.
Domingo-Ferrer J. and Saigin Y. (eds.). Privacy in Statistical Databases Lecture Notes in Computer Science, vol. 5262, Springer, Berlin, 2008.
Domingo-Ferrer J. and Torra V. (eds.). Privacy in Statistical Databases Lecture Notes in Computer Science, vol. 3050, Springer, Berlin, 2004.
Fischetti M. and Salazar-González J.J. Models and algorithms for the 2-dimensional cell suppression problem in statistical disclosure control. Mathematical Programming, 84:283–312, 1999.
Fischetti M. and Salazar-González J.J. Solving the cell suppression problem on tabular data with linear constraints. Management Science, 47:1008–1026, 2001.
Giessing S. and Repsilber D. Tools and strategies to protect multiple tables with the GHQUAR cell suppression engine. Lecture Notes in Computer Science, 2316:181–192, 2002.
Hundepool A., van de Wetering A., Ramaswamy R., de Wolf P.P., Giessing S., Fischetti M., Salazar-González J.J., Castro J., and Lowthian P. τ-Argus User’s Manual, Statistics Netherlands, The Netherlands 2007.
Hundepool A., Domingo-Ferrer J., Franconi L., Giessing S., Lenz R., Longhurst J., Schulte-Nordholt E., Giovanni Seri, and de Wolf P.P. Handbook on Statistical Disclosure Control, CENEX SDC. Available on-line at http://neon.vb.cbs.nl/casc/.\SDC_Handbook.pdf, 2007. Accessed date: 06/04/2010.
Kelly J.P., Golden B.L., and Assad A.A. Cell suppression: Disclosure protection for sensitive tabular data. Networks, 22:28–55, 1992.
Robertson D.A. and Ethier R. Cell suppression: Experience and theory. Lecture Notes in Computer Science, 2316:8–20, 2002.
Salazar-González J.J. Controlled rounding and cell perturbation: Statistical disclosure limitation methods for tabular data. Mathematical Programming, 105:583–603, 2006.
Salazar-González J.J. Statistical confidentiality: Optimization techniques to protect tables. Computers and Operations Research, 35:1638–1651, 2008.
Willenborg L. and de Waal T. (eds.). Elements of Statistical Disclosure Control Lecture Notes in Statistics, vol 155, Springer, New York, 2000.
Acknowledgments
This work has been supported by grant MTM2006-05550 of the Spanish Ministry of Science and Education.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer London
About this chapter
Cite this chapter
Castro, J. (2010). Statistical Disclosure Control in Tabular Data. In: Nin, J., Herranz, J. (eds) Privacy and Anonymity in Information Management Systems. Advanced Information and Knowledge Processing. Springer, London. https://doi.org/10.1007/978-1-84996-238-4_6
Download citation
DOI: https://doi.org/10.1007/978-1-84996-238-4_6
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-84996-237-7
Online ISBN: 978-1-84996-238-4
eBook Packages: Computer ScienceComputer Science (R0)