Skip to main content

A Computational Evaluation of Optimization Solvers for CTA

  • Conference paper
Privacy in Statistical Databases (PSD 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7556))

Included in the following conference series:

  • 918 Accesses

Abstract

Minimum-distance controlled tabular adjustment methods (CTA), an its variants, are considered an emerging perturbative approach for tabular data protection. Given a table to be protected, the purpose of CTA is to find the closest table that guarantees protection levels for the sensitive cells. We consider the most general CTA formulation which includes binary variables, thus providing protected tables with a higher data utility, at the expense of a larger solution time. The resulting model is a Mixed Integer Linear Problem (MILP). The purpose of this work is twofold. First, it presents and describes the main features of a package for CTA which is linked to both commercial (Cplex and Xpress) and open-source (Glpk, Cbc and Symphony ) MILP solvers. The particular design of the package allows easy integration with additional solvers. The second objective is to perform a computational evaluation of the above two commercial and three open-source MILP solvers for CTA, using both standard instances in the literature and real-world ones. Users of tabular data confidentiality techniques in National Statistical Agencies may find this information useful for the trade-off between the (more efficient but expensive) commercial and the (slower but free) open-source MILP solvers.

Supported by grants MTM2009-08747 of the Spanish Ministry of Science and Innovation, SGR-2009-1122 of the Government of Catalonia, and INFRA-2010-262608 of the European Union.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Castro, J.: Minimum-distance controlled perturbation methods for large-scale tabular data protection. European Journal of Operational Research 171, 39–52 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  2. Castro, J.: A shortest paths heuristic for statistical disclosure control in positive tables. INFORMS Journal on Computing 19, 520–533 (2007)

    Article  Google Scholar 

  3. Castro, J.: Extending controlled tabular adjustment for non-additive tabular data with negative protection levels. Statistics and Operations Research Transactions–SORT 35, 3–20 (2011)

    Google Scholar 

  4. Castro, J.: Recent advances in optimization techniques for statistical tabular data protection. European Journal of Operational Research 216, 257–269 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  5. Castro, J., Giessing, S.: Testing variants of minimum distance controlled tabular adjustment. In: Monographs of Official Statistics, pp. 333–343. Eurostat-Office for Official Publications of the European Communities, Luxembourg (2006)

    Google Scholar 

  6. Castro, J., González, J.A., Baena, D.: User’s and programmer’s manual of the RCTA package. Technical Report DR 2009-01, Dept. of Statistics and Operations Research, Universitat Politècnica de Catalunya (2009)

    Google Scholar 

  7. Castro, J., González, J.A.: A Tool for Analyzing and Fixing Infeasible RCTA Instances. In: Domingo-Ferrer, J., Magkos, E. (eds.) PSD 2010. LNCS, vol. 6344, pp. 17–28. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  8. Castro, J., González, J.A.: Present and future research on controlled tabular adjustment. In: Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality (2011), http://www.unece.org/fileadmin/DAM/stats/documents/ece/ces/ge.46/2011/48_Castro-Gonzalez.pdf

  9. Dandekar, R.A., Cox, L.H.: Synthetic tabular Data: an alternative to complementary cell suppression, manuscript, Energy Information Administration, U.S. (2002)

    Google Scholar 

  10. Fourer, R., Gay, D.M., Kernighan, D.W.: AMPL: A Modeling Language for Mathematical Programming. Duxbury Press (2002)

    Google Scholar 

  11. Giessing, S., Hundepool, A., Castro, J.: Rounding methods for protecting EU-aggregates. In: Eurostat Methodologies and Working Papers. Worksession on Statistical Data Confidentiality, pp. 255–264. Eurostat-Office for Official Publications of the European Communities, Luxembourg (2009)

    Google Scholar 

  12. Hundepool, A., Domingo-Ferrer, J., Franconi, L., Giessing, S., Lenz, R., Naylor, J., Schulte-Nordholt, E., Seri, G., de Wolf, P.P.: Handbook on Statistical Disclosure Control (v. 1.2), Network of Excellence in the European Statistical System in the field of Statistical Disclosure Control (2010), http://neon.vb.cbs.nl/casc/SDC_Handbook.pdf

  13. Kelly, J.P., Golden, B.L., Assad, A.A.: Cell suppression: disclosure protection for sensitive tabular data. Networks 22, 28–55 (1992)

    Article  Google Scholar 

  14. Lougee-Heimer, R.: The Common Optimization INterface for Operations Research. IBM Journal of Research and Development 47, 57–66 (2003)

    Article  Google Scholar 

  15. Stroustrup, B.: The C++ Programming Language. Addison-Wesley (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Castro, J. (2012). A Computational Evaluation of Optimization Solvers for CTA. In: Domingo-Ferrer, J., Tinnirello, I. (eds) Privacy in Statistical Databases. PSD 2012. Lecture Notes in Computer Science, vol 7556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33627-0_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33627-0_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33626-3

  • Online ISBN: 978-3-642-33627-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics