Skip to main content

A CTA Model Based on the Huber Function

  • Conference paper
Privacy in Statistical Databases (PSD 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8744))

Included in the following conference series:

Abstract

Minimum distance controlled tabular adjustment (CTA) is an emerging perturbative method of statistical disclosure control for tabular data. The goal of CTA is to find the closest safe table to some original tabular data with sensitive information. Closeness is usually measured by ℓ1 or ℓ2 distances. Distance ℓ1 provides solutions with a smaller ℓ0 norm than ℓ2 (i.e., with a lesser number of changes with respect to the original table). However the optimization problem formulated with ℓ2 requires half the number of variables than that for ℓ1, and it is more efficiently solved. In this work a pseudo-Huber function (which is a continuous nonlinear approximation of the Huber function) is considered to measure the distance between the original and protected tables. This pseudo-Huber function approximates ℓ1 but can be formulated with the same number of variables than ℓ2. It results in a nonlinear convex optimization problem which, theoretically, can be solved in polynomial time. Some preliminary results using the Huber-CTA model are reported.

Supported by grants MTM2012-31440 of the Spanish Ministry of Economy and Competitiveness, SGR-2014-542 of the Government of Catalonia, and DwB INFRA-2010-262608 of the FP7 European Union Program.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Andersen, E.D.: Personal communication (2014)

    Google Scholar 

  2. Andersen, E.D., Andersen, K.D.: The MOSEK interior point optimizer for linear programming: an implementation of the homogeneous algorithm. In: Frenk, H., Roos, K., Terlaky, T., Zhang, S. (eds.) High Performance Optimization, pp. 197–232. Kluwer (2000)

    Google Scholar 

  3. Castro, J.: Minimum-distance controlled perturbation methods for large-scale tabular data protection. European Journal of Operational Research 171, 39–52 (2006)

    Article  MATH  MathSciNet  Google Scholar 

  4. Castro, J.: An interior-point approach for primal block-angular problems. Computational Optimization and Applications 36, 195–219 (2007)

    Article  MATH  MathSciNet  Google Scholar 

  5. Castro, J.: Recent advances in optimization techniques for statistical tabular data protection. European Journal of Operational Research 216, 257–269 (2012)

    Article  MATH  MathSciNet  Google Scholar 

  6. Castro, J.: On assessing the disclosure risk of controlled adjustment methods for statistical tabular data. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 20, 921–941 (2012)

    Article  MathSciNet  Google Scholar 

  7. Castro, J., Cuesta, J.: Quadratic regularizations in an interior-point method for primal block-angular problems. Mathematical Programming 130, 415–445 (2011)

    Article  MATH  MathSciNet  Google Scholar 

  8. Castro, J., Cuesta, J.: Solving L 1-CTA in 3D tables by an interior-point method for primal block-angular problems. TOP 21, 25–47 (2013)

    Article  MATH  MathSciNet  Google Scholar 

  9. Castro, J., Frangioni, A., Gentile, C.: Perspective reformulations of the CTA problem with L 2 distances. Operations Research (in press, 2014)

    Google Scholar 

  10. Castro, J., Giessing, S.: Testing variants of minimum distance controlled tabular adjustment. Monographs of Official Statistics, pp. 333–343. Eurostat-Office for Official Publications of the European Communities, Luxembourg (2006)

    Google Scholar 

  11. Castro, J., González, J.A.: Assessing the information loss of controlled adjustment methods in two-way tables. In: Domingo-Ferrer, J. (ed.) PSD 2014. LNCS, vol. 8744, pp. 11–23. Springer, Heidelberg (2014)

    Google Scholar 

  12. Castro, J., González, J.A.: A fast CTA method without the complicating binary decisions. Documents of the Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality, Statistics Canada, Ottawa, pp. 1–7 (2013)

    Google Scholar 

  13. Castro, J., González, J.A.: A multiobjective LP approach for controlled tabular adjustment in statistical disclosure control. Working paper, Dept. of Statistics and Operations Research, Universitat Politècnica de Catalunya (2014)

    Google Scholar 

  14. Dandekar, R.A., Cox, L.H.: Synthetic tabular Data: an alternative to complementary cell suppression, manuscript, Energy Information Administration, U.S. (2002)

    Google Scholar 

  15. Fountoulakis, K., Gondzio, J.: A second-order method for strongly convex L1-regularization problems. Technical Report ERGO-14-005, School of Mathematics, The University of Edinburgh (2014)

    Google Scholar 

  16. Huber, P.J.: Robust estimation of a location parameter. Annals of Statistics 53, 73–101 (1964)

    Article  Google Scholar 

  17. Hundepool, A., Domingo-Ferrer, J., Franconi, L., Giessing, S., Schulte-Nordholt, E., Spicer, K., de Wolf, P.P.: Statistical Disclosure Control. Wiley, Chichester (2012)

    Book  Google Scholar 

  18. Nesterov, Y.: Introductory Lectures on Convex Optimization: A Basic Course. Kluwer, Boston (2004)

    Book  Google Scholar 

  19. Nesterov, Y., Nemirovskii, A.: Interior-Point Polynomial Algorithms in Convex Programming. SIAM, Philadelphia (1994)

    Book  MATH  Google Scholar 

  20. Wright, S.J.: Primal-Dual Interior-Point Methods. SIAM, Philadelphia (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Castro, J. (2014). A CTA Model Based on the Huber Function. In: Domingo-Ferrer, J. (eds) Privacy in Statistical Databases. PSD 2014. Lecture Notes in Computer Science, vol 8744. Springer, Cham. https://doi.org/10.1007/978-3-319-11257-2_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-11257-2_7

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-11256-5

  • Online ISBN: 978-3-319-11257-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics