Abstract
Minimum distance controlled tabular adjustment (CTA) is an emerging perturbative method of statistical disclosure control for tabular data. The goal of CTA is to find the closest safe table to some original tabular data with sensitive information. Closeness is usually measured by ℓ1 or ℓ2 distances. Distance ℓ1 provides solutions with a smaller ℓ0 norm than ℓ2 (i.e., with a lesser number of changes with respect to the original table). However the optimization problem formulated with ℓ2 requires half the number of variables than that for ℓ1, and it is more efficiently solved. In this work a pseudo-Huber function (which is a continuous nonlinear approximation of the Huber function) is considered to measure the distance between the original and protected tables. This pseudo-Huber function approximates ℓ1 but can be formulated with the same number of variables than ℓ2. It results in a nonlinear convex optimization problem which, theoretically, can be solved in polynomial time. Some preliminary results using the Huber-CTA model are reported.
Supported by grants MTM2012-31440 of the Spanish Ministry of Economy and Competitiveness, SGR-2014-542 of the Government of Catalonia, and DwB INFRA-2010-262608 of the FP7 European Union Program.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Andersen, E.D.: Personal communication (2014)
Andersen, E.D., Andersen, K.D.: The MOSEK interior point optimizer for linear programming: an implementation of the homogeneous algorithm. In: Frenk, H., Roos, K., Terlaky, T., Zhang, S. (eds.) High Performance Optimization, pp. 197–232. Kluwer (2000)
Castro, J.: Minimum-distance controlled perturbation methods for large-scale tabular data protection. European Journal of Operational Research 171, 39–52 (2006)
Castro, J.: An interior-point approach for primal block-angular problems. Computational Optimization and Applications 36, 195–219 (2007)
Castro, J.: Recent advances in optimization techniques for statistical tabular data protection. European Journal of Operational Research 216, 257–269 (2012)
Castro, J.: On assessing the disclosure risk of controlled adjustment methods for statistical tabular data. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 20, 921–941 (2012)
Castro, J., Cuesta, J.: Quadratic regularizations in an interior-point method for primal block-angular problems. Mathematical Programming 130, 415–445 (2011)
Castro, J., Cuesta, J.: Solving L 1-CTA in 3D tables by an interior-point method for primal block-angular problems. TOP 21, 25–47 (2013)
Castro, J., Frangioni, A., Gentile, C.: Perspective reformulations of the CTA problem with L 2 distances. Operations Research (in press, 2014)
Castro, J., Giessing, S.: Testing variants of minimum distance controlled tabular adjustment. Monographs of Official Statistics, pp. 333–343. Eurostat-Office for Official Publications of the European Communities, Luxembourg (2006)
Castro, J., González, J.A.: Assessing the information loss of controlled adjustment methods in two-way tables. In: Domingo-Ferrer, J. (ed.) PSD 2014. LNCS, vol. 8744, pp. 11–23. Springer, Heidelberg (2014)
Castro, J., González, J.A.: A fast CTA method without the complicating binary decisions. Documents of the Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality, Statistics Canada, Ottawa, pp. 1–7 (2013)
Castro, J., González, J.A.: A multiobjective LP approach for controlled tabular adjustment in statistical disclosure control. Working paper, Dept. of Statistics and Operations Research, Universitat Politècnica de Catalunya (2014)
Dandekar, R.A., Cox, L.H.: Synthetic tabular Data: an alternative to complementary cell suppression, manuscript, Energy Information Administration, U.S. (2002)
Fountoulakis, K., Gondzio, J.: A second-order method for strongly convex L1-regularization problems. Technical Report ERGO-14-005, School of Mathematics, The University of Edinburgh (2014)
Huber, P.J.: Robust estimation of a location parameter. Annals of Statistics 53, 73–101 (1964)
Hundepool, A., Domingo-Ferrer, J., Franconi, L., Giessing, S., Schulte-Nordholt, E., Spicer, K., de Wolf, P.P.: Statistical Disclosure Control. Wiley, Chichester (2012)
Nesterov, Y.: Introductory Lectures on Convex Optimization: A Basic Course. Kluwer, Boston (2004)
Nesterov, Y., Nemirovskii, A.: Interior-Point Polynomial Algorithms in Convex Programming. SIAM, Philadelphia (1994)
Wright, S.J.: Primal-Dual Interior-Point Methods. SIAM, Philadelphia (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Castro, J. (2014). A CTA Model Based on the Huber Function. In: Domingo-Ferrer, J. (eds) Privacy in Statistical Databases. PSD 2014. Lecture Notes in Computer Science, vol 8744. Springer, Cham. https://doi.org/10.1007/978-3-319-11257-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-11257-2_7
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11256-5
Online ISBN: 978-3-319-11257-2
eBook Packages: Computer ScienceComputer Science (R0)