Exclusive Strategy for Generalization Algorithms in Micro-data Disclosure

Zhang, Lei; Wang, Lingyu; Jajodia, Sushil; Brodsky, Alexander

doi:10.1007/978-3-540-70567-3_15

Lei Zhang¹,
Lingyu Wang²,
Sushil Jajodia¹ &
…
Alexander Brodsky¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5094))

Included in the following conference series:

IFIP Annual Conference on Data and Applications Security and Privacy

1229 Accesses
4 Citations

Abstract

When generalization algorithms are known to the public, an adversary can obtain a more precise estimation of the secret table than what can be deduced from the disclosed generalization result. Therefore, whether a generalization algorithm can satisfy a privacy property should be judged based on such an estimation. In this paper, we show that the computation of the estimation is inherently a recursive process that exhibits a high complexity when generalization algorithms take a straightforward inclusive strategy. To facilitate the design of more efficient generalization algorithms, we suggest an alternative exclusive strategy, which adopts a seemingly drastic approach to eliminate the need for recursion. Surprisingly, the data utility of the two strategies are actually not comparable and the exclusive strategy can provide better data utility in certain cases.

Download to read the full chapter text

Chapter PDF

Partial Domain Theories for Privacy

On-Average KL-Privacy and Its Equivalence to Generalization for Max-Entropy Mechanisms

Correcting Finite Sampling Issues in Entropy l-diversity

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Dobra, A., Feinberg, S.E.: Bounding entries in multi-way contingency tables given a set of marginal totals. In: Foundations of Statistical Inference: Proceedings of the Shoresh Conference 2000, Springer, Heidelberg (2003)
Google Scholar
Machanavajjhala, A., Gehrke, J., Kifer, D., Venkitasubramaniam, M.: l-diversity: Privacy beyond k-anonymity. In: Proceedings of the 22nd IEEE International Conference on Data Engineering (ICDE 2006) (2006)
Google Scholar
Slavkovic, A., Feinberg, S.E.: Bounds for cell entries in two-way tables given conditional relative frequencies. Privacy in Statistical Databases (2004)
Google Scholar
Dobkin, D.P., Jones, A.K., Lipton, R.J.: Secure databases: Protection against user influence. ACM TODS 4(1), 76–96 (1979)
Article Google Scholar
Du, Y., Xia, T., Tao, Y., Zhang, D., Zhu, F.: On multidimensional k-anonymity with local recoding generalization
Google Scholar
Chin, F.: Security problems on inference control for sum, max, and min queries. J.ACM 33(3), 451–464 (1986)
Article MathSciNet Google Scholar
Aggarwal, G., Feder, T., Kenthapadi, K., Motwani, R., Panigrahy, R., Thomas, D., Zhu, A.: k-anonymity: Algorithms and hardness. Technical report, Stanford University (2004)
Google Scholar
Miklau, G., Suciu, D.: A formal analysis of information disclosure in data exchange. In: SIGMOD (2004)
Google Scholar
Duncan, G.T., Feinberg, S.E.: Obtaining information while preserving privacy: A markov perturbation method for tabular data. In: Joint Statistical Meetings, Anaheim, CA (1997)
Google Scholar
Fellegi, I.P.: On the question of statistical confidentiality. Journal of the American Statistical Association 67(337), 7–18 (1993)
Article MATH Google Scholar
Byun, J., Bertino, E.: Micro-views, or on how to protect privacy while enhancing data usability: concepts and challenges. SIGMOD Record 35(1), 9–13 (2006)
Article Google Scholar
Kleinberg, J., Papadimitriou, C., Raghavan, P.: Auditing boolean attributes. In: PODS (2000)
Google Scholar
Schlorer, J.: Identification and retrieval of personal records from a statistical bank. Methods Info. Med. (1975)
Google Scholar
Kenthapadi, K., Mishra, N., Nissim, K.: Simulatable auditing. In: PODS (2005)
Google Scholar
LeFevre, K., DeWitt, D., Ramakrishnan, R.: Incognito: Efficient fulldomain k-anonymity. In: SIGMOD (2005)
Google Scholar
Cox, L.H.: Solving confidentiality protection problems in tabulations using network optimization: A network model for cell suppression in the u.s. economic censuses. In: Proceedings of the Internatinal Seminar on Statistical Confidentiality (1982)
Google Scholar
Cox, L.H.: New results in disclosure avoidance for tabulations. In: International Statistical Institute Proceedings (1987)
Google Scholar
Cox, L.H.: Suppression, methodology and statistical disclosure control. J. of the American Statistical Association (1995)
Google Scholar
Li, N., Li, T., Venkatasubramanian, S.: t-closeness: Privacy beyond k-anonymity and l-diversity. In: ICDE (2007)
Google Scholar
Sweeney, L.: k-anonymity: a model for protecting privacy. International Journal on Uncertainty, Fuzziness and Knowledge-based Systems 10(5), 557–570 (2002)
Article MathSciNet MATH Google Scholar
Meyerson, A., Williams, R.: On the complexity of optimal k-anonymity. In: ACM PODS (2004)
Google Scholar
Adam, N.R., Wortmann, J.C.: Security-control methods for statistical databases: A comparative study. ACM Comput. Surv. 21(4), 515–556 (1989)
Article Google Scholar
Diaconis, P., Sturmfels, B.: Algebraic algorithms for sampling from conditional distributions. Annals of Statistics (1998)
Google Scholar
Samarati, P.: Protecting respondents identities in microdata release. In: IEEE TKDE, pp. 1010–1027 (2001)
Google Scholar
Samarati, P., Sweeney, L.: Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression. Technical report, CMU, SRI (1998)
Google Scholar
Bayardo, R.J., Agrawal, R.: Data privacy through optimal k-anonymization. In: ICDE (2005)
Google Scholar
Chawla, S., Dwork, C., McSherry, F., Smith, A., Wee, H.: Toward privacy in public databases. In: Theory of Cryptography Conference (2005)
Google Scholar
Dalenius, T., Reiss, S.: Data swapping: A technique for disclosure control. Journal of Statistical Planning and Inference 6, 73–85 (1982)
Article MathSciNet MATH Google Scholar
Xiao, X., Tao, Y.: Personalized privacy preservation. In: SIGMOD (2006)
Google Scholar
Zhang, L., Jajodia, S., Brodsky, A.: Information disclosure under realistic assumptions: Privacy versus optimality. In: ACM Conference on Computer and Communications Security (CCS) (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Secure Information Systems, George Mason University, Fairfax, VA 22030, USA
Lei Zhang, Sushil Jajodia & Alexander Brodsky
Concordia Institute for Information Systems Engineering, Concordia University, Montreal, QC H3G 1M8, Canada
Lingyu Wang

Authors

Lei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lingyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Sushil Jajodia
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Brodsky
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Vijay Atluri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, L., Wang, L., Jajodia, S., Brodsky, A. (2008). Exclusive Strategy for Generalization Algorithms in Micro-data Disclosure. In: Atluri, V. (eds) Data and Applications Security XXII. DBSec 2008. Lecture Notes in Computer Science, vol 5094. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70567-3_15

Download citation

DOI: https://doi.org/10.1007/978-3-540-70567-3_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70566-6
Online ISBN: 978-3-540-70567-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Exclusive Strategy for Generalization Algorithms in Micro-data Disclosure

Abstract

Chapter PDF

Similar content being viewed by others

Partial Domain Theories for Privacy

On-Average KL-Privacy and Its Equivalence to Generalization for Max-Entropy Mechanisms

Correcting Finite Sampling Issues in Entropy l-diversity

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Exclusive Strategy for Generalization Algorithms in Micro-data Disclosure

Abstract

Chapter PDF

Similar content being viewed by others

Partial Domain Theories for Privacy

On-Average KL-Privacy and Its Equivalence to Generalization for Max-Entropy Mechanisms

Correcting Finite Sampling Issues in Entropy l-diversity

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation