Toward Privacy in Public Databases

  • Shuchi Chawla
  • Cynthia Dwork
  • Frank McSherry
  • Adam Smith
  • Hoeteck Wee
Conference paper

DOI: 10.1007/978-3-540-30576-7_20

Part of the Lecture Notes in Computer Science book series (LNCS, volume 3378)
Cite this paper as:
Chawla S., Dwork C., McSherry F., Smith A., Wee H. (2005) Toward Privacy in Public Databases. In: Kilian J. (eds) Theory of Cryptography. TCC 2005. Lecture Notes in Computer Science, vol 3378. Springer, Berlin, Heidelberg

Abstract

We initiate a theoretical study of the census problem. Informally, in a census individual respondents give private information to a trusted party (the census bureau), who publishes a sanitized version of the data. There are two fundamentally conflicting requirements: privacy for the respondents and utility of the sanitized data. Unlike in the study of secure function evaluation, in which privacy is preserved to the extent possible given a specific functionality goal, in the census problem privacy is paramount; intuitively, things that cannot be learned “safely” should not be learned at all.

An important contribution of this work is a definition of privacy (and privacy compromise) for statistical databases, together with a method for describing and comparing the privacy offered by specific sanitization techniques. We obtain several privacy results using two different sanitization techniques, and then show how to combine them via cross training. We also obtain two utility results involving clustering.

Download to read the full conference paper text

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Shuchi Chawla
    • 1
  • Cynthia Dwork
    • 2
  • Frank McSherry
    • 2
  • Adam Smith
    • 3
  • Hoeteck Wee
    • 4
  1. 1.Carnegie Mellon University 
  2. 2.Microsoft Research SVC 
  3. 3.Weizmann Institute of Science 
  4. 4.University of CaliforniaBerkeley

Personalised recommendations