Theory of Computing Systems

, Volume 57, Issue 4, pp 1159–1201 | Cite as

Lower Bounds on the Error of Query Sets Under the Differentially-Private Matrix Mechanism

Article
  • 123 Downloads

Abstract

A common goal of privacy research is to release synthetic data that satisfies a formal privacy guarantee and can be used by an analyst in place of the original data. To achieve reasonable accuracy, a synthetic data set must be tuned to support a specified set of queries accurately, sacrificing fidelity for other queries. This work considers methods for producing synthetic data under differential privacy and investigates what makes a set of queries “easy” or “hard” to answer. We consider this issue in the particular case of answering sets of linear counting queries using the matrix mechanism (Li et al. 2010), a recent differentially-private mechanism that can reduce error by adding complex correlated noise adapted to a specified workload. Our main result is a novel lower bound on the minimum total error required to simultaneously release answers to a set of workload queries when using the matrix mechanism. The bound reveals that the hardness of a query workload is related to the spectral properties of the workload when it is represented in matrix form. Under (𝜖, δ)-differential privacy, we prove that this bound is tight for many common workloads such as the set of all predicate queries and the set of all k-way marginals. Our empirical study also indicates this bound is close-to-tight on workloads consisting of random interval queries or random marginals.

Keywords

Differential privacy Linear query Minimal error Spectral decomposition Lower bound 

References

  1. 1.
    Ács, G., Castelluccia, C., Chen, R.: Differentially private histogram publishing through lossy compression. In: ICDM, pp. 1–10 (2012)Google Scholar
  2. 2.
    Barak, B., Chaudhuri, K., Dwork, C., Kale, S., McSherry, F., Talwar, K.: Privacy, accuracy, and consistency too: A holistic solution to contingency table release. In: PODS (2007)Google Scholar
  3. 3.
    Ben-Israel, A., Greville, T.: Generalized inverses: Theory and applications, vol. 15. Springer (2003)Google Scholar
  4. 4.
    Bhaskara, A., Dadush, D., Krishnaswamy, R., Talwar, K.: Unconditional differentially private mechanisms for linear queries. In: STOC, pp. 1269–1284, New York, NY, USA (2012)Google Scholar
  5. 5.
    Blum, A., Ligett, K., Roth, A.: A learning theory approach to non-interactive database privacy. In: STOC, pp. 609–618 (2008)Google Scholar
  6. 6.
    Cormode, G., Procopiuc, M., Shen, E., Srivastava, D., Yu, T.: Differentially private spatial decompositions. ICDE, 20–31 (2012)Google Scholar
  7. 7.
    Ding, B., Winslett, M., Han, J., Li, Z.: Differentially private data cubes: optimizing noise sources and consistency. In: SIGMOD, pp. 217–228 (2011)Google Scholar
  8. 8.
    Dinur, I., Nissim, K.: Revealing information while preserving privacy. In: PODS, pp. 202–210 (2003)Google Scholar
  9. 9.
    Dwork, C., Kenthapadi, K., McSherry, F., Mironov, I., Naor, M.: Our data, ourselves: Privacy via distributed noise generation. In: EUROCRYPT, pp. 486–503 (2006)Google Scholar
  10. 10.
    Dwork, C., McSherry, F., Nissim, K., Smith, A.: Calibrating noise to sensitivity in private data analysis. In: TCC, pp. 265–284 (2006)Google Scholar
  11. 11.
    Dwork, C., Naor, M., Reingold, O., Rothblum, G., Vadhan, S.: On the complexity of differentially private data release: efficient algorithms and hardness results. In: STOC, pp. 381–390 (2009)Google Scholar
  12. 12.
    Dwork, C., Rothblum, G.N., Vadhan, S.P.: Boosting and differential privacy. In: FOCS, pp. 51–60 (2010)Google Scholar
  13. 13.
    Fawaz, N., Muthukrishnan, S., Nikolov, A.: Nearly optimal private convolution. In: ESA, pp. 445–456 (2013)Google Scholar
  14. 14.
    Fulton, W.: Eigenvalues, invariant factors, highest weights, and schubert calculus. Bulletin of the AMS 37(3), 209–250 (2000)MATHMathSciNetCrossRefGoogle Scholar
  15. 15.
    Gray, R.M.: Toeplitz and circulant matrices: A review. Now Pub (2006)Google Scholar
  16. 16.
    Gupta, A., Roth, A., Ullman, J.: Iterative constructions and private data release. In: TCC, pp. 339–356 (2012)Google Scholar
  17. 17.
    Hardt, M., Ligett, K., McSherry, F.: A simple and practical algorithm for differentially private data release. In: NIPS, pp. 2348–2356 (2012)Google Scholar
  18. 18.
    Hardt, M., Rothblum, G.: A multiplicative weights mechanism for privacy-preserving data analysis. In: FOCS, pp. 61–70 (2010)Google Scholar
  19. 19.
    Hardt, M., Talwar, K.: On the geometry of differential privacy. In: STOC, pp. 705–714 (2010)Google Scholar
  20. 20.
    Hay, M., Rastogi, V., Miklau, G., Suciu, D.: Boosting the accuracy of differentially-private histograms through consistency. PVLDB 3(1-2), 1021–1032 (2010)Google Scholar
  21. 21.
    Kasiviswanathan, S., Rudelson, M., Smith, A., Ullman, J.: The price of privately releasing contingency tables and the spectra of random matrices with correlated rows. In: STOC, pp. 775–784 (2010)Google Scholar
  22. 22.
    Li, C., Hay, M., Rastogi, V., Miklau, G., McGregor, A.: Optimizing linear counting queries under differential privacy. In: PODS, pp. 123–134 (2010)Google Scholar
  23. 23.
    Li, C., Miklau, G.: An adaptive mechanism for accurate query answering under differential privacy. PVLDB 5(6), 514–525 (2012)Google Scholar
  24. 24.
    Li, Y.D., Zhang, Z., Winslett, M., Yang, Y.: Compressive mechanism: utilizing sparse representation in differential privacy. In: WPES, pp. 177–182 (2011)Google Scholar
  25. 25.
    McSherry, F., Mironov, I.: Differentially Private Recommender Systems: Building Privacy into the Netflix Prize Contenders. In: SIGKDD, pp. 627–636 (2009)Google Scholar
  26. 26.
    Nikolov, A., Talwar, K., Zhang, L.: The geometry of differential privacy: the sparse and approximate cases. In: Proceedings of the 45th annual ACM symposium on Symposium on theory of computing, STOC ’13, pp. 351–360 (2013)Google Scholar
  27. 27.
    Nissim, K., Raskhodnikova, S., Smith, A.: Smooth sensitivity and sampling in private data analysis. In: STOC, pp. 75–84 (2007)Google Scholar
  28. 28.
    Rastogi, V., Suciu, D., Hong, S.: The boundary between privacy and utility in data publishing. In: VLDB, pp. 531–542 (2007)Google Scholar
  29. 29.
    Roth, A., Roughgarden, T.: Interactive privacy via the median mechanism. In: STOC, pp. 765–774 (2010)Google Scholar
  30. 30.
    Xiao, X., Wang, G., Gehrke, J.: Differential privacy via wavelet transforms. In: ICDE, pp. 225–236 (2010)Google Scholar
  31. 31.
    Xiao, Y., Xiong, L., Yuan, C.: Differentially private data release through multidimensional partitioning. In: SDM, pp. 150–168 (2010)Google Scholar
  32. 32.
    Xu, J., Zhang, Z., Xiao, X., Yang, Y., Yu, G., Winslett, M.: Differentially private histogram publication. The VLDB Journal, 1–26 (2013)Google Scholar
  33. 33.
    Yaroslavtsev, G., Cormode, G., Procopiuc, C. M., Srivastava, D.: Accurate and efficient private release of datacubes and contingency tables. In: ICDE (2013)Google Scholar
  34. 34.
    Yuan, G., Zhang, Z., Winslett, M., Xiao, X., Yang, Y., Hao, Z.: Low-rank mechanism: Optimizing batch queries under differential privacy. PVLDB 5(11), 1136–1147 (2012)Google Scholar

Copyright information

© Springer Science+Business Media New York 2015

Authors and Affiliations

  1. 1.University of MassachusettsAmherstUSA

Personalised recommendations