Abowd, J.M., Vilhuber, L.: How protective are synthetic data? In: Domingo-Ferrer, J., Saygın, Y. (eds.) PSD 2008. LNCS, vol. 5262, pp. 239–246. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87471-3_20
CrossRef
Google Scholar
Benedetto, G., Stinson, M., Abowd, J.M.: The creation and use of the SIPP synthetic beta (2013)
Google Scholar
Bowen, C.M., et al.: A synthetic supplemental public use file of low-income information return data: methodology, utility, and privacy implications. In: Domingo-Ferrer, J. Muralidhar, K. (eds.) PSD 2020. LNCS, vol. 12276, pp. 257–270. Springer, Heidelberg (2020)
Google Scholar
Bowen, C.M., Snoke, J.: Comparative study of differentially private synthetic data algorithms from the NIST PSCR differential privacy synthetic data challenge. arXiv preprint arXiv:1911.12704 (2020)
Breiman, L., Friedman, J., Stone, C.J., Olshen, R.A.: Classification and Regression Trees. CRC Press, Boca Raton (1984)
MATH
Google Scholar
Bryant, V.: General description booklet for the 2012 public use tax file (2017)
Google Scholar
Cilke, J.: The case of the missing strangers: what we know and don’t know about non-filers. In: 107th Annual Conference of the National Tax Association, Santa Fe, New Mexico (2014)
Google Scholar
Dinur, I., Nissim, K.: Revealing information while preserving privacy. In: Proceedings of the Twenty-Second ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pp. 202–210 (2003)
Google Scholar
Drechsler, J., Reiter, J.P.: Sampling with synthesis: a new approach for releasing public use census microdata. J. Am. Stat. Assoc. 105(492), 1347–1357 (2010)
MathSciNet
CrossRef
Google Scholar
Elliot, M.: Final report on the disclosure risk associated with the synthetic data produced by the SYLLS team. Report 2015-2 (2015)
Google Scholar
Fuller, W.: Masking procedures for microdata disclosure. J. Off. Stat. 9(2), 383–406 (1993)
Google Scholar
Hu, J., Reiter, J.P., Wang, Q.: Disclosure risk evaluation for fully synthetic categorical data. In: Domingo-Ferrer, J. (ed.) PSD 2014. LNCS, vol. 8744, pp. 185–199. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11257-2_15
CrossRef
Google Scholar
IRS: 2012 supplemental public use file (2019)
Google Scholar
Machanavajjhala, A., Kifer, D., Gehrke, J., Venkitasubramaniam, M.: L-diversity: privacy beyond k-anonymity. ACM Trans. Knowl. Discov. Data (TKDD) 1(1), 3-es (2007)
CrossRef
Google Scholar
McClure, D., Reiter, J.P.: Differential privacy and statistical disclosure risk measures: an investigation with binary synthetic data. Trans. Data Priv. 5(3), 535–552 (2012)
MathSciNet
Google Scholar
Nowok, B., Raab, G.M., Snoke, J., Dibben, C., Nowok, M.B.: Package ‘synthpop’ (2019)
Google Scholar
Raab, G.M., Nowok, B., Dibben, C.: Practical data synthesis for large samples. J. Priv. Confid. 7(3), 67–97 (2016)
Google Scholar
Reiter, J.P.: Using cart to generate partially synthetic public use microdata. J. Off. Stat. 21(3), 441 (2005)
Google Scholar
Reiter, J.P., Wang, Q., Zhang, B.: Bayesian estimation of disclosure risks for multiply imputed, synthetic data. J. Priv. Confid. 6(1), 17–33 (2014)
Google Scholar
Snoke, J., Raab, G.M., Nowok, B., Dibben, C., Slavkovic, A.: General and specific utility measures for synthetic data. J. Roy. Stat. Soc. Ser. A (Stat. Soc.) 181(3), 663–688 (2018)
MathSciNet
CrossRef
Google Scholar
Taub, J., Elliot, M., Pampaka, M., Smith, D.: Differential correct attribution probability for synthetic data: an exploration. In: Domingo-Ferrer, J., Montes, F. (eds.) PSD 2018. LNCS, vol. 11126, pp. 122–137. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99771-1_9
CrossRef
Google Scholar
Winkler, W.E.: Examples of easy-to-implement, widely used methods of masking for which analytic properties are not justified. Statistics Research Division, US Bureau of the Census (2007)
Google Scholar
Woo, M.J., Reiter, J.P., Oganian, A., Karr, A.F.: Global measures of data utility for microdata masked for disclosure limitation. J. Priv. Confid. 1(1), 111–124 (2009)
Google Scholar