Abstract
Randomized response methods for quantitative sensitive data are treated in an unified approach which includes the use of auxiliary information at the estimation stage. A class of estimators for the mean of a sensitive variable is proposed under a generic randomization model and the optimum estimator is obtained. Some special models are discussed in detail. To evaluate the degree of respondents’ confidentiality in models using auxiliary variables, a new measure of privacy protection is introduced. Different models are then compared both from the perspective of efficiency and privacy protection.
Similar content being viewed by others
References
Agrawal D, Aggarwal CC (2001) On the design and quantification of privacy preserving data mining algorithms, Proceedings of the 20th symposium on principles of database system, Santa Barbara, California, USA
Allen J, Singh S (2001) Response techniques to analyse various transformation and selection probabilities, Interstat. Available at http://interstat.statjournals.net/YEAR/2001/abstracts/0111002.php
Bar-Lev SK, Bobovitch E, Boukai B (2004) A note on randomized response models for quantitative data. Metrika 60: 255–260
Chaudhuri A, Mukerjee R (1988) Randomized response: theory and techniques. Marcel Dekker, Inc., New York
Chaudhuri A, Roy D (1997) Model assisted survey sampling strategies with randomized response. J Stat Plan Inference 60: 61–68
Cochran WG (1977) Sampling techniques. Wiley, New York
Diana G, Perri PF (2007) Estimation of finite population mean using multi-auxiliary information. Metron LXV: 99–112
Diana G, Perri PF (2009) Estimating a sensitive proportion through randomized response procedures based on auxiliary information. Stat Pap 50: 661–672
Eichhorn B, Hayre LS (1983) Scrambled randomized response methods for obtaining sensitive quantitative data. J Stat Plan Inference 7: 307–316
Eriksson SA (1973) A new model for randomized response. Int Stat Rev 41: 101–113
Greenberg BG, Abul-Ela ALA, Simmons WR, Horvitz DG (1969) The unrelated question randomized response model: theoretical framework. J Am Stat Assoc 64: 520–539
Greenberg BG, Kuebler RR, Abernathy JR, Horvitz DG (1971) Application of the randomized response technique in obtaining quantitative data. J Am Stat Assoc 66: 243–250
Grewal IS, Bansal ML, Sidhu SS (2005–2006) Population mean corresponding to Horvitz-Thompson’s estimator for multi-characteristics using randomized response technique. Model Assist Stat Appl 1: 215–220
Guerriero M, Sandri MF (2007) A note on the comparison of some randomized response procedures. J Stat Plan Inference 137: 2184–2190
Gupta S, Gupta B, Singh S (2002) Estimation of sensitive level of personal interview survey questions. J Stat Plan Inference 100: 239–247
Hore B, Mehrotra S, Tsudik G (2004) A privacy-preserving index for range queries. Proceedings of the 30th VLDB conference, Toronto, Canada
Horvitz DG, Shah BV, Simmons WR (1967) The unrelated question randomized response model. Social statistics section proceedings of the American statistical association, pp 65–72
Mahajan PK (2005–2006) Optimum stratification for scrambled response with ratio and regression methods of estimation. Model Assist Stat Appl 1: 17–22
Mahajan PK, Singh R (2005) Optimum stratification for scrambled response in pps sampling. Metron LXIII: 103–114
Mahajan PK, Gupta JP, Singh R (1994) Determination of optimum strata boundaries for scrambled response. Statistica 54: 375–381
Mukerjee S, Duncan GT (1997) Disclosure limitation through additive noise data masking: analysis of skewed sensitive data. Proceedings of the 13th Annual Hawwaii international conference on system sciences
Pollock KH, Bek Y (1976) A comparison of three randomized response models for quantitative data. J Am Stat Assoc 71: 884–886
Poole WK (1974) Estimation of the distribution function of a continuous type random variable through randomized response. J Am Stat Assoc 69: 1002–1005
Ryu J-B, Kim J-M, Heo T-Y, Park CG (2005–2006) On stratified randomized response sampling. Model Assist Stat Appl 1: 31–36
Saha A (2007) A simple randomized response technique in complex surveys. Metron LXV: 59–66
Singh S, Joarder AH, Kinh ML (1996) Regression analysis using scrambled responses. Aust N Z J Stat 38: 201–211
Singh S, Tracy DS (1999) Ridge regression analysis using scrambled responses. Metron LVII: 147–157
Sukhatme PV, Sukhatme BV, Sukhatme S, Asok C (1984) Sampling theory of surveys with applications. Iowa State University Press, Ames, Iowa, USA
Warner SL (1965) Randomized response: a survey technique for eliminating evasive answer bias. J Am Stat Assoc 60: 63–69
Zaizai Y (2005–2006) Ratio method of estimation of population proportion using randomized response technique. Model Assist Stat Appl 1: 125–130
Zaizai Y, Jingyu W, Junfeng L (2009) An efficiency and protection degree-based comparison among the quantitative randomized response strategies. Commun Stat Theory Methods 38: 400–408
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Diana, G., Perri, P.F. A class of estimators for quantitative sensitive data. Stat Papers 52, 633–650 (2011). https://doi.org/10.1007/s00362-009-0273-1
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00362-009-0273-1