Abstract
Recall data of visits to recreational sites often contain reported numbers that appear to be rounded to nearby focal points (e.g., the closest 5 or 10). Failure to address this rounding has been shown to produce biased estimates of marginal effects and non-linear combinations of coefficients such as willingness to pay. We investigate the relative performance of three count data models used with data of the kind typically found in survey data. We create a dataset based on observed recreational trip counts and associated trip costs that exhibits substantial rounding. We then conduct a Monte Carlo simulation exercise to compare the estimated parameters, willingness to pay, and the average consumer surplus per trip for three alternative estimators: a standard Poisson model with no adjustment for rounding, a censored Poisson model, a grouped Poisson model, and a latent class Poisson model with rounding and non-rounding classes. The standard Poisson model with no adjustment for rounding exhibits significant and persistent bias, especially in estimates of non-linear effects. The grouped and latent class Poisson models, in contrast, show little to no bias in estimates.
Similar content being viewed by others
Notes
We use all three terms interchangeably.
Cameron and Trivedi (1998) also suggest using a truncated model, but we found no instances in which it offered any advantage over a censored model.
Evans and Herriges’ latent class model includes rounding and non-rounding classes while also allowing for the values of β to vary across classes. However, these are two separate empirical issues. Evidence suggests that rounded responses are a universal phenomenon and not specific to any type of respondent. Additionally, if class-varying parameters were also present in our simulations, it would be difficult to disentangle potential bias from rounded responses and determine which empirical strategies are best-suited to handle rounding. We therefore restrict our analysis to instances in which β is class-invariant.
Because the calibration sets half of all observations to the rounding class, up to 40% of all data can be rounded, as depicted by the gray line in Fig. 1.
The instability of the estimates generated by the censored Poisson arises from the censoring. When the share of censored observations is too large, the estimating sample contains too little variation to permit reliable estimation. As the censoring point is raised, the value of \(\beta_{0}\) at which censoring removes too much variation from the sample rises.
References
Allen CM, Griffith SD, Shiffman S, Heitjan DF (2017) Proximity and gravity: modeling heaped self-reports. Stat Med 36(20):3200–3215. https://doi.org/10.1002/sim.7327
Androkovich RA (2015) Recreational visits to the Adam’s river during the annual sockeye run: a travel cost analysis. Mar Resour Econ 30(1):35–49. https://doi.org/10.1086/679462
Baerenklau KA (2010) A latent class approach to modeling endogenous spatial sorting in zonal recreation demand models. Land Econ 86(4):800–816. https://doi.org/10.3368/le.86.4.800
Baltas G, Argouslidis PC, Skarmeas D (2010) The role of customer factors in multiple store patronage: a cost–benefit approach. J Retail 86(1):37–50. https://doi.org/10.1016/j.jretai.2010.01.005
Bar HY, Lillard DR (2012) Accounting for heaping in retrospectively reported event data—a mixture-model approach. Stat Med 31(27):3347–3365. https://doi.org/10.1002/sim.5419
Bishop RC, Boyle KJ (2019) Reliability and validity in nonmarket valuation. Environ Resour Econ 72(2):559–582. https://doi.org/10.1007/s10640-017-0215-7
Blair E, Burton S (1987) Cognitive processes used by survey respondents to answer behavioral frequency questions. J Consum Res 14(2):280. https://doi.org/10.1086/209112
Browning M, Crossley TF, Weber G (2003) Asking consumption questions in general purpose surveys. Econ J 113(491):F540–F567. https://doi.org/10.1046/j.0013-0133.2003.00168.x
Burton S, Blair E (1991) Task conditions, response formulation processes, and response accuracy for behavioral frequency questions in surveys. Public Opin Q 55(1):50. https://doi.org/10.1086/269241
Cameron AC, Trivedi PK (1998) Regression analysis of count data. Cambridge University Press, Cambridge
Carpio CE, Wohlgenant MK, Boonsaeng T (2008) The demand for agritourism in the United States. J Agric Resour Econ 33(2):254–269
Crawford FW, Weiss RE, Suchard MA (2015) Sex, lies and self-reported counts: Bayesian mixture models for heaping in longitudinal count data via birth–death processes. Ann Appl Stat 9(2):572–596. https://doi.org/10.1214/15-AOAS809
Dall T, Mann SE, Zhang Y, Martin J, Chen Y (2008) Economic costs of diabetes in the U.S. in 2007. Diabetes Care 31(3):596–615. https://doi.org/10.2337/dc08-9017
Dominitz J, Manski CF (1997) Perceptions of economic insecurity: evidence from the survey of economic expectations. Public Opin Q 61(2):261. https://doi.org/10.1086/297795
Drechsler J, Kiesl H (2016) Beat the heap: an imputation strategy for valid inferences from rounded income data. J Surv Stat Methodol 4(1):22–42. https://doi.org/10.1093/jssam/smv032
Evans K, Herriges J (2010) Rounding in recreation demand models: a latent class count model. Economic working papers (2002–2016). Retrieved from https://lib.dr.iastate.edu/econ_las_workingpapers/117
Ferrini S, Scarpa R (2007) Designs with a priori information for nonmarket valuation with choice experiments: a Monte Carlo study. J Environ Econ Manag 53(3):342–363. https://doi.org/10.1016/j.jeem.2006.10.007
Gustat J, O’Malley K, Luckett BG, Johnson CC (2015) Fresh produce consumption and the association between frequency of food shopping, car access, and distance to supermarkets. Prev Med Rep 2:47–52. https://doi.org/10.1016/j.pmedr.2014.12.009
Han B, Olfson M, Huang L, Mojtabai R (2017) National trends in specialty outpatient mental health care among adults. Health Aff 36(12):2062–2068. https://doi.org/10.1377/hlthaff.2017.0922
Heitjan DF (1989) Inference from grouped continuous data: a review. Stat Sci 4(2):164–179. https://doi.org/10.1214/ss/1177012601
Heitjan DF, Rubin DB (1991) Ignorability and coarse data. Ann Stat 19(4):2244–2253. https://doi.org/10.1214/aos/1176348396
Hindsley P, Landry CE, Gentner B (2011) Addressing onsite sampling in recreation site choice models. J Environ Econ Manag 62(1):95–110. https://doi.org/10.1016/j.jeem.2010.10.007
Hoderlein S, Winter J (2010) Structural measurement errors in nonseparable models. J Econom 157(2):432–440. https://doi.org/10.1016/j.jeconom.2010.03.044
Hynes S, Greene W (2013) A panel travel cost model accounting for endogenous stratification and truncation: a latent class approach. Land Econ 89(1):177–192. https://doi.org/10.3368/le.89.1.177
Kuminoff NV, Parmeter CF, Pope JC (2010) Which hedonic models can we trust to recover the marginal willingness to pay for environmental amenities? J Environ Econ Manag 60(3):145–160. https://doi.org/10.1016/j.jeem.2010.06.001
Landry CE, McConnell KE (2007) Hedonic onsight cost model of recreation demand. Land Econ 83(2):253–267. https://doi.org/10.3368/le.83.2.253
Landry CE, Lewis AR, Liu H, Vogelsong H (2016) Addressing onsite sampling in analysis of recreation demand: economic value and impact of visitation to Cape Hatteras National Seashore. Mar Resour Econ 31(3):301–322. https://doi.org/10.1086/686892
Luceri B, Latusi S (2012) The importance of consumer characteristics and market structure variables in driving multiple store patronage. J Retail Consum Serv 19(5):519–525. https://doi.org/10.1016/j.jretconser.2012.06.008
Manski CF, Molinari F (2010) Rounding probabilistic expectations in surveys. J Bus Econ Stat 28(2):219–231. https://doi.org/10.1198/jbes.2009.08098
Martínez-Espiñeira R, Amoako-Tuffour J (2008) Recreation demand analysis under truncation, overdispersion, and endogenous stratification: an application to Gros Morne National Park. J Environ Manag 88(4):1320–1332. https://doi.org/10.1016/j.jenvman.2007.07.006
Moeltner K (2003) Addressing aggregation bias in zonal recreation models. J Environ Econ Manag 45(1):128–144. https://doi.org/10.1016/S0095-0696(02)00014-1
Moffatt PG (1995) Grouped poisson regression models: theory and an application to public house visit frequency. Commun Stat Theory Methods 24(11):2779–2796. https://doi.org/10.1080/03610929508831649
Parsons GR, Chen Z, Hidrue MK, Standing N, Lilley J (2013) Valuing beach width for recreational use: combining revealed and stated preference data. Mar Resour Econ 28(3):221–241. https://doi.org/10.5950/0738-1360-28.3.221
Roberts JM, Brewer DD (2001) Measures and tests of heaping in discrete quantitative distributions. J Appl Stat 28(7):887–896. https://doi.org/10.1080/02664760120074960
Sen A, Harwood AR, Bateman IJ, Munday P, Crowe A, Brander L, Raychaudhuri J, Lovett AA, Foden J, Provins A (2014) Economic assessment of the recreational value of ecosystems: methodological development and national and local application. Environ Resour Econ 57(2):233–249. https://doi.org/10.1007/s10640-013-9666-7
Shi W, Huang J-C (2018) Correcting on-site sampling bias: a new method with application to recreation demand analysis. Land Econ 94(3):459–474. https://doi.org/10.3368/le.94.3.459
Smith JW, Seekamp E, McCreary A, Davenport M, Kanazawa M, Holmberg K, Wilson B, Nieber J (2016) Shifting demand for winter outdoor recreation along the North Shore of Lake Superior under variable rates of climate change: a finite-mixture modeling approach. Ecol Econ 123:1–13. https://doi.org/10.1016/j.ecolecon.2015.12.010
Wang H, Heitjan DF (2008) Modeling heaping in self-reported cigarette counts. Stat Med 27(19):3789–3804. https://doi.org/10.1002/sim.3281
Wen H, Hockenberry JM, Cummings JR (2015) The effect of medical marijuana laws on adolescent and adult use of marijuana, alcohol, and other substances. J Health Econ 42:64–80. https://doi.org/10.1016/j.jhealeco.2015.03.007
Whitehead JC, Phaneuf DJ, Dumas CF, Herstine J, Hill J, Buerger B (2010) Convergent validity of revealed and stated recreation behavior with quality change: a comparison of multiple and single site demands. Environ Resour Econ 45(1):91–112. https://doi.org/10.1007/s10640-009-9307-3
Wright DE, Bray I (2003) A mixture model for rounded data. J R Stat Soc Ser D (Stat) 52(1):3–13. https://doi.org/10.1111/1467-9884.00338
Zinn S, Würbach A (2016) A statistical approach to address the problem of heaping in self-reported income data. J Appl Stat 43(4):682–703. https://doi.org/10.1080/02664763.2015.1077372
Acknowledgements
The findings and conclusions in this article are those of the authors and should not be construed to represent any official USDA or U.S. Government determination or policy. This research was supported in part by the U.S. Department of Agriculture, Economic Research Service.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Page, I.B., Lichtenberg, E. & Saavoss, M. Estimating Willingness to Pay from Count Data When Survey Responses are Rounded. Environ Resource Econ 75, 657–675 (2020). https://doi.org/10.1007/s10640-020-00403-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10640-020-00403-6