Bayesian benchmarking with applications to small area estimation
- 281 Downloads
It is well-known that small area estimation needs explicit or at least implicit use of models (cf. Rao in Small Area Estimation, Wiley, New York, 2003). These model-based estimates can differ widely from the direct estimates, especially for areas with very low sample sizes. While model-based small area estimates are very useful, one potential difficulty with such estimates is that when aggregated, the overall estimate for a larger geographical area may be quite different from the corresponding direct estimate, the latter being usually believed to be quite reliable. This is because the original survey was designed to achieve specified inferential accuracy at this higher level of aggregation. The problem can be more severe in the event of model failure as often there is no real check for validity of the assumed model. Moreover, an overall agreement with the direct estimates at an aggregate level may sometimes be politically necessary to convince the legislators of the utility of small area estimates.
One way to avoid this problem is the so-called “benchmarking approach”, which amounts to modifying these model-based estimates so that we get the same aggregate estimate for the larger geographical area. Currently, the most popular approach is the so-called “raking” or ratio adjustment method, which involves multiplying all the small area estimates by a constant data-dependent factor so that the weighted total agrees with the direct estimate. There are alternate proposals, mostly from frequentist considerations, which meet also the aforementioned benchmarking criterion.
We propose in this paper a general class of constrained Bayes estimators which also achieve the necessary benchmarking. Many of the frequentist estimators, including some of the raked estimators, follow as special cases of our general result. Explicit Bayes estimators are derived which benchmark the weighted mean or both the weighted mean and weighted variability. We illustrate our methodology by developing poverty rates in school-aged children at the state level and then benchmarking these estimates to match at the national level. Unlike the existing frequentist benchmarking literature, which is primarily based on linear models, the proposed Bayesian approach can accommodate any arbitrary model, and the benchmarked Bayes estimators are based only on the posterior mean and the posterior variance-covariance matrix.
KeywordsArea-level Penalty parameter Two-stage Weighted mean Weighted variability
Mathematics Subject Classification (2000)62F15
Unable to display preview. Download preview PDF.
- Bell WR (1999). Accounting for uncertainty about variances in small area estimation. Bulletin of the International Statistical Institute, 52nd Session, Helsinki Google Scholar
- Isaki CT, Tsay JH, Fuller WA (2000) Estimation of census adjustment factors. Surv Methodol 26:31–42 Google Scholar
- Wang J, Fuller WA, Qu Y (2008). Small area estimation under a restriction. Surv Methodol 34:29–36 Google Scholar
- You Y, Rao JNK, Dick P (2004) Benchmarking hierarchical Bayes small area estimators in the Canadian census undercoverage estimation. Stat Transit 6:631–640 Google Scholar
- Zellner A (1986) Further results on Bayesian minimum expected loss (MELO) estimates and posterior distributions for structural coefficients. In: Slottje DJ (ed) Advances in Econometrics. JAI Press, London, pp 171–182 Google Scholar