A Hierarchical Bayes Unit-Level Small Area Estimation Model for Normal Mixture Populations

Goyal, Shuchi; Datta, Gauri Sankar; Mandal, Abhyuday

doi:10.1007/s13571-019-00216-8

A Hierarchical Bayes Unit-Level Small Area Estimation Model for Normal Mixture Populations

Published: 22 January 2020

Volume 83, pages 215–241, (2021)
Cite this article

Sankhya B Aims and scope Submit manuscript

Shuchi Goyal¹,
Gauri Sankar Datta² &
Abhyuday Mandal²

373 Accesses
2 Citations
Explore all metrics

Abstract

National statistical agencies are regularly required to produce estimates about various subpopulations, formed by demographic and/or geographic classifications, based on a limited number of samples. Traditional direct estimates computed using only sampled data from individual subpopulations are usually unreliable due to small sample sizes. Subpopulations with small samples are termed small areas or small domains. To improve on the less reliable direct estimates, model-based estimates, which borrow information from suitable auxiliary variables, have been extensively proposed in the literature. However, standard model-based estimates rely on the normality assumptions of the error terms. In this research we propose a hierarchical Bayesian (HB) method for the unit-level nested error regression model based on a normal mixture for the unit-level error distribution. Our method proposed here is applicable to model cases with unit-level error outliers as well as cases where each small area population is comprised of two subgroups, neither of which can be treated as an outlier. Our proposed method is more robust than the normality based standard HB method (Datta and Ghosh, Annals Stat. 19, 1748–1770, 1991) to handle outliers or multiple subgroups in the population. Our proposal assumes two subgroups and the two-component mixture model that has been recently proposed by Chakraborty et al. (Int. Stat. Rev. 87, 158–176, 2019) to address outliers. To implement our proposal we use a uniform prior for the regression parameters, random effects variance parameter, and the mixing proportion, and we use a partially proper non-informative prior distribution for the two unit-level error variance components in the mixture. We apply our method to two examples to predict summary characteristics of farm products at the small area level. One of the examples is prediction of twelve county-level crop areas cultivated for corn in some Iowa counties. The other example involves total cash associated in farm operations in twenty-seven farming regions in Australia. We compare predictions of small area characteristics based on the proposed method with those obtained by applying the Datta and Ghosh (Annals Stat. 19, 1748–1770, 1991) and the Chakraborty et al. (Int. Stat. Rev. 87, 158–176, 2019) HB methods. Our simulation study comparing these three Bayesian methods, when the unit-level error distribution is normal, or t, or two-component normal mixture, showed the superiority of our proposed method, measured by prediction mean squared error, coverage probabilities and lengths of credible intervals for the small area means.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

mind, A methodology for multivariate small area estimation with multiple random effects

Article 23 November 2023

Estimation of small area proportions under a bivariate logistic mixed model

Article Open access 16 September 2022

An Area-Level Gamma Mixed Model for Small Area Estimation

References

Battese, G. E., Harter, R. M. and Fuller, W. A. (1988). An error component model for prediction of county crop areas using survey and satellite data. J. Am. Stat. Assoc. 83, 28–36.
Article Google Scholar
Chakraborty, A., Datta, G. S. and Mandal, A. (2019). Robust Hierarchical Bayes Small Area Estimation for Nested Error Regression Model. Int. Stat. Rev. 87, 158–176.
Article MathSciNet Google Scholar
Chambers, R. L. (1986). Outlier robust finite population estimation. J. Am. Stat. Assoc. 81, 1063–1069.
Article MathSciNet Google Scholar
Chambers, R. L., Chandra, H., Salvati, N. and Tzavidis, N. (2014). Outlier robust smallarea estimation. Journal of the Royal Statistical Society Series B 76, 47–69.
Article MathSciNet Google Scholar
Chambers, R. L., Chandra, H. and Tzavidis, N. (2011). On bias-robust mean squared error estimation for pseudo-linear small area estimators. Survey Methodology 37, 153–170.
Google Scholar
Datta, G. and Ghosh, M. (1991). Bayesian prediction in linear models: Applications to small area estimation. Annals Stat. 19, 1748–1770.
MATH MathSciNet Google Scholar
Datta, G. S. and Lahiri, P. (2000). A unified measure of uncertainty of estimated best linear unbiased predictors in small area estimation problems. Stat. Sin. 10, 613–627.
MATH MathSciNet Google Scholar
Efron, B. and Morris, C. (1973). Stein’s Estimation Rule and Its Competitors − An Empirical Bayes Approach. J. Am. Stat. Assoc. 68, 117–130.
MATH MathSciNet Google Scholar
Fay, R. E. and Herriot, R. A. (1979). Estimates of income for small places: an application of James-Stein procedures to census data. J. Am. Stat. Assoc. 74, 269–277.
Article MathSciNet Google Scholar
Prasad, N. G. N. and Rao, J. N. K. (1990). On the estimation of mean square error of small area predictors. J. Am. Stat. Assoc. 85, 163–171.
Article Google Scholar
Sinha, S. K. and Rao, J. N. K. (2009). Robust small area estimation. Can. J. Stat. 37, 381–399.
Article MathSciNet Google Scholar
Stein, C. (1955). Inadmissibility of the Usual Estimator for the Mean of a Multivariate Normal Distribution, Proccedings of the Third Berkeley Symposium. University of California Press 1, 197–206.
Google Scholar

Download references

Acknowledgment

The authors are thankful to Dr. Ray Chambers for providing the dataset used in Section 4.2. They also thank the editor for his supportive suggestions.

Author information

Authors and Affiliations

Department of Statistics, University of California at Los Angeles, Los Angeles, CA, 90095, USA
Shuchi Goyal
Department of Statistics, University of Georgia, Athens, GA, 30602, USA
Gauri Sankar Datta & Abhyuday Mandal

Authors

Shuchi Goyal
View author publications
You can also search for this author in PubMed Google Scholar
Gauri Sankar Datta
View author publications
You can also search for this author in PubMed Google Scholar
Abhyuday Mandal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abhyuday Mandal.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

(PDF 223 KB)

Appendix: A

1.1 A Integrability of Joint Posterior Probability Density Function

Chakraborty et al. (2019) showed that the joint posterior density function of $\boldsymbol {\beta }, {\sigma ^{2}_{1}}, {\sigma ^{2}_{2}}, p_{e},$ and ${\sigma _{v}^{2}}$ is proper. In particular, they showed that the function

$$ L(\boldsymbol{\beta}, {\sigma^{2}_{1}}, {\sigma^{2}_{2}}, p_{e},{\sigma_{v}^{2}})\frac{I_{\{{\sigma_{1}^{2}}<{\sigma_{2}^{2}}\}}}{({\sigma_{2}^{2}})^{2}} $$

(A.1)

is integrable with respect to $\boldsymbol {\beta }, {\sigma ^{2}_{1}}, {\sigma ^{2}_{2}}, p_{e},$ and ${\sigma _{v}^{2}}$, where $L(\boldsymbol {\beta }, {\sigma ^{2}_{1}}, {\sigma ^{2}_{2}}, p_{e},{\sigma _{v}^{2}})$ is the likelihood function based on the distribution y_ij,j = 1,…,n_i,i = 1,…,m obtained as the marginal distribution from (I)−(III) in Section 2. Similar arguments show that $L(\boldsymbol {\beta }, {\sigma ^{2}_{1}}, {\sigma ^{2}_{2}}, p_{e},{\sigma _{v}^{2}})\frac {I_{\{{\sigma _{1}^{2}}{\geq \sigma _{2}^{2}}\}}}{({\sigma _{1}^{2}})^{2}}$ is also integrable with respect to the same variables. Now we note that

$$ \begin{array}{@{}rcl@{}} \frac{I_{\{2^{-1}<p_{e}<1\}}}{({\sigma_{1}^{2}}+{\sigma_{2}^{2}})^{2}} & \leq& \frac{1}{({\sigma_{1}^{2}}+{\sigma_{2}^{2}})^{2}} = \frac{I_{\{{\sigma_{1}^{2}}<{\sigma_{2}^{2}}\}}+I_{\{{\sigma_{1}^{2}}{\geq\sigma_{2}^{2}}\}}}{({\sigma_{1}^{2}}+{\sigma_{2}^{2}})^{2}} \\ &=&\frac{1}{({\sigma_{2}^{2}})^{2}}\left( \frac{{\sigma_{2}^{2}}}{{\sigma_{1}^{2}}+{\sigma_{2}^{2}}}\right)^{2}I_{\{{\sigma_{1}^{2}}<{\sigma_{2}^{2}}\}}+\frac{1}{({\sigma_{1}^{2}})^{2}}\left( \frac{{\sigma_{1}^{2}}}{{\sigma_{1}^{2}}+{\sigma_{2}^{2}}}\right)^{2}I_{\{{\sigma_{1}^{2}}{\geq\sigma_{2}^{2}}\}}\\ &<& \frac{I_{\{{\sigma_{1}^{2}}<{\sigma_{2}^{2}}\}}}{({\sigma_{2}^{2}})^{2}}+\frac{I_{\{{\sigma_{1}^{2}}{\geq\sigma_{2}^{2}}\}}}{({\sigma_{1}^{2}})^{2}}. \end{array} $$

This implies,

$$ \begin{array}{@{}rcl@{}} L(\boldsymbol{\beta}, {\sigma^{2}_{1}}, {\sigma^{2}_{2}}, p_{e},{\sigma_{v}^{2}})\frac{I_{\{2^{-1}<p_{e}<1\}}}{({\sigma_{1}^{2}}+{\sigma_{2}^{2}})^{2}} < L(\boldsymbol{\beta}, {\sigma^{2}_{1}}, {\sigma^{2}_{2}}, p_{e},{\sigma_{v}^{2}})\left( \frac{I_{\{{\sigma_{1}^{2}}<{\sigma_{2}^{2}}\}}}{({\sigma_{2}^{2}})^{2}}+\frac{I_{({\sigma_{1}^{2}}{\geq\sigma_{2}^{2}})}}{({\sigma_{1}^{2}})^{2}}\right). \end{array} $$

(A.2)

The LHS of Eq. A.2 is bounded above by two integrable functions, hence it is also integrable.

1.2 A.2 Simulation Results with no Contamination of Unit-Level Errors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Goyal, S., Datta, G.S. & Mandal, A. A Hierarchical Bayes Unit-Level Small Area Estimation Model for Normal Mixture Populations. Sankhya B 83, 215–241 (2021). https://doi.org/10.1007/s13571-019-00216-8

Download citation

Received: 03 April 2019
Published: 22 January 2020
Issue Date: May 2021
DOI: https://doi.org/10.1007/s13571-019-00216-8

Keywords and phrases.

AMS (2000) subject classification.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Hierarchical Bayes Unit-Level Small Area Estimation Model for Normal Mixture Populations

Abstract

Access this article

Similar content being viewed by others

mind, A methodology for multivariate small area estimation with multiple random effects

Estimation of small area proportions under a bivariate logistic mixed model

An Area-Level Gamma Mixed Model for Small Area Estimation

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Electronic supplementary material

(PDF 223 KB)

Appendix: A

1.1 A Integrability of Joint Posterior Probability Density Function

1.2 A.2 Simulation Results with no Contamination of Unit-Level Errors

Rights and permissions

About this article

Cite this article

Keywords and phrases.

AMS (2000) subject classification.

Navigation

A Hierarchical Bayes Unit-Level Small Area Estimation Model for Normal Mixture Populations

Abstract

Access this article

Similar content being viewed by others

mind, A methodology for multivariate small area estimation with multiple random effects

Estimation of small area proportions under a bivariate logistic mixed model

An Area-Level Gamma Mixed Model for Small Area Estimation

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Electronic supplementary material

(PDF 223 KB)

Appendix: A

Appendix: A

1.1 A Integrability of Joint Posterior Probability Density Function

1.2 A.2 Simulation Results with no Contamination of Unit-Level Errors

Rights and permissions

About this article

Cite this article

Share this article

Keywords and phrases.

AMS (2000) subject classification.

Search

Navigation