Skip to main content

Advertisement

Log in

The impact of missing data in the estimation of concentration index: a potential source of bias

  • Original Paper
  • Published:
The European Journal of Health Economics Aims and scope Submit manuscript

Abstract

The purpose of this paper is to raise awareness of missing data when concentration indices are used to evaluate health-related inequality. Concentration indices are most commonly calculated using individual-level survey data. Incomplete data is a pervasive problem faced by most applied researchers who use survey data. The default analysis method in most statistical software packages is complete-case analysis. This excludes any cases where any variables are missing. If the missing variables in question are not completely random, the calculated concentration indices are likely to be biased, which may lead to inappropriate policy recommendations. In this paper, I use both a case study and a simulation study to show how complete-case analysis may lead to biases in the estimation of concentration indices. A possible solution to correct such biases is proposed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. The unweighted percentage is 2.62%. It becomes 2.59% after we apply population sampling weights. All the percentage numbers referred below in this section are population-weighted.

  2. The total household income in the top category is assumed at $100,000.

  3. We must acknowledge here that the statistical inference does not account for the fact that both concentration indices are calculated from the same sample. For a detailed discussion on potential problems and possible solutions, please refer to [15, 16].

  4. Population sampling weights capture differences between the original sample design and resulting response rates. They may also capture some of the patterns in the item-non-response that we are looking at. If we calculate the concentration indices without sampling weights, we can find that the differences between the concentration indices calculated from different methods are all slightly larger than those presented in the fourth column of Table 2. This finding may imply that the bias from using data with missing values would be less severe if we use population sampling weights.

  5. For more detailed discussions on this reranking effect in the context of the Gini coefficient, please refer to [22, 23]; for a discussion in the context of the concentration index, please refer to [21, 24].

References

  1. Wagstaff, A., Van Doorslaer, E., Paci, P.: Equity in the finance and delivery of health care: some tentative cross-country comparisons. Oxf. Rev. Econ. Policy 5, 89–112 (1989). doi:10.1093/oxrep/5.1.89

    Article  Google Scholar 

  2. Little, R.J., Rubin, D.B.: Statistical analysis with missing data. Wiley, New York (2000)

    Google Scholar 

  3. Ardington, C., Lam, D., Leibbrandt, M., Welch, M.: The sensitivity to key data imputations of recent estimates of income poverty and inequality in South Africa. Econ. Model. 23, 822–835 (2005). doi:10.1016/j.econmod.2005.10.009

    Article  Google Scholar 

  4. Briggs, A., Clark, T., Wolstenholme, J., Clarke, P.: Missing…. presumed at random: cost-analysis of incomplete data. Health Econ. 12, 377–392 (2003). doi:10.1002/hec.766

    Article  Google Scholar 

  5. Nicoletti, C., Peracchi, F.: The effects of income imputation on microanalyses: evidence from the European Community Household Panel. J. R. Stat. Soc. A 169, 625–646 (2006). doi:10.1111/j.1467-985X.2006.00421.x

    Article  Google Scholar 

  6. Kakwani, N.C., Wagstaff, A., van Doorslaer, E.: Socioeconomic inequalities in health: measurement, computation and statistical inference. J. Econom. 77, 87–103 (1997). doi:10.1016/S0304-4076(96)01807-6

    Article  Google Scholar 

  7. O’Donnell, O., van Doorslaer, E., Wagstaff, A., Lindelow, M.: Analyzing health equity using household survey data: a guide to techniques and their implementation. The World Bank, Washington (2008)

    Google Scholar 

  8. Rubin, D.B., Stern, H.S., Vehovar, V.: Handling ‘don’t know’ survey responses: the case of the Slovenian Plebicite. J. Am. Stat. Assoc. 90, 822–828 (1995). doi:10.2307/2291315

    Article  Google Scholar 

  9. Brick, J.M., Kalton, G.: Handling missing data in survey research. Stat. Methods Med. Res. 5, 215–238 (1996). doi:10.1177/096228029600500302

    Article  Google Scholar 

  10. Raghunathan, T.E., Lepkowski, J.M., van Hoewyk, J., Solenberger, P.: A multivariate technique for multiply imputing missing values using a sequence of regression models. Surv. Methodol. 27, 85–95 (2001)

    Google Scholar 

  11. Schenker, N., Raghunathan, T.E., Chiu, P., Makuc, D.M., Zhang, G., Cohen, A.J.: Multiple imputation of missing income data in the National Health Interview Survey. J. Am. Stat. Assoc. 101, 924–933 (2006). doi:10.1198/016214505000001375

    Article  Google Scholar 

  12. Rubin, D.B.: Multiple imputation for nonresponse in surveys. Wiley, New York (1987)

    Book  Google Scholar 

  13. Meng, X.L.: Multiple-imputation inferences with uncongenial sources of input. Stat. Sci. 9, 538–573 (1994)

    Google Scholar 

  14. Rubin, D.B.: Multiple imputation after 18+ years. J. Am. Stat. Assoc. 91, 473–520 (1996). doi:10.2307/2291635

    Article  Google Scholar 

  15. Mills, J.A., Zandvakili, S.: Statistical inference via bootstrapping for measures of inequality. J. Appl. Econ. 12, 133–150 (1997). doi:10.1002/(SICI)1099-1255(199703)12:2<133::AID-JAE433>3.0.CO;2-H

    Article  Google Scholar 

  16. Zheng, B., Cushing, B.: Statistical inference for testing inequality indices with dependent samples. J. Econom. 101, 315–335 (2001). doi:10.1016/S0304-4076(00)00087-7

    Article  Google Scholar 

  17. Van Doorslaer, E., Koolman, X., Puffer, F.: Equity in the use of physician visits in OECD countries: has equal treatment for equal need been achieved. In: OCED (ed.) Measuring up: improving health systems performance in OECD countries, pp. 225–248. OECD, Paris (2002)

  18. Van Doorslaer, E., Masseria, C., Koolman, X., the OECD Health Equity Research Group: Inequalities in access to medical care by income in developed countries. Can. Med. Assoc. J. 174, 177–183 (2006). doi:10.1503/cmaj.050584

    Article  Google Scholar 

  19. Van Doorslaer, E., Wagstaff, A., et al.: Equity in the delivery of health care in Europe and the U.S. J. Health Econ. 19, 553–583 (2000). doi:10.1016/S0167-6296(00)00050-3

    Article  Google Scholar 

  20. Van Doorslaer, E., Wagstaff, A., Calonge, S., et al.: Equity in the delivery of health care: some international comparisons. J. Health Econ. 11, 389–411 (1992). doi:10.1016/0167-6296(92)90013-Q

    Article  Google Scholar 

  21. Clarke, P.M., Gerdtham, U., Connelly, L.B.: A note on the decomposition of the health concentration index. Health Econ. 12, 511–516 (2003). doi:10.1002/hec.767

    Article  Google Scholar 

  22. Aronson, J., Johnson, P., Lambert, P.: Redistributive effect and unequal tax treatment. Econ. J. 104, 262–270 (1994). doi:10.2307/2234747

    Article  Google Scholar 

  23. Lambert, P.J., Aronson, J.R.: Inequality decomposition analysis and the Gini coefficient revisited. Econ. J. 103, 1221–1227 (1993). doi:10.2307/2234247

    Article  Google Scholar 

  24. Jimenez-Rubio, D., Smith, P., Van Doorslaer, E.: Equity in health and health care in a decentralized context: evidence from Canada. Health Econ. 17, 377–392 (2008). doi:10.1002/hec.1272

    Article  Google Scholar 

  25. Riphahn, T., Serfling, O.: Item non-response on income and wealth questions. Empir. Econ. 30, 521–538 (2005). doi:10.1007/s00181-005-0247-7

    Article  Google Scholar 

Download references

Acknowledgments

I would like to thank Jerry Hurley and an anonymous referee for their helpful comments. I am solely responsible for any remaining errors and omissions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hai Zhong.

Appendix

Appendix

See Tables 5 and 6.

Table 5 Variables included in imputation of individual level covariates
Table 6 Variables included in imputation of household income

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhong, H. The impact of missing data in the estimation of concentration index: a potential source of bias. Eur J Health Econ 11, 255–266 (2010). https://doi.org/10.1007/s10198-009-0170-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10198-009-0170-5

Keywords

JEL Classification

Navigation