The impact of missing data in the estimation of concentration index: a potential source of bias

Zhong, Hai

doi:10.1007/s10198-009-0170-5

The impact of missing data in the estimation of concentration index: a potential source of bias

Original Paper
Published: 15 July 2009

Volume 11, pages 255–266, (2010)
Cite this article

The European Journal of Health Economics Aims and scope Submit manuscript

Hai Zhong¹

271 Accesses
8 Citations
Explore all metrics

Abstract

The purpose of this paper is to raise awareness of missing data when concentration indices are used to evaluate health-related inequality. Concentration indices are most commonly calculated using individual-level survey data. Incomplete data is a pervasive problem faced by most applied researchers who use survey data. The default analysis method in most statistical software packages is complete-case analysis. This excludes any cases where any variables are missing. If the missing variables in question are not completely random, the calculated concentration indices are likely to be biased, which may lead to inappropriate policy recommendations. In this paper, I use both a case study and a simulation study to show how complete-case analysis may lead to biases in the estimation of concentration indices. A possible solution to correct such biases is proposed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Simple adjustments of observed distributions for missing income and missing people

Article 14 June 2018

Techniques for Analyzing Incomplete Data in Public Health Research

Measuring income inequality using survey data: the case of China

Article 22 August 2014

Notes

The unweighted percentage is 2.62%. It becomes 2.59% after we apply population sampling weights. All the percentage numbers referred below in this section are population-weighted.
The total household income in the top category is assumed at $100,000.
We must acknowledge here that the statistical inference does not account for the fact that both concentration indices are calculated from the same sample. For a detailed discussion on potential problems and possible solutions, please refer to [15, 16].
Population sampling weights capture differences between the original sample design and resulting response rates. They may also capture some of the patterns in the item-non-response that we are looking at. If we calculate the concentration indices without sampling weights, we can find that the differences between the concentration indices calculated from different methods are all slightly larger than those presented in the fourth column of Table 2. This finding may imply that the bias from using data with missing values would be less severe if we use population sampling weights.
For more detailed discussions on this reranking effect in the context of the Gini coefficient, please refer to [22, 23]; for a discussion in the context of the concentration index, please refer to [21, 24].

References

Wagstaff, A., Van Doorslaer, E., Paci, P.: Equity in the finance and delivery of health care: some tentative cross-country comparisons. Oxf. Rev. Econ. Policy 5, 89–112 (1989). doi:10.1093/oxrep/5.1.89
Article Google Scholar
Little, R.J., Rubin, D.B.: Statistical analysis with missing data. Wiley, New York (2000)
Google Scholar
Ardington, C., Lam, D., Leibbrandt, M., Welch, M.: The sensitivity to key data imputations of recent estimates of income poverty and inequality in South Africa. Econ. Model. 23, 822–835 (2005). doi:10.1016/j.econmod.2005.10.009
Article Google Scholar
Briggs, A., Clark, T., Wolstenholme, J., Clarke, P.: Missing…. presumed at random: cost-analysis of incomplete data. Health Econ. 12, 377–392 (2003). doi:10.1002/hec.766
Article Google Scholar
Nicoletti, C., Peracchi, F.: The effects of income imputation on microanalyses: evidence from the European Community Household Panel. J. R. Stat. Soc. A 169, 625–646 (2006). doi:10.1111/j.1467-985X.2006.00421.x
Article Google Scholar
Kakwani, N.C., Wagstaff, A., van Doorslaer, E.: Socioeconomic inequalities in health: measurement, computation and statistical inference. J. Econom. 77, 87–103 (1997). doi:10.1016/S0304-4076(96)01807-6
Article Google Scholar
O’Donnell, O., van Doorslaer, E., Wagstaff, A., Lindelow, M.: Analyzing health equity using household survey data: a guide to techniques and their implementation. The World Bank, Washington (2008)
Google Scholar
Rubin, D.B., Stern, H.S., Vehovar, V.: Handling ‘don’t know’ survey responses: the case of the Slovenian Plebicite. J. Am. Stat. Assoc. 90, 822–828 (1995). doi:10.2307/2291315
Article Google Scholar
Brick, J.M., Kalton, G.: Handling missing data in survey research. Stat. Methods Med. Res. 5, 215–238 (1996). doi:10.1177/096228029600500302
Article Google Scholar
Raghunathan, T.E., Lepkowski, J.M., van Hoewyk, J., Solenberger, P.: A multivariate technique for multiply imputing missing values using a sequence of regression models. Surv. Methodol. 27, 85–95 (2001)
Google Scholar
Schenker, N., Raghunathan, T.E., Chiu, P., Makuc, D.M., Zhang, G., Cohen, A.J.: Multiple imputation of missing income data in the National Health Interview Survey. J. Am. Stat. Assoc. 101, 924–933 (2006). doi:10.1198/016214505000001375
Article Google Scholar
Rubin, D.B.: Multiple imputation for nonresponse in surveys. Wiley, New York (1987)
Book Google Scholar
Meng, X.L.: Multiple-imputation inferences with uncongenial sources of input. Stat. Sci. 9, 538–573 (1994)
Google Scholar
Rubin, D.B.: Multiple imputation after 18+ years. J. Am. Stat. Assoc. 91, 473–520 (1996). doi:10.2307/2291635
Article Google Scholar
Mills, J.A., Zandvakili, S.: Statistical inference via bootstrapping for measures of inequality. J. Appl. Econ. 12, 133–150 (1997). doi:10.1002/(SICI)1099-1255(199703)12:2<133::AID-JAE433>3.0.CO;2-H
Article Google Scholar
Zheng, B., Cushing, B.: Statistical inference for testing inequality indices with dependent samples. J. Econom. 101, 315–335 (2001). doi:10.1016/S0304-4076(00)00087-7
Article Google Scholar
Van Doorslaer, E., Koolman, X., Puffer, F.: Equity in the use of physician visits in OECD countries: has equal treatment for equal need been achieved. In: OCED (ed.) Measuring up: improving health systems performance in OECD countries, pp. 225–248. OECD, Paris (2002)
Van Doorslaer, E., Masseria, C., Koolman, X., the OECD Health Equity Research Group: Inequalities in access to medical care by income in developed countries. Can. Med. Assoc. J. 174, 177–183 (2006). doi:10.1503/cmaj.050584
Article Google Scholar
Van Doorslaer, E., Wagstaff, A., et al.: Equity in the delivery of health care in Europe and the U.S. J. Health Econ. 19, 553–583 (2000). doi:10.1016/S0167-6296(00)00050-3
Article Google Scholar
Van Doorslaer, E., Wagstaff, A., Calonge, S., et al.: Equity in the delivery of health care: some international comparisons. J. Health Econ. 11, 389–411 (1992). doi:10.1016/0167-6296(92)90013-Q
Article Google Scholar
Clarke, P.M., Gerdtham, U., Connelly, L.B.: A note on the decomposition of the health concentration index. Health Econ. 12, 511–516 (2003). doi:10.1002/hec.767
Article Google Scholar
Aronson, J., Johnson, P., Lambert, P.: Redistributive effect and unequal tax treatment. Econ. J. 104, 262–270 (1994). doi:10.2307/2234747
Article Google Scholar
Lambert, P.J., Aronson, J.R.: Inequality decomposition analysis and the Gini coefficient revisited. Econ. J. 103, 1221–1227 (1993). doi:10.2307/2234247
Article Google Scholar
Jimenez-Rubio, D., Smith, P., Van Doorslaer, E.: Equity in health and health care in a decentralized context: evidence from Canada. Health Econ. 17, 377–392 (2008). doi:10.1002/hec.1272
Article Google Scholar
Riphahn, T., Serfling, O.: Item non-response on income and wealth questions. Empir. Econ. 30, 521–538 (2005). doi:10.1007/s00181-005-0247-7
Article Google Scholar

Download references

Acknowledgments

I would like to thank Jerry Hurley and an anonymous referee for their helpful comments. I am solely responsible for any remaining errors and omissions.

Author information

Authors and Affiliations

School of Public Finance and Public Policy, Central University of Finance and Economics, 39 South College Road, 100081, Beijing, China
Hai Zhong

Authors

Hai Zhong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hai Zhong.

Appendix

See Tables 5 and 6.

Table 5 Variables included in imputation of individual level covariates

Full size table

Table 6 Variables included in imputation of household income

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhong, H. The impact of missing data in the estimation of concentration index: a potential source of bias. Eur J Health Econ 11, 255–266 (2010). https://doi.org/10.1007/s10198-009-0170-5

Download citation

Received: 20 March 2008
Accepted: 23 June 2009
Published: 15 July 2009
Issue Date: June 2010
DOI: https://doi.org/10.1007/s10198-009-0170-5

Keywords

JEL Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The impact of missing data in the estimation of concentration index: a potential source of bias

Abstract

Access this article

Similar content being viewed by others

Simple adjustments of observed distributions for missing income and missing people

Techniques for Analyzing Incomplete Data in Public Health Research

Measuring income inequality using survey data: the case of China

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

The impact of missing data in the estimation of concentration index: a potential source of bias

Abstract

Access this article

Similar content being viewed by others

Simple adjustments of observed distributions for missing income and missing people

Techniques for Analyzing Incomplete Data in Public Health Research

Measuring income inequality using survey data: the case of China

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation