Abstract
The purpose of this paper is to raise awareness of missing data when concentration indices are used to evaluate health-related inequality. Concentration indices are most commonly calculated using individual-level survey data. Incomplete data is a pervasive problem faced by most applied researchers who use survey data. The default analysis method in most statistical software packages is complete-case analysis. This excludes any cases where any variables are missing. If the missing variables in question are not completely random, the calculated concentration indices are likely to be biased, which may lead to inappropriate policy recommendations. In this paper, I use both a case study and a simulation study to show how complete-case analysis may lead to biases in the estimation of concentration indices. A possible solution to correct such biases is proposed.
Similar content being viewed by others
Notes
The unweighted percentage is 2.62%. It becomes 2.59% after we apply population sampling weights. All the percentage numbers referred below in this section are population-weighted.
The total household income in the top category is assumed at $100,000.
Population sampling weights capture differences between the original sample design and resulting response rates. They may also capture some of the patterns in the item-non-response that we are looking at. If we calculate the concentration indices without sampling weights, we can find that the differences between the concentration indices calculated from different methods are all slightly larger than those presented in the fourth column of Table 2. This finding may imply that the bias from using data with missing values would be less severe if we use population sampling weights.
References
Wagstaff, A., Van Doorslaer, E., Paci, P.: Equity in the finance and delivery of health care: some tentative cross-country comparisons. Oxf. Rev. Econ. Policy 5, 89–112 (1989). doi:10.1093/oxrep/5.1.89
Little, R.J., Rubin, D.B.: Statistical analysis with missing data. Wiley, New York (2000)
Ardington, C., Lam, D., Leibbrandt, M., Welch, M.: The sensitivity to key data imputations of recent estimates of income poverty and inequality in South Africa. Econ. Model. 23, 822–835 (2005). doi:10.1016/j.econmod.2005.10.009
Briggs, A., Clark, T., Wolstenholme, J., Clarke, P.: Missing…. presumed at random: cost-analysis of incomplete data. Health Econ. 12, 377–392 (2003). doi:10.1002/hec.766
Nicoletti, C., Peracchi, F.: The effects of income imputation on microanalyses: evidence from the European Community Household Panel. J. R. Stat. Soc. A 169, 625–646 (2006). doi:10.1111/j.1467-985X.2006.00421.x
Kakwani, N.C., Wagstaff, A., van Doorslaer, E.: Socioeconomic inequalities in health: measurement, computation and statistical inference. J. Econom. 77, 87–103 (1997). doi:10.1016/S0304-4076(96)01807-6
O’Donnell, O., van Doorslaer, E., Wagstaff, A., Lindelow, M.: Analyzing health equity using household survey data: a guide to techniques and their implementation. The World Bank, Washington (2008)
Rubin, D.B., Stern, H.S., Vehovar, V.: Handling ‘don’t know’ survey responses: the case of the Slovenian Plebicite. J. Am. Stat. Assoc. 90, 822–828 (1995). doi:10.2307/2291315
Brick, J.M., Kalton, G.: Handling missing data in survey research. Stat. Methods Med. Res. 5, 215–238 (1996). doi:10.1177/096228029600500302
Raghunathan, T.E., Lepkowski, J.M., van Hoewyk, J., Solenberger, P.: A multivariate technique for multiply imputing missing values using a sequence of regression models. Surv. Methodol. 27, 85–95 (2001)
Schenker, N., Raghunathan, T.E., Chiu, P., Makuc, D.M., Zhang, G., Cohen, A.J.: Multiple imputation of missing income data in the National Health Interview Survey. J. Am. Stat. Assoc. 101, 924–933 (2006). doi:10.1198/016214505000001375
Rubin, D.B.: Multiple imputation for nonresponse in surveys. Wiley, New York (1987)
Meng, X.L.: Multiple-imputation inferences with uncongenial sources of input. Stat. Sci. 9, 538–573 (1994)
Rubin, D.B.: Multiple imputation after 18+ years. J. Am. Stat. Assoc. 91, 473–520 (1996). doi:10.2307/2291635
Mills, J.A., Zandvakili, S.: Statistical inference via bootstrapping for measures of inequality. J. Appl. Econ. 12, 133–150 (1997). doi:10.1002/(SICI)1099-1255(199703)12:2<133::AID-JAE433>3.0.CO;2-H
Zheng, B., Cushing, B.: Statistical inference for testing inequality indices with dependent samples. J. Econom. 101, 315–335 (2001). doi:10.1016/S0304-4076(00)00087-7
Van Doorslaer, E., Koolman, X., Puffer, F.: Equity in the use of physician visits in OECD countries: has equal treatment for equal need been achieved. In: OCED (ed.) Measuring up: improving health systems performance in OECD countries, pp. 225–248. OECD, Paris (2002)
Van Doorslaer, E., Masseria, C., Koolman, X., the OECD Health Equity Research Group: Inequalities in access to medical care by income in developed countries. Can. Med. Assoc. J. 174, 177–183 (2006). doi:10.1503/cmaj.050584
Van Doorslaer, E., Wagstaff, A., et al.: Equity in the delivery of health care in Europe and the U.S. J. Health Econ. 19, 553–583 (2000). doi:10.1016/S0167-6296(00)00050-3
Van Doorslaer, E., Wagstaff, A., Calonge, S., et al.: Equity in the delivery of health care: some international comparisons. J. Health Econ. 11, 389–411 (1992). doi:10.1016/0167-6296(92)90013-Q
Clarke, P.M., Gerdtham, U., Connelly, L.B.: A note on the decomposition of the health concentration index. Health Econ. 12, 511–516 (2003). doi:10.1002/hec.767
Aronson, J., Johnson, P., Lambert, P.: Redistributive effect and unequal tax treatment. Econ. J. 104, 262–270 (1994). doi:10.2307/2234747
Lambert, P.J., Aronson, J.R.: Inequality decomposition analysis and the Gini coefficient revisited. Econ. J. 103, 1221–1227 (1993). doi:10.2307/2234247
Jimenez-Rubio, D., Smith, P., Van Doorslaer, E.: Equity in health and health care in a decentralized context: evidence from Canada. Health Econ. 17, 377–392 (2008). doi:10.1002/hec.1272
Riphahn, T., Serfling, O.: Item non-response on income and wealth questions. Empir. Econ. 30, 521–538 (2005). doi:10.1007/s00181-005-0247-7
Acknowledgments
I would like to thank Jerry Hurley and an anonymous referee for their helpful comments. I am solely responsible for any remaining errors and omissions.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhong, H. The impact of missing data in the estimation of concentration index: a potential source of bias. Eur J Health Econ 11, 255–266 (2010). https://doi.org/10.1007/s10198-009-0170-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10198-009-0170-5