Abstract
Composite indicators are widely used to determine the ranking of countries, organizations or individuals in terms of overall performance on multiple criteria. Their calculation requires standardization of the individual statistical criteria and aggregation of the standardized indicators. These operations introduce a potential propagation effect of extreme values on the calculation of the composite indicator of all entities. In this paper, we propose robust composite indicators for which this propagation effect is limited. The approach uses winsorization based on a robust estimate of the distribution of the sub-indicators. It is designed such that the winsorization affects only the composite indicator rank but has no effect on the entities ranking in each sub-indicator. The simulation study documents the benefits of distribution-based winsorization in the presence of outliers. It leads to a ranking that is closer to the clean data ranking when compared to the ranking obtained using either no winsorization or the traditional winsorization based on empirical quantiles. In the empirical application, we illustrate the use of winsorization for ranking countries based on the United Nations Industrial Development Organization’s Competitive Industrial Performance index. We show that even though the sub-indicator ranking does not change, the robust winsorization approach has a material impact on the ranking of the composite indicator for countries with large discrepancies in the scores of the sub-indicators.
Similar content being viewed by others
Notes
A related but different literature on composite indicator robustness studies the sensitivity of the composite indicator-based ranking to the choice of standardization, aggregation and imputation methods. Booysen (2002) provides an overview of dimensions to classify and evaluate the various implementations. Saisana and Saltelli (2011) use Monte Carlo techniques to quantify the sensitivity of the composite indicator rankings to the choice of standardization, aggregation and imputation method. Their technique is used in UNIDO (2013) to evaluate the robustness of the Composite Industrial Performance index. Davino and Romano (2014) describe how multivariate statistical techniques such as analysis of variance (ANOVA) and principal components analysis (PCA) can be used to analyze the variability due to the different composite indicator construction methods. Cherchye et al. (2008) propose a linear programming technique to obtain a robust ranking, which holds for a wide set of normalization and/or aggregation procedures. Permanyer (2011) studies the lack of ranking robustness by choosing specific weights scheme for the variables included in composite indicators.
It should be noted that other complex distributions could be considered to model the data, but the Weibull distribution is chosen because of its simple form and because it is applicable to the indicators used in the empirical application. Inspired by Alfons et al. (2013), we replace the extreme observations with quantiles of the fitted Weibull distribution of sub-indicators.
For instance, when we extend the dimension of sub-indicators to 8 or use data generated from a Weibull distribution with other parameters, we still reach similar conclusions.
References
Alfons, A., Templ, M., & Filzmoser, P. (2013). Robust estimation of economic indicators from survey samples based on Pareto tail modelling. Journal of the Royal Statistical Society: Series C (Applied Statistics), 62, 271–286.
Boudt, K., Caliskan, D., & Croux, C. (2011). Robust explicit estimators of Weibull parameters. Metrika, 73(2), 187–209.
Booysen, F. (2002). An overview and evaluation of composite indices of development. Social Indicators Research, 59(2), 115–151.
Cherchye, L., Moesen, W., Rogge, N., & Van Puyenbroeck, T. (2007). An introduction to “benefit of the doubt” composite indicators. Social Indicators Research, 82, 111–145.
Cherchye, L., Ooghe, E., & Van Puyenbroeck, T. (2008). Robust human development rankings. The Journal of Economic Inequality, 6, 287–321.
Correia, C. M. (2014). Measuring creativity in the EU member states. Investigaciones Regionales: Journal of Regional Research, 30, 7–26.
Davino, C., & Romano, R. (2014). Assessment of composite indicators using the ANOVA model combined with multivariate methods. Social Indicators Research, 119, 627–646.
Hubert, M., Rousseeuw, P. J., & Verdonck, T. (2012). A deterministic algorithm for robust location and scatter. Journal of Computational and Graphical Statistics, 21(3), 618–637.
Khan, J. A., Van Aelst, S., & Zamar, R. H. (2007). Robust linear model selection based on least angle regression. Journal of the American Statistical Association, 102(480), 1289–1299.
Maggino, F. (2017). Complexity in society: From indicators construction to their synthesis (Vol. 70). Berlin: Springer.
Moll de Alba, J., & Todorov, V. (2018). Green industrial performance: The GIP index. World Review of Science, Technology and Sustainable Development, 14(2–3), 266–293.
Moll de Alba, J., & Todorov, V. (2019). Measurement of green industrial performance: An enhanced GIP index. International Journal of Environment and Sustainable Development (forthcoming).
OECD. (2008). Glossary of statistical terms.
OECD. (2008). Handbook on constructing composite indicators: Methodology and user guide. Paris: OECD.
Parente, F. (2019). A multidimensional analysis of the EU regional inequalities. Social Indicators Research, 143(3), 1017–1044.
Permanyer, I. (2011). Uncertainty and robustness in composite indices rankings. Oxford Economic Papers, 64(1), 57–79.
Rousseeuw, P. J. (1984). Least median of squares regression. Journal of the American Statistical Association, 79(388), 871–880.
Rousseeuw, P. J. (1985). In W. Grossmann, G. Pflug, I. Vincze and W. Wertz (Eds.), Mathematical statistics and applications (Vol. B, pp. 283–297). Dordrecht: Reidel.
Rousseeuw, P. J., & Driessen, K. (1999). A fast algorithm for the minimum covariance determinant estimator. Technometrics, 41, 212–223.
Saisana, M. (2010). ELLI-index: A sound measure for lifelong learning in the EU European Commission. EUR 24529 EN.JRC, IPSC, Ispra, Italy (pp. 15–16).
Saisana, M., & Saltelli, A. (2008). Expert panel opinion and global sensitivity analysis for composite indicators. Computational methods in transport: Verification and validation (pp. 251–275). Dordrecht: Springer.
Saisana, M., & Saltelli, A. (2011). Rankings and ratings: Instructions for use. Hague Journal on the Rule of Law, 3(2), 247–268.
Saltelli, A. (2007). Composite indicators between analysis and advocacy. Social Indicators Research, 81, 65–77.
Tarabusi, E. C., & Guarini, G. (2013). An unbalance adjustment method for development indicators. Social Indicators Research, 112(1), 19–45.
Todorov, V., & Filzmoser, P. (2009). An object-oriented framework for robust multivariate analysis. Journal of Statistical Software, 32(3), 1–47.
UNIDO. (2002). Industrial development report 2002/2003: Competing through innovation and learning. Vienna: UNIDO.
UNIDO. (2013). Competitive industrial performance index 2012/2013: The industrial competitiveness of nations: Looking back, forging ahead. Vienna: UNIDO.
UNIDO. (2017). Competitive industrial performance report 2016 (Vols. 1–2). Vienna: UNIDO.
UNIDO. (2019). Competitive industrial performance report 2018. Vienna: UNIDO.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The views expressed herein are those of the author(s) and do not necessarily reflect the views of the United Nations Industrial Development Organization.
Rights and permissions
About this article
Cite this article
Boudt, K., Todorov, V. & Wang, W. Robust Distribution-Based Winsorization in Composite Indicators Construction. Soc Indic Res 149, 375–397 (2020). https://doi.org/10.1007/s11205-019-02259-w
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11205-019-02259-w