Abstract
The United Nations’ Human Development Index remains a widely used and accepted measure of human development. Although it has been revised over the years to address various critiques, a remaining concern is the way the three dimensions are aggregated into the single index. A deterioration in one dimension can be compensated for by an improvement in another. Since compensability is inextricably linked with trade-offs and intensity of preferences, a non-compensatory (i.e., Condorcet) approach to aggregation is employed in this paper. Although non-compensatory approaches have been employed previously, this paper adds to the literature by undertaking an application of the Condorcet approach to the entire HDI. This approach, which does not use intensities of preferences, ensures that the degree of compensability connected with the aggregation model is at the minimum possible level. To achieve this, country level rankings are then compared to those for the 2020 Human Development Index which aggregates dimensions using a geometric mean. The findings demonstrated substantial changes in rank-order between the HDI and Condorcet approach. This outcome provides empirical evidence which demonstrates that the non-compensatory Condorcet approach can mitigate issues of compensation present within the geometric aggregation technique currently employed by the HDI. These findings have potential implications in aiding the identification and employment of potential policy priorities—specifically, the notion that policy should emphasise the development of a country as opposed to economic growth alone.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Since its inception in 1990, the Human Development Index (HDI) of the United Nations Development Program (UNDP) has become one of the best known and widely used measures of human development. Enthusiasm for the HDI derives from the fact it provides a broader measure of a country’s development than the traditional measure of income per capita. It is also relatively straightforward to calculate and provides a transparent tool for comparing progress across a large number of countries. At the same time the HDI has been criticised on a number of grounds. These criticisms include (but are not limited to) the neglect of other important dimensions of human development (Dasgupta & Weale, 1992; Liang et al., 2019), the quality of data from which it is calculated (Herrero et al., 2012; Srinivasan, 1994), the subjective choices made during the construction process (Mariano et al., 2021), the index adding little to the value of the individual variables comprising it (McGillivray, 1991; Morse, 2014), the use of national averages and the masking of inter-country inequalities (Hicks, 1997; Klugman, Rodriguez & Choi, 2011; Sagar & Najam, 1998) and the use of arbitrary weightings (Desai, 1991; Liang et al., 2019). Yet the HDI has stood the test of time and is still widely used by policymakers, academic researchers and the media.
Over time, the methods used to calculate the HDI have been revised, with the most notable being undertaken for the 20th anniversary edition (i.e., the 2010 edition of the HDI). While retaining the same three-dimensional structure with equal weights, the 2010 version of the HDI adopted inter alia different variables to proxy its dimensions as well as a different aggregation procedure (Klugman et al., 2011). More specifically, the sub-indices of the HDI were aggregated using a geometric mean instead of an arithmetic mean. The aggregation approach was changed due to the issue of perfect substitutability between its dimensions (Herrero et al., 2012). This was a problematic assumption of the old HDI formula, because it implied that falls in the attainment of one of the HDI components could be perfectly offset by an equal improvement in the attainment of another. When adopting a geometric mean, perfect substitutability no longer applies although some degree of substitutability still exists.
This paper posits that the aggregation issue continues to be a weakness of the HDI. It does not criticise the components of the HDI such as equal weighting and the arbitrary parameters of the component indices which also impact the final rank order of nations. Instead, this paper focuses on the method in which the HDI is aggregated and how this contributes to a flawed ranking result of countries given that falls in one component can still be compensated for by improvements in another. Ravallion (2010) argued that these trade-offs, measured by the marginal rates of substitution across HDI dimensions, are troubling on ethical and other grounds. He demonstrates that the implicit monetary values attached to an extra year of life vary from as little as $0.51 per year for Zimbabwe to almost $9,000 per year for the richest countries. Such issues can be problematic when optimal resource allocation decision making occurs in the context of trade-offs.
Although non-compensatory approaches have been employed previously (Alaimo & Seri, 2023; Goerlich & Reig, 2021), this paper adds to the literature by undertaking an application of the Condorcet approach to the entire HDI. Under this approach, the weights of the HDI dimensions are interpreted as importance weights rather than marginal rates of substitution which is more in line with the original spirit of the HDI. Moreover, there is no compensation allowed for changes in the dimension values using this approach. Country rankings are then compared to those for the latest HDI. The results showed substantial changes in rank-order between the HDI and Condorcet approach.
The remainder of this paper is structured as follows. Section 2 describes the main changes to the HDI methodology. Section 3 discusses the weakness of the aggregation approach used in the HDI and outlines the Condorcet aggregation approach adopted by this paper. Section 4 provides the results of the changes to the rank-order of countries from the HDI geometric mean aggregation compared to the Condorcet aggregation technique. Finally, Sect. 5 concludes with some policy implications arising from the research.
2 Reviewing the Main Methodological Changes to the HDI
From 1990 until 2009, a long and healthy life, which comprises the first HDI dimension, was measured using life expectancy at birth. The second dimension: knowledge, was measured using the adult literacy rate (with a weighting of two-thirds); and the combined gross enrolment ratio for primary, secondary and tertiary schooling (with a weighting of one-third). The third, and final, dimension was a decent standard of living, which was measured using GDP per capita in US dollars adjusted for Purchasing Power Parity (PPP). The variable was logged to reflect the diminishing importance of income for human development at higher levels of GDP per capita. The variables for the three dimensions were normalised using a fixed minima and maxima to achieve the property of scale neutrality using the following formula:
To arrive at the final HDI score, an arithmetic mean using equal weights of one-third for the three-dimensional indices was applied.
In 2010, the HDR methodology was revised with the following changes: The aggregation of the HDI involved the geometric mean of the normalised indices (Deb, 2015); the proxy for knowledge combined the mean years of schooling for adults 25 years and older with the expected years of schooling of children commencing schooling; and the proxy for standard of living was replaced with the log of per capita Gross National Income (GNI) adjusted for PPP. Another significant revision was the use of a constant maxima for the normalization of the indices, instead of the observed maxima. The specified upper limits were: 85 years for life expectancy, 15 years for mean years of schooling, 18 years for expected years of schooling, and $75,000 for GNI per capita.Footnote 1 The scores for the three dimensional indices (I) were aggregated using the geometric mean rather than the arithmetic mean to provide the overall HDI score:
3 Index Aggregation: Additive Method, Geometric Mean and the Condorcet Approach
The Condorcet approach is also referred to as non-compensatory multi-criteria analysis aggregation technique.
As Goerlich and Reig (2021) point out, the key to distinguishing aggregation methods centres on whether they implement a compensatory or a non-compensatory logic. In this light, this section reviews the three main aggregation approaches: (i) additive method; (ii) geometric mean aggregation approach utilised by the HDI; and (iii) the Condorcet aggregation technique.
3.1 Additive Method
Up until 2009, the HDI employed an additive aggregation approach, via an arithmetic mean, to determine its value. The approach is based on ordinal information, where a country’s rank is summed for each of the indicators. As Munda and Nardo (2005a) state, this approach is simple to use, does not consider potential interactions across the dimensions and is insensitive to outliers. This has led to arguably its main criticism which is that shortfalls in one dimension can be compensated for by a strong outcome in another dimension (Sagar & Najam, 1998). Studies by McGillivray and White (1993), Dijkstra and Hanmer, (2000) and Cahill (2005) highlight how strong GDP outcomes can overshadow poor health or education outcomes potentially leading to policies that result in a sub-optimal allocation of resources. As Desai (1991) points out, the additive approach implied perfect substitutability between the three HDI dimensions reflects a reductionist viewpoint making it an inappropriate approach. Given these shortcomings, the HDI moved to a geometric aggregation approach from 2010. This approach is reviewed below.
3.2 HDI Geometric Mean Aggregation
Sagar and Najam (1998) put forward an aggregate calculation based on a multiplicative scheme where the HDI is calculated by the geometric mean of the component values. With the application of a geometric mean the UNDP, 2010, (p. 15) notes:
Poor performance in any dimension is now directly reflected in the HDI, and there is no longer perfect substitutability across dimensions. This method captures how well rounded a country’s performance is across the three dimensions. As a basis for comparisons of achievement, this method is also more respectful of the intrinsic differences in the dimensions than a simple average is. It recognizes that health, education and income are all important, but also that it is hard to compare these different dimensions of well-being and that we should not let changes in any of them go unnoticed.
As asserted by Sagar and Najam (1998) this property of the HDI computed using a geometric mean is consistent with the original purpose of HDI which was to control for the trade-offs between each dimension. By doing so, each dimension was treated as being vital and non-substitutable. However, under this multiplicative scheme, a compensatory effect still occurs where one dimension that fares poorly can be offset by a good result in another dimension. As Greco et al. (2019) assert, the use of a geometric mean is an improved approach compared to the additive aggregation via an arithmetic mean (i.e., linear approach). However, they add, that it is not ideal as it is akin to being between compensatory and non-compensatory techniques (Zhou et al., 2010). Since the objective is to assign weights based on their importance to the model, such a trade-off is theoretically inconsistent with this objective. This perspective is echoed by Goerlich and Rigg, 2021, (p. 3), who opined that, “… a geometric aggregation represents a middle-of-the-way approach, between full compensability and non-compensability (Blancas et al., 2013; Nardo et al., 2008; Van Puyenbroeck & Rogge, 2017; Zimmermann & Zysno, 1983)”. They also state that the limitation of this method (i.e., inferior compensability) is most apparent when indices have lower values. In such cases, a non-compensatory multi-criteria approach is preferred.
Although less compensatory than additive aggregation, the compensability nature of the multiplicative scheme does not, this paper insists, go far enough to align with the original goals of the UNDP, which was to mirror the real needs of the nations under analysis rather than to provide a distorted image through an uneven, or overly compensatory, dimension contribution. This is why an absence of conflict amongst the variables is preferred.
For aggregation purposes, the nature of the data is an important consideration. Since the HDI data is not on a ratio scale, the aggregation options are quite limited. According to Ebert and Welsch (2004), a weighted geometric mean of the un-normalised data can produce a purposeful index so long as all the variables are strictly positive with a natural origin.Footnote 3 The 2010 revised HDI employed the geometric mean as one part of its revisions outlined in Sect. 2 above. According to Klugman et al. (2011), this approach seems to mirror the alternative put forward by Herrero, Martinez and Villar (2005, 2010).
Although the notion of a purposeful index through the use of a geometric mean is welcome, its purposefulness could be further improved by addressing remaining aggregation issues. Specifically, empirical studies have showed that the geometric mean poses other issues such as not knowing the degree of compensability (Alaimo & Seri, 2023), penalising unbalanced or skewed attainment across the HDI dimensions (Mangaraj & Aparajits, 2020), imperfect substitutability (Mazziotta & Pareto, 2022). In addition, Pinar’s (2022) analysis of HDI demonstrated that similar HDI scores were received for most nations with both aggregation techniques–geometric and arithmetic mean. Furthermore, these similarities in scores also were displayed when there was great variations across the three dimensions. Consequently, from a conceptual and methodological perspective, the aggregative-compensative approach is seen as inappropriate (Alaimo & Seri, 2021; Alaimo et al., 2022).
Furthermore, prior studies such as Herrero et al. (2012), Klugman et al. (2011), Zambrano (2011), Ravallion (2010) and Morse (2014) also examined the impact of the HDI modifications to its methodology on country rankings. While all agree that the refinements to the HDI has led to improved measurement, there is also mention of inconsistencies and improvements that are still required. For example, Morse (2014) argued that the changes in the HDI methodology led to increased turbulence in country rankings. This increased turbulence can result in enhanced press reporting of the rank order changes which can be construed as a positive. However, it could also result in a country either rising rapidly without necessarily invoking anything tangible or declining greatly despite initiating a set of good policies designed to help with human development.
Ravallion (2010) claims that the relaxation of perfect substitutability between the three dimensions has led to ‘troubling trade-offs’ which an alternative aggregation technique that still allows for imperfect substitution could resolve. In addition, Herrero et al. (2012) argues that although the geometric mean is the right choice, improvements are required regarding how raw variables are numbered and the manner in which the income log variable is explained. Similar arguments have been made by Deb (2015) and Anand (2018). Unfortunately, change in the HDI aggregation approach reduces, but does not solve the compensability issue between the different dimensions as it does not penalise unbalanced achievements across dimensions strongly enough.Footnote 4 This supports the earlier assertion by Goerlich and Reig (2021) that the geometric aggregation scheme lies between full compensability and non-compensability.Footnote 5
Given that the principle of the symmetrical importance of variables needs to be retained, this paper argues the HDI needs to shift away from the use of a geometrical mean, since it does not solve the ‘trade-off’ issue (Greco et al., 2019; Zhou et al., 2010). Given this, a move towards adopting a non-compensatory multi-criteria approach is preferred. This approach avoids complete compensability and suggests a theoretical assurance that the HDI weights are interpreted as measures of importance rather than marginal rates of substitution (Bouyssou, 1986; Munda, 2005). From a compensatory logic perspective, the marginal rates of substitution is problematic when the importance of the weights for individual indicators are actually measuring substitutability (Paruolo et al., 2013).
3.3 Non-compensatory Multi-criteria analysis (Condorcet Approach)
Our approach tries to resolve the compensability issue by using a discrete multi-criteria approach that includes the lack of preference independence (Munda, 1995, 2012; Roy, 1996). A pair-wise comparison of selected countries across the HDI variables is performed and ranked from best to worst. This Condorcet-type ranking procedure occurs via a complete pre-order by a mathematical formulation (Natoli & Zuhair, 2011).
Under the Condorcet approach, the intensity of preferences are not used, thus minimising compensability with the HDI aggregation, with the pair-wise comparisons interpreted as a voting matrix (Munda & Nardo, 2005a; Nardo et al., 2005). As Munda (2005, 2012) points out, unlike linear or geometric aggregation, the Condorcet approach finds compromises between two or more legitimate goals assuring non-compensability which is also supported via the social choice literature (Arrow & Raynaud, 1986; Moulin, 1988; Munda & Nardo, 2005b).Footnote 6
The main drawback to this method, according to Munda (2005), is that the number of permutation calculations increases exponentially, when many countries are analysed. Despite this drawback, employing a non-compensability aggregation approach to the HDI is desirable due to its theoretical consistency and its ability to have no limitation on the measurement scale of variables (Munda & Nardo, 2005a). These benefits should reduce the uncertainty and imprecision in the HDI index. The Condorcet technique is detailed below. Note, that while the HDI weights should be viewed as trade-offs in compensatory aggregation methods, they are viewed as ‘importance coefficients’ in non-compensatory aggregation methods.
An axiomatic setting for the non-compensatory approach is outlined below. When comparing development across countries, it is common to find that one country outperforms another in one area but not in another. Overcoming these conflicting results in a non-compensatory manner enables this paper to appropriately rank the selected countries favouring a Condorcet approach.Footnote 7 However, one drawback involves the issue of cycles.Footnote 8
Regarding cycles, we can consider a voting situation with three voters, V1, V2 and V3 and three candidates C1, C2 and C3. The preferences are given in table 1 below where candidates are given in decreasing order of preference.
For instance, if C3 is the winner, It is possible to argue that C2 should have been the winner because V1 and V2 prefer C2 to C3 and only V3 prefer C3 to C2. It then follows that C1 is preferred to C2 and C3 is preferred to C1 with a two to one margin in each case. This is a case of cycling where C1 is preferred to C2 which is preferred over C3 which is preferred over C1.
Cycles can occur quite regularly with macroeconomic data and is dealt with via the Condorcet-Kemeny-Young-Levenglick (CKYL) ranking procedure.Footnote 9 However, as Munda (2005) and Munda and Nardo (2005a) point out, one still must accept that rank reversals may occur. A recent study by Su et al. (2023), examined the rank-reversal problem through employing a subgroup dominance-based benefit-of-doubt (SD-BoD) model. The model compared and analysed the HDI of 28 European nations and found that the severity of rank reversals was largely dictated by the employed common-weight BoD method. Although rank reversals are at variance with Arrow’s axiom of independence of irrelevant alternatives, Young (1988) states that the locally stable nature of the CKYL ranking procedure maintains the ranking of alternatives. As Munda (2005) states, the CKYL ranking procedure for composite indices is obtained by a simple, yet formal ranking algorithm that supports the maximum likelihood ranking of countries. To achieve this, the maximum number of individual indicators for each pair-wise comparison is summed over all pairs of countries involved as per the below ranking algorithm.
Given a set of individual indicators \(G = \{ g_{m} \} ,m = 1,2,...,M,\) and a defined set \(A = \{ a_{n} \} ,n = 1,2,...,N\) of countries; each country \(a_{n}\) is evaluated with respect to an individual indicator \(g_{m}\) from an assumed ordinal, interval or ratio measurement scale. Here, individual indicators with higher values are preferred to a lower one as outlined below:
where P and I specify a preference and an indifference relation accordingly, both fulfilling the transitive property.
From there, a set of individual indicator weights \(W = \{ w_{m} \} ,m = 1,2...,M,\) with \(\sum\limits_{m = 1}^{M} {w_{m} } = 1,\) are derived as importance coefficients. To rank in a complete pre-order, from best to worst with the available information, the following mathematical aggregation procedure is required: (i) pair-wise comparison of countries according to the whole set of individual indicators used; and (ii) ranking of countries in a complete pre-order. To perform a pair-wise comparison of countries for the whole set of HDI indicators, an axiomatic system is adapted from Arrow and Raynaud (1986, pp. 81–82) is required.
Axiom 1: Diversity
Each individual indicator is a total order on the defined set A of countries to be ranked, and there is no restriction on these variables; they can be any total order on A.Footnote 10
Axiom 2: Symmetry
The individual indicators are non-comparable scales that contain ordinal pair-wise preferences. This primarily means that intensity of preferences and compensability are sidestepped, while the weights are symmetrical importance coefficients. It also means that a normalisation step is not needed since this axiom (symmetry) reduces uncertainty and imprecision.
Axiom 3: Positive Responsiveness
For two countries \(a\) and \(b\) the level of performance between them depends on how many indicators and their associated weights that rank \(a\) before \(b\).Footnote 11 As a result, the equal treatment of all individual indicators is broken (Munda, 2005; Munda & Nardo, 2005a) and a trade-off occurs between anonymity and decisiveness (i.e., a ranking must be selected). Of the two, decisiveness is favoured.Footnote 12
The three axioms (diversity, symmetry and positive responsiveness) allow a N x N matrix,\(E\), called an outranking matrix to be established, which assumes to contain all available information (Arrow & Raynaud, 1986; Roy, 1996). Any generic element of \(E:e_{jk} ,j \ne k\) arises from the pair-wise comparison of all the \(M\) individual indicators between countries j and k. To arrive at a global pair-wise comparison, the equation below is employed:
where, \(w_{m} \left( {P_{jk} } \right)\) and \(w_{m} \left( {I_{jk} } \right)\) refer to the relation between preference \(P\) and indifference \(I\) based on the weights of individual variables. Hence, we state that:
The outranking matrix \(E\) comprise all the \(N\left( {N - 1} \right)\) pair-wise comparisons. The notation represents \(R\) the set of all \(N!\) possible complete rankings of alternatives, \(R = \left\{ {r_{s} } \right\},s = 1,2,...,N!.\) For each \(r_{s}\) the corresponding score \(\phi_{s}\) is calculated as the sum of \(e_{jk}\) over all the \(\left( {\begin{array}{*{20}c} N \\ 2 \\ \end{array} } \right)\) pairs \(j,k\) of alternatives, i.e. \(\phi_{s} = \sum {e_{jk} }\), where \(j \ne k,s = 1,2,...N!\) and \(e_{jk} \in r_{s}\).
The final ranking \(\left( {r*} \right)\) maximises the equation below:
The CKYL approach also contains other formal properties which include (Munda, 2005; Young, 1988; Young & Levenglick, 1978):
-
Neutrality: no country is favoured over another, thus all countries are treated equally.
-
Unanimity: if all the individual indicators favour country a to country b, then country a should be chosen.
-
Monotonicity: there are no defiers, hence if country a is preferred in any pair-wise comparison and only their variables are improved, then country a should continue to be the higher rank ordered country.
-
Reinforcement: if the set A of countries is ranked by two subsets \(G_{1}\) and \(G_{2}\) of the individual indicator set G, such that the ranking is the same for both \(G_{1}\) and \(G_{2}\), then \(G_{1} \cup G_{2} = G\) should still supply the same ranking. This notion of reinforcement is crucial when initially applying the variables to each single dimension and then aggregating them to the general model.
As Natoli and Zuhair (2011) point out, given the importance of reinforcement, the only Condorcet consistent rule that can uphold this property is the maximum likelihood ranking procedure. A property, which Arrow and Raynaud (1986) assert, whose definite ethical construct is critical to welfare economics.
It was found that if equal values are used for the weightings in determining the ranking using the Condorcet approach, the ranking may not be unique. With equal weightings, it is found that the non-uniqueness of the ranking occurs within a few ranking positions. Thus, as pointed out by Natoli and Zuhair (2011), when undertaking any aggregation technique, practical compromises need to take place. Hence, a small value, such as π/100, was used to adjust different weightings while still maintaining \(\sum_{m=1}^{M}{w}_{m}=1.\) For example, if M = 3, traditional weightings would be w1 = w2 = w3 = 1/3. In the present work, we use \({w}_{1}=\frac{1}{3}+ \frac{\uppi }{100}, {w}_{2}= \frac{1}{3}, {w}_{3}= \frac{1}{3}-\frac{\uppi }{100}\). With an interchanging of weightings, the ranking positions will change within the few non-unique positions of those using the equal weightings. This does not impact on the Condorcet method since, in reality, equal weighting for all the indicators is a subjective choice and the small difference in weighting means that some indicators are slightly more favoured than the others.
Although both anonymity and information on the preference intensity of variables is lost, the non-compensatory approach preserves the theoretical importance of the coefficients and reduces the main source of imprecision and uncertainty (Munda & Nardo, 2005a). As Natoli and Zuhair (2011) posit, since progress measures can serve as a foundation to improve decisions on resource allocations, the Condorcet approach is preferred since such policy decisions should occur without trade-offs between dimensions.
4 Results and Discussion
4.1 Empirical Results
To explore whether there are rank-order issues prevalent within the 2020 HDI report derived from the geometric mean aggregation approach relative to the Condorcet approach, an analysis of rank-orders is required.
Table 2 contains the 189 countries in the 2020 HDI report and displays both their HDI and Condorcet rank-order. The results are also presented via Fig. 1. Although rank-order change occurs throughout, as shown in Fig. 1, the most pronounced variation between the HDI and Condorcet rank-orders seems to be concentrated among the mid-range countries with variations at the high-range and low-range being less apparent. This finding is similar to those highlighted by Greco et al. (2019) in a study by Munda (2012) who applied a CKYL non-compensatory approach to the Environmental Sustainability Index (ESI). Munda (2012) identified noticeable difference in rankings between his CKYL approach and the linear approach used by the ESI, which was mostly evident in the middle positions and less evident among those ranked first and last.
A study by Mangaraj and Aparajita (2020) undertook a generalised HDI (GHDI) as well as a measure of relative GHDI (GHDIR) with a multi-index ranking model. They compared the top 30 countries as ranked via the HDI with the GHDI and the GHDIR. Their findings showed that the rank-order changes of the GHDI did not deviate much from the ranks of the HDI. This reinforces the notion of little deviation when the focus is on either the high-end or low-end of nations in the HDI table.
Table 3 provides a snapshot of the number of countries which experience a sizable change in rank order. As Table 3 demonstrates, of the 189 countries, just over half (104, or 55% of all countries in the HDI) experience a rank order change of at least six positions. In addition, just over one-quarter (27.5%, or 52 countries in the HDI) experienced a movement of at least 10 rank-order positions. At the highest level of rank-order changes, 11 countries (or 5.8% of all HDI countries) experience a rank-order change of at least 19 positions.
In keeping with the highest level of rank-order changes, Table 4 identifies the top ten countries to experience the highest level of variability in rank-order changes in terms of both improvement and deterioration. As identified earlier, most of the countries that experienced the highest level of rank-order changes are those grouped in the middle of the HDR report (HDI original rank from 58 to 95).
As Table 4 shows, Morocco [MAR], Maldives [MDV], Costa Rica [CRI] and Cuba [CUB] benefited the most regarding a positive rank-order change under the Condorcet approach. For rank deteriorations, the most notable occurred to Bahamas [BHS], Trinidad & Tobago [TTO], Equatorial Guinea [GNQ] and Turkmenistan [TKM].
The magnitude of the changes in rank-order suggests that, as Anand (2018, p. 6) points out, that the geometric mean is not suitable since it allows “… changes in any of them [variables] to go unnoticed”. Instead, due to the non-compensatory multi-criteria approach, the Condorcet aggregation technique can identify compromises between more than one legitimate goal to assure non-compensability.
Table 5 shows the overall number of rank changes from the HDI rank-order to the Condorcet rank-order for all countries. Here, tied ranks refers to no change in a country’s rank-order from the HDI and Condorcet approach, a positive rank refers to an improvement in a country’s rank-order from the HDI to the Condorcet (e.g. 5 in HDI to 4 under a Condorcet approach), and a negative rank refers to a deterioration in a country’s rank-order from HDI to Condorcet (e.g. 6 in HDI to 5 under a Condorcet approach). Of the 189 countries, there were only five occasions (2.6%) which resulted in a tied rank. Thus, the Condorcet aggregation approach results in almost all country rankings undergoing a change in rank-order.
4.2 Discussion
A case for the Condorcet aggregation method providing a better measure of country level human development can be made by examining the index scores across the three individual HDI components. For example, among countries with a ‘very high’ level of human development, Singapore is one of the countries that ranks much higher (3rd) under the Condorcet aggregation method than under the geometric mean approach (11th). This is because Singapore outranks many countries ranked higher in the HDI rankings with respect to both life expectancy and income. Singapore ranks less well for education. Under a geometric mean aggregation approach its overall HDI score and ranking are lower since the life expectancy and income index scores compensate for the lower education index score. Its higher rank under the Condorcet approach stems from the fact that Singapore ranks better on two of the three HDI components than nearly all other countries. We argue that unless there is a reason to believe that education is more important than income and health, Singapore should receive a higher overall ranking given its better performance in two of the three HDI components. In comparison, Switzerland which is ranked 3rd by the UNDP’s HDI, ranks first under the Condorcet aggregation method. This is because it outranks Norway, Ireland and all other countries on at least two of the three HDI component indices.
The Bahamas, another country with a ‘very high’ level of human development, drops 30 places in the rankings under the Condorcet method, mainly because its relatively high score for the income index can no longer compensate the relatively lower scores for life expectancy and education. A similar explanation applies to Equatorial Guinea, a country with a ‘medium’ level of human development which falls 24 places in the rankings (145th to 169th) when its relatively high income index score can no longer compensate its poorer life expectancy and education index scores. Conversely, Morocco, another country with a ‘medium’ level of human development ranks much better under the Condorcet approach (94th vs. 121st) because it has relatively higher life expectancy and income index scores than countries above it in the HDI index rankings.
There is relatively less change in rankings for countries with a ‘low’ level of human development. An exception is that of Eritrea which performs better when the Condorcet approach is applied since it performs relatively well according to life expectancy and income for countries at this level of development. Its rank improves from 180 to 164th when life expectancy and income are no longer compensate for lower education index scores bringing down its overall average HDI score. In sum, if we accept that the HDI components are equally important for human development and that shortfalls/achievements in one of the components cannot be compensated for (or penalised) by levels in another, our ranking according to the Condorcet method offers a more accurate depiction of the human development level of each country.
5 Conclusion and Policy Implications
This paper compares the country rankings of the UNDP’s HDI (2020) with those resulting from a modified Condorcet ranking, a non-compensatory approach. Country HDI rankings using the Condorcet aggregation approach were compared to those reported in the 2020 Human Development Report which allow for shortfalls in one index component to be compensated for by high values in another. The paper compared the rank-order of the human development for the year 2020 with the Condorcet rank-order.
Findings suggest that the approach to aggregating the HDI is very important with respect to country rankings. Although a monotonic association was identified, the changes in rank-order between the HDI and Condorcet approach were substantial. This outcome provides empirical evidence regarding the problematic, and inconsistent, nature of using a geometric mean to aggregate the HDI where ‘troubling trade-offs’ occur due to the relaxation of perfect substitutability between the three dimensions. The substantial changes derived from the Condorcet approach can be partly explained by the fact that it is able to address this issue via its non-compensatory multi-criteria analysis.
An issue with geometric mean, as discussed above and in the literature, is that weakness in one criterion can be compensated by the strength in another criteria which could confuse the interpretation of the results and unable to provide clear direction to the policy makers. By removing this compensatory effect, Condorcet ranking is able to provide clearer direction on policy formulation for development.
The HDI, with its focus on people and their capabilities, can be used to query the policy choices of nations. For instance, the change in HDI rankings based on the Condorcet approach can capture the attention of policy makers, media, non-governmental organisations, etc. and further stimulate the debate surrounding the purpose of government policy. Specifically, that policy should emphasise the development of a country as opposed to economic growth alone. It may also be used as one indicator amongst a suite-of-indicators to assess foreign aid or Official Development Assistance to low-income countries. This could result in alterations to the volume of aid received by some countries.
Notes
For more detail on the methodological changes in the HDI see Deb (2015).
Interestingly, Bohringer and Jochem (2007) assessed eleven sustainability indices that possessed ratio scale noncomparability scaled variables – of which the HDI was one. Of the eleven, only the Living Planet Index failed to use the geometric mean aggregation procedure set out in Ebert and Welsch’s (2004) article.
Although as Gasparatos, El-Haram and Horner (2008) opine, it would appear on a lower level than the arithmetic mean aggregation employed by the HDI.
Arguably, the geometric-mean aggregation approach is less of a concern than the functional form of the HDI’s components. For instance, regarding the functional form of the income variable. The UNDP’s choice of using a logarithmic transformation is based on marginal gains in income being more important at lower levels. However, Sagar and Najam (1998) question this approach since even at high levels, income is able to expand peoples’ choice sets. They further state that, “In fact, it could be argued that income has to reach a certain threshold before it becomes tradable for human development (Sagar and Hajam, 1998, pp. 253-54)”. Hence, if other functional forms (such as exponential or power functions) were applied to income this would result in changes to the HDI country rankings.
In fact, Arrow and Raynaud (1986, p. 77) state that, “… for aggregating an algorithm, the highest feasible multi-criterion ranking must be Condorcet.”.
In the original, Arrow and Raynaud (1986) discuss a finite set X of alternatives, with no restriction condition on the criteria that can be any total order on X.
In this instance, Arrow and Raynaud (1986) originally discussed the intensity of preferences between two alternatives xi and xj.
Furthermore, as Munda (2005) posits, to avoid the aggregation becoming lexicographic, no indicator should weigh more than 50 per cent.
References
Alaimo, L. S., & Seri, E. (2023). Measuring human development by means of composite indicators: Open issues and new methodological tools. Quality & Quantity. https://doi.org/10.1007/s11135-022-01597-1
Alaimo, L. S., Ivaldi, E., Landi, S., & Maggino, F. (2022). Measuring and evaluating socio-economic inequality in small areas: An application to the urban units of the municipality of Genoa. Socio-Economic Planning Sciences, 83, 101170.
Alaimo, L. and Seri, E. (2021). Monitoring the main aspects of social and economic life using composite indicators: A literature review. Research Group Economics, Policy Analysis and Language (REAL) series.
Anand, S. (2018). Recasting human development measures. UNDP Human Development Report Discussion Paper, March 2018.
Arrow, K. J., & Raynaud, H. (1986). Social choice and multicriterion decision-making. MIT Press.
Blancas, F. J., Contreras, I., & Ramírez-Hurtado, J. M. (2013). Constructing a composite indicator with multiplicative aggregation under the objective of ranking alternatives. Journal of the Operational Research Society, 64(5), 668–678.
Bohringer, C., & Jochem, P. E. P. (2007). Measuring the immeasurable: A survey of sustainability indices. Ecological Economics, 63(1), 1–8.
Bouyssou, D. (1986). Some remarks on the notion of compensation in MCDM. European Journal of Operational Research, 26(1), 150–160.
Cahill, M. B. (2005). Is the human development index redundant? Eastern Economic Journal, 31(1), 1–5.
Condorcet, Marquis de. (1785). Essai sur l’application de l’analyse a la probabilite des decisions rendues a la pluralite des voix. De L’imprimerie Royale, Paris.
Dasgupta, P., & Weale, M. (1992). On measuring the quality of life. World Development, 20(1), 119–131.
Deb, S. (2015). The human development index and its methodological refinements. Social Change, 45(1), 131–136.
Desai, M. (1991). Human development, concepts and measurement. European Economic Review, 35(2–3), 350–357.
Dijkstra, A. G., & Hanmer, L. C. (2000). Measuring socio-economic gender inequality: Toward an alternative to the UNDP gender-related development index. Feminist Economics, 6(2), 41–75.
Ebert, U., & Welsch, H. (2004). Meaningful environmental indices: A social choice approach. Journal of Environmental Economics and Management, 47(2), 270–283.
Gasparatos, A., El-Haram, M., & Horner, M. (2008). A critical review of reductionist approaches for assessing the progress towards sustainability. Environmental Impact Assessment Review, 28(4–5), 286–311.
Goerlich, F. J., & Reig, E. (2021). Quality of life ranking of Spanish cities: A non-compensatory approach. Cities, 109, 102979.
Greco, S., Ishizaka, A., Tasiou, M., & Torrisi, G. (2019). On the methodological framework of composite indices: A review of the issues of weighting, aggregation, and robustness. Social Indicators Research, 141, 61–94.
Herrero, C., Martinez, R., & Villar, A. (2012). A newer human development index. Journal of Human Development and Capabilities, 13(2), 247–268.
Herrero, C., Martínez, R., & Villar, A. (2005). A multiplicative human development index. Working Paper, Fundación BBVA / BBVA Foundation.
Herrero, C., Martínez, R., & Villar. A. (2010). Improving the measurement of human development. Human Development Research Paper 12. UNDP–HDRO, New York.
Hicks, D. A. (1997). The inequality-adjusted human development index: A constructive proposal. World Development, 25(8), 1283–1298.
Kemeny, J. (1959). Mathematics without numbers. Daedalus, 88, 571–591.
Klugman, J., Rodriguez, F., & and Choi, H-J. (2011). The HDI 2010: New controversies, old critiques. Human Development Research Paper 2011/01. Human Development Report Office, UNDP, New York.
Liang, M., Niu, S., Li, Z., & Qiang, W. (2019). International comparison of human development index corrected by greenness and fairness indicators and policy implications for China. Social Indicators Research, 142(1), 1–24.
Mangaraj, B. K., & Aparajita, U. (2020). Constructing a generalized model of the human development index. Socio-Economic Planning Sciences, 70, 100778.
Mariano, E. B., Ferraz, D., & de Oliveira Gobbo, S. C. (2021). The human development index with multiple data envelopment analysis approaches: A comparative evaluation using social network analysis. Social Indicators Research, 157, 443–500.
Mazziotta, M., & Pareto, A. (2022). Composite indices construction: The performance interval approach. Social Indicators Research, 161, 511–551.
McGillivray, M. (1991). The human development index: Yet another redundant composite development indicator? World Development, 19(10), 1461–1468.
Morse, S. (2014). Stirring the pot: Influence of changes in methodology of the human development index on reporting by the press. Ecological Indicators, 45(October), 245–254.
Moulin, H. (1988). Axioms of cooperative decision-making. Cambridge University Press.
Munda, G. (1995). Multicriteria evaluation in a fuzzy environment: Theory and applications in ecological economics. Physica-Verlag.
Munda, G. (2012). Choosing aggregation rules for composite indicators. Social Indicators Research, 109(3), 337–354.
Munda, G., & Nardo, M. (2005a). Non-compensatory composite indicators for ranking countries: A defensible setting. EUR Report, EUR 21833 EN, European Commission, Joint Research Centre, Ispra.
Munda, G., & Nardo, M. (2005b). Constructing consistent composite indicators: The isue of weights. EUR Report, EUR 21834 EN, European Commission, Joint Research Centre, Ispra.
Munda, G. (2005). Multiple criteria decision analysis and sustainable development in J. Figueira., S. Greco., & M. Ehrgott (eds) Multiple-criteria decision analysis: State of the art surveys. Springer, New York, 953–986.
Nardo, M., Saisana, M., Saltelli, A., Tarantola, S., Hoffman, A., & Giovannini, E. (2005). Handbook on constructing composite indicators: Methodology and user guide. OECD Statistics Working Papers STD/DOC (2005) 3, OECD, Paris.
Nardo, M., Saisana, M., Saltelli, A., Tarantola, S., Hoffmann, A., & Giovannini, E. (2008). Handbook on constructing composite indicators. Methodology and user guide. Paris: OECD; Ispra, Italy: Joint Research Centre.
Natoli, R., & Zuhair, S. (2011). Rediscovering the Condorcet approach as an aggregation technique for progress measures. Economic Papers, 30(3), 368–376.
Paruolo, P., Saisana, M., & Saltelli, A. (2013). Ratings and rankings: Voodoo or science? Journal of the Royal Statistical Society Series a: Statistics in Society, 176(3), 609–634.
Pinar, M. (2022). Choquet-integral aggregation method to aggregate social indicators to account for interactions: An application to the human development index. Social Indicators Research, 159(1), 1–53.
Ravallion, M. (2010). Troubling tradeoffs in the human development index. (November 1, 2010). World Bank Policy Research Working Paper No. 5484. Available at SSRN: https://ssrn.com/abstract=1713611
Roy, B. (1996). Multicriteria methodology for decision aiding. Kluwer Academic Publishers.
Sagar, A. D., & Najam, A. (1998). The human development index: A critical review. Ecological Economics, 25(3), 249–264.
Srinivasan, T. N. (1994). Human development: A new paradigm of reinvention of the wheel? American Economic Review, 84(2), 238–243.
Su, W., Chen, S., Zhang, C., & Li, K. W. (2023). A subgroup dominance-based benefit of the doubt method for addressing rank reversals: A case study of the human development index in Europe. European Journal of Operational Research, 307(3), 1299–1317.
United Nations Development Programme (UNDP). (2010). Human development report 2010. Palgrave Macmillan.
United Nations Development Programme (UNDP). (2020). Human development report 2020. New York.
Van Puyenbroeck, T., & Rogge, N. (2017). Geometric mean quantity index numbers with Benefit-of-the-Doubt weights. European Journal of Operational Research, 256(3), 1004–1014.
Young, H. P. (1988). Condorcet’s theory of voting. American Political Science Review, 82(4), 1231–1244.
Young, H. P., & Levenglick, A. (1978). A consistent extension of Condorcet’s election principle. SIAM Journal of Applied Mathematics, 35(2), 285–300.
Zambrano, E. (2011). Functioning, capabilities and the 2010 human development index, controversies, old critiques. Human Development Research Paper 2011/11, Human Development Report Office, UNDP, New York.
Zhou, P., Ang, B. W., & Zhou, D. Q. (2010). Weighting and aggregation in composite indicator construction: A multiplicative optimization approach. Social Indicators Research, 96, 169–181.
Zimmermann, H. J., & Zysno, P. (1983). Decisions and evaluations by hierarchical aggregation of information. Fuzzy Sets and Systems, 10(1–3), 243–260.
Acknowledgements
The authors are grateful to the editor and the anonymous reviewers for their guidance and constructive suggestions, which have improved the quality of the manuscript.
Funding
Open Access funding enabled and organized by CAUL and its Member Institutions.
Author information
Authors and Affiliations
Contributions
RN: Conceptualisation, Literature Review, Methodology, Data collection, Writing—Original draft preparation. SF: Conceptualisation, Writing—Reviewing and Editing. JL: Analysis. SZ: Writing—Editing, Supervision.
Corresponding author
Ethics declarations
Conflict of interest
None.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Natoli, R., Feeny, S., Li, J. et al. Aggregating the Human Development Index: A Non-compensatory Approach. Soc Indic Res 172, 499–515 (2024). https://doi.org/10.1007/s11205-024-03318-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11205-024-03318-7