1 Introduction

Since its inception in 1990, the Human Development Index (HDI) of the United Nations Development Program (UNDP) has become one of the best known and widely used measures of human development. Enthusiasm for the HDI derives from the fact it provides a broader measure of a country’s development than the traditional measure of income per capita. It is also relatively straightforward to calculate and provides a transparent tool for comparing progress across a large number of countries. At the same time the HDI has been criticised on a number of grounds. These criticisms include (but are not limited to) the neglect of other important dimensions of human development (Dasgupta & Weale, 1992; Liang et al., 2019), the quality of data from which it is calculated (Herrero et al., 2012; Srinivasan, 1994), the subjective choices made during the construction process (Mariano et al., 2021), the index adding little to the value of the individual variables comprising it (McGillivray, 1991; Morse, 2014), the use of national averages and the masking of inter-country inequalities (Hicks, 1997; Klugman, Rodriguez & Choi, 2011; Sagar & Najam, 1998) and the use of arbitrary weightings (Desai, 1991; Liang et al., 2019). Yet the HDI has stood the test of time and is still widely used by policymakers, academic researchers and the media.

Over time, the methods used to calculate the HDI have been revised, with the most notable being undertaken for the 20th anniversary edition (i.e., the 2010 edition of the HDI). While retaining the same three-dimensional structure with equal weights, the 2010 version of the HDI adopted inter alia different variables to proxy its dimensions as well as a different aggregation procedure (Klugman et al., 2011). More specifically, the sub-indices of the HDI were aggregated using a geometric mean instead of an arithmetic mean. The aggregation approach was changed due to the issue of perfect substitutability between its dimensions (Herrero et al., 2012). This was a problematic assumption of the old HDI formula, because it implied that falls in the attainment of one of the HDI components could be perfectly offset by an equal improvement in the attainment of another. When adopting a geometric mean, perfect substitutability no longer applies although some degree of substitutability still exists.

This paper posits that the aggregation issue continues to be a weakness of the HDI. It does not criticise the components of the HDI such as equal weighting and the arbitrary parameters of the component indices which also impact the final rank order of nations. Instead, this paper focuses on the method in which the HDI is aggregated and how this contributes to a flawed ranking result of countries given that falls in one component can still be compensated for by improvements in another. Ravallion (2010) argued that these trade-offs, measured by the marginal rates of substitution across HDI dimensions, are troubling on ethical and other grounds. He demonstrates that the implicit monetary values attached to an extra year of life vary from as little as $0.51 per year for Zimbabwe to almost $9,000 per year for the richest countries. Such issues can be problematic when optimal resource allocation decision making occurs in the context of trade-offs.

Although non-compensatory approaches have been employed previously (Alaimo & Seri, 2023; Goerlich & Reig, 2021), this paper adds to the literature by undertaking an application of the Condorcet approach to the entire HDI. Under this approach, the weights of the HDI dimensions are interpreted as importance weights rather than marginal rates of substitution which is more in line with the original spirit of the HDI. Moreover, there is no compensation allowed for changes in the dimension values using this approach. Country rankings are then compared to those for the latest HDI. The results showed substantial changes in rank-order between the HDI and Condorcet approach.

The remainder of this paper is structured as follows. Section 2 describes the main changes to the HDI methodology. Section 3 discusses the weakness of the aggregation approach used in the HDI and outlines the Condorcet aggregation approach adopted by this paper. Section 4 provides the results of the changes to the rank-order of countries from the HDI geometric mean aggregation compared to the Condorcet aggregation technique. Finally, Sect. 5 concludes with some policy implications arising from the research.

2 Reviewing the Main Methodological Changes to the HDI

From 1990 until 2009, a long and healthy life, which comprises the first HDI dimension, was measured using life expectancy at birth. The second dimension: knowledge, was measured using the adult literacy rate (with a weighting of two-thirds); and the combined gross enrolment ratio for primary, secondary and tertiary schooling (with a weighting of one-third). The third, and final, dimension was a decent standard of living, which was measured using GDP per capita in US dollars adjusted for Purchasing Power Parity (PPP). The variable was logged to reflect the diminishing importance of income for human development at higher levels of GDP per capita. The variables for the three dimensions were normalised using a fixed minima and maxima to achieve the property of scale neutrality using the following formula:

$${\text{Dimension}}\;{\text{index}}\, = \,\left( {Actual\;value{-}Minimum\;value} \right)/\left( {Maximum\;value{-}Minimum\;value} \right)$$

To arrive at the final HDI score, an arithmetic mean using equal weights of one-third for the three-dimensional indices was applied.

In 2010, the HDR methodology was revised with the following changes: The aggregation of the HDI involved the geometric mean of the normalised indices (Deb, 2015); the proxy for knowledge combined the mean years of schooling for adults 25 years and older with the expected years of schooling of children commencing schooling; and the proxy for standard of living was replaced with the log of per capita Gross National Income (GNI) adjusted for PPP. Another significant revision was the use of a constant maxima for the normalization of the indices, instead of the observed maxima. The specified upper limits were: 85 years for life expectancy, 15 years for mean years of schooling, 18 years for expected years of schooling, and $75,000 for GNI per capita.Footnote 1 The scores for the three dimensional indices (I) were aggregated using the geometric mean rather than the arithmetic mean to provide the overall HDI score:

$${\text{HDI }} = \, \left( {{\text{I}}_{{{\text{Health}}}} *{\text{ I}}_{{{\text{Education}}}} *{\text{ I}}_{{{\text{Income}}}} } \right)^{{{1}/{3}}}$$
(1)

3 Index Aggregation: Additive Method, Geometric Mean and the Condorcet Approach

The Condorcet approach is also referred to as non-compensatory multi-criteria analysis aggregation technique.

As Goerlich and Reig (2021) point out, the key to distinguishing aggregation methods centres on whether they implement a compensatory or a non-compensatory logic. In this light, this section reviews the three main aggregation approaches: (i) additive method; (ii) geometric mean aggregation approach utilised by the HDI; and (iii) the Condorcet aggregation technique.

3.1 Additive Method

Up until 2009, the HDI employed an additive aggregation approach, via an arithmetic mean, to determine its value. The approach is based on ordinal information, where a country’s rank is summed for each of the indicators. As Munda and Nardo (2005a) state, this approach is simple to use, does not consider potential interactions across the dimensions and is insensitive to outliers. This has led to arguably its main criticism which is that shortfalls in one dimension can be compensated for by a strong outcome in another dimension (Sagar & Najam, 1998). Studies by McGillivray and White (1993), Dijkstra and Hanmer, (2000) and Cahill (2005) highlight how strong GDP outcomes can overshadow poor health or education outcomes potentially leading to policies that result in a sub-optimal allocation of resources. As Desai (1991) points out, the additive approach implied perfect substitutability between the three HDI dimensions reflects a reductionist viewpoint making it an inappropriate approach. Given these shortcomings, the HDI moved to a geometric aggregation approach from 2010. This approach is reviewed below.

3.2 HDI Geometric Mean Aggregation

Sagar and Najam (1998) put forward an aggregate calculation based on a multiplicative scheme where the HDI is calculated by the geometric mean of the component values. With the application of a geometric mean the UNDP, 2010, (p. 15) notes:

Poor performance in any dimension is now directly reflected in the HDI, and there is no longer perfect substitutability across dimensions. This method captures how well rounded a country’s performance is across the three dimensions. As a basis for comparisons of achievement, this method is also more respectful of the intrinsic differences in the dimensions than a simple average is. It recognizes that health, education and income are all important, but also that it is hard to compare these different dimensions of well-being and that we should not let changes in any of them go unnoticed.

As asserted by Sagar and Najam (1998) this property of the HDI computed using a geometric mean is consistent with the original purpose of HDI which was to control for the trade-offs between each dimension. By doing so, each dimension was treated as being vital and non-substitutable. However, under this multiplicative scheme, a compensatory effect still occurs where one dimension that fares poorly can be offset by a good result in another dimension. As Greco et al. (2019) assert, the use of a geometric mean is an improved approach compared to the additive aggregation via an arithmetic mean (i.e., linear approach). However, they add, that it is not ideal as it is akin to being between compensatory and non-compensatory techniques (Zhou et al., 2010). Since the objective is to assign weights based on their importance to the model, such a trade-off is theoretically inconsistent with this objective. This perspective is echoed by Goerlich and Rigg, 2021, (p. 3), who opined that, “… a geometric aggregation represents a middle-of-the-way approach, between full compensability and non-compensability (Blancas et al., 2013; Nardo et al., 2008; Van Puyenbroeck & Rogge, 2017; Zimmermann & Zysno, 1983)”. They also state that the limitation of this method (i.e., inferior compensability) is most apparent when indices have lower values. In such cases, a non-compensatory multi-criteria approach is preferred.

Although less compensatory than additive aggregation, the compensability nature of the multiplicative scheme does not, this paper insists, go far enough to align with the original goals of the UNDP, which was to mirror the real needs of the nations under analysis rather than to provide a distorted image through an uneven, or overly compensatory, dimension contribution. This is why an absence of conflict amongst the variables is preferred.

For aggregation purposes, the nature of the data is an important consideration. Since the HDI data is not on a ratio scale, the aggregation options are quite limited. According to Ebert and Welsch (2004), a weighted geometric mean of the un-normalised data can produce a purposeful index so long as all the variables are strictly positive with a natural origin.Footnote 3 The 2010 revised HDI employed the geometric mean as one part of its revisions outlined in Sect. 2 above. According to Klugman et al. (2011), this approach seems to mirror the alternative put forward by Herrero, Martinez and Villar (2005, 2010).

Although the notion of a purposeful index through the use of a geometric mean is welcome, its purposefulness could be further improved by addressing remaining aggregation issues. Specifically, empirical studies have showed that the geometric mean poses other issues such as not knowing the degree of compensability (Alaimo & Seri, 2023), penalising unbalanced or skewed attainment across the HDI dimensions (Mangaraj & Aparajits, 2020), imperfect substitutability (Mazziotta & Pareto, 2022). In addition, Pinar’s (2022) analysis of HDI demonstrated that similar HDI scores were received for most nations with both aggregation techniques–geometric and arithmetic mean. Furthermore, these similarities in scores also were displayed when there was great variations across the three dimensions. Consequently, from a conceptual and methodological perspective, the aggregative-compensative approach is seen as inappropriate (Alaimo & Seri, 2021; Alaimo et al., 2022).

Furthermore, prior studies such as Herrero et al. (2012), Klugman et al. (2011), Zambrano (2011), Ravallion (2010) and Morse (2014) also examined the impact of the HDI modifications to its methodology on country rankings. While all agree that the refinements to the HDI has led to improved measurement, there is also mention of inconsistencies and improvements that are still required. For example, Morse (2014) argued that the changes in the HDI methodology led to increased turbulence in country rankings. This increased turbulence can result in enhanced press reporting of the rank order changes which can be construed as a positive. However, it could also result in a country either rising rapidly without necessarily invoking anything tangible or declining greatly despite initiating a set of good policies designed to help with human development.

Ravallion (2010) claims that the relaxation of perfect substitutability between the three dimensions has led to ‘troubling trade-offs’ which an alternative aggregation technique that still allows for imperfect substitution could resolve. In addition, Herrero et al. (2012) argues that although the geometric mean is the right choice, improvements are required regarding how raw variables are numbered and the manner in which the income log variable is explained. Similar arguments have been made by Deb (2015) and Anand (2018). Unfortunately, change in the HDI aggregation approach reduces, but does not solve the compensability issue between the different dimensions as it does not penalise unbalanced achievements across dimensions strongly enough.Footnote 4 This supports the earlier assertion by Goerlich and Reig (2021) that the geometric aggregation scheme lies between full compensability and non-compensability.Footnote 5

Given that the principle of the symmetrical importance of variables needs to be retained, this paper argues the HDI needs to shift away from the use of a geometrical mean, since it does not solve the ‘trade-off’ issue (Greco et al., 2019; Zhou et al., 2010). Given this, a move towards adopting a non-compensatory multi-criteria approach is preferred. This approach avoids complete compensability and suggests a theoretical assurance that the HDI weights are interpreted as measures of importance rather than marginal rates of substitution (Bouyssou, 1986; Munda, 2005). From a compensatory logic perspective, the marginal rates of substitution is problematic when the importance of the weights for individual indicators are actually measuring substitutability (Paruolo et al., 2013).

3.3 Non-compensatory Multi-criteria analysis (Condorcet Approach)

Our approach tries to resolve the compensability issue by using a discrete multi-criteria approach that includes the lack of preference independence (Munda, 1995, 2012; Roy, 1996). A pair-wise comparison of selected countries across the HDI variables is performed and ranked from best to worst. This Condorcet-type ranking procedure occurs via a complete pre-order by a mathematical formulation (Natoli & Zuhair, 2011).

Under the Condorcet approach, the intensity of preferences are not used, thus minimising compensability with the HDI aggregation, with the pair-wise comparisons interpreted as a voting matrix (Munda & Nardo, 2005a; Nardo et al., 2005). As Munda (2005, 2012) points out, unlike linear or geometric aggregation, the Condorcet approach finds compromises between two or more legitimate goals assuring non-compensability which is also supported via the social choice literature (Arrow & Raynaud, 1986; Moulin, 1988; Munda & Nardo, 2005b).Footnote 6

The main drawback to this method, according to Munda (2005), is that the number of permutation calculations increases exponentially, when many countries are analysed. Despite this drawback, employing a non-compensability aggregation approach to the HDI is desirable due to its theoretical consistency and its ability to have no limitation on the measurement scale of variables (Munda & Nardo, 2005a). These benefits should reduce the uncertainty and imprecision in the HDI index. The Condorcet technique is detailed below. Note, that while the HDI weights should be viewed as trade-offs in compensatory aggregation methods, they are viewed as ‘importance coefficients’ in non-compensatory aggregation methods.

An axiomatic setting for the non-compensatory approach is outlined below. When comparing development across countries, it is common to find that one country outperforms another in one area but not in another. Overcoming these conflicting results in a non-compensatory manner enables this paper to appropriately rank the selected countries favouring a Condorcet approach.Footnote 7 However, one drawback involves the issue of cycles.Footnote 8

Regarding cycles, we can consider a voting situation with three voters, V1, V2 and V3 and three candidates C1, C2 and C3. The preferences are given in table 1 below where candidates are given in decreasing order of preference.

Table 1 Cycles

For instance, if C3 is the winner, It is possible to argue that C2 should have been the winner because V1 and V2 prefer C2 to C3 and only V3 prefer C3 to C2. It then follows that C1 is preferred to C2 and C3 is preferred to C1 with a two to one margin in each case. This is a case of cycling where C1 is preferred to C2 which is preferred over C3 which is preferred over C1.

Cycles can occur quite regularly with macroeconomic data and is dealt with via the Condorcet-Kemeny-Young-Levenglick (CKYL) ranking procedure.Footnote 9 However, as Munda (2005) and Munda and Nardo (2005a) point out, one still must accept that rank reversals may occur. A recent study by Su et al. (2023), examined the rank-reversal problem through employing a subgroup dominance-based benefit-of-doubt (SD-BoD) model. The model compared and analysed the HDI of 28 European nations and found that the severity of rank reversals was largely dictated by the employed common-weight BoD method. Although rank reversals are at variance with Arrow’s axiom of independence of irrelevant alternatives, Young (1988) states that the locally stable nature of the CKYL ranking procedure maintains the ranking of alternatives. As Munda (2005) states, the CKYL ranking procedure for composite indices is obtained by a simple, yet formal ranking algorithm that supports the maximum likelihood ranking of countries. To achieve this, the maximum number of individual indicators for each pair-wise comparison is summed over all pairs of countries involved as per the below ranking algorithm.

Given a set of individual indicators \(G = \{ g_{m} \} ,m = 1,2,...,M,\) and a defined set \(A = \{ a_{n} \} ,n = 1,2,...,N\) of countries; each country \(a_{n}\) is evaluated with respect to an individual indicator \(g_{m}\) from an assumed ordinal, interval or ratio measurement scale. Here, individual indicators with higher values are preferred to a lower one as outlined below:

$$\left\{ {\begin{array}{*{20}c} {a_{j} Pa_{k} \Leftrightarrow g_{m} (a_{j} ) > g_{m} (a_{k} )} \\ {a_{j} Ia_{k} \Leftrightarrow g_{m} (a_{j} ) = g_{m} (a_{k} )} \\ \end{array} } \right.$$
(2)

where P and I specify a preference and an indifference relation accordingly, both fulfilling the transitive property.

From there, a set of individual indicator weights \(W = \{ w_{m} \} ,m = 1,2...,M,\) with \(\sum\limits_{m = 1}^{M} {w_{m} } = 1,\) are derived as importance coefficients. To rank in a complete pre-order, from best to worst with the available information, the following mathematical aggregation procedure is required: (i) pair-wise comparison of countries according to the whole set of individual indicators used; and (ii) ranking of countries in a complete pre-order. To perform a pair-wise comparison of countries for the whole set of HDI indicators, an axiomatic system is adapted from Arrow and Raynaud (1986, pp. 81–82) is required.

Axiom 1: Diversity

Each individual indicator is a total order on the defined set A of countries to be ranked, and there is no restriction on these variables; they can be any total order on A.Footnote 10

Axiom 2: Symmetry

The individual indicators are non-comparable scales that contain ordinal pair-wise preferences. This primarily means that intensity of preferences and compensability are sidestepped, while the weights are symmetrical importance coefficients. It also means that a normalisation step is not needed since this axiom (symmetry) reduces uncertainty and imprecision.

Axiom 3: Positive Responsiveness

For two countries \(a\) and \(b\) the level of performance between them depends on how many indicators and their associated weights that rank \(a\) before \(b\).Footnote 11 As a result, the equal treatment of all individual indicators is broken (Munda, 2005; Munda & Nardo, 2005a) and a trade-off occurs between anonymity and decisiveness (i.e., a ranking must be selected). Of the two, decisiveness is favoured.Footnote 12

The three axioms (diversity, symmetry and positive responsiveness) allow a N x N matrix,\(E\), called an outranking matrix to be established, which assumes to contain all available information (Arrow & Raynaud, 1986; Roy, 1996). Any generic element of \(E:e_{jk} ,j \ne k\) arises from the pair-wise comparison of all the \(M\) individual indicators between countries j and k. To arrive at a global pair-wise comparison, the equation below is employed:

$$e_{jk} = \sum\limits_{m = 1}^{M} {\left( {w_{m} \left( {P_{jk} } \right) + \frac{1}{2}w_{m} \left( {I_{jk} } \right)} \right)}$$
(3)

where, \(w_{m} \left( {P_{jk} } \right)\) and \(w_{m} \left( {I_{jk} } \right)\) refer to the relation between preference \(P\) and indifference \(I\) based on the weights of individual variables. Hence, we state that:

$$e_{jk} + e_{kj} = 1$$
(4)

The outranking matrix \(E\) comprise all the \(N\left( {N - 1} \right)\) pair-wise comparisons. The notation represents \(R\) the set of all \(N!\) possible complete rankings of alternatives, \(R = \left\{ {r_{s} } \right\},s = 1,2,...,N!.\) For each \(r_{s}\) the corresponding score \(\phi_{s}\) is calculated as the sum of \(e_{jk}\) over all the \(\left( {\begin{array}{*{20}c} N \\ 2 \\ \end{array} } \right)\) pairs \(j,k\) of alternatives, i.e. \(\phi_{s} = \sum {e_{jk} }\), where \(j \ne k,s = 1,2,...N!\) and \(e_{jk} \in r_{s}\).

The final ranking \(\left( {r*} \right)\) maximises the equation below:

$$\begin{array}{*{20}c} {r* \Leftrightarrow \phi_{*} = \max \sum {e_{jk} } } & {{\text{where}}} & {e_{jk} \in R} \\ \end{array}$$
(5)

The CKYL approach also contains other formal properties which include (Munda, 2005; Young, 1988; Young & Levenglick, 1978):

  • Neutrality: no country is favoured over another, thus all countries are treated equally.

  • Unanimity: if all the individual indicators favour country a to country b, then country a should be chosen.

  • Monotonicity: there are no defiers, hence if country a is preferred in any pair-wise comparison and only their variables are improved, then country a should continue to be the higher rank ordered country.

  • Reinforcement: if the set A of countries is ranked by two subsets \(G_{1}\) and \(G_{2}\) of the individual indicator set G, such that the ranking is the same for both \(G_{1}\) and \(G_{2}\), then \(G_{1} \cup G_{2} = G\) should still supply the same ranking. This notion of reinforcement is crucial when initially applying the variables to each single dimension and then aggregating them to the general model.

As Natoli and Zuhair (2011) point out, given the importance of reinforcement, the only Condorcet consistent rule that can uphold this property is the maximum likelihood ranking procedure. A property, which Arrow and Raynaud (1986) assert, whose definite ethical construct is critical to welfare economics.

It was found that if equal values are used for the weightings in determining the ranking using the Condorcet approach, the ranking may not be unique. With equal weightings, it is found that the non-uniqueness of the ranking occurs within a few ranking positions. Thus, as pointed out by Natoli and Zuhair (2011), when undertaking any aggregation technique, practical compromises need to take place. Hence, a small value, such as π/100, was used to adjust different weightings while still maintaining \(\sum_{m=1}^{M}{w}_{m}=1.\) For example, if M = 3, traditional weightings would be w1 = w2 = w3 = 1/3. In the present work, we use \({w}_{1}=\frac{1}{3}+ \frac{\uppi }{100}, {w}_{2}= \frac{1}{3}, {w}_{3}= \frac{1}{3}-\frac{\uppi }{100}\). With an interchanging of weightings, the ranking positions will change within the few non-unique positions of those using the equal weightings. This does not impact on the Condorcet method since, in reality, equal weighting for all the indicators is a subjective choice and the small difference in weighting means that some indicators are slightly more favoured than the others.

Although both anonymity and information on the preference intensity of variables is lost, the non-compensatory approach preserves the theoretical importance of the coefficients and reduces the main source of imprecision and uncertainty (Munda & Nardo, 2005a). As Natoli and Zuhair (2011) posit, since progress measures can serve as a foundation to improve decisions on resource allocations, the Condorcet approach is preferred since such policy decisions should occur without trade-offs between dimensions.

4 Results and Discussion

4.1 Empirical Results

To explore whether there are rank-order issues prevalent within the 2020 HDI report derived from the geometric mean aggregation approach relative to the Condorcet approach, an analysis of rank-orders is required.

Table 2 contains the 189 countries in the 2020 HDI report and displays both their HDI and Condorcet rank-order. The results are also presented via Fig. 1. Although rank-order change occurs throughout, as shown in Fig. 1, the most pronounced variation between the HDI and Condorcet rank-orders seems to be concentrated among the mid-range countries with variations at the high-range and low-range being less apparent. This finding is similar to those highlighted by Greco et al. (2019) in a study by Munda (2012) who applied a CKYL non-compensatory approach to the Environmental Sustainability Index (ESI). Munda (2012) identified noticeable difference in rankings between his CKYL approach and the linear approach used by the ESI, which was mostly evident in the middle positions and less evident among those ranked first and last.

Table 2 HDI and condorcet 2020 rank order
Fig. 1
figure 1

Rank order comparisons: HDI and condorcet

A study by Mangaraj and Aparajita (2020) undertook a generalised HDI (GHDI) as well as a measure of relative GHDI (GHDIR) with a multi-index ranking model. They compared the top 30 countries as ranked via the HDI with the GHDI and the GHDIR. Their findings showed that the rank-order changes of the GHDI did not deviate much from the ranks of the HDI. This reinforces the notion of little deviation when the focus is on either the high-end or low-end of nations in the HDI table.

Table 3 provides a snapshot of the number of countries which experience a sizable change in rank order. As Table 3 demonstrates, of the 189 countries, just over half (104, or 55% of all countries in the HDI) experience a rank order change of at least six positions. In addition, just over one-quarter (27.5%, or 52 countries in the HDI) experienced a movement of at least 10 rank-order positions. At the highest level of rank-order changes, 11 countries (or 5.8% of all HDI countries) experience a rank-order change of at least 19 positions.

Table 3 HDI, 2020 sizable rank-order changes

In keeping with the highest level of rank-order changes, Table 4 identifies the top ten countries to experience the highest level of variability in rank-order changes in terms of both improvement and deterioration. As identified earlier, most of the countries that experienced the highest level of rank-order changes are those grouped in the middle of the HDR report (HDI original rank from 58 to 95).

Table 4 HDI 2020 Top 10 rank-order changes

As Table 4 shows, Morocco [MAR], Maldives [MDV], Costa Rica [CRI] and Cuba [CUB] benefited the most regarding a positive rank-order change under the Condorcet approach. For rank deteriorations, the most notable occurred to Bahamas [BHS], Trinidad & Tobago [TTO], Equatorial Guinea [GNQ] and Turkmenistan [TKM].

The magnitude of the changes in rank-order suggests that, as Anand (2018, p. 6) points out, that the geometric mean is not suitable since it allows “… changes in any of them [variables] to go unnoticed”. Instead, due to the non-compensatory multi-criteria approach, the Condorcet aggregation technique can identify compromises between more than one legitimate goal to assure non-compensability.

Table 5 shows the overall number of rank changes from the HDI rank-order to the Condorcet rank-order for all countries. Here, tied ranks refers to no change in a country’s rank-order from the HDI and Condorcet approach, a positive rank refers to an improvement in a country’s rank-order from the HDI to the Condorcet (e.g. 5 in HDI to 4 under a Condorcet approach), and a negative rank refers to a deterioration in a country’s rank-order from HDI to Condorcet (e.g. 6 in HDI to 5 under a Condorcet approach). Of the 189 countries, there were only five occasions (2.6%) which resulted in a tied rank. Thus, the Condorcet aggregation approach results in almost all country rankings undergoing a change in rank-order.

Table 5 Overall rank changes

4.2 Discussion

A case for the Condorcet aggregation method providing a better measure of country level human development can be made by examining the index scores across the three individual HDI components. For example, among countries with a ‘very high’ level of human development, Singapore is one of the countries that ranks much higher (3rd) under the Condorcet aggregation method than under the geometric mean approach (11th). This is because Singapore outranks many countries ranked higher in the HDI rankings with respect to both life expectancy and income. Singapore ranks less well for education. Under a geometric mean aggregation approach its overall HDI score and ranking are lower since the life expectancy and income index scores compensate for the lower education index score. Its higher rank under the Condorcet approach stems from the fact that Singapore ranks better on two of the three HDI components than nearly all other countries. We argue that unless there is a reason to believe that education is more important than income and health, Singapore should receive a higher overall ranking given its better performance in two of the three HDI components. In comparison, Switzerland which is ranked 3rd by the UNDP’s HDI, ranks first under the Condorcet aggregation method. This is because it outranks Norway, Ireland and all other countries on at least two of the three HDI component indices.

The Bahamas, another country with a ‘very high’ level of human development, drops 30 places in the rankings under the Condorcet method, mainly because its relatively high score for the income index can no longer compensate the relatively lower scores for life expectancy and education. A similar explanation applies to Equatorial Guinea, a country with a ‘medium’ level of human development which falls 24 places in the rankings (145th to 169th) when its relatively high income index score can no longer compensate its poorer life expectancy and education index scores. Conversely, Morocco, another country with a ‘medium’ level of human development ranks much better under the Condorcet approach (94th vs. 121st) because it has relatively higher life expectancy and income index scores than countries above it in the HDI index rankings.

There is relatively less change in rankings for countries with a ‘low’ level of human development. An exception is that of Eritrea which performs better when the Condorcet approach is applied since it performs relatively well according to life expectancy and income for countries at this level of development. Its rank improves from 180 to 164th when life expectancy and income are no longer compensate for lower education index scores bringing down its overall average HDI score. In sum, if we accept that the HDI components are equally important for human development and that shortfalls/achievements in one of the components cannot be compensated for (or penalised) by levels in another, our ranking according to the Condorcet method offers a more accurate depiction of the human development level of each country.

5 Conclusion and Policy Implications

This paper compares the country rankings of the UNDP’s HDI (2020) with those resulting from a modified Condorcet ranking, a non-compensatory approach. Country HDI rankings using the Condorcet aggregation approach were compared to those reported in the 2020 Human Development Report which allow for shortfalls in one index component to be compensated for by high values in another. The paper compared the rank-order of the human development for the year 2020 with the Condorcet rank-order.

Findings suggest that the approach to aggregating the HDI is very important with respect to country rankings. Although a monotonic association was identified, the changes in rank-order between the HDI and Condorcet approach were substantial. This outcome provides empirical evidence regarding the problematic, and inconsistent, nature of using a geometric mean to aggregate the HDI where ‘troubling trade-offs’ occur due to the relaxation of perfect substitutability between the three dimensions. The substantial changes derived from the Condorcet approach can be partly explained by the fact that it is able to address this issue via its non-compensatory multi-criteria analysis.

An issue with geometric mean, as discussed above and in the literature, is that weakness in one criterion can be compensated by the strength in another criteria which could confuse the interpretation of the results and unable to provide clear direction to the policy makers. By removing this compensatory effect, Condorcet ranking is able to provide clearer direction on policy formulation for development.

The HDI, with its focus on people and their capabilities, can be used to query the policy choices of nations. For instance, the change in HDI rankings based on the Condorcet approach can capture the attention of policy makers, media, non-governmental organisations, etc. and further stimulate the debate surrounding the purpose of government policy. Specifically, that policy should emphasise the development of a country as opposed to economic growth alone. It may also be used as one indicator amongst a suite-of-indicators to assess foreign aid or Official Development Assistance to low-income countries. This could result in alterations to the volume of aid received by some countries.