Abstract
The adage “the rich are getting richer” refers to increasingly skewed and heavily-tailed income distributions. For such distributions, the mean is not the best measure of the center, but the classical indices of income inequality, including the celebrated Gini index, are mean based. In view of this, it has been proposed in the literature to incorporate the median into the definition of the Gini index. In the present paper we make a further step in this direction and, to acknowledge the possibility of differing viewpoints, investigate three median-based indices of inequality. These indices overcome past limitations, such as: (1) they do not rely on the mean as the center of, or a reference point for, income distributions, which are skewed, and are getting even more heavily skewed; (2) they are suitable for populations of any degree of tail heaviness, and income distributions are becoming increasingly such; and (3) they are unchanged by, and even discourage, transfers among the rich persons, but they encourage transfers from the rich to the poor, as well as among the poor to alleviate their hardship. We study these indices analytically and numerically using various income distribution models. Real-world applications are showcased using capital incomes from 2001 and 2018 surveys from fifteen European countries.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Measuring income inequality has been a challenging task, as each of the indices used for the purpose attempt to condense the complexities of populations into just one number. Among the many indices, we have the Atkinson, Bonferroni, Gini, Palma, Pietra, Theil, and Zenga indices, to name just a few associated with the names of their inventors. Treatises have been written on the topic, such as the handbook by Atkinson and Bourguignon (2000, 2015), which also contains many references to earlier studies. Illuminating monographs on the topic have been written by, for example, Gini (1912), Bonferroni (1930), Kakwani (1980), Nygård and Sandström (1981), Sen (1997, 1998), Champernowne and Cowell (1998), Amiel and Cowell (1999), Atkinson and Piketty (2007), Cowell (2011), Yitzhaki and Schechtman (2013), and Piketty (2014).
The indices are often the areas under certain income-equality curves, which are considerably more difficult to present and explain to the general audience, let alone to easily compare. For example, the Gini index of inequality is 1 minus twice the area under the Lorenz curve. (We shall give mathematical definitions later in this paper). The curves and thus the indices are based on comparing the mean income of the poor with other means, such as the mean income of the entire population, the mean income of the nonpoor, and the mean income of the rich, whatever the definitions of “poor” and “rich” might be. Hence, to be well defined, the curves and the indices inevitably assume that the mean of the underlying population is finite. With the rising income inequality, and thus with the distribution of incomes becoming more skewed and heavily tailed, researchers have therefore sought other ways for measuring inequality.
Gastwirth (2014) proposed to use the median instead of the mean when “normalizing” the absolute Gini mean difference, widely known as the GMD. The author noted, however, that the proposed index might fall outside the class of normalized indices because it compares the mean income of the poor with the median income of the entire population. There is a natural remedy to this normalization issue: compare the median income of the poor with the median of the population. Even more, we can compare the median income of the poor with the median of the “not poor” or, for example, with the median of the rich, whatever the latter might mean. This is the path – advocated also by Prendergast and Staudte (2016, 2018), Jokiel-Rokita and Pia̧tek (2023), and Pia̧tek (2023) – that we take in this paper.
In this regard we wish to mention the study of Bennett and Zitikis (2015) where it is shown that a number of classical indices of income inequality arise naturally from a Harsanyi-inspired model of choice under risk, with persons acting as reference-dependent expected-utility maximizers in the face of an income quantile lottery, thus giving rise to a reinterpretation of the classical indices as measures of the desirability of redistribution in society. This relativistic approach to constructing indices of income inequality was further explored by Greselin and Zitikis (2018), although more from the modeller’s perspective than from the philosophical one. The present paper, whose preliminary version appeared in the form of a technical report by Brazauskas et al. (2023), further advances this line of research by showing how naturally percentile-based indices arise in this relativistic context, and how they facilitate inequality measurement, especially in the recent-past and current socioeconomic situations. These indices overcome a number of past limitations, such as:
-
They do not rely on the mean as the center of, or a reference point for, income distributions, which are skewed, and are getting even more heavily skewed.
-
They are suitable for populations of any degree of tail heaviness, and income distributions are becoming increasingly such.
-
They are unchanged by, and even discourage, transfers among the rich persons, but they encourage transfers from the rich to the poor, as well as among the poor to alleviate their hardship.
In more technical terms, while analyzing capital incomes, Greselin et al. (2014) observed that the Hill estimator of the tail index yields a value in the interval (0.5, 1) for some of the samples. In such cases, the classical mean-based inequality indices are inappropriate, as the mean does not exist. Consequently, these empirical observations prompted us to seek new approaches, as we do in the present paper, for measuring economic/income inequality that are suitable for all distributions, irrespective of their tail heaviness.
The rest of the paper is organized as follows. In Sect. 2 we define the inequality indices, alongside the corresponding equality curves, preceded by several classical indices for comparison purposes. In Sect. 3 we illustrate the indices and their curves numerically, using several popular families of distributions. In Sect. 4, we use the indices to first analyze capital incomes of European countries using data from a 2001 survey, and then we compare the results with those obtained from a 2018 survey. In Sect. 5 we look at the indices from the perspective of income transfers. Sect. 6 concludes the paper with a brief recap. Proofs and other technicalities are in Appendix 1.
2 Inequality indices and their curves
We start with technical prerequisites. Let F be the cumulative distribution function of the population incomes X, a random variable. We assume that F is non-negatively supported, that is, \(F(x)=0\) for all real \(x<0\). Furthermore, let Q denote the (generalized) inverse of F, called the quantile function. That is, for each \(p\in (0,1)\), Q(p) is the smallest number x such that \(F(x)\ge p\). Hence, the population median income is
assumed throughout this paper to be strictly greater than 0. Generally, Q(p) is the \(p\times 100^{\textrm{th}}\) percentile. Furthermore, the median income of the poorest \(p\times 100\%\) persons is Q(p/2). Based on these quantities, we shall later describe three ways for measuring inequality, but first, we recall the definitions of a few classical indices that serve as benchmarks for our current study.
2.1 In the classical mean-based world
The index of Gini (1914) is the most widely-used measure of inequality. It can be expressed in a myriad of ways (e.g., Yitzhaki 1998; Yitzhaki and Schechtman 2013). For example, the Gini index can be written in terms of the Bonferroni curve
as follows:
where
is the mean of X, assumed in this section to be finite and strictly greater than 0.
Zenga (2007) argued that the mean income of those below the percentile Q(p) need to be compared not with the mean of all the incomes but with the mean income of those above the percentile Q(p). This point of view led the author to the index
Davydov and Greselin (2019, 2020) suggested to modify Zenga’s idea by comparing the mean income of those below the percentile Q(p) with the mean income of those above the percentile \(Q(1-p)\). This point of view led the authors to the index
Of course, 1/p in the numerator and denominator cancel out, but in this way written D facilitates an easier comparison with Z.
2.2 A transition into the heavily tailed modern world
Unlike the above three mean-based indices G, Z and D, the index of Gastwirth (2014) is a mean-median based index. Namely, given the well-known expression
of the Gini index G in terms of the Gini mean difference (GMD), which is often written as the expectation \(\mathbb {E}(|X_1-X_2|)\), where \(X_1\) and \(X_2\) are two independent copies of X, Gastwirth (2014) argued that comparing the GMD with twice the median would be better than comparing with twice the mean as in Eq. (2.2). This viewpoint has given rise to the index
Note that \(\mu /m\), which can be viewed as the benchmark replacing 1 in the previous indices, is the mean-median ratio that has been used as an easy to understand – and thus to convey to the general audience – indicator of wealth and income distribution (e.g., Garratt 2020). In the case of symmetric distributions, \(\mu /m\) is of course equal to 1.
2.3 In the skewed and heavily tailed modern world: quantile-based indices
The above discussion naturally leads to three strategies of defining purely median-based indices of income inequality and their corresponding curves of equality, all based on percentiles and thus well defined irrespective of whether the income variable X has a finite first or any other moment.
2.3.1 Strategy 1
Compare the median income of the poorest \(p\times 100\%\) persons with the median of the entire population (Fig. 1).
This leads to the equality curve
also independently introduced by Jokiel-Rokita and Pia̧tek (2023, Eq. (9)). Compare it also with \(L_1(F;p)\) of Prendergast and Staudte (2016, Definition 1). Averaging this curve over all p’s gives rise to the inequality index
Compare it with the left-most integral of Prendergast and Staudte (2016, Eq. (3)). Note the mathematical similarity between the Bonferroni curve b and the curve \(\psi _1\):
2.3.2 Strategy 2
Compare the median income of the poorest \(p\times 100\%\) persons with the median of the nonpoor (Fig. 2).
This leads to the equality curve
also independently introduced by Jokiel-Rokita and Pia̧tek (2023, Eq. (10)), which is well defined because we assume that the median income \(m=Q(1/2)\) is strictly positive, and thus \(Q(1/2+p/2)\), being not smaller than Q(1/2), is strictly positive for every \(0<p<1\). Averaging this curve over all p’s gives rise to the inequality index
which is also considered by Pia̧tek (2023). Note the mathematical similarity between the Zenga curve z and the curve \(\psi _2\):
2.3.3 Strategy 3
Compare the median income of the poorest \(p\times 100\%\) persons with the median of the richest \(p\times 100\%\) persons (Fig. 3).
This leads to the equality curve (Prendergast and Staudte 2018)
also considered by Jokiel-Rokita and Pia̧tek (2023, Eq. (11)), which is well defined because we assume that the median income \(m=Q(1/2)\) is strictly positive, and thus \(Q(1-p/2)\), being not smaller than Q(1/2), is strictly positive for every \(0<p<1\). Compare this curve also with \(L_2(F;p)\) of Prendergast and Staudte (2016, Definition 1). Averaging this curve over all p’s gives rise to the inequality index (Prendergast and Staudte 2018)
which is also considered by Pia̧tek (2023). Note the mathematical similarity between the Davydov-Greselin curve d and the curve \(\psi _3\):
2.3.4 A recap
Summarizing the above discussion, in view of Eqs. (2.4), (2.6), and (2.8), the three income-equality curves are connected to the corresponding income-inequality indices via the equation
Note that the three curves \(\psi _k\) take values only in the interval [0, 1], and so the three indices \(\Psi _k\) are always normalized, that is, \(\Psi _k \in [0,1]\). In this context it is useful to look at the following unrealistic cases:
-
If the income-equality curve \(\psi _k\) is equal to 1 everywhere on (0, 1), which implies quantile-based income equality, then the income-inequality index \(\Psi _k\) is equal to 0, which means lowest inequality.
-
If the income-equality curve \(\psi _k\) is equal to 0 everywhere on (0, 1), which implies extreme quantile-based income inequality, then the income-inequality index \(\Psi _k\) is equal to 1, which means maximal inequality.
For a cautionary and illuminating note concerning the meaning of quantile-based income equality and extreme quantile-based income inequality, we shall have Example 1 below.
Hence, these two extreme cases serve as benchmark curves: the one that is identically equal to 1 is the curve of perfect equality, and the one that is identically equal to 0 is the curve of extreme inequality. We can therefore say that the three indices \(\Psi _k\) measure the deviation of the actual curves \(\psi _k\) from the benchmark egalitarian curve \(\psi _e(p)=1\), \(0\le p \le 1\), by calculating the areas between them.
Example 1
Consider a society with \(n\ge 3\) subjects, one of which is the ruler. Each of the working \(n-1\) subjects earns $ 1, just to be taken away by the ruler. Hence, ultimately, each of the working subjects possesses $ 0 and the ruler has $ \((n-1)\). This is a textbook example of extreme inequality.
The classical income inequality indices are mean based, which is not – as argued by statisticians – an appropriate measure of the center in the case of skewed populations. When, on the other hand, the median is used to measure the center, the large values, such as the income of the ruler in the above society, do not influence the center. Indeed, the center of incomes in the above society is $ 0, which is a more appropriate description of the typical income than the mean value \(\$\,(n-1)/n\) would be. (The ruler does not work and accumulates wealth only by taking away $ 1 from each of the working subjects).
Hence, in summary, we can say that the mean-based society views the ruler as a member of the society, making \(\$\,(n-1)/n\) a typical value of the society, whereas the median-based society views the ruler as being above the society, that is, not in the society, and so the typical income in this case is \(\$\,0\).
To see what happens with the three quantile-based indices \(\Psi _k\) and their curves \(\psi _k\) in the above situation of “extreme inequality,” we first recall that we have assumed that the median must be above zero. To accommodate this condition, we assume that the ruler lets each of the \(n-1\) working subjects keep a small amount \(\$\,\varepsilon \in (0,1)\) of their earned \(\$\,1\). Hence, the ruler accumulates the wealth of \(\$\,(n-1)(1-\varepsilon )\), making sure – needless to say – that the ruler’s wealth is not smaller than that of any of the working subjects, that is, the inequality \((n-1)(1-\varepsilon )\ge \varepsilon\) holds. Note that this inequality is equivalent to \(\varepsilon \le 1-1/n\), thus implying – quite naturally – that the more subjects there are in the society, the larger the amount they can be allowed to retain for their own use without making them richer than the ruler.
The quantile function Q in this scenario is
and thus the median income is \(Q(1/2)=\varepsilon\). We have the following expressions:
Note that the inequality \((n-1)(1-\varepsilon )\ge \varepsilon\) ensures that the indices \(\Psi _2\) and \(\Psi _3\) are non-negative, just like the index \(\Psi _1=0\) is. In the case \((n-1)(1-\varepsilon )= \varepsilon\), we have quantile-based income equality, that is, \(\psi _2(p)=1\) and \(\psi _3(p)=1\) for all \(p\in (0,1)\), and thus \(\Psi _2=0\) and \(\Psi _3=0\).
Hence, the index \(\Psi _1\) indicates equality in the aforementioned society (recall that the ruler is above the society, not in it), whereas the indices \(\Psi _2\) and \(\Psi _3\), each indicating some degree of inequality when \((n-1)(1-\varepsilon )> \varepsilon\), show that the inequality ultimately vanishes when the society grows in size, that is, when \(n\rightarrow \infty\). This concludes Example 1.
3 The indices and curves: a parametric viewpoint
Modelling population incomes using parametric distributions and also fitting such distributions to income data are common approaches in the area (e.g., Kleiber and Kotz 2003). From this perspective, the inequality indices G, Z, D and \(G_2\) and their corresponding equality curves have been amply discussed and illustrated by their inventors and subsequent researchers. Hence, we devote this section to illustrating only the three indices \(\Psi _k\) and their corresponding curves \(\psi _k\).
We use nine parametric families of distributions, most of which are common in modeling incomes (e.g., Kleiber and Kotz 2003). They are right skewed and present a full spectrum of tail heaviness: some are lightly tailed (e.g., exponential), some are heavily tailed (e.g., Pareto distributions), and others have the right tails of intermediate heaviness (e.g., lognormal). For their specific parametrizations, Table 1 contains all the essential formulas.
We have computed the inequality indices \(\Psi _k\) for these distributions under various parameter choices, to be clarified and discussed in a moment. The results are in Table 2,
where we also report the rankings of the distributions based on the indices: rank 1 corresponds to the lowest inequality and rank 16 to the highest inequality. It is encouraging to see that while the magnitudes of the indices differ, the rankings induced by them are fairly similar.
In Table 2 we have four groups consisting of four distributions. The groups reflect the fact that in Figs. 4, 5, 6, the distributions are grouped into four rows each containing four panels. The figures depict the three income-equality curves \(\psi _k\) for the distributions specified in Table 2.
Since the curves are ratios of percentiles, the scale parameter of each distribution has no effect on the inequality indices. The same is true for the log-location parameter (\(e^{\mu }\)) of the lognormal and log-Cauchy distributions. However, the shape (\(\alpha\), \(\gamma\)) and the log-scale (\(e^{\sigma }\)) parameters are the primary drivers of the underlying inequality. To explore this effect, we choose a couple of values of each of these parameters for plotting. In particular, since the gamma and Weibull distributions are generalizations of the exponential distribution, it is of interest to see the effect of heavier (\(\alpha = \tau = 0.5\)) and lighter (\(\alpha = \tau = 2\)) than exponential tails. Likewise, for the lognormal and log-Cauchy distributions, \(\sigma = 1\) represents a “standard” case while \(\sigma = 2\) is a heavier-than-standard-tail case. And for the Pareto distributions, the tail heaviness is controlled by the shape parameter \(\alpha\): the model has infinite variance when \(\alpha \le 2\) and infinite mean when \(\alpha \le 1\). Therefore, it makes sense to choose \(\alpha\)’s around these important benchmarks. In the plots of Figs. 4, 5, 6, the uniform distribution serves as a benchmark for comparing the curves. In each plot, the dash-dotted line marks the curve \(\psi _k\) in the case of the uniform distribution. Numerical evaluations labeled ‘area’ represent the areas of the corresponding shaded regions above the curves \(\psi _k\), which are the values of the inequality indices.
From Table 2 and Figs. 4, 5, 6 we observe several facts, which follow immediately from the formulas of Table 1:
-
\(\psi _1\) for Pareto-III\((\sigma , \gamma = 2)\) and \(\psi _3\) for Pareto-II\((\sigma , \alpha = 1)\) coincide, being equal to \((\frac{p}{2-p})^2\), thus giving identical inequality indices 0.7736.
-
\(\psi _1\) for Pareto-II\((\sigma , \alpha = 1)\) and \(\psi _3\) for both Uniform\((0, \theta )\) and Pareto-III\((\sigma , \gamma = 0.5)\) coincide, being equal to \(\frac{p}{2-p}\), thus giving identical inequality indices 0.6147.
-
\(\psi _1\) for Lognormal\((\mu , \sigma = 2)\) and \(\psi _3\) for Lognormal\((\mu , \sigma = 1)\) coincide, being equal to \(e^{2 \Phi ^{-1}(p/2)}\), thus giving identical inequality indices 0.6648.
-
\(\psi _1\) for Log-Cauchy\((\mu , \sigma = 2)\) and \(\psi _3\) for Log-Cauchy\((\mu , \sigma = 1)\) coincide, being equal to \(e^{2 \tan (\pi (p-1)/2)}\), thus giving identical inequality indices 0.7470.
We conclude this section with the note that there are, of course, many other parametric distributions for modelling incomes (see, e.g., Kleiber and Kotz 2003).
4 A nonparametric viewpoint
We now consider nonparametric (also called empirical) ways for estimating all the aforementioned indices of inequality and their corresponding equality curves, with analyses of real data.
4.1 Empirical estimators of the quantile-based indices
Let \(X_{1},\dots , X_{n}\) denote incomes of randomly selected persons, with \(X_{1:n}\le \cdots \le X_{n:n}\) denoting the ordered incomes. We assume that the empirical median
is strictly positive, where, for every real \(x\ge 0\), \(\lceil x \rceil\) is the smallest integer that is not below x. The empirical counterparts of the three indices \(\Psi _k\) are (see their justifications in Appendix 1)
where, for every real \(x\ge 0\), \(\lfloor x \rfloor\) is the largest integer that does not exceed x. Note that all the three indices are well defined because we assume that the median \(X_{\lceil n/2 \rceil :n}\) is strictly positive. When it is desirable to emphasize the dependence of the indices on incomes, we do so by writing them as \(\Psi _{k,n}(\textbf{X})\), where \(\textbf{X}=(X_{1:n}, \dots , X_{n:n})\) is the vector of all the (ordered) incomes in the sample. Next are a few immediate consequences of definitions (4.1)–(4.3).
Property 1
For every real \(c\ge 0\), we have \(\Psi _{k,n}(c\textbf{X})=\Psi _{k,n}(\textbf{X})\).
This property implies, for example, that changing the currency with which the incomes are reported does not affect the values of the three inequality indices.
Property 2
We have the inequality \(\Psi _{k,n}(\textbf{X})\ge \Psi _{k,n}(\textbf{X}+c )\) for every real \(c\ge 0\). The inequality is strict under the following two conditions: first, \(c>0\), and second, there is at least one ratio inside the sum of the definition of \(\Psi _{k,n}\) that is not equal to 1. (Note that none of the ratios exceeds 1).
This property implies that adding the same amount of income to everybody does not increase inequality and, under a minor caveat specified in the property, the index even decreases. To see the necessity of the assumption, consider the case when all X’s are equal, which gives \(\Psi _{k,n}(\textbf{X})=0\) and also \(\Psi _{k,n}(\textbf{X}+c )=0\) irrespective of the value of c. For a proof of Property 2, as well as for proofs of other properties, see Appendix 1.
Property 3
When \(c\rightarrow \infty\), we have \(\Psi _{k,n}(\textbf{X}+c ) \rightarrow 0\).
Intuitively, this property says that if we keep adding the same positive amount of income to everyone, all else being equal, then we shall eventually eliminate the inequality.
4.2 Estimators of the mean-based indices
Next we report the definitions of the empirical estimators of Z, D, G and \(G_2\) obtained by replacing the population quantile function Q by the empirical quantile function \(Q_n\), which is given by the equation
for every \(p\in (0,1]\). Slightly modifying the obtained expression in an asymptotically equivalent way to make it intuitively and computationally more appealing, we arrive at the estimator
of Z, which appears in Greselin and Pasquazzi (2009). Likewise, we arrive at
which is an empirical estimator of D that appears in Davydov and Greselin (2020). (Of course, 1/i in the numerator and denominator cancel out). The same reasoning leads to the empirical Gini index
where the last equation follows from simple algebra, with \(\overline{X}\) denoting the mean, assumed to be strictly positive, of the incomes \(X_{1},\dots , X_{n}\). Note that the last expression for \(G_n\) is the one that places the empirical Gini index into the family of S-Gini indices introduced by Donaldson and Weymark (1980) and Weymark (1980/81); see also Zitikis and Gastwirth (2002) for further references and statistical inference.
Note 1
The asymptotically negligible term 1/n on the right-hand side of the first equation of \(G_n\) ensures that \(G_n\) makes sense for all sample sizes. Without this term we may get counterintuitive values. For example, when the ‘incomes’ are \(X_1=1\), \(X_2=2\) and \(X_3=3\), we have \(G_n=2/9\), whereas \(G_n\) without the added \(1/n=1/3\) would give the negative value \(-1/9\), which is incompatible with the meaning of the index.
Finally, using the same arguments as above but now with the right-most expression for \(G_{2}\) given in Sect. 2.2 as our starting point, we arrive at
as an empirical estimator of \(G_{2}\). As before, \(\overline{X}\) stands for the mean of \(X_{1},\dots , X_{n}\).
4.3 An analysis of capital incomes from the ECHP (2001) survey
Using the formulas for calculating the aforementioned indices from data, we now analyze capital incomes, which are income flows from financial assets actually received during the reference year, reported in the European Community Household Panel survey (ECHP 2001) that was conducted by Eurostat in 2001, the last of the eight waves of the survey. In this regard, it is instructive to recall the definition of capital incomes given by T. Piketty:
[C]apital is defined as the sum total of nonhuman assets that can be owned and exchanged on some market. Capital includes all forms of real property (including residential real estate) as well as financial and professional capital (plants, infrastructure, machinery, patents, and so on) used by firms and government agencies. (Piketty 2014, p. 46)
For the importance, especially in the context in Europe, of capital incomes and income transfers, which we later analyze in Sect. 5, we again refer to T. Piketty:
Ultimately, the decline in the capital/income ratio between 1913 and 1950 is the history of Europe’s suicide, and in particular of the euthanasia of European capitalists. (Piketty 2014, p. 149)
Modern redistribution, as exemplified by the social states constructed by the wealthy countries in the twentieth century, is based on a set of fundamental social rights: to education, health, and retirement. Whatever limitations and challenges these systems of taxation and social spending face today, they nevertheless marked an immense step forward in historical terms. Partisan conflict aside, a broad consensus has formed around these social systems, particularly in Europe, which remains deeply attached to what is seen as a “European social model.” (Piketty 2014, p. 481)
For the countries of Europe, the priority now should be to construct a continental political authority capable of reasserting control over patrimonial capitalism and private interests and of advancing the European social model in the twenty-first century. The minor disparities between national social models are of secondary importance in view of the challenges to the very survival of the common European model. (Piketty 2014, p. 561–562)
Specifically, the data come from 59 750 households with 121 122 persons from the fifteen European countries specified in Table 3 using the ISO 3166-1 alpha-2 (two-letter) codes. By looking at the means and medians in Table 3, we see how skewed to the right the distributions of the countries are. Figure 7 (with \(G_{2,n}\) excluded due to its large values)visualizes the index values calculated using formulas (4.1)–(4.3) and reported in Table 3. (The arrangement of the countries from left to right is totally arbitrary, and the lines connecting the index values of different countries is only for the purpose of visualization and easier comparison of the countries). For a more detailed description of the data and relevant references, we refer to Greselin et al. (2014, Section 1). Next are several observations based on Table 3 and Fig. 7.
Portugal has the lowest value of \(\Psi _{1,n}\), with the median income of the poorest \(p \times 100 \%\) persons equal, after averaging over all \(p \in (0,1)\), to \(84.7\%\) of the median income of the entire population.
The opposite happens in France, which provides the highest contrast among the countries when comparing the median income of the poorest \(p\times 100 \%\) persons with the overall median income: after averaging such ratios over all \(p \in (0,1)\), we obtain \(21.7\%\).
For France, we also observe the largest value of \(\Psi _{3,n}\). The median income of the poorest \(p \times 100 \%\) people is equal, after averaging over all \(p \in (0,1)\), to only \(15.5\%\) of the median income of the richest \(p \times 100 \%\) persons in the population.
When we are interested in comparing the median income of the poorest \(p\times 100 \%\) persons with the median income of the remaining \((1-p)\times 100\%\) part of the population, the index \(\Psi _{2,n}\) tells us that Finland is the country in which such a contrast, after averaging over all \(p \in (0,1)\), is the largest.
Figures 8, 9, 10depict the three income-equality curves \(\psi _{k,n}\) for the fifteen European countries specified in Table 3, with the shaded-in areas above them depicting the values of the indices \(\Psi _{k,n}\). The curves have been obtained via formulas (2.3)–(2.7) by replacing Q by \(Q_n\) given by Eq. (4.4) with \(n=n_P\), where \(n_P\) is the number of people in the sample who possess capital incomes, and \(n_T\) is the total sample size of the given country.
Comparing the plots of Figs. 8, 9, 10 derived from the actual data with the ones of Figs. 4, 9, 6 generated from the parametric distributions, for most of the countries we see that the distributions of capital incomes are right-skewed and similar to most of our illustrative choices (e.g., gamma, lognormal, Pareto, Weibull). To reach a more definitive answer, formal statistical analysis should be performed using the methods provided by Prendergast and Staudte (2016).
4.4 A comparison with capital incomes from the EU-SILC (2018) survey
To get an insight into more recent European situation, we further analyse data coming from the EU Statistics on Income and Living Conditions survey (EU-SILC 2018), which substituted the ECHP survey after its eighth wave in 2001.
We note at the outset that in the EU-SILC survey, the capital incomes are available only at the level of households, and sample sizes are approximately seven times larger if compared with the earlier ECHP survey. Hence, the EU-SILC data give rise to more accurate estimates. In our study we use the following variables:
- HY040G::
-
income from rental of a property or land.
- HY090G::
-
interests, dividends, profit from capital investments in unincorporated business.
- PY080::
-
pensions received from individual private plans.
Namely, for each household we sum up HY040G and HY090G, and then add all the pensions received by a component of the same household (variable PY080). This gives us the capital incomes for each household. Denote them by, say, \(Z_1,\dots , Z_n\). As the data refer to households, an equivalence scale needs to be employed to make meaningful comparisons of monetary incomes of social units with different numbers of inhabitants, and to also take into account the economies of scale (within each household) with regard to the consumption of certain goods. An equivalence scale acts as a weight, giving rise to an equivalence income that can be used for inequality, poverty and welfare analyses. We opt for the “square root” equivalence scale, adopted by the Organization for Economic Cooperation and Development (OECD) in their recent publications. Namely, each household income \(Z_i\) is divided by the square root of the household size \(w_i\), yielding \(X_i\), \(i=1,\ldots ,n\), that we use in our analysis.
We analyse the same fifteen European countries as in previous Sect. 4.3, and consider the 340 540 households surveyed by the EU-SILC in 2018. A summary is provided in Table 4.For a useful comparison of means and medians, we apply the official average national currency exchange rates (year 2018) for the three countries that have not adopted the Euro: Denmark, Great Britain, and Sweden, whose currencies are the Danish Krone, the British Pound, and the Swedish Krona, respectively. Hence, all the analyzed data are in Euro.
The differences between the means and medians in Table 4 facilitate the assessment of skewness of income distributions. The list of countries with lower inequality (having a two-digit rank in at least one of the indices) is comprised of Denmark, Benelux, France, Ireland, Spain, Finland and Sweden. To compare with the 2001 data, Ireland has joined the list while Germany, Luxembourg, Great Britain and Greece left it. Portugal, that was the country with the highest inequality in 2001, in 2018 was joined by Greece in the list for the primacy of the highest inequality, as seen from the rankings produced by the three indices. Fig. 11 (with \(G_{2,n}\) excluded due to its large values) visualizes the index values calculated using formulas (4.1)–(4.3) and reported in Table 4. As in the case of Fig. 7, the arrangement of the countries from left to right in Fig. 11 is arbitrary, although follows exactly that of Fig. 7, and the lines connecting the index values of different countries is only for the purpose of visual comparison of the countries. Figs. 12, 13, 14 depict the three income-equality curves \(\psi _{k,n}\) for the fifteen European countries specified in Table 4, with the shaded-in areas above them depicting the values of the indices \(\Psi _{k,n}\).
5 The effects of income transfers on the indices \(\Psi _{k,n}\)
We have already alluded to the importance of income redistribution in Sect. 4.3, with a number of quotes on the subject from Piketty (2014). In the current section, we present a mathematical treatment of income transfers in terms of the data-driven versions \(\Psi _{k,n}\) of the income inequality indices \(\Psi _k\), \(k=1,2,3\).
Consider n persons whose ordered incomes we denote by \(X_{1:n}< \cdots < X_{n:n}\). Choose any pair from these persons and call them L and H. The person \(L \in \{1,\dots , n-1\}\) possesses income \(X_{L:n}\) and the person \(H \in \{2,\dots , n\}\) possesses income \(X_{H:n}\). We assume \(L<H\). Hence, L has less income than H, that is, \(X_{L:n}< X_{H:n}\). Denote \(\textbf{X}=(X_{1:n}, \dots , X_{n:n})\).
Assume now that H transfers a positive amount \(c>0\) to L without changing the income ordering among the n persons. The transfer produces \(\textbf{X}'=(X_{1:n}', \dots , X_{n:n}')\) with the same ordering \(X_{1:n}'< \cdots < X_{n:n}'\) of the coordinates as in the case of \(\textbf{X}\). (See Appendix 1 for additional technical details). Succinctly, we denote the transfer by
and read it, e.g., “L receives amount c from H” or “H transfers amount c to L.” We are interested in how the three indices \(\Psi _{k,n}=\Psi _{k,n}(\textbf{X})\) react to such transfers, that is, when \(\textbf{X}\) turns into \(\textbf{X}'\).
In addition to L and H, we also involve the “median” person
whose income is \(X_{M:n}=Q_n(1/2)\) as per Eq. (4.4) with \(p=1/2\). Any person P with income above the median (i.e., when \(P>M\)) is called well-off, and any person P with income below the median (i.e., when \(P<M\)) is called struggling (see Fig. 15).In what follows, we shall be interested in the effects of transfer (5.1) on the three indices when both L and H are well-off, both are struggling, and when one of them (i.e., L) is struggling and the other one (i.e., H) is well-off.
Before going into details, we note that the classical Pigou-Dalton principle (PDP) – when it holds – says that \(\Psi _{k,n}(\textbf{X})\ge \Psi _{k,n}(\textbf{X}')\) in its weak form and \(\Psi _{k,n}(\textbf{X})> \Psi _{k,n}(\textbf{X}')\) in its strong form. As we shall soon see, the three indices will tell us a richer story. Based on it, we shall be able to choose a preferred index, or at least be prompted to think outside the box, which is necessary as Amiel and Cowell (1999) have convincingly argued.
5.1 Index \(\Psi _{1,n}\)
Property 4
In the case of struggling L and well-off H (i.e., \(L<M<H\)), the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) decreases the value of the index \(\Psi _{1,n}\), that is, we have \(\Psi _{1,n}(\textbf{X})> \Psi _{1,n}(\textbf{X}')\).
Property 5
When both L and H are well-off (i.e., \(M<L<H\)), or when both are struggling (i.e., \(L<H< M\)), the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) does not change the value of the index \(\Psi _{1,n}\), that is, we have \(\Psi _{1,n}(\textbf{X})= \Psi _{1,n}(\textbf{X}')\).
These two properties say that in order to decrease income inequality based on the index \(\Psi _{1,n}\), a well-off person needs to transfer some amount to a struggling person, whereas any transfer between two well-off persons or between two struggling ones does not make any difference.
5.2 Index \(\Psi _{2,n}\)
The index \(\Psi _{2,n}\) is more sensitive to transfers than the previous index. Specifically, we shall see from the following properties that \(\Psi _{2,n}\) decreases when \(L {\mathop {\longleftarrow }\limits ^{c}} H\), unless both H and L are well-off and H transfers to L only a small amount \(c>0\).
Property 6
In the case of struggling L and well-off H (i.e., \(L< M < H\)), or when both L and H are struggling (i.e., \(L<H< M\)), the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) decreases the value of the index \(\Psi _{2,n}\), that is, \(\Psi _{2,n}(\textbf{X})> \Psi _{2,n}(\textbf{X}')\).
Property 7
When both L and H are well-off (i.e., \(M<L<H\)), the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) implies \(\Psi _{2,n}(\textbf{X})> \Psi _{2,n}(\textbf{X}')\) when
Furthermore, we have \(\Psi _{2,n}(\textbf{X})= \Psi _{2,n}(\textbf{X}')\) in the “boundary” case \(c=c_2\), and \(\Psi _{2,n}(\textbf{X})< \Psi _{2,n}(\textbf{X}')\) when \(c<c_2\).
Hence, the index \(\Psi _{2,n}\) avoids giving the impression of inequality reduction when only a small amount is transferred among well-off persons. In other words, for the index to decrease in the case of two well-off persons, the richer one needs to transfer a sufficiently large amount in order to qualify for inequality reduction. Next is an example illustrating Properties 6 and 7.
Example 2
Consider a group of seven persons, among whom there are three struggling ones (denoted by S’s) and three well-off persons (denoted by W’s). The person M has the median income \(X_{M:7}\) among these seven persons, and thus a “7” in its notation. Let their incomes be
The index of inequality for this vector is \(\Psi _{2,n}=0.8472\). Hence, \(n=7\) and thus \(M=\lceil 3.5 \rceil =4\), which gives the median income \(X_{4:7}=7\). There are three struggling persons \(S_1\), \(S_2\), and \(S_3\) with incomes 1, 3, and 5, respectively, and three well-off persons \(W_1\), \(W_2\), and \(W_3\) with incomes 10, 20, and 24, respectively (see the top-left panel in Fig. 16 for a visualization). The horizontal dashed line in each panel of Fig. 16, noted as “egalitarian income” and plotted at the height 10, refers to the egalitarian redistribution of the above specified incomes (whose sum is equal to 70) among the seven participating persons. Various transfers of the incomes are visualized in the other panels of the figure, with red dots depicting the incomes of the person(s) whose incomes decreased due to transfers to less fortunate one(s), highlighted in green. We next discuss what we see in the three panels of Fig. 16 depicting various transfers.
Top-right panel.
The panel depicts the transfer \(W_1{\mathop {\longleftarrow }\limits ^{c}}W_2\) among two well-off persons of the insufficient for inequality decrease amount \(c=2\). Hence, the resulting distribution
retains the same value of the index \(\Psi _{2,n}=0.8472\) as distribution (5.3).
To see what amounts are sufficient and what insufficient, we note that condition (5.2) is equivalent to
For the ordering of incomes to remain the same after the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\), we need the restriction
Hence, to decrease income inequality according to the index \(\Psi _{2,n}\), the person H needs to transfer to L more than 2, but less than 5 to avoid swapping the position with L.
Bottom-left panel.
The panel depicts the transfer \(W_1{\mathop {\longleftarrow }\limits ^{c}}W_2\) of the sufficient for inequality decrease amount \(c=4\), in which case we have
with the value of the index \(\Psi _{2,n}=0.8442\).
Bottom-right panel.
We now consider a more complex situation when every well-off person commits to improving the incomes of the three struggling persons, with the final distribution of incomes becoming (4, 5, 6, 7, 9, 18, 21) . We can achieve this distribution in several steps, each reducing income inequality and maintaining the original ordering of the seven persons. Recall that we start from the vector (1, 3, 5, 7, 10, 20, 24) , whose inequality index is \(\Psi _{2,n}=0.8472\), and the steps could be these: The transfer \(S_3 {\mathop {\longleftarrow }\limits ^{1}} W_1\) results in the distribution
with the index \(\Psi _{2,n}=0.8296\). The transfer \(S_2{\mathop {\longleftarrow }\limits ^{2}} W_2\) results in
with the index \(\Psi _{2,n}=0.7870\). Finally, the transfer \(S_1{\mathop {\longleftarrow }\limits ^{3}} W_3\) results in the distribution
depicted in the bottom right panel of Fig. 16 and having the index \(\Psi _{2,n}=0.6640\). All these are inequality-reducing transfers from well-off persons to struggling ones. A continuation of this example is given in Appendix 1 with another instructive set of steps leading to distribution (5.8).
5.3 Index \(\Psi _{3,n}\)
Property 8
In the case of struggling L and well-off H (i.e., \(L<M<H\)), the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) decreases the value of the index \(\Psi _{3,n}\), that is, \(\Psi _{3,n}(\textbf{X})> \Psi _{3,n}(\textbf{X}')\).
Property 9
When both L and H are well-off (i.e., \(M<L<H\)), or when both are struggling (i.e., \(L<H< M\)), the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) increases the value of the index \(\Psi _{3,n}\), that is, we have \(\Psi _{3,n}(\textbf{X})< \Psi _{3,n}(\textbf{X}')\).
Hence, when the goal is to decrease income inequality, these two properties say that well-off persons must transfer to struggling persons, and the index discourages transfers between two well-off persons, or between two struggling ones, as the index views such transfers manipulative with no real consequences. Whether we agree with this or not determines whether or not we want adopt the index \(\Psi _{3,n}\) for measuring income inequality.
5.4 A numerical example
Having by now discussed the three indices and their properties, we next have a numerical example that illustrates the performance of the three indices side-by-side. Namely, consider the six distributions of incomes specified in (5.3)–(5.8) and visualized in Fig. 16. Table 5 contains the numerical values of the three indices for the six income distributions. Note that the original (or initial) incomes are given by distribution (5.3), from which various transfers are executed, with post-transfer index values given in the columns to the right of (5.3). We next discuss these post-transfer index values with respect to the pre-transfer values, which are reported in column (5.3). For this, it is instructive to have the six distributions side-by-side, with the median (equal to 7) in bold separating the struggling (on the left) from the well-off (on the right) persons:
The index \(\Psi _{1,n}\) values remain unchanged after the transfers from (5.3) to (5.4), and also from (5.3) to (5.5), because the transfers are among the well-off persons (Property 5). The index values decrease more and more when the transfers from (5.3) are made to (5.6), (5.7), and (5.5), because the three transfers are from well-off persons to the struggling ones (Property 4), and more and more are being transferred to the struggling persons, as seen by comparing distributions (5.6)–(5.8).
In minute details, the performance of the index \(\Psi _{2,n}\) has been discussed in Example 2. Here is its summary: The value of \(\Psi _{2,n}\) does not change when moving from distribution (5.3) to (5.4) because the transfer amount is the boundary case (Property 7), meaning that it neither increases nor decreases the index. The transfer from (5.3) to (5.5) is, however, sufficiently large to decrease the index, even though the transfer occurs among the well-off persons (Property 7). The transfers from (5.3) to (5.6), (5.7), and (5.5) are from well-off persons to struggling ones, and since increasing amounts are being transferred, the index \(\Psi _{2,n}\) values decrease more and more (Property 6).
Contrary to what the classical Pigou-Dalton principle postulates, the index \(\Psi _{3,n}\) discourages transfers among the well-off (as well as among the struggling) persons (Property 9). This is reflected by the increased values of the index in the case of transfers from (5.3) to (5.4), and also from (5.3) to (5.5). The index, however, starts to decrease, and more so, for transfers from (5.3) to (5.6), (5.7), and (5.5), because the three transfers are from well-off persons to the struggling ones (Property 8).
6 Conclusion
In this paper we have explored three inequality indices that reflect three different views of measuring income inequality:
-
(1)
The median income of the poor is compared with the median income of the entire population. This is index \(\Psi _{1,n}\). It decreases when a well-off person transfers any amount to a struggling one, provided that the transfer does not change the ranking of the persons. However, the index does not change when the transfer happens between two well-off persons, or between two struggling ones, provided that the transfer does not change the ranking of the persons.
-
(2)
The median income of the poor is compared with the median income of those who are not poor. This is index \(\Psi _{2,n}\). It decreases when a well-off person transfers any amount to a struggling one, or when transfer occurs among two struggling persons, provided that the transfer does not change the ranking of the persons. However, only large transfers among well-off persons decrease the index, and increase when only small amounts are transferred, provided that the transfers do not change the ranking of the persons.
-
(3)
The median income of the poor is compared with the median of the same proportion of the richest. This is index \(\Psi _{3,n}\). It decreases when a well-off person transfers any amount to a struggling one, provided that the transfer does not change the ranking of the persons. The index, however, increases when two well-off persons transfer any amount among themselves, or when struggling persons transfer any amount among themselves, provided that the transfer does not change the ranking of the persons.
Hence, in view of how transfers affect the indices, we may decide which of the three indices to use (or not to use) in actual data analyses. For example, when it is of interest to see whether well-off persons help the poor ones, irrespective of what is happening inside the well-off group, or inside the struggling one, then the index \(\Psi _{1,n}\) should be preferred.
To facilitate practical implementation and analyses at the data and population levels, we have presented the three inequality indices and their equality curves in two ways: one that is suitable for modeling populations, and the other one that is suitable for direct data-focused computations. In particular, the indices and their curves have been illustrated using popular parametric models of income distributions, and also calculated and interpreted using real data. Such results facilitate the development of statistical inference, as seen from the contributions by Prendergast and Staudte (2016, 2018), Oancea and Pirjol (2019), Jokiel-Rokita and Pia̧tek (2023), and Pia̧tek (2023).
Important statistical work remains to be done in the area. For example, decomposition of the indices by subpopulations, income components, intra- and inter-groups inequalities are among the topics of immediate interest (e.g, Amate-Fortes et al. 2021; Qiu et al. 2021), and for a sample of methodological research in the case of the Gini, Zenga, and related indices we refer to Radaelli (2010), Porro and Zenga (2020), and Zenga and Jȩdrzejczak (2020).
The indices do not require any finite moment and therefore are suitable to analyze any population, including ultra heavily tailed, i.e., without any finite moment, unlike the Gini and many other classical indices whose definitions require a finite first moment. Developing statistical inference in such situations usually relies on Extreme Value Theory, and for a glimpse of related to our current study research, we refer to Greselin et al. (2014), where, based on empirical evidence, it is noted that some income distributions may not have finite first moments.
References
Amate-Fortes, I., Guarnido-Rueda, A., Martínez-Navarro, D., Oliver-Márquez, F.J.: Measuring inequality in income distribution between men and women: what causes gender inequality in Europe? Qual. Quant. Int. J. Methodol. 55, 395–418 (2021)
Amiel, Y., Cowell, F.: Thinking about inequality. Cambridge University Press, Cambridge (1999)
Atkinson, A.B., Bourguignon, B.: Handbook of income distribution, vol. 1. Elsevier, Amsterdam (2000)
Atkinson, A.B., Bourguignon, B.: Handbook of income distribution, vol. 2. Elsevier, Amsterdam (2015)
Atkinson, A.B., Piketty, T.: Top incomes over the twentieth century: a contrast between continental European and English-speaking countries. Oxford University Press, Oxford (2007)
Bennett, C.J., Zitikis, R.: Ignorance, lotteries, and measures of economic inequality. J. Econ. Inequal. 13, 309–316 (2015)
Bonferroni, C.E.: Elementi di Statistica Generale. Libreria Seeber, Firenze (1930)
Brazauskas, V., Greselin, F., Zitikis, R.: Measuring income inequality via percentile relativities. Technical report (2023) available at https://doi.org/10.48550/arXiv.2308.03708
Champernowne, D.G., Cowell, F.A.: Economic inequality and income distribution. Cambridge University Press, Cambridge (1998)
Cowell, F.A.: Measuring inequality, 3rd edn. Oxford University Press, Oxford (2011)
Davydov, Y., Greselin, F.: Inferential results for a new measure of inequality. Econom. J. 22, 153–172 (2019)
Davydov, Y., Greselin, F.: Comparisons between poorest and richest to measure inequality. Sociol. Methods Res. 49, 526–561 (2020)
Donaldson, D., Weymark, J.A.: A single-parameter generalization of the Gini indices of inequality. J. Econom. Theory 22, 67–86 (1980)
ECHP: European Community Household Panel. Eurostat, European Union. (2001). https://ec.europa.eu/eurostat/web/microdata/european-community-household-panel
EU-SILC: EU Statistics on Income and Living Conditions. Eurostat, European Union. (2018). https://ec.europa.eu/eurostat/web/microdata/european-union-statistics-on-income-and-living-conditions
Garratt, D.: Wealth and income inequality in Britain. The Sloman Economics News Site, Pearson Education. (2020). https://pearsonblog.campaignserver.co.uk/patterns-in-british-wealth-and-income-inequality/
Gastwirth, J.L.: Median-based measures of inequality: reassessing the increase in income inequality in the U.S. and Sweden. Stat. J. IAOS 30, 311–320 (2014)
Gini, C.: Sulla misura della concentrazione e della variabilità dei caratteri. In: Atti del Reale Istituto Veneto di Scienze, Lettere ed Arti. Anno Accademico 1913–1914, Tomo LXXII - parte seconda. Premiate Officine Grafiche C. Ferrari, Venezia, 1201–1248 (1914)
Gini, C.: Variabilità e Mutabilità: Contributo allo Studio delle Distribuzioni e delle Relazioni Statistiche. Tipografia di Paolo Cuppini, Bologna (1912)
Greselin, F., Zitikis, R.: From the classical Gini index of income inequality to a new Zenga-type relative measure of risk: a modeller’s perspective. Econometrics (Special issue on “Econometrics and Income Inequality” with Guest Editors Martin Biewen and Emmanuel Flachaire), 6, 1–20. (Article #4) (2018)
Greselin, F., Pasquazzi, L.: Asymptotic confidence intervals for a new inequality measure. Commun. Stat. Comput. Simul. 38(8), 17–42 (2009)
Greselin, F., Pasquazzi, L., Zitikis, R.: Heavy tailed capital incomes: Zenga index, statistical inference, and ECHP data analysis. Extrem. Stat. Theory Appl. Sci. Eng. Econ. 17, 127–155 (2014)
Jokiel-Rokita, A., Pia̧tek, S.: Nonparametric estimators of inequality curves and inequality measures. Technical report (2023) available at https://doi.org/10.48550/arXiv.2310.16200
Kakwani, N.C.: Income inequality and poverty: methods of estimation and policy applications. Oxford University Press, New York (1980)
Kleiber, C., Kotz, S.: Statistical size distributions in economics and actuarial sciences. Wiley, Hoboken (2003)
Nygård, F., Sandström, A.: Measuring income inequality. Almqvist & Wiksell, Stockholm (1981)
Oancea, B., Pirjol, D.: Extremal properties of the Theil and Gini measures of inequality. Qual. Quanti. Int. J. Methodol. 53, 859–869 (2019)
Piątek, S.: Parametric estimation of quantile versions of Zenga and \(D\) inequality curves: methodology and application to Weibull distribution. Technical report (2023) available at https://doi.org/10.48550/arXiv.2312.07350
Piketty, T.: Capital in the twenty-first century, English Harvard University Press, Cambridge (2014)
Porro, F., Zenga, M.: Decomposition by subpopulations of the Zenga-84 inequality curve and the related index \(\zeta\): an application to 2014 Bank of Italy survey. Stat. Methods Appl. 29, 187–207 (2020)
Prendergast, L.A., Staudte, R.G.: Quantile versions of the Lorenz curve. Electron. J. Stat. 10, 1896–1926 (2016)
Prendergast, L.A., Staudte, R.G.: A simple and effective inequality measure. Am. Stat. 72, 328–343 (2018)
Qiu, L.J., Zhong, S.B., Sun, B.W., Song, Y., Chen, X.H.: Is internet penetration narrowing the rural-urban income inequality? A cross-regional study of China. Qual. Quant. Int. J. Methodol. 55, 1795–1814 (2021)
Radaelli, P.: On the decomposition by subgroups of the Gini index and Zenga’s uniformity and inequality indexes. Int. Stat. Rev. 78, 81–101 (2010)
Sen, A.: On economic inequality. In: Foster, J.E., Sen, A. (eds.) Expanded edition with a substantial Annexe. Clarendon Press, Oxford (1997)
Sen, A.: Choice, welfare and measurement (second printing). Harvard University Press, Cambridge (1998)
Weymark, J.A.: Generalized Gini inequality indices. Math. Soc. Sci. 1, 409–430 (1980/81)
Yitzhaki, S.: More than a dozen alternative ways of spelling Gini. Res. Econ. Inequal. 8, 13–30 (1998)
Yitzhaki, S., Schechtman, E.: The Gini methodology. Springer, New York (2013)
Zenga, M.: Inequality curve and inequality index based on the ratios between lower and upper arithmetic means. Stat. Appl. 5, 3–27 (2007)
Zenga, M., Jȩdrzejczak, A.: Decomposition of the Zenga inequality index \(I (Y)\) Into the contributions of macro-regions and income components-an application to data from Poland and Italy. Argum. Oecon. 44, 101–125 (2020)
Zitikis, R., Gastwirth, J.L.: Asymptotic distribution of the S-Gini index. Aust. N. Z. J. Stat. 44, 439–446 (2002)
Funding
Open access funding provided by Università degli Studi di Milano - Bicocca within the CRUI-CARE Agreement. This research has been supported by the NSERC Alliance–MITACS Accelerate grant (ALLRP 580632-22) entitled “New Order of Risk Management: Theory and Applications in the Era of Systemic Risk” from the Natural Sciences and Engineering Research Council (NSERC) of Canada, and the national research organization Mathematics of Information Technology and Complex Systems (MITACS) of Canada, as well as by the individual NSERC Discovery Grant of R. Zitikis (RGPIN-2022-04426).
Author information
Authors and Affiliations
Contributions
All authors equally contributed to the study conception and design. Material preparation, data collection and analysis were performed by all authors, who also drafted the manuscript and worked on all its versions. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
A Technicalities
A Technicalities
Proof
(Justification of definitions (4.1)–(4.3)) The three empirical indices arise from formulas (2.4)–(2.8) by first replacing the population quantile function Q by the empirical quantile function \(Q_n\) in all the formulas. (We have asymptotically insignificantly modified the obtained expressions to facilitate their intuitive appeal). In detail, with \(F_n\) denoting the empirical cumulative distribution function based on \(X_{1},\dots , X_{n}\), the empirical quantile function is given by Eq. (4.4). Thus, for example, \(Q_n(1/2)=X_{M:n}\) with \(M=\lceil n/2 \rceil\) is the empirical median used in the definition of \(\Psi _{1,n}\). Note also that \(\lfloor n/2 \rfloor + M = n\), and thus the definition of the index \(\Psi _{2,n}\) does not go beyond the random variables \(X_{1},\dots , X_{n}\). \(\square\)
Proof of Property 2
The inequality holds because \(a/b\le (a+c)/(b+c)\) for all (positive) \(a\le b\) and \(c\ge 0\), and to have the strict inequality, we note that \(a/b< (a+c)/(b+c)\) holds for all (positive) \(a< b\) and \(c> 0\).
Proof of Property 3
The property follows from \((a+c)/(b+c)\rightarrow 1\) when \(c\rightarrow \infty\) irrespective of the values of (positive) \(a\le b\).
Proof
(Details of definition (5.1))
where L and H are integers such that \(1\le L<H\le n\), and \(c>0\) is any positive real number (i.e., the amount transferred from H to L) such that the following ordering holds:
When inequalities (6.1) hold, we succinctly denote this transfer by \(L {\mathop {\longleftarrow }\limits ^{c}} H\).
Proof of Property 4
Since \(L< M < H\), the increase in L’s income affects the index
because
whereas the decrease in H’s income does not affect \(\Psi _{1,n}\) because H is not among the terms making up the definition of the index. Hence, \(\Psi _{1,n}(\textbf{X})> \Psi _{1,n}(\textbf{X}')\). \(\square\)
Proof of Property 5
When \(M<L<H\), the index \(\Psi _{1,n}\) is not affected by the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) because, mathematically speaking, L and H are outside the summation range due to \(\lfloor n/2 \rfloor \le M\) and, according to property (6.1), the transfer does not change the ordering of incomes. In other words, the median income and the incomes below it are not affected by the transfer, and we therefore have \(\Psi _{1,n}(\textbf{X})= \Psi _{1,n}(\textbf{X}')\).
When \(L<H< M\), both L and H are among the terms in the sum making up the definition of \(\Psi _{1,n}\). Since we have the equations
the value \(\Psi _{1,n}\) is not affected by the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\). This implies \(\Psi _{1,n}(\textbf{X})= \Psi _{1,n}(\textbf{X}')\) and establishes Property 5. \(\square\)
Proof of Property 6
Consider first the case when \(L< M < H\). Since \(L< M\), we have \(L\le \lfloor n/2 \rfloor\), and so the index
is affected by the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) because
which implies \(\Psi _{2,n}(\textbf{X})> \Psi _{2,n}(\textbf{X}')\).
When \(L<H< M\), we have \(L<H \le \lfloor n/2 \rfloor\), and so
with the inequality holding because \(X_{M +L:n}<X_{M +H:n}\). Hence, \(\Psi _{2,n}(\textbf{X})> \Psi _{2,n}(\textbf{X}')\), thus concluding the proof of Property 6.
Proof of Property 7
Since \(M<L<H\), the incomes of L and H are above the median \(X_{M:n}\), and so there are two k’s in the sum in the definition of \(\Psi _{2,n}\) that give \(M+k=L\) and \(M+k=H\), respectively. Consequently, \(\Psi _{2,n}(\textbf{X})> \Psi _{2,n}(\textbf{X}')\) holds if and only if the following inequality holds:
Simple algebra shows that the inequality is equivalent to \(c> c_2\), where \(c_2\) is defined by Eq. (5.2). This establishes Property 7. \(\square\)
Proof of Property 7
(Continuation of Example 2) Alternatively, without delving into the psychology of people participating in various transfers and thus the plausibility of such transfers, we can have the following steps, some of which involving two well-off persons and some involving both well-off and struggling persons, leading to the same end-result (4, 5, 6, 7, 9, 18, 21) as in the first part of Example 2:
-
(1)
\(W_1 {\mathop {\longleftarrow }\limits ^{3}} W_2\) results in (1, 3, 5, 7, 13, 17, 24) with \(\Psi _{2,n}=0.8461\)
-
(2)
\(W_2 {\mathop {\longleftarrow }\limits ^{3}} W_3\) results in (1, 3, 5, 7, 13, 20, 21) with \(\Psi _{2,n}=0.8450\)
-
(3)
\(S_3{\mathop {\longleftarrow }\limits ^{1}} W_2\) results in (1, 3, 6, 7, 13, 19, 21) with \(\Psi _{2,n}=0.8265\)
-
(4)
\(S_2{\mathop {\longleftarrow }\limits ^{1}} W_2\) results in (1, 4, 6, 7, 13, 18, 21) with \(\Psi _{2,n}=0.8050\)
-
(5)
\(S_2 {\mathop {\longleftarrow }\limits ^{1}} W_1\) results in (1, 5, 6, 7, 12, 18, 21) with \(\Psi _{2,n}=0.7844\)
-
(6)
\(S_1{\mathop {\longleftarrow }\limits ^{3}} W_1\) results in (4, 5, 6, 7, 9, 18, 21) with \(\Psi _{2,n}=0.6640\)
Step 1 is justified by our earlier argument at the beginning of this example saying that any transfer higher than 2 but less than 5 from \(W_2\) to \(W_1\) is legitimate, and we transfer \(c=3\). To justify Step 2, we note that we can only transfer less than \((24-17)/2=3.5\) but more than \((3\times 24^2 - 5\times 17^2) /(3\times 24+ 5\times 17)= 1.8025\), and so we transfer \(c=3\). All Steps 3–6 are from well-off persons to struggling ones, and so the only requirement on the transfers is that they should maintain the original ordering of incomes.
Proof of Property 8
The transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) affects the index
via both L and H because
which implies \(\Psi _{3,n}(\textbf{X})> \Psi _{3,n}(\textbf{X}')\) and establishes Property 8.
Proof of Property 9
Consider first the case of two well-off persons, that is, \(M<L<H\). Since \(\lfloor n/2 \rfloor + \lceil n/2 \rceil =n\) and \(M=\lceil n/2 \rceil\), we have \(n-k+1>M\) for every \(k\le \lfloor n/2 \rfloor\). Consequently, there are two k’s in the sum in the definition of \(\Psi _{3,n}\) that give \(n-k+1=L\) and \(n-k+1=H\), respectively, because \(M<L<H\). Hence, the inequality \(\Psi _{3,n}(\textbf{X})\ge \Psi _{3,n}(\textbf{X}')\) holds if and only if
which is equivalent \(c\ge c_3\), where
Recall now that the transfer \(L {\mathop {\longleftarrow }\limits ^{c}} H\) does not change the ordering of incomes, and thus we must have \(X_{L:n}+c < X_{H:n}-c\), which is equivalent to \(c<c_0\), where
Hence, to have \(L {\mathop {\longleftarrow }\limits ^{c}} H\) for some \(c>0\), we must have \(c_3<c_0\), which is equivalent to
which simplifies to
The latter inequality is impossible because \(L<H\) implies \(X_{H:n}>X_{L:n}\) and \(X_{n-H+1:n} < X_{n-L+1:n}\). Consequently, it is impossible to have \(c_3<c_0\) and so there is not a single \(c>0\) that satisfies \(c\ge c_3\) and \(c<c_0\) simultaneously. This shows that the only possibility that exists is \(\Psi _{3,n}(\textbf{X})< \Psi _{3,n}(\textbf{X}')\).
Consider now the case of two struggling persons, that is, \(L<H< M\). In this case we have \(L<H\le \lfloor n/2 \rfloor\) and so \(L {\mathop {\longleftarrow }\limits ^{c}} H\) affects \(\Psi _{3,n}\) because of the inequality
that holds due to \(X_{n-L+1:n} > X_{n-H+1:n}\). Hence, \(\Psi _{3,n}(\textbf{X})< \Psi _{3,n}(\textbf{X}')\), concluding the proof of Property 9.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Brazauskas, V., Greselin, F. & Zitikis, R. Measuring income inequality via percentile relativities. Qual Quant 58, 4859–4896 (2024). https://doi.org/10.1007/s11135-024-01881-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11135-024-01881-2