1 Introduction

Income distribution and its relation to economic activity and social welfare have become one of the main interests of numerous theoretical and empirical studies over the last three decades. Some of these papers are based on social welfare functions and assess the trade-off between efficiency and equity (Nicola 2013).

An empirical analysis of welfare and economic performance involves accounting for multiple socio-economic characteristics. Welfare depends on both the size and distribution of national income. With its capacity to account for multiple inputs and outputs, data envelopment analysis (DEA) has proved to be an appropriate tool in this domain. Prasada Rao and Coelli (1998, 2002) were the first to embed inequality indicator in the measuring of economic performance. Lábaj et al. (2014) later derived a measure of social welfare which simultaneously assesses economic, ecological, and social dimensions. Using DEA the overall change in productivity described by the Malmquist index can be decomposed to expose catch-up and frontier-shift effects (see e.g. Färe et al. 1994; Cooper et al. 2007).

As pointed out by Nicola (2013, p. vi), many indicators of welfare other than income, e.g. education, good health, or happiness show a positive correlation between per capita income and a satisfaction score. A new indicator for measuring economic performance beyond GDP which includes inequality was developed by Jones and Klenow (2016).

Statistically based studies that measure income distribution indicate a widening gap between the income of rich and poor individuals during the last three decades (e.g. Atkinson 2015; Atkinson et al. 2011). Following Feldstein (2005), “the emphasis should be on eliminating poverty and not on the overall distribution of income or the general extent of inequality. Like most economists, I accept the Pareto principle that an economy is better off if someone gains and no one loses. This is true even if the gainer has above-average income, causing a Gini coefficient measure of income distribution to shift to greater inequality”, (p. 12). Atkinson (2015) supports this view: “I share his concern with what is happening at the bottom of the income scale” (p. 23).

The simplest way to extract the poverty indicator from income distribution is to define an income value, ymin, and to declare as poor those people with an income that does not exceed it. However, this so-called head-count ratio does not take into account income distribution among individuals in this group. For the same Gini coefficient, the income share of the poorest group in relation to total income can be different. A deeper insight in the structure of the inequality measure is needed.

The remainder of the paper is organized as follows. The next section is devoted to the measurement of income distribution based on DEA methodology. As the evaluation of income distribution encompasses both descriptive and prescriptive issues (Sen 2000), some preferences need to be included in the prospective method. In the spirit of Rawls and following Feldstein (2005) we introduce decreasing weighting for income quantile shares, with the highest weight for the poorest group. A assurance region DEA model is employed to implement these weight restrictions. A method of assessing of productivity change over time using the Malmquist index is presented afterwards. In Sect. 3 empirical results are presented to demonstrate how the proposed measure performs compared to the Gini index, providing deeper insight into individual distributions in 29 European countries. The intertemporal analysis spans the period of 2007–2016. Section 4 concludes and outlines further research.

2 Measurement of inequalities

Over the years, a multitude of indices haves been developed to facilitate the designing and evaluating of policies with respect to social development. Most of them aim to capture aspects beyond GDP utilizing a range of indicators, famously articulated in Stiglitz et al. (2009). Leaving aside environmental issues, the most discussed aspect of well-being is social justice, particularly the distribution of wealth. There are a number of indicators capturing the distribution of wealth or income on an aggregate level, including the Gini or Theil indices as a class of statistically based indicators. These indices are closely linked to social welfare functions. Quantities characterizing health or education are closely correlated with income. This is why income distribution analyses dominate the domain of the social aspect of well-being research.

Measures like the head-count ratio for the poverty index or the widely used Gini coefficient do not provide deeper insights into income distribution among the different types of households. It is important to know not only the surface generated by the Lorenz curve for the estimation of the Gini coefficient but also the curvature of this curve. Adopting the definition of justice as representing “…the basic structure of society” (Rawls 1971) and Feldstein’s (2005) focus on eliminating poverty, we would value overall welfare with respect to the lowest income. We deviate from the strict max–min principle, maintaining however the spirit of lower incomes’—greater contribution to total welfare. Since detailed data on income brackets are hardly available for a study across European countries, we make use of income distribution data where income shares per population quantiles are accessible. The more income is received by the poorest, the higher “Rawlsian” social welfare would be generated.

2.1 Measurement of inequalities with a weight restriction in DEA

In order to provide deeper insights into income distribution among the different income groups, e.g. quantiles of households by level of income) and, thus, to reveal the potential for reducing income inequalities we propose a modified DEA model. DEA was originally developed for measuring efficiency and evaluating the performance of many different kinds of entities engaged in many different activities (Charnes et al. 1978). Applications of DEA include evaluations of the performance of cities, regions and countries with different kinds of inputs and outputs which can be expressed in different (monetary, physical) units. DEA does not require any assumption about the functional form of the transformation function and provides a single measure of efficiency without needing to assign prespecified weights to multiple inputs and outputs. It measures the efficiency of a decision making unit (DMU) relative to all other DMUs with the simple restriction that all other DMUs lie on or below the efficiency frontier. For each inefficient DMU DEA identifies the sources and level of inefficiency for each of the inputs and outputs.

The basic DEA model can easily be extended according to the research question or the field of applications (an extensive account is given, for example, in Cooper et al. 2007). One very useful modification is related to the situation where differing importance or priorities with respect to particular inputs or outputs are defined and should be taken into account. Accepting Rawls criterion and Feldstein’s argument for increasing social welfare—an increase in poorer household’s income generates higher social welfare than an increase in richer households’ income—we propose the idea of an “assurance region” DEA model.

Let us illustrate this idea using a very simple example with two outputs (the shares of income owned by two types of households, poor and rich, whose values add up to one) and three countries (DMUs). The data are given in the following table.

Country

Output 1

Output 2

Input

A

0.3

0.7

1

B

0.7

0.3

1

C

0.5

0.5

1

In the output oriented model aiming at maximizing the aggregated sum of the outputs, the linear programming constraints associated with countries A, B, and C have the following inequalities in common along with the non-negativity constraint for all variables:

$$ v - 0.3u_{1} - 0.7u_{2} \ge 0 $$
(A)
$$ v - 0.7u_{1} - 0.3u_{2} \ge 0 $$
(B)
$$ v - 0.5u_{1} - 0.5u_{2} \ge 0, $$
(C)

where v denotes the input multiplier and u1, u2 label output multipliers.

This is a very simple output oriented model where all three countries are efficient (none of the countries is dominated by another one or a combination of the other countries). In order to demonstrate the role of weight for identifying the Charnes–Cooper–Rhodes (CCR) efficiency of DMUs, we will graphically show the efficiency frontier for this example in the weight variables (multiplier) space. Because there are only two outputs and one unitized input, we can plot a two-dimensional graph.

Dividing the inequalities constraints by v > 0, the following expressions are obtained:

$$ 0.3\,\frac{{u_{1} }}{v}+ 0.7\,\frac{{u_{2} }}{v}\le 1 $$
(A)
$$ 0.7\,\frac{{u_{1} }}{v}+ 0.3\frac{{u_{2} }}{v}\le 1 $$
(B)
$$ 0.5\,\frac{{u_{1} }}{v}+ 0.5\frac{{u_{2} }}{v}\le 1 $$
(C)

Taking u1/v and u2/v as axes, the area P ≡ OP4P0P3 in Fig. 1 shows the set of feasible solutions for the above constraints. The line segments P3P0 and P0P4, where P0 is intersection of the three lines describe the efficiency frontier.

Fig. 1
figure 1

Feasible region P

All three countries A, B, and C are efficient. Any (u1, u2) on the line segment P3P0 expresses the optimal weights for A, segment P0P4 represents optimal combinations of weights for B, and P2P0 for country C. It is easy to see that the optimal weights for an efficient DMU need not to be unique.

In the spirit of Rawls we now postulate a higher weight for the income of the poorer households. For this purpose we introduce condition u2 ≤ 0.7u1 (ensuring that u1 > u2 and allowing for a graphically discernible representation). We thus require that the weight for the income of a poor household is at least 1.43 times higher than the weight for a rich household. In the empirical part of the paper, we will alter the lower bound for the weights ratio and analyse the effect of the change for the discrimination power of our measure.

To analyse the influence of the assurance region method on the efficiency frontier and to illustrate our inequality measure, we add the weight constraint u2 ≤ 0.7u1 in the (v, u) space in Fig. 1. Dividing the weight restriction inequality by v > 0 and rewriting we obtain

$$ \frac{{u_{2} }}{v} - 0.7\frac{{u_{1} }}{v}\le 0 $$

The segment line OR corresponding to the equation, obtained by rewriting the above inequality intersects the segment line P0P4 at point R in Fig. 2. Consequently, the feasible region in the (v, u) space is now represented by the triangle ORP4, and the line segments P3P0 and P0R are shut out from the efficiency frontier by the assurance region constraint. The efficiency frontier is now described by the line segment RP4 and the only efficient unit remains country B.

Fig. 2
figure 2

Assurance region

Taking a look at the data confirms that, this result is in accordance with economic reasoning. The income’s share of household 1 (output 1 with higher weight) in country B is higher than in the other two countries.

To see the impact of the weight restriction on the efficiency scores of the DMUs A and C and to provide an interpretation of our DEA inequality measure we move the line segments P3Po (which corresponds to country A) parallel to touch the efficiency frontier at point R with coordinates u1/v = 1.1 and u2/v = 0.77. The efficiency score of country A is then

$$ 0.3\,\frac{{u_{1} }}{v} + 0.7\,\frac{{u_{2} }}{v}= 0.869 $$

Thus the efficiency of DMU A drops to 0.869. Similarly, if we move the line segment P2P5 parallel to point R, the efficiency of DMU C is reduced to 0.934. The impact of the weight constraint for country C is smaller than for A, which is not surprising looking at the data in our example.

Summarizing, the inequality indicator obtained by an assurance region modification of the CCR model is smaller or equal to one and describes the amount of outputs that must be redistributed in order to be projected on the frontier. The greater the redistribution that is needed, the lower the value of the resulting score. Unit value of the score indicates that the DMU lies on the efficiency frontier defining the benchmark and that no redistribution is needed.

2.2 Assurance region programs

Extending the considerations from the previous section for more than two outputs and with more countries under assessment, the output oriented radial model, in the notation of Cooper et al. (2007) takes the following form.

$$ (R1)\quad \mathop {\hbox{min} }\limits_{{{\mathbf{u}},{\mathbf{v}}}} \quad {\mathbf{v}}^{\text{T}} {\mathbf{x}}_{0} $$
(1)
$$ {\text{s}} . {\text{t}} .\quad {\mathbf{u}}^{\text{T}} {\mathbf{y}}_{0} = 1 $$
(2)
$$ - {\mathbf{v}}^{\text{T}} {\mathbf{X + u}}^{\text{T}} {\mathbf{Y}}\le {\mathbf{0}} $$
(3)
$$ {\mathbf{u}}^{\text{T}} {\mathbf{Q}} \le {\mathbf{0}} $$
(4)
$$ {\mathbf{u}} \ge {\mathbf{0}},\,\,{\mathbf{v}} \ge {\mathbf{0}}, $$
(5)

where X and Y are input and output data matrices respectively (with n DMUs, m inputs and s outputs), u and v are multipliers, and while Q is a matrix of bounds for the output weights.

From the point of view of decision makers benchmarks providing theoretical support for redistributive politics are of the great interest. The proposed model projects observed data onto the efficiency frontier utilizing optimal solutions from the dual envelopment program

$$ \mathop {\hbox{max} }\limits_{{\varphi ,{\varvec{\uplambda}},{\varvec{\uptau}}}} \quad \varphi $$
(6)
$$ {\text{s}} . {\text{t}} .\quad {\mathbf{x}}_{0} \ge {\mathbf{X\lambda }} $$
(7)
$$ \varphi {\mathbf{y}}_{0} \le {\mathbf{Y\lambda }} + {\mathbf{Q\tau }} $$
(8)
$$ {\varvec{\uplambda}} \ge {\mathbf{0}},\,\,{\varvec{\uptau}} \ge {\mathbf{0}}, $$
(9)

where τ are dual variables associated with weight restrictions in (4). The output projections of interest are then calculated by means of the optimal values λ* as \( {\hat{\mathbf{y}}}_{0} = {\mathbf{Y\lambda }}^{*} \).

The RI index derived and illustrated in Sect. 2.1 assesses the level of inequality in a given period of time as the level of redistributions among households needed in order to be projected on the frontier.

This approach can be further utilized to analyse intertemporal changes in individual countries and/or the shift of the frontier. For this purpose the Malmquist productivity index (10) is employed. Based on the pioneering work of Caves et al. (1982), Färe and Grosskopf (1992) defined the index in terms of distance functions triggering a mass of studies employing the approach in a variety of applications up until now.

$$ M = C \times F = \frac{{d^{2} ({\mathbf{x}}_{0} ,{\mathbf{y}}_{0} )^{2} }}{{d^{1} ({\mathbf{x}}_{0} ,{\mathbf{y}}_{0} )^{1} }}\left[ {\frac{{d^{1} ({\mathbf{x}}_{0} ,{\mathbf{y}}_{0} )^{1} }}{{d^{2} ({\mathbf{x}}_{0} ,{\mathbf{y}}_{0} )^{1} }} \times \frac{{d^{1} ({\mathbf{x}}_{0} ,{\mathbf{y}}_{0} )^{2} }}{{d^{2} ({\mathbf{x}}_{0} ,{\mathbf{y}}_{0} )^{2} }}} \right]^{1/2} $$
(10)

Expression (10) shows that the overall change in performance, in the case of the fixed input driven effectively by changes in the outputs, can be decomposed into two factors—the catch-up effect (C) and the frontier-shift effect (F).

A DMU0 represented by the activities (x0, y0)t, (t = 1, 2) is assessed in two time periods with respect to two technology frontiers in periods 1 and 2 by distance functions dS, (S = 1, 2). Thus we compute the terms involved in (10) by solving linear programs of two types. Within scores are obtained by optimization

$$ \begin{aligned} & d^{S} \left( {{\mathbf{x}}_{0} ,{\mathbf{y}}_{0} } \right)^{S} = \mathop {\,\,\hbox{max} }\limits_{{\varphi ,{\varvec{\uplambda}}}} \varphi \\ & {\text{s}} . {\text{t}} .\quad {\mathbf{x}}_{0}^{S} \ge {\mathbf{X}}^{S} {\varvec{\uplambda}} \\ & \varphi \,{\mathbf{y}}_{0}^{S} \le {\mathbf{Y}}^{S} {\varvec{\uplambda}} + {\mathbf{Q}}\tau \\ & {\varvec{\uplambda}} \ge 0,\,\,{\varvec{\uptau}} \ge 0, \\ \end{aligned} $$
(11)

where \( {\mathbf{X}}^{S} = ({\mathbf{x}}_{1}^{S} , \ldots ,{\mathbf{x}}_{m}^{S} ) \) and \( {\mathbf{Y}}^{S} = ({\mathbf{y}}_{1}^{S} , \ldots ,{\mathbf{y}}_{s}^{S} ) \) are input and output data matrices respectively for the period S.

Intertemporal scores come from the program in the form

$$ \begin{aligned} & d^{S} \left( {{\mathbf{x}}_{0} ,{\mathbf{y}}_{0} } \right)^{t} = \mathop {\,\,\hbox{max} }\limits_{{\varphi ,{\varvec{\uplambda}}}} \varphi \\ & {\text{s}} . {\text{t}} .\quad {\mathbf{x}}_{0}^{t} \ge {\mathbf{X}}^{S} {\varvec{\uplambda}} \\ & \varphi \,{\mathbf{y}}_{0}^{t} \le {\mathbf{Y}}^{S} {\varvec{\uplambda}} + {\mathbf{Q}}\tau \\ & {\varvec{\uplambda}} \ge 0,\,\,{\varvec{\uptau}} \ge 0, \\ \end{aligned} $$
(12)

This program is solved for the pairs (t, S) = (1, 2) and (2, 1). We thus employ the “exclusive” scheme (Cooper et al. 2007, p. 333) allowing the value of d in (12) to take on values lower than the unit in the sense of super-efficiency (Thomson et al. 1986).

3 Empirical results

For an empirical demonstration of the RI index performance we consider 29 European countries acting as DMUs. The data were sourced from Eurostat and comprise the Gini index and income shares of disposable income of five quintile population groups based on the EU-SILC survey, the latter entering our model as five outputs. For the intertemporal analysis we collected data from 2 years—2007 and 2016. Concentrating on outputs we fix input to unit value. The input data matrix thus collapses to a unit vector rendering the model robust to the returns to scale assumption. To increase the discriminating power of the model, we can set the Rawlsian weights constraints in a stricter way, letting the ratio of the successive multipliers be bounded by a specified number l. Matrix Q from (4) will then take the form

$$ {\mathbf{Q}} = \left( {\begin{array}{*{20}c} {l_{1,2} } & {\quad 0} & {\quad 0} & {\quad 0} \\ { - 1} & {\quad l_{2,3} } & {\quad 0} & {\quad 0} \\ 0 & {\quad - 1} & {\quad l_{3,4} } & {\quad 0} \\ 0 & {\quad 0} & {\quad - 1} & {\quad l_{4,5} } \\ 0 & {\quad 0} & {\quad 0} & {\quad - 1} \\ \end{array} } \right), $$

where 0 < li,j < 1, i = 1, 2, 3, 4; j = 2, 3 4,5. In this way a sequentially constrained weights are embodied in the DEA model. We exemplify our approach by choosing the values for li,j from the matrix Q equal to 0.9. This implies the following decreasing sequence of weights

$$ u_{1} > u_{2} > u_{3} > u_{4} > u_{5} . $$

Solving the assurance model for each DMU we calculate the RI indicator for 2016 as well as optimal values for the weights and dual variables. In Table 1 the data and values of the RI index are displayed along with the corresponding value of the Gini index. The complete results for all countries can be viewed in Table 6 (“Appendix A”).

Table 1 Data, RI scores, and the Gini index for selected countries (2016, l = 0.9).

The model determines four efficient DMUs with the unit score—the Czech Republic, Slovenia, Slovakia, and Norway, which are the countries with the lowest Gini coefficients. Noticeably, the Czech Republic gained its RI-efficiency from the massive income share of the first quintile income group (the poorest 20%). Slovenia and Slovakia (both relative efficient according to the RI) differ slightly in their Gini index at—24.4 and 24.3 respectively, with Slovenia offsetting an advantage in Q1 (9.5 vs. 9.3) by a poorer value in Q2 (14.9 vs. 15.2). Germany and Ireland with the same value of Gini (29.5) are indistinguishable with the chosen ratio of weights. When compared to Hungary, Ireland’s share in Q2 and Q3 is lower (while Q1 are equal) which results in a lower RI score for Ireland.

A comparison of the results for the Czech Republic and Denmark for the year 2007 (see “Appendix A”, Table 7) reveals the difference between our RI indicator (see column d11) and the Gini coefficient (see column G07). The lower Gini coefficient for Denmark (25.2) indicates a lower general extent of inequality than in the Czech Republic (where the Gini coefficient is 25.3). However the RI index for the Czech Republic is higher than for Denmark, reflecting the higher income share of the first quantile in the Czech Republic (10.1)—with the highest weight—compared with the income share for this group in Denmark (9.2). See “Appendix A”, Table 5 for further details.

Having demonstrated the sensitivity of the proposed measure, we derive results which are potentially useful for policy making. The projections for outputs suggest a desirable income share adjustment in individual quintile groups needed to perform at the best practice level. In this way, the potential identified for reducing income inequality provides further important merit and difference of the RI indicator compared with the Gini coefficient. An example of selected countries is given in Table 2. Adjustments in the table are computed as differences between the projection and the data, so positive values indicate the need to increase the share of a particular income group.

Table 2 Income share adjustment (2016, l = 0.9).

Naturally, in the Rawlsian sense the most massive redistribution would be needed in the “richest” quantiles as indicated by the negative values of the suggested adjustments. Clearly, for efficient DMUs like the Czech Republic, no changes are needed and the adjustments are zero. Since projections add up to unity, adjustments add up to zero (proof in “Appendix B”). The extreme value for Bulgaria suggests a need for an extensive redistribution from Q5 (− 10.5). The complete results are given in Table 6 (“Appendix A”).

The intertemporal analysis was conducted by calculating the d-terms described in Sect. 2.2, labeled d11 and d22 (within the scores for 2007 and 2016) along with d21 and d12 (intertemporal scores). Then the catch-up (C) and frontier-shift (F) effects as well as the overall Malmquist productivity index (M) were computed. A selection of countries is given in Table 3, the complete results can be seen in Table 7 (Appendix A).

Table 3 The Malmquist index, its components and the change in the Gini index for selected countries (l = 0.9).

In Table 3, within the score for the period 2 (d22) is identical to the RI index (efficiency score) for 2016 analysed above. The new insights provided by the RI index and its discriminating capacity in comparison to the Gini coefficient can be seen in the example of Slovakia versus the Czech Republic. Slovakia’s Gini index changed from 24.5 to 24.3 between 2007 and 2016, i.e. to lower inequality. However, the RI approach indicates a deterioration in the Rawlsian performance by the M value of 0.995 < 1, i.e. in the opposite direction. The Gini coefficient is an absolute measure of inequality while the DEA based RI index is a relative one. Inequality in Slovakia did decrease but the improvement was lower than the improvement in the reference country of the Czech Republic. The data for the Czech Republic and Slovakia reveals that, the share of the first quantile with the highest weight was at the same level in 2007 and 2016 in the Czech Republic while in Slovakia it decreased from 10.0 to 9.3. The Gini coefficient dropped in both countries though the RI Malmquist index for Slovakia decreased too. This exemplifies the difference between the Gini and RI measures. On average, the productivity index M which is slightly below 1 (see Table 3) suggests less egalitarian distributions across Europe in 2016 compared to 2007 The efficiency frontier-shift effect defined by the most efficient countries reveals a deteriorating in the best practice itself. Most individual catch–up is observed in Portugal, Romania, Poland, and Ireland contributing heavily to the general improvement (M > 1). Less impressive progress has been made by Germany, Latvia, Netherlands, the United Kingdom, and Switzerland (see Table 7, “Appendix A”).

In order to increase the discrimination power of our measure we sharpened the differences between the weights by reducing the value of li,j of to 0.7 (see Table 4 and for more details Table 8, “Appendix A”).

Table 4 The Malmquist index, its components and Gini change for selected countries (l = 0.7).

The results show the decreasing values of the inequality indicators RI and increasing discrimination in the country’s performance. Looking at the DEA results with respect to the peer group shows the improving performance of the Czech Republic (the country with the highest income share for the first quantile) due to the increasing weight of the first group. The reference set for 2016 for all countries (excluding Norway) consists the Czech Republic only. This illustrates the dependence of results on the change of the coefficient l, describing the decreasing sequence of weights.

4 Conclusions

We developed a new measure of income inequality providing a deeper insight into the distribution of income than aggregated Gini coefficient. The performance income distribution indicator has been demonstrated to favour more Rawlsian distributions. Projections computed from the model provide policy recommendations related to redistributional adjustments in particular income groups. The intertemporal analysis revealed a slight deterioration in income distribution towards a less egalitarian structure. This finding is, on average, confirmed by the increasing value of the Gini coefficient. The results show that a more clear-cut discrimination could be achieved by selecting the more restrictive set of weights. In general, the proposed relative indicator with its merits of providing deeper insights into income distribution among individuals, is meant to supplement and extend statistically based aggregated indicators like the Gini or Theil indices with information applicable for policy making. The restricted multipliers approach presents a promising avenue for examining poverty indicators in a similar fashion.