Heavy-tailedness and income distribution
Numerous studies in economics and finance have pointed out that many key variables in these fields, including financial returns, foreign exchange rates, income and wealth, have distributions that exhibit heavy tails as in the case of commonly observed Pareto-type power laws (see, for instance, Embrechts et al. 1997, Ch. 7 in McNeil et al. 2005; Atkinson 2008; Gabaix 2008, 2009; Ibragimov 2009; Milanovic 2005, 2011; Ibragimov et al. 2013, 2015; Ibragimov and Prokhorov 2017, and references therein). Heavy-tailed random variables (r.v.’s) \(X>0\) with distribution that has power tails satisfy
$$\begin{aligned} P\left( {X>x} \right) \sim Cx^{-\zeta }, \end{aligned}$$
(1)
as \(x\rightarrow \infty \), with the tail index \(\zeta >0\) (here and throughout the paper, \(f(x)\sim g(x)\)) means that \(f\left( x \right) =g\left( x \right) (1+o\left( 1 \right) )\)) as \(x\rightarrow \infty \)).
Important examples of power laws (1) are given by Pareto distributions with parameters (tail indices) \(\zeta \) where (1) holds exactly for all values x greater than a certain threshold \(x_m :\)
$$\begin{aligned} P\left( {X>x} \right) \sim Cx^{-\zeta }, x\ge x_m,,C=x_m^{\zeta } , \end{aligned}$$
(2)
and their modifications such as double Pareto families, and Singh–Maddala distributions that were used in a number of works for income distribution modeling (see Cowell and Flachaire 2007; Davidson and Flachaire 2007; Toda 2012, and references therein).
The tail index \(\zeta \) characterizes the heaviness (the rate of decay) of the tails of power law distribution (1). An important property of r.v.’s X satisfying a power law with the tail index \(\zeta \) is that the moments of X are finite if and only if their order is less than \(\zeta : EX^{p}<\infty \) if and only if \(p<\zeta \). Heavy-tailedness (the tail index \(\zeta \)) of the variable X (e.g., income, wealth, risk, financial return or foreign exchange rate) governs the likelihood of observing outliers and large observations and fluctuations in the variable. The smaller values of the tail index \(\zeta \) correspond to a higher degree of heavy-tailedness in X and, thus, to a larger likelihood of observing outliers, extreme observations and large fluctuations in realizations of this variable. In particular, conclusions of tail index parameters being similar for a variable of interest in two or more economies are important because they point out to similarity in likelihoods of occurrence of outliers, extreme observations and large fluctuations in the variable considered. As discussed in the next section, in the case of income distributions, similarity of tail indices for different economies further implies similarity of their implications concerning inequality in the upper tails of distributions of these variables.
Empirical studies of income and wealth indicate that distributions of these variables in developed economies typically satisfy power laws (1) with the tail index \(\zeta \) that varies between 1.5 and 3 for income and is rather stable, perhaps around 1.5, for wealth (see, among others, Atkinson 2004; Atkinson and Piketty 2007; Gabaix 2008, 2009; Atkinson et al. 2011, and references therein). This implies, in particular, that the mean is finite for income and wealth distributions (since \(\zeta >1\)). However, the variance is infinite for wealth (since \(\zeta \approx 1.5<2\)) and may be infinite for income (if \(\zeta \le 2\)). In addition, since their tail indices are smaller than 3, income and wealth distributions have infinite third and higher moments. The problem of infinite variance in income and wealth distributions is important because it may invalidate or make problematic direct applicability of standard inference approaches, including regression analysis, (auto-)correlation based approaches and least squares methods (see the discussion in Ibragimov et al. 2015; Ibragimov and Prokhorov 2017). In a similar fashion, infinite fourth moments for these variables need to be taken into account in regression and other models involving their volatilities or variances, e.g., in the analysis of permanent and transitory components of income variability and their cross-country comparisons (see Gorodnichenko et al. 2010) and the study of autocorrelation properties of financial time series and volatility clustering (see the discussion in Cont 2001; Mikosch and Stariča 2000, and references therein). Finiteness of first moments is important because it points out to optimality of diversification and robustness of a number of economic models for the variables considered (see Ibragimov 2009; Ibragimov et al. 2013, 2015; Ibragimov and Prokhorov 2017). Several recent works in the literature have emphasized robustness as an important aspect in the choice of inequality measures and estimation and inference methods for them (see, among others, Cowell and Flachaire 2007; Davidson and Flachaire 2007; Ibragimov et al. 2017 and references therein). The interest in robust inequality assessment is motivated, in part, by sensitivity of many income inequality measures to changes in different parts of the underlying income distribution, including sensitivity to extremes and outliers.
Heavy tails and upper-tail inequality
The analysis of heavy-tailedness of income distributions is closely related to the study of inequality in their tails (among the rich and super-rich) and top incomes’ dynamics and distribution. The recent years have witnessed a revival of interest in the analysis of the volume and distributions of top incomes and their inequality that are of key importance for a number of reasons, including the enormous gaps between the incomes of the rich and the rest of population, or “the haves and the have-nots,” as well as between the rich and very rich, or “the haves and have-mores,” e.g., between the top 1 percent and the top 0.1 percent of income and wealth distributions (see, among others, the contributions in Davies 2008, and the discussion in Milanovic 2011); the huge magnitude of the total income distributed among a few (super-)rich individuals; implications for overall fairness of distribution of economic resources across individuals in a society and important economic impacts such as those on overall growth and resources, taxation and other redistributive policies as well as on overall inequality (see Atkinson et al. 2011, and references therein). For instance, as discussed in Milanovic (2011), in 2004, the top 0.1% annual after-tax income per capita in UK was 400,000 GBP, the top 1% was 81,000 GBP, and the average British after-tax per capita income was 11,600 GBP per annum. As indicated in Atkinson et al. (2011), the top 1% of the US income distribution in 2007 captures more than a fifth of total income in the country. Similarly, in the world income distribution, the top 10% receive more than two-thirds of the total world income, while the top 5% receive 45% of the income. As discussed in Milanovic (2011), for the World income distribution, the ratio of the incomes of the top 10% and the bottom 10% incomes (the so-called decile ratio) is about eighty to one, and the ratio for the top 5% and the bottom 5% is almost two hundred to one (see also the World Bank’s World Development Indicators database for income shares of different deciles of income distributions in the countries of the World, http://databank.worldbank.org). A related fact on the extreme nature of the World income distribution is that the income of the top 1.75% matches the income of the poorest 77%. Using the data on the top 10% and 1% incomes, the (upper) inequality around the world and its changes over time are also emphasized in the recent book by Piketty (2014).Footnote 1
Guriev and Rachinsky (2009) discuss the dynamics of wealth distribution and inequality in the former USSR and Central and Eastern European countries during the transition period. Guriev and Rachinsky (2009), in particular, point out that the combined wealth of the 26 Russian billionaires in 2004 amounted to 19% of Russian GDP (for comparison, as indicated in the paper, the combined wealth of the 262 US billionaires was only 7% of the US GDP). Using the dataset on incomes of Moscow taxpayers in 2004 that includes the super-rich, Guriev and Rachinsky (2006) find the Gini coefficient of about 63% for resulting income distribution, which is contrasted with the official value of 41% for the Gini coefficient in Russia in 2004 (see also Table 1 for the official values of the Gini coefficient of income inequality in Russia and the discussion in Guriev and Rachinsky 2009). Guriev and Rachinsky (2006) further indicate that the top 10% of individuals get 50% of the total income in the dataset. These and other facts further indicate the extreme nature of income and wealth distributions and inequality in Russia, especially those in the distributions’ upper tails (see also the results and discussion in Sects. 4 and 5).
To illustrate the connection between heavy-tailedness and upper-tail inequality, consider the class of power law Pareto distributions (2), where (1) holds for all values \(x\ge x_m\). The Gini coefficient G of inequality over the whole income or wealth distribution (2) is equal to
$$\begin{aligned} G=G\left( \zeta \right) =1/\left( {2\zeta -1} \right) . \end{aligned}$$
(3)
Power law distribution (1) is approximately Pareto (2) for (large) income or wealth levels x in the upper tails. Thus, taking into account relation (3), the value of the tail index \(\zeta \) in (1) may be regarded as a measure of upper tail inequality (that is, among the rich, with smaller values of the tail index corresponding to larger inequality in the upper tails) that seems to be a useful complement to the Gini coefficient for the whole income or wealth distribution (see also the discussion in Atkinson 2008).Footnote 2 As expected, a higher degree of heavy-tailedness in the underlying (power law (1)) income distribution and the implied greater likelihood of occurrence of outliers and large fluctuations in it (that is, a smaller value of the tail index \(\zeta \)) translates into greater inequality in the tails.
Research objectives and an overview of results
The discussion in the previous sections illustrates the importance of reliable inference on income distributions and their heavy-tailedness properties, in particular, in the context of making conclusions on upper-tail inequality.
Emerging economic markets are likely to be more volatile than their developed counter-parts and subject to more extreme external and internal shocks (see, among others, Gorodnichenko et al. 2010, for the discussion of extreme macroeconomic disturbances accompanying Russia’s transition to a market economy following the collapse of the Soviet Union). The higher degree of volatility suffered by emerging countries leads to the expectation that heavy-tailedness properties and distributions of their key economic and financial variables and indicators may be different from those in developed economies. The recent analysis in Ibragimov et al. (2013) (see also Ibragimov et al. 2015) provides support for the hypothesis that emerging country exchange rates are more pronouncedly heavy-tailed compared to developed country exchange rates. In particular, according to the analysis, in contrast to developed country exchange rates, variances may be infinite for several emerging country exchange rates considered in Ibragimov et al. (2013).
This paper focuses on the robust analysis of heavy-tailedness in income distribution in Russia, with a particular focus on the question whether, similar to the case of emerging country exchange rates, its heavy-tailedness characteristics differ from those of income distributions in developed economies considered in the previous literature. Among other results, using recently proposed robust tail index inference methods, we provide estimates of heavy-tailedness parameters \(\zeta \) for income distribution in Russia and their comparisons with the benchmark values \(\zeta \in (1.5, 3)\) that are well-established for developed economies (see Sect. 4).
Our results point out to important and somewhat surprising similarity between heavy-tailedness properties of income distribution in Russia and those in developed markets. The estimates of the tail index \(\zeta \) for Russian income distribution obtained in this paper are largely in agreement with the benchmark interval \(\zeta \in (1.5, 3)\) for income distributions in developed economies considered in the previous literature. This indicates that the likelihood of observing very large income values and outliers in income distribution in Russia is similar to that in developed countries. It further points out that inequality in the right tail of Russian income distribution (among the rich), as measured by tail indices, appears to be similar to that in developed economies. The above conclusions are in contrast to the case of foreign exchange rates, where, according to the analysis in Ibragimov et al. (2013) discussed above, heavy-tailedness properties in emerging markets, including Russia, are more pronounced compared to developed economies. Furthermore, the estimates obtained in the paper indicate that the tail index of Russian income distribution is greater than 2 and, thus, the distribution has finite variance. The conclusion on the finite variance is especially important since it justifies applicability and validity of the standard OLS regression methods in the analysis of models involving the data on incomes.Footnote 3
The rest of the paper is organized as follows. Sect. 2 discusses inference methods used in the analysis. Sect. 3 reviews the datasets used in the study. Sect. 4 presents and discusses the estimation results on heavy-tailedness of Russian income distribution obtained in the paper. Sect. 5 makes some concluding remarks and reviews suggestions for the further research. Appendix contains the tables and diagrams on the empirical results obtained or used in the paper.