Keywords

1 Introduction

Inequality increased during the last decade of the twentieth century in both developing and developed countries (World Bank 2005a). In the People’s Republic of China, the Russian Federation, India, Mexico, South Africa, and Viet Nam, as in many other developing and transition countries, this increase in inequality was accompanied by spatial and regional disparities in economic activities, and income and social indicators (Kanbur and Venables 2005 ) . This rising trend in overall inequality was also evident in South Asia which had long been characterized by relatively low and stable levels of inequality. Recent evidence shows that there was a sizeable increase in inequality in Bangladesh, India, Nepal , and Sri Lanka in the late 1980s and 1990s (World Bank 2005a). For example, inequality in consumption expenditure in Sri Lanka rose from a Gini coefficient of 0.32 in 1990 to 0.40 in 2009/2010, possibly the sharpest increase in inequality in its recent history, making its distribution more unequal than many of its East Asian neighbors, and on par with Bangladesh and Nepal.Footnote 1

Rising inequality has two components. The first is within the fast-growing modern industrial sector and region. This is to a large extent vertical inequality, driven by asset and skill differences.Footnote 2 The second is between the fast-growing modern industrial sectors and regions, on the one hand, and the traditional agricultural sectors and regions, on the other. What is striking about the latter component of inequality is that two individuals with identical productive characteristics (schooling, skills, training, experience) could face differential returns to their endowments, depending on where they live. These differentials encourage individuals in the low-returns sector or region to move or migrate into the high-returns sector, eventually equalizing returns in both sectors. Cheaper labor in the low-returns regions may also attract capital (firms and entrepreneurs) to move to these areas. But if these differentials persist over time, it suggests that there are barriers (failures) that prevent the traditional sector or lagging region from being absorbed into the modern sector. This argument is well presented in the World Development Report 2009: Reshaping Economic Geography (World Bank 2008) which argues that higher densities, shorter distances, and lower divisions are essential for the development of an economy. Economic geography suggests that this growth will be initially unbalanced and lead to widening disparities. However, spatial transformations that reduce distance and divisions, and calibrate densities of economic growth with densities of poverty, may lead to inclusive growth and lower inequality. Hence, policies to reduce inequality need to identify the sources of spatial disparities—if there are gaps in characteristics, it is important to improve endowments of the households; if barriers exist between lagging and leading regions, these barriers need to be removed.

Although Sri Lanka recorded moderate economic growth during the last decade, the regions that are far away from its economic capital, Colombo, tend to be significantly poorer than the areas closer to Colombo. Though a number of regional development programs have been implemented by successive governments, living standards of the people in the remote areas continue to fall significantly behind their urban counterparts. This suggests that a better understanding of the causes of inequality is essential for effective policy formulation.

Programmes and policies in Sri Lanka’s current development strategy may be divided into three categories. The first includes the improvement of facilities in schools, establishment of new universities in poor areas, and the spread of information technology facilities (provision of an IT center to each district secretariat), all of which pertain to improving household and community human capital endowments. The second category includes the improvement of roads (primarily rural roads), irrigation systems (small-scale tanks), and promotion of rural industries (primarily via small-, large-, and medium-scale enterprises). These could be considered as efforts to improve location-specific returns , but keep the focus at the community or regional level. The last set of programs which is expected to have an impact on reducing disparities in location-specific returns focuses primarily on transport infrastructure, such as the Colombo–Kandy expressway and the Southern Highway, which improves access to and mobility between regions, and is likely to reduce regional gaps in location-specific returns.

This paper uses a standard methodology and its recent expansion to examine whether well-being gaps between urban (richer regions) and rural (poorer regions) areas are the result of (i) regional/spatial differences in household characteristics or (ii) differences in location-specific returns to these characteristics. The purpose of the paper is to examine the role of location-specific returns to households’ productive characteristics, and the extent to which they contribute to the gaps in consumption across the distribution of consumption.

2 Analytical Framework

This study employs a conventional decomposition method of measuring discrimination and its quantile regression extension to analyze spatial inequality in Sri Lanka. In this section, we (1) explain the decomposition as it applies to average gaps between locations, (2) describe quantile regression, and (3) present the extension of the decomposition to quantile regression estimates.

Wan (2007) discusses that inequality between urban and rural areas cannot be totally explained by the geographical division between the urban and rural areas of a particular economy as assumed by the conventional inequality decomposition techniques. Many other factors such as differences in human capital and differences in demographic factors also affect the determination of urban and rural inequalities. But the traditional inequality decomposition techniques do not include control variables for these factors. The decomposition of inequality using regression provides a neat solution for this (Wan 2007). The regression-based decomposition allows for the inclusion of control variables as well as other socioeconomic determinants of inequality rather than the geographical location (Gunatilaka and Chotikapanich 2009). Furthermore, regression-based decomposition analysis enables identification and quantification of the determinants of inequality (Wan 2002) which are important and of interest to economists and policy makers.

The seminal works of Blinder (1973) and Oaxaca (1973) provide the basic roots for regression-based decomposition techniques. The Blinder–Oaxaca decomposition is extensively applied in decomposition analysis, and several extensions have been developed recently (Fortin et al. 2011 ) . The intuition behind the conventional method of measuring discrimination, developed independently by Blinder (1973) and Oaxaca (1973), is that in the absence of discrimination, the estimated effects of individuals’ observed characteristics on their wages are identical for groups of individuals. Similarly, in the absence of location-specific returns , the estimated effects of a household’s observed characteristics on some measure of household well-being (such as income or consumption) are identical for each location. The estimated income gap can be decomposed as follows:

$$ {\text{In}}y^{{*{\text{urban}}}} - {\text{In}}y^{{*{\text{rural}}}} = X^{{*{\text{rural}}}} (\beta^{\text{urban}} - \beta^{\text{rural}} ) + (X^{{*{\text{urban}}}} - X^{{*{\text{rural}}}} )\beta^{\text{urban}} $$
(9.1)

where lny is a measure of household income or consumption, X is a vector of income-generating characteristics for the ith household, and β is a vector of coefficients. The asterisks denote mean or average. The first term on the right-hand side is the portion due to differences in coefficients\( (\beta^{\text{urban}} - \beta^{\text{rural}} ) \), evaluated at the same set of average income-generating characteristics \( X^{{ * {\text{rural}}}} \), in this case the rural (poorer region). The second term is the portion of the gap attributed to differences in average earnings-generating characteristics \( (X^{{ * {\text{urban}}}} - X^{{ * {\text{rural}}}} ) \), weighted by the urban (richer region) returns structure.Footnote 3

If there were no location-specific effects, \( \beta^{\text{urban}} =\upbeta^{\text{rural}} \), i.e., endowments in both locations yield similar returns, the first term would be zero, and any regional disparities would be completely explained by differences in characteristics of households in the two locations, \( X^{{ * {\text{urban}}}} - X^{{ * {\text{rural}}}} \). With no disparity in returns, migration would be low or zero.

The Blinder–Oaxaca decomposition is based on ordinary least squares (OLS) regression which assumes that the effect of the regressors does not vary along the conditional distribution of the dependent variable. For example, the effect of schooling on household welfare is assumed to be the same at the bottom of the welfare distribution as it is at the top. If, however, these effects do vary along the distribution of household welfare, quantile regressions , which yield models for different percentiles of the distribution, provide a parsimonious way of describing the whole distribution (Martins and Pereira 2004). The θth quantile of \( y_{i} \) conditional on \( \varvec{X}_{i} \) is given by

$$ Q_{\theta } (y_{i} |\varvec{X}_{\varvec{i}} ) = \varvec{X}_{\varvec{i}} \beta_{\theta } ,\theta \in (0,1)\quad \left( {i = 1, \ldots ,I} \right) $$
(9.2)

where the coefficient \( \beta_{\theta } \) is the slope of the quantile line, giving the effect of changes in X on the θth conditional quantile of y. As shown by Koenker and Basset (1978 ) , the quantile regression estimator of \( \beta_{\theta } \) solves the following minimization problem.

$$ \beta_{\theta } = \,\arg \hbox{min} \left[ {\sum\limits_{{i:y_{i} \ge X_{i} \beta }} {\theta |y_{i} } - X_{i} \beta |+ \sum\limits_{{i:y_{i} \ge X_{i} \beta }} {1 - \theta |y_{i} - } X_{i} \beta |} \right] $$
(9.3)

It is easily seen that for the median [θ = 0.5], the quantile regression minimizes the sum of absolute deviations.

Machado and Mata (2005) combine quantile regression with a bootstrap approach and derive the following decomposition, which is analogous to the Blinder–Oaxaca decomposition in Eq. 9.1:

$$ \begin{aligned} Q_{\theta } \left( {y^{\text{urban}} } \right) - Q_{\theta } \left( {y^{\text{rural}} } \right) & = \left[ {Q_{\theta } \left( {X_{i}^{\text{rural}} \beta^{\text{urban}} } \right) - Q_{\theta } \left( {X_{i}^{\text{rural}} \beta^{\text{rural}} } \right)} \right] \\ & \quad + [Q_{\theta } (X_{i}^{{u{\text{rban}}}} \beta^{\text{urban}} - Q_{\theta } (X_{i}^{\text{rural}} \beta^{\text{urban}} )] + {\text{residual}} \\ \end{aligned} $$
(9.4)

The first term on the right-hand side is the contribution of the coefficients (returns effect), and the second term is the contribution of the covariates (covariate effect) to the difference between the θth quantile of the urban (rich region) distribution of consumption and the θth quantile of the rural (poor region) consumption distribution. The residual term comprises the simulation errors which disappear with more simulations, the sampling errors which disappear with more observations, and the specification error induced by estimating linear quantile regression (Melly 2005). It is assumed that the linear quantile model is correctly specified.

The Machado-Mata (2005) decomposition is interpreted similar to the Blinder-Oaxaca decomposition. Since the decomposition can be conducted at any percentile of the consumption distribution, it reveals whether the relative importance of covariates and coefficients varies along the distribution.

3 Data and Variables

The data used in this study are from the Household Income and Expenditure Surveys (HIES) for 2006/2007 and 2009/2010. These national surveys, however, do not cover the Northern and Eastern Provinces of Sri Lanka .Footnote 4 The primary sampling unit is the household, and the sample size ranges from 17,037 households in 2006/2007 to 19,958 households in 2009/2010.Footnote 5 Both surveys comprise 12 monthly rounds that capture seasonal variations. The sample design allows for subgroup analysis at the province and district levels. In the urban–rural analysis, the estate sector was subsumed under the rural sector for the purpose of decomposition. These data sets have been used in poverty analysis and exhibit no major problems in terms of inconsistency and inaccuracy (World Bank 2005b, 2007a, b).

The primary measure of well-being in this paper is real household consumption expenditure per capita. Consumption data are used in preference to income data for several reasons. Consumption is a direct measure of achieving or fulfilling basic needs and a better measure of current welfare, incorporating consumption smoothing by households within a given period of time and over the life cycle (Duclos and Araar 2006; Deaton 1997). Consumption data are more easily observable than income data. The latter are vulnerable to underreporting due to the innate features of income reporting, that is, fewer formal income receivers, seasonal and unrecorded income sources, and the diversified nature of earnings (Heltberg 2003 ) . For these reasons, consumption is typically used in the analysis of poverty and inequality.

The measure of consumption expenditure includes over 400 items of household consumption. Food consumption is reported in calendar style, for a week, while non-food consumption is reported for the past month, 6 or 12 months. Consumption on all items is converted to monthly consumption. Reported values are of the amount consumed, which includes goods and services purchased as well as home-produced goods and services. Although the latter comprises a substantial proportion of household consumption, and problems of using imputed values are well known (Deaton 1997), the values used are consistent over time, unlike the problems raised by the use of different (and possibly inconsistent) values in transition countries (Benjamin et al . 2005; Ravallion 2005). The rental value of owned housing is also imputed in the data set. The household is defined as ‘one or more persons living together and having common arrangements for food and other essentials of living’ (Department of Census and Statistics 1987).

Spatial and temporal price indices are computed using district-level nominal poverty lines published by the DCSFootnote 6 and are constructed at the district level. These are constructed for each data set, allowing spatial prices to vary (as they do) over time, and are later adjusted with regard to temporal variations.

Two categorizations of spatial location are used in the decompositions derived in this paper. The first is the conventional urban and rural (the estate sector is subsumed into the rural) distinction. The second categorization classifies the economically better-off Western Province (WP), which has all the characteristics of a leading region, as the richer region, and includes all other regions in the poorer region category. While this classification ignores the variation among other regions, it is adopted because the methodology requires a binary classification. The gap in consumption between the WP and the other regions is sufficiently large to justify such a classification.

Regressors used in the model (Table 9.1) include (1) factors that influence the household’s earning ability, such as the number of employed members in the household, the sector of employment (whether any household members were engaged in agricultural work), the household head’s age, and household human capital (the highest level of education attained by any member of the household)Footnote 7; (2) demographic features of the household that influence the level of consumption per capita such as the number of dependents (number of household members aged below 15 years and above 65 years); (3) other demographic features such as the household head’s gender and ethnicity; and (4) location variables to control for regional variation within urban and rural sectors (only in the urban and rural specification).

Table 9.1 List of variables

A clear difference in consumption expenditure between urban and rural sectors as well as between the WP and the rest of Sri Lanka (OP) is indicated by the mean calculations of consumption expenditure per capita as shown in Tables 9.2 and 9.3. Rural consumption was around 70 % of the urban consumption in both the survey years. Figure 9.1 illustrates the difference in log real expenditure per capita (LREPC) distributions with kernel density curves for urban, rural, Western, and other provinces for 2006/2007 and 2009/2010. Urban and WP consumption densities lie to the right of the rural and other province densities (respectively), indicating higher consumption for urban and WP populations throughout the distribution. The urban–rural consumption gap is greater in 2006/2007 relative to 2009/2010, suggesting that recent regional development programs of the country are delivering the expected results. The consumption expenditure gap is wider in the upper consumption percentiles. This result is comparable to that found in Viet Nam using a similar analysis (Nguyen et al . 2006).

Table 9.2 Descriptive statistics, 2009/2010
Table 9.3 Descriptive statistics, 2006/2007
Fig. 9.1
figure 1

Kernel density curves

Descriptive statistics (Tables 9.2 and 9.3) give an indication of the differences in endowments between urban and rural areas and the WP and other provinces. Urban areas and the WP have more educated households, greater ethnic diversity, as is to be expected, and a much smaller proportion of households employed in agricultural work.

Table 9.3 indicates that educational endowments beyond junior secondary level have increased in the population as a whole between 2006/2007 and 2009/2010.

The percentage of those employed in the agricultural sector in rural and urban areas continued to decline over the period, while changes (decline) in the number of dependents and household size were marginal. The age and gender of household heads did not change significantly over the period.

4 Discussion of Results and Conclusions

In this section, we first present the OLS and quantile regression results on which the decompositions are based. We then present the results of decomposing the consumption gap between urban and rural Sri Lanka , and between the WP and other provinces in two segments: the consumption gap (difference) due to the differences in household endowments (characteristics) and the difference due to location (area of residence).Footnote 8 These decompositions are derived by applying the methodology described in Sect. 9.2.

4.1 Regression Results

A detailed discussion of OLS and quantile regression estimates on which the decompositions were based is presented in Kumara (2009, 2012 ) . Footnote 9 A brief summary is given here (and in table form in appendix Tables 9.5, 9.6, 9.7, and 9.8). OLS and QR estimates suggest that household consumption increases monotonically with the level of education. An additional finding of the QR analysis is that the impact of education on consumption is significantly higher in the upper consumption quantiles for all education levels but the junior secondary level. Impact of education on consumption expenditure is relatively high for the lower consumption groups.

An additional employed member in the family increases the household consumption level to a greater extent in the urban sector (WP) than in the rural (OP) sector, while engagement in agricultural work reduces household consumption irrespective of the area of living. This is consistent with previous findings (World Bank 2004, 2005b).Footnote 10 A male-headed household consumes more compared to a female-headed household, and the result is stronger in the urban sector and in the WP.

The negative relationship between the number of household members engaged in agricultural work and household consumption persists in the QR analysis, and the impact is higher in the upper expenditure quantiles. QR analysis indicates that additional employment in the family generates more positive impacts on consumption in the upper expenditure quantiles, whereas the effect is weaker in the lower quantiles. Agricultural employment reduces consumption of upper quantile households more than the lower quantiles irrespective of the area of living. This finding holds for both the survey years.

The consumption advantage of male-headed households becomes weaker in the upper expenditure quantiles and in the urban sector in both the periods.Footnote 11 This suggests that the gender of the household head is irrelevant for the upper consumption groups and for the urban sector. Most formal jobs that pay equal wages for both males and females are concentrated in urban areas, whereas most informal jobs that discriminate against females are concentrated in rural areas. This may explain the equal benefits in urban areas and the differences in rural areas. An additional member in the household generates negative results for all the quantiles, and the impact is significantly higher in upper expenditure groups and urban areas.

As suggested by the OLS estimation, Muslim and other ethnic groups have a higher consumption expenditure compared to their Sinhala counterparts in all the consumption quantiles. Furthermore, the percentage increase is higher in the upper consumption groups. Analogous to the OLS results, QR estimates also show that there is a negative correlation between the household consumption expenditure and households headed by persons of Tamil ethnic origin, as compared to households headed by Sinhala counterparts.

4.2 Inequality Decomposition

In this section, mean regression (Oaxaca 1973; Blinder 1973) and quantile regression (Machado and Mata 2005) methods are applied in decomposing consumption inequalities in Sri Lanka into two components: (1) a component that is due to the differences in the distribution of household endowments (covariate effect) between urban and rural sectors in Sri Lanka and (2) another component that is due to the location-specific returns (returns effect) to these covariates.

The mean regression decomposition method considers the means of two distributions. Adding value to the analysis, quantile regression decomposes the gap according to the differences at each quantile. In decomposing urban–rural (Western–other province) inequality, it compares rural household LREPC with a simulated (counterfactual) LREPC derived from rural characteristics (endowments) and urban (coefficients) returns. This estimates the difference in consumption between an urban household and a rural household that are identically endowed, where the only difference between them is the location. In other words, if the level of average urban (WP) household endowments was suddenly replaced by the level of endowments of average rural (other provinces) households, how large would the spatial consumption gap (i.e., the returns effect) be? If rural (OP) households were to move instantaneously (and without cost) to urban (WP) areas, what would the consumption gap between them and identical households in rural (OP) areas be? This component captures location-specific effects on consumption . With markets working perfectly, and no barriers to mobility (and a well-specified model), this gap would in the long term be zero. The entirety of the consumption gap would then be due to differences in characteristics between urban and rural (Western and other) households, captured by the covariates effect.

Both mean regression and quantile regression decomposition results are presented in Table 9.4 and Fig. 9.2. The second column of the table represents the decomposition results based on the mean regression analysis, and the 3rd to 7th columns represent the 5th, 25th, 50th, 75th, and 95th quantiles of the quantile regression analysis. The first panel illustrates the total consumption gap between the urban and rural sectors in Sri Lanka, and the second panel shows the decomposition results.

Table 9.4 Total gap and unexplained gap (returns effect) 2009/2010 and 2006/2007
Fig. 9.2
figure 2

Inequality decomposition: returns effect and endowment effect

Mean regression analysis suggests that consumption inequality in Sri Lanka slightly decreased during the study period (second column in Table 9.4). This may be due to the recent regional development projects carried in Sri Lanka, discussed in Sect. 9.1.Footnote 12

The urban–rural and WP–OP gaps that remain after controlling for household characteristics (evaluating urban consumption using rural household endowments) are positive. This signifies that even after adjusting urban consumption for rural endowment (characteristics), urban consumption is higher than rural consumption. This component is defined as the location-specific returns to household endowments. The major portion of the mean urban–rural consumption gap in Sri Lanka is explained by the returns effect, and its dominance is the same in both the surveys.

The dominance of the returns effect is observed in the quantile regression decomposition for all quantiles. For any quantile, for both survey years, the urban–rural (WO) consumption gap was recorded as positive, implying that the urban–rural (WO) gap favors the urban sector (WP) even after evaluating urban (WP) consumption with rural (OP) characteristics. This suggests that even though the consumption inequalities declined between the two survey years in Sri Lanka, the returns effect is still dominant.

Three major policy conclusions can be drawn from the quantile regression decomposition analysis. First, the urban–rural (total or unadjusted) consumption gap is smaller in the lower consumption quantiles and significantly higher in the upper quantiles. The difference in consumption expenditure gaps between the 95th percentile and 5th percentile is around 30 % in both the survey periods. This implies that the urban (WP) rich are much better off than the rural (other province) rich in terms of consumption expenditure. Second, the returns effect dominates throughout the distribution of consumption expenditure. The unexplained percentage of consumption gap is always more than 70 % of the total gap for all quantiles. Third, adding to the second conclusion, the returns effect dominates throughout the expenditure distribution, but tends to decline as it moves toward the upper consumption quantiles. This means that location-specific effects account for more than 80 % of the urban–rural total gap at the 5th quantile compared to less than 70 % of the urban–rural total gap in the 95th quantile. On the other hand, the importance of the covariate effect in explaining the urban–rural gap increases at the upper end of the expenditure quantiles. These findings are different from those of Nguyen et al. (2006) using the same analytical method for Viet Nam. They found that the covariate effect dominates in the lower quantiles, whereas the returns effect dominates in the upper quantiles.

A temporal analysis of the urban–rural inequality decomposition based on quantile regression analysis also finds that urban–rural inequality decreased over the study period, and inequality between the lower end and the upper end of the consumption distribution reduced relatively in 2009/2010. Furthermore, the dominance of the returns effect also declined in urban–rural analysis, while there was not much change in the WP and other process analysis.

The dominance of the returns effect in the lower quantiles of the consumption expenditure distribution suggests that returns to household endowments matter more than household characteristics to poor people in Sri Lanka . Sri Lanka is recognized as a country that has achieved extraordinary success in health and education indicators despite a lower level of income per capita (World Bank 2005b). Sri Lanka also records a relatively high rank in the UNDP ‘Human Development Index.’ Gender seems to matter less in determining household characteristics (e.g., education and health) in Sri Lanka compared to many other developing countries. This suggests that the distribution of household characteristics is relatively better in Sri Lanka as compared to Viet Nam. The lower rewards to rural households can be linked to the poor rural markets. Most formal employment is concentrated in urban areas, and rural areas are separated from the urban centers due to the poor road network, leading to poorer market access for the rural population. The communication barriers between urban (WP) and rural (OP) areas may also contribute to lower rewards to the characteristics in the rural sector. In addition, if characteristics for which controls were not included due to lack of data, such as the quality of education, are correlated with location (for instance, richer areas have better quality education), then these would add to the location-specific effects.

The foregoing analysis suggests that the existence of barriers such as remoteness and poor access to markets prevents lagging regions from being absorbed into the modern sector or growing region. These barriers play a larger role in perpetuating spatial inequality , especially for the poor, than do disparities in household endowments between regions and sectors.

Policies that are indicated for further reduction in the urban–rural gap and higher poverty levels in the rural areas include (1) connecting lagging regions to markets in the growing regions, i.e., improvement of roads and transportation, electricity and communication infrastructure , and improving the investment climate in the rural areas; (2) improving the quality of schooling via better training and resources, especially in remote areas; and (3) removing barriers to labor mobility such as regulations in labor markets and land markets.