Introduction

A topic of long-standing interest among social scientists is the degree to which an individual’s status in society is determined by the position of one’s parents. In egalitarian societies, where you are in the social and economic status ladder should not be principally determined by your parent’s income, educational level or occupation (Roemer 1998; Bowles and Gintis 2002). Social mobility research has been concerned with intergenerational occupational mobility, but much of this research has concentrated on advanced market economies (Erikson and Goldhorpe 1992; Miles1999). However, this question is of crucial importance in the developing world, and especially in emerging economies which have undergone modernisation as they have opened up to the world economy in recent decades. Emerging literature has investigated intergenerational occupational mobility in developing countries (Wu and Treiman 2007; Krishna 2013a, b; Azam 2015). Much of this literature has documented the patterns of social mobility for the general population as well as for certain social groups (Motiram and Singh 2012; Reddy 2015)Footnote 1. Less understood are what Chetty et al. (2014) call “the geography of intergenerational mobility”—do we see spatial variations in intergenerational occupational mobility, and if so, what are the spatial determinants of social mobility in the developing country context?

We examine the spatial determinants of intergenerational occupational mobility across villages in India. The federal nature of India’s polity, where local governments have an important role to play in the provision of local public goods, provides an ideal setting to examine the role of village-level factors in explaining social mobility. This is also made possible by the systematic differences in agro-ecological conditions, physical infrastructure, human capital, social structure, economic endowments and the remoteness of villages from centres of economic activity. We use a nationally representative data set—the Indian Human Development Survey II of 2011–2012, which provides detailed information on parents and adult age children as well as range of individual and village characteristics of the location of the adult child. Looking at father–son occupational mobility, we find clear effects of village-level factors in explaining intergenerational occupational mobility. The remoteness of the village, its agro-ecological conditions, infrastructural public goods and its economic prosperity serve as important predictors of whether the son is less likely to follow his parent’s occupation in the village.

In the next section, we provide a brief review of the literature. We then discuss measures of social mobility and describe how we have categorised the occupations in measuring occupational mobility. We then present the empirical strategy, followed by the descriptives and the results.

Related literature

A long-standing literature has looked at the spatial determinants of rural poverty in India. In an important contribution to the literature, Datt and Ravallion (1998) link the changes in rural poverty (measured alternately by the head-count ratio, the poverty gap and squared poverty gap) in a particular state to initial conditions that existed in the state in 1960 (the year their analysis began), growth in farm output per acre, growth in per capita non-farm output, changes in the cost of living for agricultural labourers and per capita state development expenditure. They show that cross-state variations in trend rate of growth of average farm yields were important in explaining cross-state variations in the trend rates of rural poverty reduction.

Palmer-Jones and Sen (2003) have highlighted the role of agro-ecological factors in mediating the relationship between agricultural growth and rural poverty; they argued that some Indian states had relatively homogeneous agro-ecological properties that were favourable to agricultural growth. These states, mostly in north-west India, had appropriate supportive policies (in terms of large-scale irrigation and access to the new Borlaug seed-fertiliser technology introduced in the mid-1960s), which largely accounts for their better performance in both agricultural growth and poverty reduction, and hence the associations found in state-level analyses. Other states had diverse agro-ecological conditions, which resulted in poorer average performance, despite impressive performance in some of their regions. Thus, while states which experienced rapid agricultural growth consequently achieved poverty reduction, Palmer-Jones and Sen (2003) show that the success of these states is due to the presence of a high proportion of favourable agro-ecological conditions; and where these conditions are not present, neither poverty reduction nor agricultural growth has been achieved to the same degree.

Krishna and Bajpai (2011) show that there has been a clear spatial pattern in income growth in rural India in 1993–2005. While inflation adjusted per capita incomes grew between 1993 and 2004 in villages located within 5 kms of the nearest town, it decreased in villages situated more than 5 kms of the nearest town (and where more than 80% of India’s rural population reside). Iversen et al. (2014) find that Scheduled Caste households in villages where they are the majority have higher incomes and faster poverty reduction than Scheduled Castes living in upper caste dominated villages.

Together, these studies show the importance of agro-ecology, remoteness of location and social structure in explaining spatial patterns in India’s poverty reduction and income mobility. No such study thus far has examined the spatial determinants of intergenerational occupational mobility. Our study aims to fill this gap.

Measuring social mobility

Mobility is the lack of association between some characteristics of the parent with an equivalent characteristic of the child. This could be income, occupation or education. Economists have generally focused on income in the developed country context. However, such a practice of using income as the relevant characteristic of the parent and child in understanding mobility has come under some criticism due to the lack of stability in parent–child associations (Chetty et al. 2014). Income as a measure of mobility is particularly problematic in the developing country context, where income data are sparse and it is difficult to accurately determine the permanent income of the parent or child. The use of occupations as a measure of mobility is common in the sociology literature. In the context of a low-income country setting, where occupation is a strong predictor of income, occupation-based measures of social mobility are more appropriate. We focus on the association between the occupation of the father and son to measure social mobility.

The two common measures of social mobility are the intergenerational regression coefficient (IGRC) and the intergenerational coefficient (IGC). To obtain the IGRC, intergenerational mobility in occupation, one should estimate the following equation:

$${Y}_{1}={\beta }_{0}+{\beta }_{1}{Y}_{0}+{u}_{i}$$
(1)

where β1 is IGRC, Y0 is occupational rank of the father, and Y1 is the corresponding category for offspring.

The IGC is given by:

$$\rho ={\beta }_{1}\left(\frac{{\sigma }_{1}}{{\sigma }_{0}}\right)$$
(2)

where σ1 and σ0 are standard deviations (dispersions) of occupational categories in the child and parent generation, respectively. The social mobility measures in (1 and 2) overlap if the intra-generational dispersions of occupations are identical for the parent and offspring generation, which is unlikely. Further, a cross-sectional rise in occupational inequality from the parent to the child generation will translate into lower social mobility while a relatively more compressed distribution of occupations in the child generation results in a higher social mobility estimate (Iversen et al. 2017).

Data

The data obtained from the Indian Human Development Survey (2011–2012) are a large-scale household survey that covers most of the territory of India. It was conducted by NCAER on behalf of the University of Maryland. A unique feature of the survey was that the household-level questionnaire featured questions about the parent’s occupation and education of the heads of households. Another unique feature of the data was that a village-level questionnaire was administered to 1420 villages in 384 districts, which asked detailed questions about the physical infrastructure, agro-ecological properties, economic conditions and social structure of the village. One limitation of the survey is that village-level GPS coordinates are not released to researchers as it may compromise the confidentiality of information about the village. This implies that we know in which district the village is located, but not its exact location in the village. Furthermore, as the survey was not meant to obtain representative data at the district or state level, aggregating the data to district or state level to obtain spatial maps of social mobility and village characteristics is not possible.

Categorising occupations

To estimate the IGRC and IGC, we need a method of categorising occupations such that they can be rank ordered. The IHDS surveys ask heads of households about the main occupation of their fathers (or fathers of husbands, if the head of household is female). Since we focus on father–son occupational mobility, we restrict our sample to male-headed households, with the head aged 20 years and above and who is not retired or unfit for workFootnote 2. The occupational codes are provided at the two-digit level (as detailed in Appendix). In their analysis of occupational mobility based on IHDS I data, Motiram and Singh (2012) use the Indian National Classification of Occupations (NCO 2004), which draws on ILO’s occupational classifications (ISCO88 and its antecedents) with adjustments considered appropriate for the Indian context (ibid.). A key feature of ISCO88 is the use of skill requirements as the main principle guiding occupational rank (e.g. Ganzeboom and Treiman 1996). In the Indian context, caste makes the translation of skill requirements into occupational status more intricate, for example, regardless of the skill requirements of their traditional, caste-based occupations, Shudra or Scheduled Caste individuals are likely to have low social status. Following Iversen et al. (2017), we attempt to adjust for this and other relevant empirical facts when converting the IHDS categories into an occupational ranking (Table 1). In our analysis, we use the following six occupational categories (with IHDS occupation codes in brackets).

Table 1 Occupational categories and codes

Higher values of the occupational category variable are associated with higher standing on the social status and plausibly on the earnings ladder. While categories 6 and 5 are quite straightforward, the placement of farmers in category 4 is less clear-cut given the substantive heterogeneity among India’s cultivatorsFootnote 3. For the main questions that we address and the tables that follow, this simplification is not a major concern.

To address caste and occupational status, we distinguish between categories 2 and 3, capturing low and higher status for a spectrum of vocational and other skills, as done by Iversen et al. (2017). The idea here is to distinguish between occupations that are require skill but have low status because of caste connotation and those that are not; new, modern jobs and vocations form a subset of the latter. Examples of low status vocational occupations (category 2) are blacksmiths and shoemakers, higher status occupations include tailors while modern vocational occupations include machinery/electrical fitters, broadcasting station operators and plumbers. Finally, given our focus on sharp ascents, we examine whether the lower end layer within category 6, which we interpret to include nurses (occ code 8) and teachers (occ code 15), accounts for a substantive fraction of the entrants into this topmost categoryFootnote 4.

To obtain the occupation codes of sons, we first ascertain whether the head of household is a farmer. For those individuals who are not farmers, we use the occupational codes provided in WS4 (for those engaged in wage or salaried work) and in NFIB (for those who have a primary non-farm business). Clearly, any occupational classification includes an element of choice and arbitrariness. Employing a different classification schema did not, however, produce any major changes in the results reported in the following sections.

Empirical strategy

We follow two approaches to assess the spatial correlates of occupational mobility.

The first approach starts off with the workhorse of empirical analysis of intergenerational occupational mobility, as in Eq. (1) earlier:

$${Y}_{1}={\beta }_{0}+{\beta }_{1}{Y}_{0}+{u}_{i}$$
(3)

where β1 is the intergenerational regression coefficient (IGRC): Y0 is occupational rank of the father and Y1 is the corresponding category for offspring. To see the role of village characteristics on explaining variations in the IGRC spatially, we augment Eq. (3) with the following specification:

$${Y}_{1}={\beta }_{0}+{\beta }_{1}{Y}_{0}+ {\beta }_{2}{Y}_{0}*X+{\beta }_{3}Z+ {u}_{i}$$
(4)

where X is a vector of village covariates, X1Xn, and Z is a vector of controls.

For a given village characteristic Xi, we can assess whether Xi positively or negatively affects occupational mobility by looking at the sign of β2. We estimate Eqs. (1 and 2) by employing linear probability methods (ordered probit estimates were broadly similar).

Our second approach is to use a multinomial logit to capture individual’s constrained choice of one occupation over other occupations, which is the standard approach to modelling occupational choice in the labour economics literature (Abowd and Killingsworth 1984, Constant and Zimmerman 2003). Let us suppose s is the occupational choice (occupational type of the son in our case) variable, which takes values of 0, 1, …, J for J + 1 outcomes. The model for determination of s is specified as

$$ P_{ij} = \Pr [s_{i} = j] = \frac{{\exp (X_{i} \beta_{j} )}}{{1 + \sum\limits_{j = 1}^{J} {\exp (X_{i} \beta_{j} )} }} $$
(5)

where i indexes the individual and j indexes the choice or outcome.

We are specifically interested in upward ascents, and we will examine the role of village covariates in explaining probability of the son’s mobility to other occupations, where the father is an agricultural or other labourer (the lowest ranked occupation). The advantage of the second approach is that we do not need to make an assumption of rank ordering of occupations, which is the case with the first approach.

Our set of village covariates is derived from the previous literature on social mobility as well as the literature on the spatial determinants of rural poverty and income mobility in India. The literature on social mobility has identified the role of segregation in reducing social mobility by reducing exposure to successful peers and role models, decreasing funding for local public goods such as schools and constraining access to nearby jobs (Wilson 1987, Massey and Denton 1993, Cutler and Glaeser 1997). Our segmentation variable that is appropriate in the Indian context is whether social groups in the village live in separate hamlets. The presence of good quality schools in the area has also been found to be an important correlate of social mobility (Chetty et al. 2014). We do not have data on the quality of schools, and in the Indian context, where many villages lack middle and secondary schools, whether a village has a middle or secondary school is an important marker for the possibility of human capital formation necessary for social mobility in the village. The presence of public goods in the village such as access to electricity and a permanent all-weather road is an important factor in allowing for income diversification possibilities in the village (Binswanger et al. 1989). Agro-ecological and climactic factors such as the vulnerability of the village to droughts or flooding limit the possibilities of rural income growth and limits the variety of jobs in the service sector that may be supported by the village economy (Palmer-Jones and Sen 2004). Among other geographical factors, the remoteness of the village from major towns can hinder the possibility of social mobility in the village as villagers living closer to the towns and cities have opportunities to engage in higher value activities that serve the needs of the residents of towns and cities, which their counterparts in more remote villages are not able to do (Krishna and Bajpai 2011). Finally, the economic prosperity of the village is an important predictor of the range of occupations that a village economy may support. More prosperous villages, whether the prosperity is due to more productive farming or a flourishing non-agricultural sector, have the incomes to support higher-order occupations in the service and manufacturing sectors.

Our set of controls are drawn from the existing literature on social mobility in India. Caste is an important marker of the possibility of social mobility in rural India, with individuals in socially disadvantaged social groups such as the Scheduled Castes (SC), Scheduled Tribes (ST), Other Backward Classes (OBC) exhibiting lower rates of social mobility than forward castes (Iversen et al. 2016). More educated fathers are expected to have sons who are more likely to move up the occupational ladder. Finally, we also add state fixed effects to control for unobserved state-level factors that may be correlated with social mobility.

Descriptives

Occupational mobility matrices

We begin by providing occupational mobility matrices, first for the combined rural and urban sample, then separately for the rural and urban samples (Table 2). We note that the diagonal terms dominate the off-diagonal terms, which suggests that sons in India are likely to follow the occupations of their fathers. For example, for the all-India sample, 58.6% of the sons of agricultural and other labourers were also in the same occupational category. Such dominance is not a verdict on India per se, since it tells us little about how India compares with other countries. Greater upward occupational mobility would be evident if the off-diagonal elements on the right-hand side of the diagonals dominate the off-diagonal elements on the left-hand side. We do not find evidence of such mobility, independent of rural/urban location and social group.

Table 2 Occupational mobility matrices, all India, rural and urban samples (percentages), IHDS II

The matrices point to higher occupational mobility among forward castes than among Other Backward Classes (OBC) and Scheduled Castes (SC) and Scheduled Tribes (STs) (Table 3). Among upper castes, we observe sharp ascents for 24.7% of the sons of agricultural and other labourers who enter the highest two occupational categories–clerical and other workers and professionals. In contrast, in the case of OBC, SC and ST individuals, the respective percentages are 15.7, 10.6 and 9.3%. Also, striking and consistent with Motiram and Singh (2012), we find a much higher prevalence of sharp descents among SC and ST sons.

Table 3 Occupational mobility matrices, by social group (percentages), IHDS II

Intergenerational regression coefficient and intergenerational coefficient by district

We now provide estimates of the IGRC and IGC, where we aggregate the village-level observations of the districtFootnote 5. It is important to note here that the villages in the IHDS are not representative of the district. So, the estimates provided here are not “true” measures of intergenerational occupational mobility at the district level. Notwithstanding this, we find clear spatial differences in social mobility across India, as captured by the IGRC and IGC (Fig. 1). For example, Narmada district has an IGRC of 0.001, suggesting that the son’s occupation is very weakly correlated with the father’s occupation. In contrast, we have the district of Hissar where the IGRC is 0.84, suggesting very high persistence of parent–son occupation pairs.

Fig. 1
figure 1

Source IHDS 2011–2012

Estimates of IGRC and IGC across districts in India.

Village covariates

We next provide a sense of how different villages are in their physical, economic and social characteristics in our sample. In Fig. 2, we plot the proportion of villages where jatis live separately, where middle and secondary schools are present, and where there is a pucca (permanent all weather) road. In 63% of villages, jatis live separately; in 73% of villages, a middle school is present, in 33% of villages, a secondary school is present, and in 86% of villages, a pucca road is present. Figure 3 shows that while in 22% of the villages, 100% of households receive electricity, in around one-third of villages, the proportion of households receiving electricity is 70% or less. Figure 4 shows distance of the village from the district headquarters. While 40% of the villages are located within 30 kms of the district headquarters, one-third of the villages are located 51 kms or more from the district headquarters.

Fig. 2
figure 2

Source IHDS 2011–2012

Social and human capital characteristics of villages (%).

Fig. 3
figure 3

Source IHDS 2011–2012

Village-wise distribution of percentage of households with electricity.

Fig. 4
figure 4

Source IHDS 2011–2012

Distance from district headquarters (kilometre ranges). Note Ranges: 1: ≤ 5 km, 2: 6–10 km, 3: 11–20 km, 4: 21–30 km, 5: > 31–50 km, 6: 51–75 km, 7: 76–100 km, 8: > 100 km.

Agro-ecological conditions also vary significantly across villages. While 53% of villages had no drought for the last 7 years, 15% of villages experienced droughts for at least 2 out of the last 7 years (Fig. 5). Flooding is less prevalent in the villages in our sample, with 74% of villages not experiencing flooding in the last 7 years (Fig. 6). Economic prosperity also varies significantly across the sample villages, as is evident from Figs. 7 and 8 in the distribution of mean real agricultural and non-agricultural wages (nominal agricultural and non-agricultural wages divided by the state poverty line).

Fig. 5
figure 5

Source IHDS 2011–2012

Number of years when the village experienced a drought during 2006–2012.

Fig. 6
figure 6

Source IHDS 2011–2012

Number of years when the village experienced a flood during 2006–2012.

Fig. 7
figure 7

Source IHDS 2011–2012

Village-wise distribution of real agricultural wages.

Fig. 8
figure 8

Source IHDS 2011–2012

Village-wise distribution of real non-agricultural wages.

Overall, we see striking differences in village infrastructure, schooling endowments, social segmentation, agro-ecological/climatic conditions and economic prosperity. We now investigate whether these differences manifest themselves in differential rates of intergenerational occupational mobility.

Results

We first present estimates of the IGRC augmented with individual and village-level covariates in Table 4, followed by the multinomial logit estimates of son’s occupational structure in the case of parents who were agricultural labourers in Table 5. In Col (1) of Table 5, we present estimates of the IGRC without interacting the father’s occupational rank with village covariates. In Col. (2), we interact the father’s occupational rank with all village characteristics. In Col. (3), we add individual level controls–these are the caste status of the son (SC, ST, OBC), father’s education (in years of education) and the age of the son. Finally in Col. (4), we include state-fixed effects.

Table 4 Regression results, augmented IGRC estimates
Table 5 Multinomial logit regressions, relative risk ratios, where the father is a labourer. Base category: agricultural and other labourers

We find that remoteness of the village (as captured by the distance from the district headquarters), agro-ecological conditions of the village—whether the village has been repeatedly affected by drought and floods, access of the village to the outside world (captured by the availability of a permanent (pucca) road), and economic conditions of the village (captured by real agricultural and non-agricultural wages) are important in explaining intergenerational occupational mobility in all the estimates—the coefficients for these variables are significant at 10% or less in Cols (1–4). The greater the remoteness of the village and the higher the number of years that a particular village has been affected by droughts or floods, the lower the mobility of sons relative to fathers (or lessens the persistence of occupations from fathers to sons). On the other hand, access to a permanent road and higher real agricultural and non-agricultural wages increases the mobility of sons relative to fathers in the village. This may reflect the possibility of sons accessing higher-income opportunities in these villages and consequently moving out of occupations at the lower end of the scale.

Village characteristics that are significant in some specifications but not in all estimates are whether the village has a secondary school and the percentage of households in the village with electricity. However, these characteristics are significant in Col. (4), where we include all individual-level controls and state-fixed effects. Whether a village has access to a secondary school or whether households in the village have access to electricity has a positive effect on intergenerational mobility. On the other hand, whether households live in separate clusters (jatis) or whether the village has a middle school does not seem to affect social mobility.

With respect to the individual-level controls, we obtain the expected results—SC, ST and OBC male heads are likely to have lower-ranked occupations, and father’s education and the son’s age has a positive effect on social mobility (the latter capturing life-cycle effects).

Finally, in Table 5, we present multinomial logit estimates when the father’s occupation is an agricultural or other labourer. We want to see to what extent village characteristics explain the likelihood that the son will move on to higher order occupations, and in particular, occupations at the top of the social ladder such as clerical workers and professionals. The relative risk ratios presented in the table capture the effects of village characteristics on the likelihood of individuals being in other occupational categories relative to the base category—agricultural/other labourer. We find that social segmentation in the village (that is, whether jatis stay in separate hamlets) decreases the likelihood that the son will move into other higher order occupations (except the move to being a farmer). Remoteness of the village also decreases the likelihood that the son will move into other higher-order occupations (again, except the move to being a farmer). In fact, the nearness of the village to the district headquarters is the only village characteristic that increases the likelihood of a sharp ascent—that is, whether an agricultural labourer’s son becomes a professional. Whether the village has a pucca road decreases the likelihood of an agricultural labourer’s son being a farmer relative to remaining an agricultural labourer, while the presence of a middle school increases this likelihood. Higher real non-agricultural wages have a very strong positive effect on the likelihood of transition to other occupations, especially more valued occupations such as clerical workers. In villages with a flourishing non-agricultural sector, a wider set of occupational possibilities would be available to the poorer households in the village, including well-paid service sector jobs.

Our results with the multinomial logit models differ somewhat from the IGRC estimates, and this may be due to the limitations of the IGRC in capturing mobility patterns among the households at the bottom of the occupational ladder. While the augmented IGRC estimates have an intuitive simplicity about them, the multinomial logit approach may be the way forward to capture more complex mobility patterns (where mobility may be both upward or downward), and not to make the restrictive assumption of rank ordering of occupations.

Concluding remarks

In this study, we examined the spatial determinants of intergenerational occupational mobility in rural India. In the literature on the spatial determinants of rural poverty and income mobility, we have found clear evidence of the importance of geographical factors such as agro-ecology and remoteness in explaining social mobility patterns in Indian villages. Physical infrastructure such as access to roads and electricity is also an important correlate of occupational mobility. Economic prosperity of the village has a very strong positive association with social mobility. For sons whose parents are agricultural labourers, social segmentation also plays an important role.

Several key policy implications follow from the findings of the study. Firstly, it is important for policymakers to address the geographical and infrastructural disadvantages that several households face in rural India and possibly target investment schemes to remotely located villages. Secondly, policies that promote rural prosperity can also indirectly promote social mobility; hence, growth enhancing policies may have a double benefit of reducing intergenerational inequality. Finally, there needs to be a clear focus on changing social norms that lead to segmentation in Indian villages, where the poorer households are particularly disadvantaged.