The Importance of Being Earliest: Birth Order and Educational Outcomes along the Socioeconomic Ladder in Mexico

In this paper we study birth order effects on educational outcomes in Mexico using 2 million observations from the extended questionnaire section of the 2010 Census. We find that birth order effects are negative, and a variety of endogeneity and robustness checks suggest a causal interpretation of this finding. We then examine whether these effects vary across households’ economic status, and we find significant heterogeneity across absolute as well as relative standards of living – operationalized as household wealth and relative deprivation. Finally we find that first-borns’ advantage is amplified when they are male, and in particular when other siblings are female.


Introduction
The existence of a systematic heterogeneity among humans depending on their birth orders has long been investigated across an array of disciplines. Beginning with Galton's (1874) study on the preponderance of firstborns among eminent persons in society, research on the role of birth order † We are grateful for suggestions and encouragement to Horacio Almanza, Shaun Hargreaves Heap, Rosanne Tromp, Sarah Tustin, Arjan Verschoor and seminar audience at the University of Kent. We are also indebted to four anonymous Referees for a wealth of empirical and conceptual suggestions, which have substantially improved the quality of our work. All remaining errors are ours. continues to attract considerable attention across a number of fields, including the study of intelligence (Rodgers 2014), educational attainment (Monfardini and See 2016), earnings (Bertoni and Brunello 2016), psychological traits (Salmon, Cuthbertson and Figueredo 2016), malnutrition (Jayachandran and Pande 2017), health outcomes (Black, Devereux and Salvanes 2016), addictions (Barclay et al. 2016), and consumer behavior (Saad, Gill and Nataraajan 2005). Such a sustained and multidisciplinary interest in birth order effects is perhaps not surprising if one considers that the alleged source of inequalities between siblings roots within the very institution which should protect children from any sort of discrimination -the family.
In this paper we examine the relationship between birth order and education in Mexico. Our data consist of over 2 million observations from the extended questionnaire of the 2010 Mexican census, a module administered to 10% of the population that grants statistical representativeness at municipal level. As a first contribution of our paper, we provide evidence of negative birth order effects in Mexico and show that this result is robust to a number of sensitivity analyses and estimation strategies designed to address specific endogeneity concerns. As a second contribution of our paper, we focus on the interplay between households' economic status and birth order. For the first time, the notion of economic status in the study of birth order effects is extended to include not only households' absolute standards of living, but also relative deprivation -i.e. a measure indicating the economic disadvantage relative to other households in the same municipality. We find that both facets of economic status are significant predictors of educational outcomes, and that they significantly interact with the birth order variable.
Absolute standards of living affect investments in education via material pathways related to credit constraints, affordability of education and physical inputs (Basu and Van 1998). By contrast, relative standards of living capture psychosocial pathways whereby relative standing in society affects human capital investments by shaping aspirations and carving social identities (Mayer 2001;Oyserman 2009 and. It is therefore important to disentangle the potential role of each facet of economic status in shaping the effects of birth order on educational outcomes. Mexico is an interesting context for studying birth order effects on educational outcomes and for exploring their heterogeneity across economic status. Mexico is an upper middle-income country and OECD member, and yet it displays a range of features more typical of low-income countries. Around 46% of the population lives in absolute poverty according to official national statistics (CONEVAL 2012) and the need to contribute economically to the household is a major determinant of child labor (Alcaraz, Chiquiar and Salcedo 2012). Economic inequality is particularly high, placing Mexico at the very top of OECD countries for a range of national and sub-national indicators of inequality (OECD 2016). This significant economic disparity, where substantial parts of the population live in absolute poverty at the bottom while others are considerably well-off at the top, suggests that different dynamics might exist for different parts of the population so far as birth order effects are concerned.
Understanding whether there exist economic gradients in birth order effects is not only interesting in its own right, but can also offer insights into the mechanisms behind these effects. While the overall picture of a disadvantage for later-borns we observe for Mexico is consistent with the general evidence for economically developed countries (De Haan, Plug and Rosero 2014), we find that, ceteris paribus, birth order effects are more pronounced at higher absolute as well as relative standards of living. This is in line with evolutionary models of biased parental investment, which postulate that parental attitudes towards risk might strengthen the advantage of earlier-borns in wealthier households (Gibson and Sear 2010;Gibson and Lawson 2011;Hedges et al. 2016). Because higher levels of standards of living imply greater reliability of returns to parental investment and lower vulnerability to risks over which parents have little control (e.g. neighborhood violence), it becomes relatively safer to put 'more eggs in one basket' -and this benefits earlier-borns over later-borns. At the other end of the spectrum, the evidence that earlier-borns' advantage is smaller in poorer households is also consistent with the existence of child labor. When child labour becomes an option for increasing family earnings, older children are more likely to be sent to work as they are able to command higher wages and this may erode or even overturn the advantage they may have had towards younger siblings (Emerson and Souza 2008). We also find that while the above patterns are consistent across genders, the advantage of the first-born is greater when this is a boy, and especially so when all other siblings are female.
The paper develops as follows. In section 2 we present a brief discussion of the literature on birth order effects (sub-section 2.1) and discuss how these may be expected to vary at different levels of the two facets of economic status we take into examination (sub-section 2.2). Section 3 describes our data and outlines our empirical approach. Section 4 presents our findings together with several sensitivity checks.
Section 5 concludes, and discusses the implications and limitations of our findings.

A pecking order within the family?
The seminal contribution of Belmont and Marolla (1973) found strong evidence of lower intellectual performance for later-borns. Following this work, an array of models focusing on family interactions and household environment were developed to investigate systematic heterogeneities in intellectual and educational achievements across siblings of different birth orders. These heterogeneities may arise due to the declining intellectual age in the family subsequent to additional siblings (see Zajonc and Markus's 1975 confluence model), resource constraints (see Blake's 1981 resource dilution model), parents' fertility decisions (see the optimal stoppage model by Ejrnaes and Pörtner 2004) and parents' agerelated 'reproductive quality' (see the biological model of Malaspina et al 2005). Additional factors studied as possible determinants of birth order effects are parental time allocation (Price 2008, Pavan 2015, Monfardini and See 2016, preferential parenting (Behrman and Taubman 1986, Kessler 1991, Hotz and Pantano 2015, Mechoulan and Wolff 2015, Herd et al 2016 and personality or behavioral traits (Sulloway 1996, Baer et al. 2005, Black, Grönqvist and Öckert 2017).
An important source of birth order effects on which the literature has focused is household resources.
In line with Becker and Tomes' (1976) idea of a trade-off between quality and quantity of children, Blake's (1981) resource dilution model ascribes earlier-borns' greater intellectual development to the lower number of siblings with whom household resources have to be shared (see also Hanushek 1992).
In a similar fashion, de Haan (2010) argues that negative birth order effects may arise due to the financial resource dilution which accompanies the birth of new siblings. Other mechanisms arising from barriers to human capital investments due to resource constraints have instead been argued to lead to positive birth order effects. Parish and Willis (1993) find that earlier-borns acquire less education and explain this result through credit constraints mechanisms; older siblings would be disadvantaged because they are born when their parents are in the earlier stages of their careers, earn less, and are less able to borrow. An alternative mechanism leading to positive birth order effects is suggested by Emerson and Souza (2008). They explain lower educational outcomes amongst earlier-borns by arguing that when child labor is necessary to boost family incomes, it is older siblings who go to work because they can command higher wages than their siblings.
As noted by de Haan, Plug and Rosero (2014), the literature seems to suggest a pattern related to the country's level of economic development. Earlier-borns are typically found to have better educational outcomes in high income countries (e.g. Behrman and Taubman 1986, Modin 2002, Black, Devereux and Salvanes 2005, Kantarevic and Mechoulan 2006, Booth and Kee 2009, de Haan 2010, Härkönen 2014, Barclay 2015, Hotz and Pantano 2015, Mechoulan and Wolff 2015 while later-borns do so in developing countries (e.g. Emerson and Souza 2008, Rammohan and Dancer 2008, Adli, Louichi and Tamouh 2010, de Haan, Plug and Rosero 2014, Lafortune and Lee 2014and Seid and Gurmu 2015. Interestingly, an analogous within-country socioeconomic gradient is found by Haan, Plug andRosero (2014) in Ecuador, andTenikue andVerheyden (2010) show that earlier-borns have better (worse) educational outcomes in richer (poorer) households in a sample of twelve Sub-Saharan countries. At the same time, there is also evidence deviating from the general pattern related to the country's level of economic development -e.g. Dayioğlu, Kirdar and Tansel (2009) find negative birth order effects for Turkey and Cho (2011) finds positive ones for South Korea.
Evidence from Mexico is scarce and offers conflicting conclusions. Using data from nine schools, Binder (1998) finds an advantage for earlier-borns in terms of years of schooling for girls but not for boys. In contrast, Binder and Woodruff's (2002) study of intergenerational mobility in schooling that tracks families across four cohorts finds some evidence of greater educational achievements for laterborns'. In particular, birth order does not influence years of schooling for the first cohort, but increases it in the fourth cohort and with a stronger effect for females. This gender result for the Mexican context is at odds with the findings of Ejrnaes and Pörtner (2004), de Haan, Plug and Rosero 2014and Härkönen (2014, who instead find weaker birth order effects for females in, respectively, the Philippines, Ecuador and Germany. Gender could also matter through the likelihood to engage in child labor -e.g. Dammert (2010) finds that in Nicaragua and Guatemala older boys are more likely to engage in paid work. Gutmann (2006) argues that in Mexico parents are often more concerned with the education of the firstborns when they are males, as a result of a macho culture characterized by son-preference (Filmer, Friedman and Schady 2008) and where having a first-born male can be seen as a demonstration of manliness (Rojas and Figueroa 2005).

Absolute standard of living
The strength of the mechanisms discussed above may differ for households of different socioeconomic status. With regard to the absolute level of standard of living, the main mechanisms discussed in the literature refer to child labor and resource dilution. Livelihood strategies arising in response to economic hardship, such as the involvement of earlier-borns in child labor, can be expected to be more prevalent among poorer households. Material resource dilution is also likely to be less of a restraint for richer households, although, if the level of material comfort enjoyed by the household and the economic resources invested in children are commensurate with family earnings, some dilution issues with the arrival of further offspring may in fact arise for most households. 1 This suggests contrasting influences for these two mechanisms, with child labor (favoring later-borns' schooling) and resource dilution (favoring earlier-borns' schooling) both being more relevant amongst the less wealthy.
Largely neglected in the economics literature, evolutionary models of biased parental investment suggest that wealthier households might invest more in earlier-borns. According to these models, the reason why parents may favor earlier-borns is that these have higher reproductive value (both because older offspring are closer to reproductive maturity and because the rate of juvenile mortality tends to 1 An interesting perspective on this is offered by the (at first sight counterintuitive) findings of a study based on a sample of 13,176 mothers, where wealthy mothers expressed greater concerns about the economic cost of having large families compared to poorer mothers (Lawson and Mace 2009). decrease with age) while the reason for favoring later-borns is that the effect of a unit of additional investment on offspring survival is on average higher for younger offspring as they are weaker (Clutton-Brock 1991). These competing drives are typically resolved in favor of earlier-borns, as formalized in the model developed by Jeon (2008). A number of recent contributions argue that this preferential pattern in favor of earlier-borns is stronger in wealthier households (Gibson and Sear 2010;Gibson and Lawson 2011;Hedges et al. 2016). This is because at higher levels of standards of living, there is greater reliability of returns to parental investment and lower vulnerability to the risks over which parents have little control (e.g. susceptibility to disease, neighborhood violence, etc.), so that it becomes relatively safer to put 'more eggs in one basket'. 2 By contrast for poorer households, higher risks reduce the level of effort per child beyond which success is due to chance (Quinlan 2007). This leads to a less discriminative attitude and to a sort of 'bet-hedging' investment strategy in offspring (Liddell, Barrett and Henzi 2003).

Relative standard of living
Despite the increasing recognition of the role played by relative income in shaping social outcomes (Clark, Frijters and Shields 2008;Clark and D'Ambrosio 2015), to the best of our knowledge the relationship between relative standard of living (independently from its absolute counterpart) and heterogeneity of outcomes across birth orders has been neglected in the literature. To see how birth order may interact with relative deprivation in shaping educational outcomes it is important to consider that, unlike absolute standards of living (which determine the affordability of education and possibly child labor), a lower position on the economic ladder depresses educational outcomes mainly via aspirations, perceptions and expectations (Mayer 2001). 3 This is in line with the theory of identity-based motivation, according to which, since people pursue goals that they believe to be congruent with their 2 When parents are not averse to inequality in children's outcomes and one child is chosen as the main human capital investment, first-borns would be more likely to be picked for a number of reasons -for example, because they are the closest to being of an economically productive age and they have the highest reproductive potential (being the closest to reproductive maturity). 3 Quoting Mayer's (2001) influential sociological work, "If children feel relatively deprived, they may be less inclined to study or stay in school. Relative deprivation can also make parents feel stressed and alienated, lowering their expectations for their children or reducing the quality of their parenting" (p.4). socially constructed identities, lower standing in the economic hierarchy leads to underinvestment in education Oyserman 2009, 2010). In addition, relative deprivation exerts negative effects on aspects of behavior which affect the ability to sustain investment in human capital formation. These include impulsivity, self-control, susceptibility to boredom, alcohol consumption (Balsa, French and Regan 2014;Mishra and Novakowski 2016), engagement in criminal activities and passive as well as active bullying at school (Odgers et al. 2015;Napoletano et al. 2016;Vogel and South 2016). For a more in-depth discussion of the link between relative deprivation and educational attainments, see Esposito and Villaseñor (2019).
It is difficult to predict the direction of the interaction between relative deprivation and birth order in shaping educational outcomes. One may expect that relative deprivation is more detrimental for laterborns compared to earlier-borns, because the negative influence of relative deprivation on behavioral aspects may be amplified in later-borns if they are more rebellious and lacking in discipline (see Sulloway 1996). On the other hand, the motivational/aspirational pathway may suggest an equalizing effect of relative deprivation across siblings: if aspirations are suppressed, monetary as well as time investment in education may be reduced to the bare minimum for all siblings, so that educational attainments, and consequently birth order effects, would be 'leveled down'. In light of the evidence that lower subjective socioeconomic status negatively influences adults' self-assessed intelligence (Kudrna et al., 2010), it is also possible that parents in relatively deprived households invest more in their earlierborns to try to create more capable carers for younger offspring.

Data and variables
We use data from the extended-questionnaire section of the 2010 Mexican census. This module is administered to 10% of the population following a stratified clustered sampling design which covers around 2.9 million households and grants statistical representativeness at municipal level. Since we are comparing birth order outcomes amongst children within the same household, we focus on children aged 6-17 years, and restrict our estimations to households who have at least two siblings. We also exclude the children of mothers whose age at first birth was lower than 13 or greater than 40, since their children might have poorer outcomes on account of physiological factors arising from maternal age.
Our final dataset consists of 2,026 million observations from 2,452 municipalities.
Our outcome variable is on-track grade enrollment -i.e. whether the child is enrolled in the expected grade given her age (+/-1 year). This is an important educational indicator because being 'off track' (due to late entry and/or grade repetition) has been shown to lead to higher school dropout rates, lower educational attainments, poorer labour outcomes, greater behavioural problems and negative externalities on other students -with evidence spanning both high-as well as low-income countries (Byrd, Weitzman and Auinger 1997, Jimerson 1999, Glick and Sahn 2010, Manacorda 2012, García-Pérez, Hidalgo-Hidalgo and Robles-Zurita 2014, Jaekel et al 2015. Children in Mexico are expected to start primary school at age six. Primary school consists of six grades, followed by three grades of secondary and three grades of "preparatoria" or high school. Currently both "basic" schooling (primary + secondary) and high school are mandatory, but enforcement mechanisms are weak. The problem of being 'off track' is particularly severe in Mexico, where at the time of our data grade repetition rates at any schooling level were well above the OECD average despite high levels of school enrollment (UNESCO 2007, OECD 2011, and where detrimental effects of grade repetition were found to be important across the educational spectrum including primary school (Gibbs and Heaton 2014).
[ Table 1 about here] In our main results, we derive the birth order variable by including any child who has a mother or father residing in the same household and inferring the birth order from their age. Since in a small number of cases there are multiple families within the same household, this approach to assigning birth ranks allows for some children to have the same birth order if they have different parents and as a robustness check we also estimate birth order effects for such households separately. Robustness checks are carried out with two alternative ways of assigning birth order ranks within the household -which differ with regard to the children included in the ranking, in particular in the case of multiple-family households (more on this below). With regard to the econometric operationalization of the birth order variable, we estimate i) specifications employing raw birth order variables (i.e. birth order dummies), as well as ii) specifications using the relative definition of birth order used by Ejrnaes and Pörtner (2004) and Dayioglu et al. (2009) -viz. (r-1)/(N-1), where r is the raw birth order and N is the total number of children in the household. While this variable is less granular than birth order dummies, it lessens the problem of larger families potentially driving the birth order result due to the positive correlation between raw birth order and household size. Indeed, adopting this relative definition of birth order substantially reduces the correlation between birth order and household size in our data -from 0.50 in the case of birth order dummies to 0.03 in the case of relative birth order.
We conceptualize socioeconomic status in terms of absolute wealth and relative wealth (relative deprivation). Given the lack of income data at household level, we construct an asset index to measure absolute wealth. The extended questionnaire of the Mexican census contains a rich set of variables about the quality of the dwelling, durable goods ownership and access to basic utilities that allows us to compute a household asset index through principal component analysis (typically found to be a useful indicator for the study of educational outcomes, e.g. Filmer and Pritchett 2001;McKenzie 2005;Mazzonna 2014). Given the discrete nature of the data, we follow polychoric principal component analysis developed by Kolenikov and Angeles (2009). The correlation between our municipality mean asset index and the official municipality mean income estimated by CONEVAL (2012) is high (0.81 for linear correlation and 0.91 for rank correlation), giving some validation to our wealth measure.
In our main models, relative deprivation is quantified by applying the widely used Yitzhaki (1979) measure to our asset index, which quantifies relative deprivation as the gap between the economic resources of a household and those of richer households in the reference group. The reference group is defined according to a geographic proximity criterion (i.e. people living in the same municipality), in conformity with Deaton's (2001) view that "people almost certainly compare themselves to their immediate geographical neighbours" (p. 21). The linear functional form in the Yitzhaki (1979) index implies that relative deprivation in the comparison with richer individuals is assumed to increase with constant slope. To relax this assumption, we carry out robustness checks with the concave index proposed by Esposito (2010), which accounts for the idea that sensitivity to relative deprivation progressively levels off in the comparison with richer individuals. As an additional robustness check, we also use the (own-generated) mean municipal wealth as well as the (data-provided) mean income in the municipality -these indices based on reference-group means have often been employed in the literature as an alternative metric for relative deprivation (e.g. Ferrer-i-Carbonell 2005; Luttmer 2005).
We control for a rich set of child and household characteristics in order to separate birth order effects from the influence of other correlated characteristics which are also likely to shape educational outcomes. We control for the child's gender and age gap from the previous-born sibling, and dummies for whether the child has a disability, has a blood relationship with the household head, and has both parents living in the household. In particular, the inclusion of age gap from the previous sibling is in order to control for the potential effects of sibling spacing separately from those of birth order. At household-level, we control for the household head's gender and age, average years of education amongst adults, the number of families residing in the household, household size and the number of siblings, whether the household has an indigenous background and is a social program recipient, mother's age at first birth, and the households' economic status in terms absolute and relative wealth (see below for details). Finally, we control for three municipality characteristics which might influence educational outcomes directly or indirectly: the number of schools per child (as a proxy for supply of education), municipality population (to control for size effects), and an index of migration to the USA (capturing a potentially important form of group-level aspirations which might influence expectations from schooling and the perceived returns to education more broadly). 4 Descriptive statistics are provided in Table 1. Boys and girls are in almost equal proportions, just over a fifth have an indigenous background and 1.8% suffer from a disability, and almost three quarters of children are on track.

Empirical strategy
Our aim is to understand the relationship between birth order and on-track grade enrollment-i.e.
whether the child is enrolled in the expected grade given her age (+/-1 year). The binary character of our dependent variable and the high proportion of positive outcomes suggest that a non-linear modelling approach is best suited (Long 1997;Doi et al. 2013: 245). This also lends additional potential benefits in terms of consistency (Horrace and Oaxaca 2007) and less susceptibility to problems when the dependent variable is misclassified -i.e. measured with error (Hausman et al 1998).
Consequently, we employ logit models with and without fixed effects. The fixed-effects logit (also known as 'conditional logit') has the added benefit that it is considers only those households that display some within-household variation in outcomes, dropping those households where all or no children are on-track -for which no within-household variation across birth order can be observed. Formally, for the i th child, the logit model estimates where x is a vector of covariates which includes birth order and  is a vector of regression coefficients estimated using maximum likelihood. Throughout, standard errors are clustered at household level to allow for within-household correlation.
The fixed-effects logit models these probabilities conditional on the proportion of positive outcomes within the household. Formally, if the i th household has Ni children, ki of whom are are on-track, and the j th child's on-track outcome is denoted yij, then the vector yi of outcomes for the household is yi = (yi1, yi2, … yik) and Where the set Si contains all possible permutations of outcomes within the household such that the number of positive outcomes remains ki, and dij can take the value 0 or 1. As mentioned above, this estimator only exploits information from the households where there is some variation in the outcome variable.
Because birth order regressions focus on within-family variation in outcomes, De Haan, Plug and Rosero (2014) note that this leads to the correlation between the child's age and birth order becoming amplified. We follow the strategy commonly used to address this problem, and include dummies for the child's age in all regressions. Finally, as Kantarevic and Mechoulan (2006) have argued, we would expect birth order effects to function at least in part via the age of the mother and her age at first birth.
That is, even though this age might not cause birth order effects, we would expect it to be associated with the factors which would drive this relationship, such as intelligence, physiological condition, own employment, and ambition for children -not all of which are observed or can be controlled for. In light of this crucial relationship, we control for the mother's age at first birth and square thereof. 5 Table 2 presents estimates for the relationship between birth order and on-track school enrollment.

Birth order effects
Models 1L-6L and 1FEL-6FEL refer to logit and fixed-effects logit estimations, respectively. We report results for the full sample, households that consist of single families, and households with multiple families. The key variables of interest are relative birth order in models 1-3 and birth order dummies in models 4-6. Models 1L-6L include the full list of covariates, while in models 1 FEL -6FEL householdlevel covariates are washed out by the fixed-effects estimation approach.
The birth order variables display negative and highly significant coefficients (p<0.001) across all specifications. This indicates that the probability of being on-track is lower for later-borns, regardless of which subsample or model is adopted and whether we operationalize birth order through a relative single variable or through birth order dummies. 6 As mentioned above, whereas logit models involve both within and between-household comparison, fixed-effect logit focuses exclusively on within- [ Table 2 about here] [ Figure 1 about here] Our economic status variables have the expected sign (positive for absolute wealth and negative for relative deprivation) and are highly significant in all regressions (p<0.001). The results for absolute wealth and relative deprivation should be seen as disentangling the roles of absolute and relative standards of living rather than simply as providing 'mirror-image' pieces of information. This is because, while typically higher wealth corresponds to lower relative deprivation, this correspondence holds strongly only within a reference group -i.e. a household may have both lower absolute wealth and lower relative deprivation than another if they live in a poorer municipality. This can be seen also from correlation statistics. When calculated within individual municipalities, the correlation between absolute wealth and relative deprivation is very high (ranging between -.76 and -.99); however, when calculated over our whole sample, correlation is only moderate (-.48).
Results for our other covariates are also intuitive and in line with the educational literature. Girls are more likely to be in school, in line with the gender reversal of education in Latin America (World Bank 2012). Negative predictors are having a disability, being in a female-headed household (which in Mexico this is often equivalent to being a single-mother family), sibship size (due to increasing 7 Yet it is important to bear in mind that although the conditional logit strategy minimizes potential endogeneity problems arising from the correlation between household size and birth order, the characteristics of children might still endogenously determine subsequent fertility and thus birth order and household size. See Ejrnaes and Pörtner (2004) for a theoretical model with endogenous fertility. 8 In non-linear models regression coefficients can be interpreted directly only in terms of their sign but not their magnitude, since the marginal effect of each covariate depends on the levels of all the other covariates. 9 We present these probabilities in terms of the gap in on-track enrollment (%) relative to the first-born's outcome.
This seemed the clearest presentational strategy since the fixed effect that determines the household proportion of positive outcomes in a fixed-effects conditional logit model cannot be identified -and therefore absolute predicted probabilities provide little information while differences in predicted probabilities do. competition for resources), household and municipality size and migration intensity -which is likely to decrease the investment in home education since this is poorly remunerated in the USA (Massey and Espinosa 1997). Being indigenous (proxied by speaking an indigenous language) also has a negative effect and this is in line with the findings of Patrinos and Psacharopoulos (1997). Positive predictors are adult education, being a beneficiary of a social program, being in a nuclear family, age difference from the previous sibling, school availability and household's head age -in line with Debowicz and Golan (2014). 10

Endogeneity concerns and subsample analyses
Is the negative association between birth order and on-track enrollment causal? Besides the general indication from the fixed-effects logit estimates that household-level unobservables are unlikely to be driving the results, we estimate a number of models which address different challenges to a causal interpretation. Table 3 addresses the concern for a specific form of endogeneity which could arise due to unobservables (such as fertility decisions) driving both the number of siblings as well as birth order outcomes. Since we would expect that these unobservables remain fixed in subsamples defined by the number of siblings, we run fixed-effects logit specifications 7FEL-9FEL (relative birth order) and 10FEL-12FEL (birth order dummies) using subsamples with 2, 3, and 4 siblings, respectively. In all cases, birth order coefficients are negative and highly significant (p<0.001), suggesting that results are unlikely to be driven by unobservables related to sibship size. Here, and in the tables that follow, we display only the main explanatory variables of interest -results for the other covariates mirror those discussed above relative to Table 2 and are available upon request. The corresponding predicted probabilities (based on models 10FEL-12FEL) are summarized in Figure 1b, which shows that slopes are negative and that the outcome gap between first and lastborn children rises with the number of siblings.
[ Table 3 about here] 10 Results are qualitatively unchanged if we control for education supply by including separate controls for the availability of primary and secondary schools.
A different concern arises from potential sample censoring of two kinds. On the one hand, future plans for as-yet unborn children might influence the educational outcomes of current children; on the other, older children may have left the household and thus might not be observed, leading to biased estimates of birth order effects. While we cannot address these issues entirely due to the nature of our data, we address these concerns by censoring our sample in two alternative ways. First, by restricting the sample to mothers who are less than 32 years of age at the time of the survey: their children are in general quite young, and thus it is less likely that older siblings have left the household. Second, by restricting the sample to mothers who are over 44 years of age at the time of the survey: the rationale for this is that older mothers can be assumed to have completed their fertility (Kantarevic and Mechoulan, 2006;Emerson and Souza, 2008;Dayioğlu, Kirdar and Tansel, 2009), and thus it is unlikely that there are any as-yet unborn siblings, plans for whom might influence the educational outcomes of current children.
We provide graphical summary results for regressions based on subsamples identified by these mother's age censoring regressions in Figure 2, which displays also summary results for fixed-effects logit models estimated on subsamples defined according to an array of socioeconomic and demographic characteristics. For each of these regressions, birth order effects are negative and highly significant [ Figure 2 about here]

Birth order along the economic ladder
Our finding of a negative relationship between birth order and educational outcomes echoes the evidence typically emerging for high income countries (e.g. Black, Devereux and Salvanes, 2005;Conley and Glauber, 2006;Kantarevic and Mechoulan, 2006;Booth and Kee, 2009;De Haan, 2010).
If we consider that Mexico is an OECD member and is classified as an upper-middle income country, our finding can certainly be seen to fit this picture. At the same time, however, moving away from In Table 4 we present fixed-effects logit models including interaction terms between birth order and our absolute and relative economic status indicators -for the full sample, single and multiple families subsamples, and subsamples defined by sibship size. Interaction terms are highly significant (in all cases, p<0.001) and consistently indicate through models 13FEL-18FEL that birth order effects are more strongly negative at higher levels of absolute wealth and less strongly negative at higher levels of relative deprivation. 11 The signs of these interaction terms suggest a common direction of influence of absolute and relative economic status: the detrimental role of birth order for later-borns is more pronounced for wealthier as well as less relatively deprived households. This result is in line with the economic gradient of birth order effects observed in the literature, whereby the advantage of earlierborns is less evident or even reversed at the bottom of the economic spectrum (e.g. De Haan, Plug and Rosero 2014). At the same time, our work adds the evidence that this economic gradient may stem partly from a material pathway related to economic hardship (e.g. child labour mechanisms) and partly from a psychosocial pathway based on the relative standing on the economic ladder -i.e. indicating that the macro-hierarchy the household finds itself in may affect micro-hierarchies within the household. Figure 3 presents the gap in on-track enrollment between last and first-borns at different levels of absolute wealth (left) and relative deprivation (right) in terms of predicted probabilities calculated from 13FEL. The gaps are as expected negative, and they are consistent with the signs of the interaction terms: the educational achievement gaps in favor of earlier-borns increase at higher levels of absolute wealth and decrease at higher levels of relative deprivation. The gap in on-track grade enrollment narrows from 8.5% to 3.5% between the highest and lowest levels of absolute wealth while the corresponding changes by levels of relative deprivation are 1.5% to 6.5%.

Gender and further robustness checks
In light of the evidence of possible gender differences in birth order effects discussed in Section 2, including a preference for first-born males, we explore the existence of systematic gender patterns. A first insight can be gained from Figure 2, showing that birth order effects are significant and negative, as well as of similar magnitude, across the subsamples of all female, all male and mixed gender siblings.
To test for systematic heterogeneities in the role of birth order across gender we first run a series of regressions where birth order variables (whether entered as dummies or in the relative operationalization) are interacted with the gender dummy. In all cases the interaction terms are statistically insignificant, suggesting that, overall, average birth order effects tend to be similar for girls and boys. We next carry out a more fine-grained analysis with a specific focus on the gender of the first born. In particular, we divided households into six subsamples according to the gender of the first-born (male or female) and the gender profile of the second and later-borns (all male, all female, or mixed) and estimated our baseline fixed effects model (4FEL) in each of these subsamples. 12 [ Table 5 about here] [ Figure 4 about here] The results are presented in Table 5, which shows that significant negative birth order effects are observed in all cases except the subsample where the first born is females and the other siblings are 12 The strategy of dividing households into these subsamples is similar to that of De Haan et al. (2014) males. Predicted probabilities plotted in Figure 4 also show that the drop in on-track enrollment is strongest when the first-born is male, across all three gender profiles of the remaining siblings but particularly so when the other siblings are female. This suggests that parents might invest more in the first-born when this child is a son. Taken together, these results indicate that while, overall, average birth order effects are negative and of similar magnitude for girls and boys, there is some evidence of an advantage for first-born sons in line with the literature on Mexican first-born cultural preferences (Rojas and Figueroa 2005;Gutmann 2006;Filmer, Friedman and Schady 2008).
Finally, we provide some additional robustness checks in appendices B and C. First, as we anticipated above, we run robustness checks regarding the definition of birth order -results are shown in appendix B, where we explore two alternative ways of ranking births. The first ranks the sons and daughters of the identified household head according to their ages. Unlike our main definition, this version focuses on the children of the household head and ignores other children living in the household who are not the head's offspring. The second alternative also ranks sons and daughters of the household head, but now includes her or his grandchildren as part of the same ranking, assuming that families in these settings still bow to the grandparents' decisions. Birth order are consistently negative and statistically significant (p<0.001) under these alternative definitions.
Second, since one of the novelties of our paper is the incorporation of relative standard of living as a dimension of economic status in the study of birth order effects, we investigate the interaction between birth order and alternative metrics of relative deprivation. As we mentioned above, we use indices which differ in the functional form used (i.e. the concave index proposed by Esposito 2010), the (owngenerated) mean municipal wealth as well as the official mean income in the municipality taken from the National Statistical Office (INEGI, Consejo Nacional de Población CONAPO). These results, provided in appendix C, indicate that the pattern of interaction effects remains consistent across all three alternatives -that is, birth order effects are strengthened at higher levels of not only absolute wealth but also of relative wealth (i.e. lower levels of relative deprivation) also in the case of these alternative formulations of relative deprivation.

Conclusion
We have examined birth order effects on on-track age-for-grade school enrollment using over 2 million observations from the 2010 extended questionnaire of the Mexican census. We have found that birth order effects are consistently negative through a number of robustness checks and that this result likely warrants a causal interpretation, as it holds across models designed to address specific endogeneity concern. Beyond customary logit models, we have estimated fixed-effects models focusing on withinhousehold outcome variation, models based on a variety of subsamples defined by socioeconomic and demographic lines including the number of siblings and maternal age, models employing a relative birth order metric as well as models using birth order dummies, and models adopting alternative approaches for attributing birth ranks. Thinking of Mexico as an upper middle-income country and an OECD member, our evidence tallies with the literature wherein economically developed countries tend to have negative birth order effects -while the opposite is generally found for developing countries. It should also be considered that the prevalence of child labor has been lessened by the major conditional cash transfer program Progresa/Oportunidades introduced in 1997 (Behrman, Parker and Todd 2011) -hence possibly decreasing the strength of this factor driving positive birth order effects. At the same time, however, striking socioeconomic inequalities, the high incidence of poverty, and the fact that child labor still exists (Alcaraz, Chiquiar and Salcedo 2012), all suggest that there might exist certain mechanisms which shape birth order effects in ways that are similar to developing countries.
We investigate the heterogeneity of birth order effects at different levels of economic status, which, for the first time in the study of birth orders, we conceptualize in terms of not only absolute wealth but also relative deprivation. We estimate additional models featuring interaction terms between birth order and our two dimensions of economic status, with the aim of shedding light on the economic gradient of birth order effects and saying something more about potential mechanisms at play. These models provide consistent results across the full sample as well as subsamples by sibship size, suggesting that fertility decisions are likely to be exogenous to these interaction effects. We find that, ceteris paribus, higher absolute wealth is associated with stronger, more negative birth order effects. The pattern is suggested by the sign of the interaction term and is borne out by graphical analysis, and is consistent with evolutionary models of biased parental investment (which predict that earlier-borns' advantage would be magnified in richer households), as well as the existence of child labor (which is more detrimental for earlier-borns and is arguably more prevalent amongst poor households).
Reinforcing this effect, higher relative standards of living (i.e. lower relative deprivation) is also associated with more negative birth order effects. We have suggested alternative views on the possible role of relative deprivation, but a clear-cut interpretation of our finding in this case is more complicated given the lack of literature on the heterogeneity of birth order effects at different levels of relative standards of living (absolute standards of living being controlled for). The negative role of relative deprivation in models where absolute wealth is controlled for suggests that the economic gradient in education results not only from a material pathway, related to credit constraints, affordability of education and physical inputs (Basu and Van 1998), but also from a psychosocial pathway whereby relative standing in society affects human capital investments through curbing aspirations and shaping social identities (Mayer 2001;Oyserman 2009 and. The positive sign of the interaction term between birth order and relative deprivation in our regressions suggests therefore that birth order differences in educational outcomes may stem from the place occupied by the household in the economic hierarchy and from the way socioeconomic status affects how parents and children see their present and future roles in society.
Taken together, our findings suggest key policy implications. In common with studies from other contexts, our results highlight the importance of focusing on later-born children in the Mexican context in order to improve educational outcomes, given that birth order effects can explain around 6% of the variation in on-track grade enrollment. Second, our results emphasize the importance of recognizing socioeconomic status as a multidimensional phenomenon, characterized by both absolute and relative domains. These are likely to exert independent as well as joint influence over educational outcomes in Mexico, suggesting that the striking socioeconomic disparities in the country may have more complex and detrimental repercussions on human capital accumulation than previously thought. In addition, while we find, overall, similar birth order effects for girls and boys, we have found evidence of an advantage for first-born sons. This is in line with the literature on first-born-as well as son-preferences in the country, which are aspects of a patriarchal culture Mexican policymakers should address.
Finally, while the size and richness of our data allow us to conduct our analysis with significant precision and implement several robustness checks, important limitations should be acknowledged. The first and perhaps most obvious one resides in the cross-sectional nature of our data, which deprives our analysis of a potentially useful longitudinal component. This would have been useful in ascertaining the exact mechanisms at play behind absolute and relative wealth. We have employed fixed-effects models to strengthen the claim around causality of the observed negative birth order effects, but these models are limited in their ability to delve into the role of household-level characteristics because by construction they are washed out. Therefore, there remains the possibility that some components of socioeconomic status might be endogenous to birth order. For instance, a common set of unobservables might determine relative deprivation through households' choice of where to live, as well as birth order via fertility decisions. In addition, while our focus is on birth order mechanisms related to economic status, it is likely that other mechanisms related to physiological, cultural and intra-household factors also operate simultaneously -and possibly in interplay with socioeconomic domains. A key avenue for future research is to understand how socioeconomic domains interconnect with other factors to codetermine birth order effects.

Compliance with Ethical Standards
The authors declare that they have no conflict of interest.    Notes: This table provides regression estimates for logit and fixed-effects logit models. The dependent variable is on-track enrollment status. Standard errors in parentheses are clustered at household level for the logit models. * p < 0.05, ** p < 0.01, *** p < 0.001.    Notes: This table provides regression estimates for fixed-effects conditional logit models. The dependent variable is on-track enrollment status. All models include the child-level covariates shown in table 1. * p < 0.05, ** p < 0.01, *** p < 0.001.    The numbers shown are the difference in probability of being on-track between respective birth order levels (2, 3, 4, 5 and above) and that of the first-born.

TABLES
Appendix A: Fixed-effects logit models for subsamples by demographic and economic status (displayed in Figure 2)      Notes: This table provides regression estimates for fixed-effects logit models. The dependent variable is on-track enrollment status. The two alternative ways of ranking birth order are as follows. The first ranks the sons and daughters of the identified household head according to their ages and ignores other children living in the household who are not the head's offspring. The second alternative also ranks sons and daughters of the household head but also includes her or his grandchildren as part of the same ranking. All models include the other childlevel covariates shown in table 1. * p < 0.05, ** p < 0.01, *** p < 0.001. Notes: This table provides regression estimates for fixed-effects logit models. The dependent variable is on-track enrollment status. All models include the other child-level covariates shown in table 1. * p < 0.05, ** p < 0.01, *** p < 0.001.