Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

This chapter studies the impact of age at entry on school performance using public administrative data for Chile.Footnote 1 In contrast to previous literature, the authors are able to track this impact on a selected group of outcomes by following a cohort of students over 11 years of their school life. This not only allows the authors to understand the evolution of the impact, but also sheds light on alternative channels that explain the pattern over time.

Since Deming and Dynarski (2008) pointed out a trend in the United States (US) towards delayed school entry, an increasing number of studies have explored the short- and long-term effects of age at entry.Footnote 2 ,Footnote 3 Despite the observed positive correlation between age at entry and academic achievements (Stipek 2002), a series of recent studies reveal mixed results for these and other long-term outcomes once the endogeneity of age at entry is addressed. Among children in primary school, age at entry has been negatively associated with grade retention and positively linked with scores in standardized tests (McEwan and Shapiro 2006). The evidence for older students shows that age at entry is associated with lower IQ and increased rates of teen pregnancy and mental health problems (Black et al. 2011). Earlier age at school entry is also associated with a negative effect on completed years of education (Angrist and Kruger 1991), lower earnings in young adulthood (20s and early 30s) (Angrist and Kruger 1991; Black et al. 2011), and an insignificant effect on the Armed Forces Qualifying Test (Cascio and Lewis 2006). Therefore, although a positive effect occurs in early school outcomes, age at entry seems to have an ambiguous long-term effect.

In this chapter, the analysis is focused on the impact of age at school entry on children of school age. This work is closely related to that of McEwan and Shapiro (2006), who also study the benefits of delaying enrollment on educational outcomes in Chile. They show that an increase of 1 year in the age of enrollment is associated with a reduction in grade retention, an increase in grade point average (GPA) during the first years, and an increase in higher education participation. Also, like McEwan and Shapiro (2006), the authors address the endogeneity of age at entry by using the quasi-random assignment stemming from the discontinuity in age at entry produced by the minimum age requirements. In the present study, however, the authors use the complete population of students.Footnote 4 This is important given the recognized sorting of better students into schools with better teachers (Tincani 2014). Second, the authors follow a cohort of students for over 11 years of their school life. By following this cohort, there is no need to restrict the analysis to a specific grade, which is a function of the number of grades repeated; rather, the analysis considers the number of years elapsed since a child started primary school. This is particularly relevant for Chile, where over 30% of the students in a particular cohort have repeated a grade at least once during the course of their school life. Third, by following a cohort of students for 11 years, the authors investigate the evolution of the impact of age at entry and its reported decline over the student’s school life. In Chile, in contrast to other countries, laws on mandatory education are linked to years of education rather than to a student’s age. In fact, for the period under analysis, the completion of secondary education is mandatory. This institutional feature enables study of the long-term return of delaying school entry over a child’s school life, even into secondary education, without concern that the impact of dropping out is captured simultaneously. Finally, by using other outcome variables going beyond school achievements, the authors are able to study the channels by which age at entry affects educational performance, and to assess the evolution of the impact associated with a delay in the age at entry. Specifically, the authors study the impact on school type, the type of academic track followed by the students and whether or not delaying school entry is associated with being in a school where higher cream skimming can be observed.Footnote 5

The findings confirm that delaying school entry has a positive effect on GPA, attendance, and the likelihood of passing a grade, but also that this impact tends to wear off over time.Footnote 6 Nevertheless, in contrast to previous studies, the findings reveal that this impact may still be observed 11 years after a child has started school. Moreover, evidence on the effect of age at entry on school type provides a potential explanation for the decline in the impact of age entry on academic achievement over a child’s school life. Specifically, a higher age at entry decreases the likelihood that a child is enrolled in a municipal school, such schools being characterized by less active selection of students and lower-quality teachers. Consistent with these differences in academic selection, children who are older when they start school have a higher probability of being enrolled in schools where children coming from other schools have a GPA that is higher than the mean in the school where the student was enrolled for first time. There is also evidence that age at entry has a positive effect on the likelihood that a child follows an academic track in high school. Finally, the authors provide evidence that age at entry is associated with an increase in the probability that a child is enrolled in a school which is actively engaged in cream skimming, which also explains the drop in the impact of age at entry on school achievements.

The chapter is organized as follows. Section 2 describes the authors’ empirical strategy. Section 3 briefly sketches Chile’s educational system, presents the dataset used in the analysis, and defines the sample and the selected outcomes in the analysis. In Sect. 4 the results are presented, and Sect. 5 concludes.

2 Empirical Specification

The specification of interest in this analysis can be expressed as follows:

$$\displaystyle \begin{aligned} \begin{array}{rcl} y_{it} &\displaystyle = &\displaystyle \alpha_{t}+\gamma^{t}Aentry_{i}+\beta^{t}*X_{i}+g^{n}(x_{i}^{n})+v_{it} {} \end{array} \end{aligned} $$
(1)

with y it as one of the educational outcomes observed for a student, i, t years after she/he started school. Aentry i corresponds to the age at entry and X i to other predetermined variables. The parameter of interest is γ t, which corresponds to the impact of age at entry on a selected outcome. The time superscript highlights the fact that this impact is allowed to change over time. As extensively reported in the literature, it is suspected that estimating Eq. (1) by ordinary least squares (OLS) will produce inconsistent estimates of γ.t Families who decide to delay school entry are more likely to have relatively higher (lower) gains (costs) associated with this delay.Footnote 7 Unobserved variables correlated with these gains and costs, such as parents’ education, parents’ motivation, and so on, can also have a direct effect on a student’s achievement; the OLS estimates are likely to pick up the impact of these unobserved factors as well as the impact of age at entry.

To overcome the problem of this omitted variable and to estimate the impact of age at entry, γ t, the minimum age at entry rules is used as a source of variation in the age at enrollment in first grade of primary school. These rules establish that children, in order to be enrolled in first grade at primary school, must have turned 6 before a given date in the academic year. Children whose birthday occurs before this cutoff date are entitled to start school the year they turn 6. Those whose birthday is after this cutoff must wait until the next academic year to start school. This discontinuity in the age at entry, together with the assumption that parents cannot fully control the date of birth, provides a potential quasi-experimental variation in the age at entry that constitutes the core of our “fuzzy” regression discontinuity (RD) strategy.Footnote 8

Chile’s official enrollment cutoff used to be 1 April but, since 1992, the Ministry of Education has provided some degree of flexibility to schools, allowing them to set other cutoffs between 1 April and 1 July. In fact McEwan and Shapiro (2006) show that in practice four cutoff dates are used in Chile: 1 April, 1 May, 1 June, and 1 July.

Hahn et al. (2001) show that the estimation of causal effects in this regression discontinuity framework is numerically equivalent to an instrumental variable (IV) approach within a small interval around the discontinuity.Footnote 9 In particular, following the equivalence with an IV approach, the instruments in the analysis correspond to four dummy variables taking a value of 1 for those children whose birthday is just after one of the cutoffs, and 0 otherwise.

Finally, in Eq. (1), we include \(g^{n}(x_{i}^{n})\), which is a flexible polynomial specification in the date of birth for a student whose birthday is around the n cutoff.Footnote 10 By including \(g^{n}(x_{i}^{n})\) in the previous equation the authors recognize that students born at different times of year might differ in a systematic manner. In fact, Buckles and Hungerman (2013), for the US, show that season of birth is correlated with some mothers’ characteristics. Specifically, they show that the mothers of children born in winter are more likely to have lower levels of education, to be teen mothers, and to be Afro-American. The fact that these mothers’ characteristics are correlated simultaneously with birthday and child’s educational outcomes does not invalidate the RD approach taken here. The fact that these observed and unobserved factors do not change discontinuously in the mentioned cutoffs is the basis for this identification.

3 Data and Variables

Since a major educational reform in the early 1980s,Footnote 11 Chile’s primary and secondary educational system has been characterized by its decentralization and by a significant participation of the private sector. By 2012, the population of students was approximately 3.5 million, distributed in three types of school: public or municipal (41% of total enrollment), non-fee-charging private (51% of total enrollment), and fee-charging private (7% of total enrollment).Footnote 12 Municipal schools, as the name indicates, are managed by municipalities, while the other two types of schools are controlled by the private sector. Though both municipal and non-fee-charging private schools receive state funding through a voucher scheme, only the latter are usually called voucher schools.Footnote 13

Primary education consists of 8 years of education while secondary education depends on the academic track chosen by a student. A “Scientific-Humanist” track lasts 4 years and prepares students for a college education. A “Technical-Professional” track in some cases lasts 5 years, with a vocational orientation aiming to help transition into the workforce after secondary education. Until 2003, compulsory education consisted of 8 years of primary education; however, a constitutional reform established free and compulsory secondary education for all Chilean inhabitants up to the age of 18. Despite mixed evidence on the impact of a series of reforms introduced in the early 1980s on the quality of education,Footnote 14 Chile’s primary and secondary education systems are comparable, in terms of coverage, to any system in any developed country.

The data used in this analysis come primarily from public administrative records on educational achievement provided by the Ministry of Education of Chile for the period 2002–2012. These records contain individual information for the whole population of students during the years that a student stays in the system. Moreover, an individual’s unique identification allows each student to be tracked over her/his whole school life. This dataset is essential to this study for four reasons. First, for every student in the system, there are several measures of school performance. Second, for the cohort of children born in 1996 or later, one can observe the age at which students were enrolled in first grade of elementary school. In fact, we define “age at entry” as the age at which a student is observed at the beginning of the school year when she/he was enrolled in first grade. The analysis is focused on the oldest cohort that was eligible to start school the first year for which there are data, that is, children born in 1996. These children, in complying with school’s minimum age at entry rule, should have started school either in 2002 (those eligible to start school the year they turned 6) or in 2003 (in the case of those children whose entry into school was delayed until the year they turned 7). Third, these administrative records provide an opportunity to follow every child over her/his complete school life, and to analyze the impact of age at entry beyond the first year of enrollment. Depending on the age at entry in primary school, these children are observed either 11 or 12 times (years) in the records. Given this last constraint, the analysis is focused on the impact of age at entry over the first 11 years of a student’s school life. Finally, because the whole population of students is observed, and the student’s school can be identified at every year, it is possible to determine not only some characteristics of the school, but also whether or not a student is enrolled in a school engaged in cream skimming.

By using students’ records two sets of outcomes are constructed. The first group of variables attempts to characterize the impact of age at entry on school performance. The first variable, attendance, corresponds to the percentage of days, out of the total, that a child has attended school during a given school year. Attendance, however, might well be capturing a school’s effort, since, for those institutions receiving funding through the voucher system, funding is a function of students’ attendance. To account for the fact that attendance is able to capture this and other school effects, we define a dummy variable “attendance below the median,” which takes a value of 1 for students with attendance below the median in the class, and 0 otherwise. The next variable is the annual average GPA over all subjects. As well as the variable attendance, GPA could reflect a school’s characteristics rather than a student’s own achievements.Footnote 15 As with attendance, a dummy variable is defined as indicating whether or not the GPA for a student in a particular year is above the class median. Finally, for this group of outcomes, the variable “Pass” is defined as a dummy variable taking a value of 1 when a student passes to the next grade, and 0 otherwise.

The second group of variables is composed of variables describing the movement of students between schools and variables related to the school’s characteristics. First, a dummy variable describes the movement between schools in a given year and takes a value of 1 when a child is observed in two (or more) different schools for two consecutive years. The rest of the outcomes in this second group characterize a school in three dimensions. The first dummy variable, “Public school,” takes a value of 1 if a child is enrolled in a public school, and 0 otherwise. A second variable, “Scientific-Humanist,” takes a value of 1 if a child who is attending secondary education is enrolled in a school following an academic track to prepare students for college, and 0 otherwise. Finally, since the classmates in the cohort may be observed over the 11 years followed in this study, the rest of the outcomes are useful for measuring the degree of cream skimming observed in the school, that is, the extent to which schools are able to select better students and remove students at the bottom of the distribution. First, a variable is created with the fraction of students among those coming from other schools who had grades above the median in the previous school. Since a standardized examination comparable across schools is not used here, this variable also helps to assess something about the quality of classmates. Specifically, this variable will increase when the rotation of students into the school is lowered (smaller denominator), or when more of the students who move come from the upper part of the distribution in their previous school (larger numerator). Both of these movements could be related to a higher quality of school. The next dummy variable takes a value of 1 if the GPA in first grade is higher than the median of the students that still remain from the first grade. For a child with a GPA above the median in first grade, this variable will take a value of 1. However, if the school is actively cream skimming, this variable will be less likely to take a value of 1 later on. Two dummy variables are also defined that indicate whether or not a particular student has, first, a GPA higher than that of the median of students who have ever moved and, second, a GPA higher than that of the students just moving into the school.Footnote 16 Finally, the last outcomes correspond to the average GPA of the rest of the classmates, not counting in this average the student’s own GPA.

The descriptive statistics are presented in Table 1. The sample is composed of approximately 250,000 students observed for approximately 10.2 years; 13% of the students attend schools in rural areas and 91% of the children starting primary school do so in a school in their own municipality. The average age at entry is 6.23 years (2282 days), with 40% of the students starting school the year they turn 6. In terms of the selected outcomes, the average attendance in a given year is over 91%. The average GPA is 5.6,Footnote 17 with approximately 93% of the students being promoted to the next class in every period. In a given year over the period under analysis, approximately 20% of students change school. In relation to the school type, approximately 47% of the students in a given year are enrolled in a public school. Although approximately 16% of the children are following an academic track, this fraction is driven down for the periods in primary school. When the sample is restricted to students in secondary school, this fraction increases to approximately 60%.Footnote 18

Table 1 Descriptive statistics

4 Results

4.1 Discontinuity in the Age at Entry

The RD design as a source of randomization of the treatment not only should ensure that other observed and unobserved factors are uncorrelated with the treatment, but also, and equally importantly, provides a significant variation of the treatment defined as the age at entry to the educational system.Footnote 19 Figure 1 presents the source of variation associated with the minimum age at entry using two definitions of the variable of interest: age at entry (in days) and a dummy variable indicating whether or not a child started school in the year she/he turned 6. Firstly, it is observed that those children born at the end of the calendar year are older when starting school and are also less likely to start school the year that they turn 6 (they are more likely to start school the year that they turn 7 or even later). However, conditional on being eligible to start school the year they turn 6 (being born before 1 April or being subject to the same eligibility rule as those born after 1 April but before 1 July), those students born later but to the left of a particular cutoff are the younger ones in their class. Secondly, a distinguishable jump in the age of starting school is observed for children born around 1 April, 1 May, 1 June, and 1 July. However for each of these thresholds a discontinuity in the treatment can be observed; the largest jump, for those children born around the threshold of 1 July, is noteworthy. This large jump around 1 July is explained by the perfect compliance associated with the rule of turning 6 before 1 July. In fact, the fraction of students starting school the year that they turn 6 (as opposed to the alternative of turning 7 or more) drops to practically zero for those children born after 1 July.

Fig. 1
figure 1

Age at entry and minimum age entry rule

Table 2 presents the estimated discontinuities for the “age at entry” (in days) at start of primary school. The results confirm the graphical analysis. Being born after the cutoff date causes entry to be delayed for some students, that is, increases the average age at entry. This impact is significant for each of the cutoffs, but the largest discontinuity is observed for those individuals born after 1 July, who experience an average increase of approximately half a year in the age at entry. The average increase for the rest of the cutoffs is between 15 and 45 days approximately. The results are robust to the selection of the bandwidths, the degree of the polynomial, and the inclusion of other covariates in the specification. Finally, following the equivalence with an IV approach, the value of the F-statistic for the null of the relevance of the excluded instruments is large enough to disregard any concern about weak instruments in all the specifications and selected bandwidths.

Table 2 Impact of minimum age entry rules on age of entry (days)

4.2 Impact of Age at Entry on Selected Outcomes

The impact of age at entry on the two selected groups of outcomes is reported in Table 3. The overall picture from Table 3 regarding the impact of increasing age at entry on the different school achievement outcomes is not only that it is positive but also that the impact is still present 11 years after starting primary school. The only exception regarding this positive impact of increasing age at entry is observed for some of the outcomes (attendance) for the ninth year after school entry, which would correspond to the first year of secondary school for those children who had not repeated a grade.

Table 3 Impact of age entry on selected outcomes

Specifically, first, for the variable “attendance” it is observed that children with a higher age at entry increase their attendance by between 1 and 2 percentage points during the first 2 years after starting primary school and the second year of secondary school. This impact in terms of school days, given an average attendance of 91% in the population, means that increasing age at entry by 1 year is associated with approximately 2–5 more days of classes during a specific school year. The exception in the size of the impact of age at entry is found for the first year of secondary school, when attendance falls by an average of 3 percentage points. However, even in the ninth year after starting school, by looking at the impact on the probability of having an attendance rate higher than the median in the class, one can discern a positive effect associated with age at entry. This finding suggests that the negative effect on attendance rate captured during the first year of secondary school reflects some school effects rather than an individual reduction in attendance.

Second, a higher age at entry is associated with a higher annual GPA for all years for which this cohort of students is followed. However, this impact tends to diminish with time. Specifically, the impact on average GPA falls from approximately 0.5 points for a child during her/his first year in the school system to 0.15 points 11 years after entry into the system (which would be the third year of secondary school for those children who have not repeated a grade). These magnitudes are not only statistically significant but also economically important. The magnitude of the increase in GPA (0.5 or 0.15 points) is equivalent to the difference between a student on the median of the GPA distribution and one on the 75th (60th) percentile of this distribution. Can this observed impact on GPA be driven by differences in grade inflation between schools, or differences in the requirements established in schools? The results for the outcomes indicating the place on the distribution of GPA in relation to the students within the same school cohort do not support this hypothesis. A higher age at entry increases by almost 31 percentage points the likelihood that a child’s GPA is above the median in their first year of school, and increases it by approximately 7 percentage points 11 years after school entry.Footnote 20 Consistent with this positive impact on GPA, it is found that a higher age at entry is associated with an increase in the probability of passing a grade of between 2 and 5 percentage points.

In this way, and in contrast to other studies, these results show an impact of age at entry that is still observed 11 years after the start of primary school. Tincani (2014) shows that, in Chile, private schools (privately and voucher funded) not only specialize in higher-ability students, but are also able to attract higher-quality teachers from public schools and from other sectors, by offering them better salaries. In relation to the current problem, it has been shown that a higher age at entry provides some advantage in terms of school achievement. Thus, in light of Ticani’s findings, these better students will be more likely (at some point in their school life) to sort into better schools. Although the absolute impact of this sorting process on average GPA is ambiguous,Footnote 21 the relative advantage associated with delayed entry should decrease with average quality of classmates. The rest of the analysis seeks to measure the effect of age at entry on schools’ characteristics and movement between schools.

Mixed results are observed for the outcomes characterizing movement between schools. Although children whose entry to school is delayed are less likely to change schools in the fourth year after the start of primary school, a year later they are more likely to move to other schools. In fourth grade, children take a National Examination (SIMCE). This examination aims to measure the quality of the education provided by schools, and the results might later be used by parents to choose schools. Moreover, there is anecdotal evidence indicating that schools have some leeway in selecting which students take this examination. Along these lines, this finding that a higher age at entry is associated not only with better grades but also with being less likely to change school in the year when this national examination takes place suggests that schools might have some power to retain these better students, at least in the short run (the year that this examination takes place). Also, during the first year of secondary school it is observed that delayed entry is associated with an increase in the chance of changing schools. In fact, an additional year in child’s age at school entry is associated with an increase of approximately 6 percentage points in the likelihood of changing school between the ninth (first year of secondary school) and tenth years after starting primary school. The start of secondary school in Chile is characterized by approximately 50% of the students in the educational system switching schools. This amount of friction might induce some students to actively search for a new school. Where this search effort is positively correlated with early educational outcomes, it would explain why higher age at entry results in an increased search effort in the periods with higher friction in the system. In fact, this greater search effort is consistent with a reduction in the probability of switching schools between the tenth and eleventh years. Also consistent with this increase in the fraction of students switching school at the beginning of secondary school is the already reported drop in school attendance during the first year of secondary school.

Regarding school type, it is first observed that a higher age at entry is associated with a decrease in the probability that a child attends a public school for almost all years under analysis. Second, it is observed that delaying school entry increases the likelihood that a child will follow an academic track by approximately 13 percentage points. In terms of the sample, this means that this last impact corresponds to an approximately 25% increase in the fraction of students in the educational track that aims to prepare students for college.

For the outcomes characterizing the school measured by the classmates choosing the school (who have moved there), it is observed that increasing the age at entry is associated with a rise in the fraction of students arriving from the upper part of the GPA distribution in their previous school, but specifically in secondary school. Therefore, the results reveal not only that higher age at entry is associated with a lower probability of being enrolled in schools linked to lower-quality teachers, as observed in public schools (Tincani 2014), but also that these students have better-quality classmates in secondary school, where they are more likely to follow an academic track. In fact, it can be assumed that all of these factors contribute to the drop in the GPA in secondary school, due to a tougher academic track and the lower likelihood of grade inflation reported in better schools. It is worth noting that the timing of these impacts is consistent with the drop in the GPA observed when starting secondary school. The last four outcomes explore the impact of age at entry on the probability of being enrolled in a school that is actively cream skimming. It has been shown that children with a higher age at entry have a greater likelihood of having a GPA higher than the mean of their classmates. Where some of these schools were actively engaged in cream skimming, however, the probability of having a GPA higher than the median of the students moving into the school should not only be lower but also fall over time. This is what is observed from the impact of age at entry on the outcomes that measure the probability of having a GPA higher than the median of students moving into the school in a given year, or of those who have moved to the school at some point in the past. That is, it is observed that age at entry increases the probability of having a GPA higher than the median of students moving into the school, but this increase in the probability is lower than the increase in the probability of having a GPA higher than the median of all students in the class (fourth column in Table 3) for almost all the years. Secondly, over the school life this probability decreases and by the time secondary school is reached it is not statistically significant. Also consistent with the hypothesis that age at entry increases the likelihood of being enrolled in a school actively cream skimming, analysis of the GPA in first grade over the years, but with the mean of the classmates arriving from first year of primary school, shows that this probability decreases over time. Finally, it is observed for some years, and with the exception of the first year of secondary school, that age at entry increases the probability of being enrolled in a school where classmates have, on average, a higher GPA.

5 Conclusions

The findings of this study confirm not only that delaying school entry has a positive effect on GPA, attendance and the likelihood of passing a grade but also that this impact tends to wear off across time. Nevertheless, in contrast to previous studies, the findings reveal that this impact can still be observed 11 years after a child has started school. Moreover, evidence of the effect of age at entry on school type provides a potential explanation for the fading of the impact on academic achievement throughout the school life. Specifically, a higher age at entry decreases the likelihood that a child will be enrolled in municipal (public) schools, which are characterized by a less active selection of students and lower-quality teachers. Consistent with these differences in academic selection (competition), children with a higher age at entry have a higher probability of being enrolled in schools where the children arriving from other schools were in the upper GPA distribution in their previous school. Evidence is also found that age at entry has a positive effect on the likelihood that a child follows an academic track in high school. Finally, evidence is found that age at entry is associated with an increase in the probability that a child will be enrolled in a school actively engaged in cream skimming, which also explains the drop in the impact of age at entry on school achievements.