1 Introduction

Using individual administrative records from Chile, we study the impact of the size of the school choice set, measured by the number of available slots per student in the municipality at the time of enrollment in first grade of primary education, on individual educational outcomes. We do this by relating the literature on school choice with the one on School Starting Age (SSA).

The recent literature does not take a clear position in favor or against school choice (Epple et al. 2017). The ambiguity resides in the heterogeneity of the programs implemented and the individuals affected by these interventions. While small-scale programs provide credible experimental or quasi-experimental variation in school choice, (see, for example Angrist et al. (2002; 2006)) the expected benefits are positively related to the size of the intervention.Footnote 1 However, in large-scale interventions, the non-random sorting of students across schools makes it difficult to find a credible control group and speaks for the offsetting effects for some individuals in the population.Footnote 2 In this paper, we study the impact of more/fewer school choices in a context of a country (Chile) with a national voucher system for more than 30 years which in theory allows families making use of school choices to search, use and profit from better school quality.Footnote 3 Even though most of the evidence for the Chilean voucher system points to the effect on “cream-skimming” and mixed results on test scores (see, for example, Hsieh and Urquiola 2006), here we study if a change in the set of schools available to students has an impact on the educational outcomes of these students.Footnote 4

To study the impact of the size of the school choice set, we connect with the literature on School Starting Age (SSA).Footnote 5 Specifically, we rely on a “quasi-experimental” variation coming from the institutional setting that until recently governed minimum age requirements in Chile.Footnote 6 Families/children in Chile faced i) multiple birthday cutoffs defining the minimum age when a student can start the first grade of elementary school and ii) a different composition of schools according to these cutoffs across municipalities. These two features lead students born around several cutoffs, who “endogenously” decided to start early but whose birthdays differ by a few days, to face different school choice sets. Moreover, these changes in the school choice set around cutoffs vary also across municipalities. While minimum age requirements have been extensively used to estimate the impact of SSA, here we use these particular features to estimate the contribution of the size of the school choice set.Footnote 7

Different from a setting with a unique cutoff, by comparing students on either side of the cutoffs across different cutoffs and municipalities, we can address the endogeneity of early start and also learn the contribution of the school choice set.Footnote 8 Specifically, at any cutoff-municipality combination, families face a potentially different subset of the school choice set, whose size changes differently at each cutoff across municipalities. That is, this source of variation not only allows us to control for municipality fixed factors (such as systematic differences in educational markets) but also to account for systematic differences in schools using different deadlines. In other words, our quasi-experimental variation is given by the fact that families with children born just after one of these cutoffs (except for the last one) are not forced to wait for the next academic year to send their children to school, but they face a subset of schools in the case in which they want them to start in the current year. That is, the treatment (slots per student) is different if the student’s birthday is at each side of a given cutoff (seven in total), with its intensity varying also by municipality. Moreover, our population under study are all Chilean children born around the first day of the month, from January to July. Therefore, we are studying the impact of the school choice set on a wider population of students, rather than on a more homogeneous one in terms of family income or previous educational outcomes. We are also able to capture the potential heterogeneity across parents’ education. In addition, our quasi-experimental variation allows us to study families’ behavior and its consequent effects independently of schools’ strategic behavior.Footnote 9

Our results reveal, first, that a larger set of available schools causes modest increases in standardized test scores. However, we find a sizable reduction in the likelihood of dropping out (8% in terms of sample mean) and in the probability that a student switches schools over her/his school life (10% in terms of sample mean). Second, for a subsample of students who have completed high school, we observe a positive effect on the probability of taking the national college admission test the year of high school graduation. Specifically, an increase in a third of slots per student is associated with an increase of approximately 2 percentage points in the probability of taking the college admission test (or 3% in terms of sample mean). Moreover, for families with less-educated parents, we also observe a positive effect on the probability of enrolling in college. In fact, for most of these outcomes, we show that the impact is greater among students whose parents have lower levels of education.

This paper provides evidence on the interaction of SSA with school choice. Therefore, we shed light on one institutional feature setting limits in the capability of families to profit from school choice in voucher schemes like the one in Chile. By providing evidence about the potential heterogeneity in the impact of SSA according to the school choice set, we learn about school choice contribution and also about the families who profit the most from it. Furthermore, this evidence about the positive contribution of school choice in a country with a national voucher scheme enables us to learn about the role of school choice in other dimensions other than the non-random sorting of students across schools, but on long-term educational achievements. Finally, this paper stresses the importance of school choice early in students’ academic life. We find evidence about the impact of school choice at the time of first enrollment in elementary school rather than at high school or college.Footnote 10 That is, we contribute to the recent literature studying the impact of early interventions on children’s outcomes.Footnote 11

The paper is organized as follows. Section 2 briefly sketches Chile’s educational system, presents the data set and defines the sample used in the analysis. In Sect. 3 we describe our empirical strategy and the selected outcomes in the analysis. In Sect. 4 we discuss the validity of our source of variation. In Sect. 5 we present our results, and Sect. 6 concludes.

2 Institutional background and data

2.1 Chilean education system

Since a major educational reform in the early 1980s,Footnote 12 Chile’s primary and secondary educational system has been characterized by its decentralization and by a significant participation of the private sector. By 2012, the population of students was approximately three and half millions, distributed throughout three types of schools: public or municipal (41% of total enrollment), non-fee-charging private (51% of total enrollment), and fee charging private schools (7% of total enrollment).Footnote 13 Though both municipal and private schools get state funding through a voucher scheme, the latter are usually called voucher schools.Footnote 14Between 1994 and 2015Footnote 15, public and subsidized private schools were allowed to charge a co-payment (“financiamiento compartido”). By 2015, almost all public schools were free of charge and around 70% of students in voucher schools were paying a co-payment.

Primary education consists of eight years of education while secondary education depends on the academic track followed by a student. A “Scientific-Humanist” track consists of four years and it prepares students for a college education. A “Technical-Professional” track has a duration in some cases of five years with a vocational orientation aiming to help the transition into the workforce after secondary education.Footnote 16 Despite mixed evidence on the impact of a series of reforms introduced as of the early 1980s on the quality of educationFootnote 17, Chile’s primary and secondary education systems are comparable in terms of coverage to any system we can observe in any developed country.

Fig. 1
figure 1

Fraction of students attending school in their same municipality

In Chile’s public school system, families are not restricted to enrolling their children in the municipality of residence (while around 90% of the families do so) and the norm is to enroll a student the year she/he becomes six or seven years of age. Figure 1 shows the distribution of schools according to the fraction of students living in the same municipality. Schools are divided into four groups according their type and whether they receive or not co-payment: “Public” (do not require co-payment),Footnote 18 “Vouchers - Free of charge ,” “Vouchers - Co-payment,” and “Unsubsidized private schools.” As shown in the figure, only unsubsidized primate schools appear to have a relevant fraction of students from other municipalities (around 30% in average).

2.2 Minimum age requirement rules

Minimum age requirement rules have been used extensively to address the impact of age of entry (Cascio and Lewis 2006; Black et al. 2011). These rules establish that children, to be enrolled in first grade at primary school in a given school, must have turned six before a given date during the academic year.Footnote 19 A common element in almost all of these studies is the feature that, in a given educational market, children face a unique cutoff date. That is, all children whose birthday takes place before this cutoff date are entitled to start at any eligible school the year they turn six. Those whose birthday is after this specific date must wait until the next academic year to start in any of the same schools. In Chile’s public school system, the norm is to enroll a student the year she/he becomes six or seven years of age.Footnote 20 Although the official enrollment cutoff was originally set on April 1,  since 1992 until recently the Ministry of Education has provided schools with some degree of flexibility for setting other cutoffs between January 1 and July 1.Footnote 21,Footnote 22

Different from a setting with a unique date, in this case, children whose birthday takes place after a given cutoff date are not forced to wait until the following year but are entitled to start school in the current year in any school with later cutoffs. Therefore, those whose birthday occurs after any specific cutoff (except for the last one) may either wait until the next academic year to start school (looking at the complete set of schools) or restrict the search to those schools with a deadline later in the year.

Figure 2 shows the distribution of schools around cutoffs during the period under analysis. As shown in the figure, most schools with a cutoff date away from April 1,  had sorted around thresholds later in the year. Notice also that while the vast majority (around 80%) of fee-charging voucher schools have their cutoff date on July 1,  public and free-of-charge voucher schools (and also unsubsidized private schools, around 7% of total enrollment) are distributed across several cutoffs dates.

Fig. 2
figure 2

School distribution around cutoffs by type of school

2.3 Data

We use three different sources of data in this paper. The first one comes from students’ records from the Ministry of Education, available from 2002 to 2014.Footnote 23 For each student and year (from 1st grade elementary to 4th year of high school), we know, together with their exact date of birth, which grade and school a student attended, her/his GPA, and whether or not a student passed, failed or left the class or school. These records allow us to track each student over her/his whole school life. We analyze the cohorts born between 1996 and 2001, keeping those students for whom we have information since the beginning of elementary education.

The second source of data comes from the national standardized test (SIMCE)Footnote 24, usually given in 4th, 8th, and 10th grades (the 2nd year of high school).Footnote 25 The SIMCE database also has a supplementary parents’ survey, which allows us to know the parents’ education.

Finally, the third source of data comes from the Centralized College Admission System, from 2014 to 2017 (individual records of students born between 1996 and 1999).Footnote 26These files contain the national standardized college admission test (PSU, Prueba de Selección Universitaria) and the admission records of the 33 best Chilean universities.Footnote 27

3 Empirical specification

As a consequence of the non-random nature of school choice and researcher’s limited information, the problem of evaluation of the impact of school choice is non-trivial. To circumvent this problem, we make use of the minimum age requirement rule and specifically, Chile’s institutional feature of multiple “soft” cutoffs, as a source of a quasi-experimental variation. To identify the impact, we use differences in the size of the school choice set faced by families at the time of enrollment in primary school around those soft cutoffs and across municipalities. Specifically, the source of variation used in our analysis comes from the combination of three elements: i) the minimum age of entry rules; ii) the fact that there are in practice seven deadlines that schools can choose when defining the minimum age at which a child can start school; and finally iii) the difference in the composition of schools according to the deadlines across municipalities.

Figure 3 shows the ratio of potentially available vacancies around the seven cutoffs when a child is first eligible to start primary education (classifying schools according to the age of the youngest student) over the population of six- and seven-year olds (from the Population Census). Notice first that children whose sixth birthday is before January 1 or after June 30 face the complete school set. While students who become six years old around December 31 can start elementary school the following March (the academic year is March–December), those born on or after July 1 have to wait until the next year (the closest March to their seventh birthday).Footnote 28 Nevertheless, children born between January 1 and June 30 face, in the case in which they start school the year they become six, a reduction in the universe of schools potentially available. Notice also in Fig. 3 that the largest decline in the set of schools occurs around the fourth or later cutoffs since most schools use the June 1 and the July 1 cutoffs.

Fig. 3
figure 3

Vacancies over population in the municipality by birthday

Fig. 4
figure 4

Fraction of students who start school the year becoming six and average age of entry

Figure 4 presents the fraction of students who start elementary education the academic year closest to their sixth birthday for every birthday (“early entry”). First, we observe that those children born after July 1 are “forced” to postpone enrollment until the following year. Second, except for February and March, we observe a distinguishable jump in the fraction of students entering early for the rest of the cutoffs (January and April–July) with a drop in a fraction starting early at the right side of each of these cutoffs. Notice also the larger jump for those children born around July 1, which is also explained by the large fraction of schools using this last cutoff. Together with the assumption that parents cannot fully manipulate the date of birth, this discontinuity in the age of entry has been used in the literature as quasi-experimental variation in the age of entry at the core of a “fuzzy”Footnote 29 Regression Discontinuity (RD) strategy in recent papers (McEwan and Shapiro 2006; Cascio and Lewis 2006). However, as we have just shown, around each of these cutoffs, we observe a combined treatment: school set and age of entry.

To sort out the effect of age of entry from the one of facing the larger school set, we not only use the fact that some children are not eligible to start in some schools but also that the distribution of schools/cutoff differs across municipalities and that we observe the school set available for a student with a given birthday. The idea is simple: being born after a given cutoff implies a tradeoff between starting elementary school the current year, choosing a school from the reduced set still available, or starting the following year facing the full set of schools. Moreover, this variation in school choice set not only takes place across cutoffs within a given municipality but also differs between municipalities. Since the composition of schools around a cutoff is not the same across municipalities (or across cutoffs within a municipality), children born around the same date but residing in different municipalities (or in later cutoffs for students residing in the same municipality) face a different reduction in the relevant school set available, and therefore a different cost in the case of not delaying entry.Footnote 30 That is, in case school choice would matter, the cost of an early start should be heterogeneous across different cutoffs in a given municipality and across cutoffs-municipalities.

Using administrative data for the total population of students, we can construct the full set of schools available in the municipality at each birthday for students in the case in which they start early. That is, the identification comes from the fact that the treated population (children starting school at the age of 6 or 7) were exposed to different “doses” of a second treatment (reduction in the school set) if they do not delay school entry. Therefore, the discontinuity in age eligibility, at least for some schools, generates a change in the school choice set that varies across municipalities and cutoffs. This change in the school choice set provides us with a quasi-experimental variation that addresses the endogeneity of age of entry and also the impact of school choice set in the context of a “fuzzy” regression discontinuity (RD) strategy.

Consider a scenario where students are indexed by i, birthdays by b. The specification we use to estimate the impact of the school choice can be expressed as follows,

$$\begin{aligned} {y_{icnt}}= & {} \eta _{cs}+\rho _{t}+\varsigma _{n}+X_{i}^{'}\mathbf {\psi }+{{\alpha }}_{1}EarlyEntry{}_{i}+{{\alpha }}_{2}EarlyEntry_{i}*R_{cb}+{{\alpha }}_{3}R_{cb}\nonumber \\&+g(b_{i})+\epsilon _{icnt} \end{aligned}$$
(1)

with \({y_{icnt}}\) representing one of the outcomes at year t for student i, born the day b of year s, in the neighborhood around of \(n^{th}\) cutoff, and living in municipality c.

Table 1 Descriptive Statistics

The variable \(EarlyEntry_{i}\) is a dummy variable taking a value of one in the case in which the child started primary school the academic year closest to her/his sixth birthday and zero, otherwise. The variable \(R_{cb}\), reflects a student choice set in the case of starting their elementary education early, corresponding to the ratio of available vacancies for a student with a birthday b in municipality c, over the population of 6- and 7- year olds in the municipality, the first year the child is eligible to start primary school.Footnote 31 Moreover, \(\eta _{cs},\,\rho _{t}\), \(\varsigma _{n}\) and \(X_{i}\) represent municipality-year of birth, calendar year, cutoff neighborhood fixed effects and a vector of individual covariates, respectively.Footnote 32\(^{,}\)Footnote 33Finally, \(g(b_{i})\) is a flexible polynomial specification in the day of birth for a student, taking the form:

$$\begin{aligned} g\left( b_{i}\right) =\sum _{n}\left( \sum _{k=1}^{K}\left[ \beta _{nk}\left( b_{i}-C^{n}\right) ^{k}+\rho _{nk}\left( b_{i}-C^{n}\right) ^{k}\times 1\{b_{i}-C^{n}>0\}\right] \right) , \end{aligned}$$

where k is the degree of the polynomial, \(1\{*\}\) is an indicator operator and \(C_{n}\) one of the n cutoffs values. That is, \(1\{b_{i}-C^{n}>0\}\) defines whether or not an individual has a birthday after a given cutoff n. Notice that the source of variation allows us to control for municipality fixed effects (systematic differences in educational markets) and cutoff neighborhood fixed effects (systematic differences between schools using different deadlines).Footnote 34

Table 2 Optimal bandwidth by selected outcomes and cutoffs

Hahn et al. (2001) show that the estimation of causal effects in this regression discontinuity framework is numerically equivalent to an instrumental variable (IV) approach within a small interval around the discontinuity. By focusing on observations around these seven discontinuities, we concentrate on those observations where both treatments (school set and age of entry) can be considered as randomly assigned. This randomization of the treatment ensures that all other factors (observed and unobserved) determining a given outcome must be balanced at each side of these discontinuities. Specifically, to select an optimal bandwidth, we applied the method developed by Calonico et al. (2014) at each of the cutoffs, obtaining values that on average go from 4 to 12 days around the discontinuities (see Table 2).Footnote 35

In equation (1), the cost (or benefit) of age at entry is captured jointly by the parameters \(\alpha _{1}\) and \(\alpha _{2}\). Specifically, \(\alpha _{2}\) is the parameter of interest in our analysis and represents the change or difference in the effect of early entry associated with an additional slot per student in the municipality. A positive and significant estimate for \(\alpha _{2}\) not only tells us that the impact of early entry is not independent of the school choice set faced by families but also that it is decreasing on the size of the school choice set. Moreover, in equation (1) we allow that there is a direct effect of the vacancies per student in the municipality, \(R_{cb}\).Footnote 36 Note that after including \(EarlyEntry_{i},\) \(\eta _{cs}\) and \(\varsigma _{n}\), the parameter \(\alpha _{2}\) has a similar interpretation of a difference-in-difference estimator (DD). That is, we learn the impact of school choice by comparing the additional cost faced by early entrants (on top of the one captured by \(\alpha _{1}\)) when we compare the difference in outcomes of students around several cutoffs within the municipality, to similar differences in outcomes in other municipalities that experience a different change in their school choice set between these cutoffs. However, different from the usual DD, the treated/control group status (early entrant status) is endogenous, and therefore, we instrumentalize it with minimum age requirements.

As stated above, the variables indicating whether or not a child starts school early (\(EarlyEntry_{i}\)) and its interaction with the vacancies per student (\(EarlyEntry_{i}*R_{cb}\)) are two non-random variables. To address this endogeneity, we make use of the seven discontinuities determining age requirements (the same across municipalities) and the differences in the available set of schools (across cutoffs and municipalities). For each of the two endogenous variables, \(endog_{i}\), in equation (1) we estimate the following first-stage regression to formally inspect for a relevant variation:

$$\begin{aligned} endog_{i}= & {} \eta _{cs}+\rho _{t}+\varsigma _{n}+X_{i}^{'}\psi +\theta R_{cb}+\delta _{1}\times 1\{{\widetilde{b}}_{i}>0\}+\gamma _{1}R_{cb}\nonumber \\&\times 1\{{\widetilde{b}}_{i}>0\}+\delta _{2}\times 1\{b_{i}-C^{7}>0\}\nonumber \\&+\gamma _{2}R_{cb}\times 1\{b_{i}-C^{7}>0\}+g(b_{i})+v_{i} \end{aligned}$$
(2)

where \({\widetilde{b}}_{i}=b_{i}-C^{n}\), is a centered running variable around the first six cutoffs (\(n<7\)). The parameters of interest in equation (2) are \(\delta _{s}\) and \(\gamma _{s}\), which capture the discontinuity in the endogenous variable around the cutoffs. Specifically, \(\left( \delta _{1}+\gamma _{1}R_{cb}\right) \) represents the discontinuity in the endogenous variable in the first six cutoffs and (\(\delta _{1}+\delta _{2}+(\gamma _{1}+\gamma _{2})*R_{cb}\)) for the last one. Here, \(\gamma _{1}\) is the contribution associated with the school choice set in the first six cutoffs (\((\gamma _{1}+\gamma _{2}),\) in the last cutoff). We allow a different effect between the first and last cutoffs because birthdays just after the first six deadlines do not stop students from looking for a school that enables them to start early, so its effect is expected to differ from the one in the last cutoff.Footnote 37 As previous studies regarding the impact of SSA have shown,Footnote 38 we expect \(\delta _{1}(\delta _{1}+\delta _{2})<0\) for the variable \(EarlyEntry_{i}\). That is, children whose birthday is just over one of the deadlines are less likely to start school the year they become six. However, we also expect that the higher the number of slots available for a given population (that is, the larger \(R_{cb}\)) is, the greater the chance that a student will start school early

By adding to the specification a fully flexible polynomial specification on the day of birth, \(g(b_{i})\),Footnote 39 we deal with the possibility that students born at different dates differ systematically.Footnote 40 Nevertheless, even if the mother’s characteristics were correlated simultaneously with the birthday and the child’s educational outcomes, it does not invalidate our approach. What is required is that the effect of these observed and unobserved factors do not change discontinuously in the mentioned cutoffs.Footnote 41\(^{,}\)Footnote 42

3.1 Outcomes

We construct two groups of outcomes from several data sources. By using data from public administrative records on educational achievement provided by the Ministry of Education of Chile for the period 2002–2014, the first group of outcomes attempts to characterize the school where a student is enrolled in the first grade of elementary school. The first variable takes a value of one in the case of a student enrolled in a subsidized private (voucher) school and zero otherwise. The following two variables are proxies of school inputs: class size in first grade and the average education of the other parents in the school. The last outcome in this group is the average school SIMCE, measured the year prior to effective enrollment.

The second group of outcomes come from the administrative records and the SIMCE records and attempt to characterize the impact of school choice on students’ performance. By using students’ records we are able to build six variables that describe the students’ progression over their school life. The first two outcomes in this group correspond to the standardized math and language test scores from the SIMCE examination. The third variable, Dropout, is a dummy variable taking a value of one in the case in which a student in the last year is observed (except for 2014) has not completed secondary education or in the year 2014 has not completed the academic year in any school and zero, otherwise. The following variable is a dummy variable, Move school, for students in elementary school that takes a value of one in the case where a child is observed in two (or more) different schools for two consecutive years. The idea of this variable is to capture simultaneously two factors. First, the impact of a poor match school-student when choosing the first school and families later on searching for better schools. That is, in case fewer school options imply attending a school that might not be the first option for the family, parents might be prone to look for another school later on. Second, because a worse match might adversely impact students’ achievements, students might be expelled from a given school. We define this variable only for students in elementary school since more than 50% of the students change school when they move from elementary to high school.Footnote 43 The two final variables correspond to outcomes related to college application. These variables are constructed for the cohort of students born between the years 1996 and 1999 since they are the ones who should have completed their secondary education by 2017 (both those who delayed school entry and those who did not). Took the PSU indicates whether or not the student takes the national college examination (PSU)Footnote 44 the year of high school completion and zero otherwise. Finally, Ever Enrolled College, is a dummy variable indicating whether or not a student has been enrolled in one of the colleges in the system over the period under analysis.Footnote 45

Descriptive statistics are presented in Table 1. The first column reports the statistics for the complete population of students, while the second column shows the statistics for the students in the sample under analysis. Noticeable is the similarity in the sample mean between the students in our sample and the complete population. Specifically, each cohort of students is composed of approximately 250,000 students; half of them are boys; 91 percent of the children starting primary school do so in a school in their own municipality. The average age of entry is 6.14 years of age, with approximately 78% of these students not delaying school entry. Figure 5 shows the distribution of SSA by parents’ education label. Notice that the dashed histogram (for children with parents with more than 12 years of education) shows a higher fraction of students starting school closer to their seventh birthday. Conversely, the blue solid histogram (for children with parents with 12 years of education or less) shows a larger fraction of children starting before 6 and a half years of age.

Fig. 5
figure 5

Distribution of SSA by parent’s educational label

In terms of the selected outcomes, approximately 24% of the students have failed a grade at least once. Moreover, around 60% percent of the children in secondary education follow an academic track with approximately 14% changing school during their elementary education. Regarding the type of school where the children are enrolled for the first time, approximately 8% and 50% of the students in a given year are enrolled in a private or voucher school, respectively. The average class size is approximately 28 students and the average education of the parents of the classmates is approximately 11 years.

The graphic analysis presenting the relationship between each of the outcomes and the student’s birthday is reported in online appendix.Footnote 46

4 Validity of the source of variation

4.1 Continuity in predetermined variables

Our analysis builds on the fact that changes in school eligibility and changes in the available set of schools around the cutoffs can be seen as good as a randomized assignment for those students with a birthday near these dates. Then, as in any random assignment, those pre-determined characteristics at the time of the randomization should be similar between treated (students with a birthday just after one of the seven cutoffs) and the control group (students with a birthday just before one of the cutoffs). Evidence of a systematic difference in these pre-determined characteristics around these dates would compromise the underlying assumption that individuals cannot precisely manipulate the running variable (Lee and Lemieux 2010).

Fig. 6
figure 6

Balancing covariates

Figure 6 inspects graphically the existence of a potential discontinuity among four baseline characteristics available in the data set: gender (fraction of male), mother’s and father’s education, and the highest education of the students’ parents. The graphical representation does not show any sizable discontinuity or outliers for these selected variables with the exception of the days around January 1 , May 1 and September 18. These three dates correspond to three major holidaysFootnote 47 and can be explained by a selected drop in the number of births during holidays. That is why in our preferred specification we control for whether or not a particular student was born on a legal holiday. We formally test for discontinuities in these baseline characteristics using different polynomial specifications and seven bandwidths (3, 5, 7, 10, and 15 days, plus two optimal data-driven bandwidthsFootnote 48). The p-values are reported in the first two columns of Table 3.Footnote 49 For only two out of 36 specifications the null is rejected at a significance level of 5%. Nevertheless, to avoid the results being affected by the January cutoff, our analysis excludes students around this first threshold. However, as a robustness check, we present the results including all the cutoffs in AppendixFootnote 50.

Table 3 Differences in predetermined variables between children born before and after the seven cutoffs. p-values reported

4.2 Manipulation of the running variable

In addition to the continuity of the predetermined variables, the randomization of treatment in the neighborhood of the discontinuities rests on the assumption that families cannot precisely select their children’s day of birth. That is, the validity of an RD design can be compromised in cases in which individuals can precisely manipulate the running variable, day of birth, (Lee and Lemieux 2010). Since minimum-age entry rules are of public knowledge, we could expect that benefits/costs associated with a delay in the age of entry might induce some families to choose the season of birth.Footnote 51 Moreover, it is worth noting that Chile is (with Turkey and Mexico), one of the countries with the highest rate of c-sections in the world.Footnote 52 These two facts suggest some power to select the running variable.

Nevertheless, the fact that families can sort themselves over the calendar year does not invalidate this quasi-experimental design. The critical identification assumption, however, is that individuals lack the power to precisely sort themselves around these discontinuities. Under precise manipulation, we would find observations stacking up around the discontinuities or in other words, we would observe a discontinuous distribution for the day of birth (the running variable).

Fig. 7
figure 7

Raw and conditional histogram for day of birth

Panel (a) of Figure 7 presents the raw histogram for the day of birth for all individuals born from December 15 to July 6. Despite the high volatility, the figure hides a quite uniform distribution of birth during the calendar year but with a high dispersion across weekdays and years of birth. Specifically, we observe an average of 650 births within a range of 500 to 800 births per day. However, when we control for the day of the week, holidays, and year of birth fixed effects, as Panel (b) of Fig. 7 reveals, there is a uniform distribution of births across the calendar year and no discontinuities in the distribution of the running variable over the cutoffs. That is, in a given municipality and year of birth, we observe approximately 3 births per day.Footnote 53 Finally, following McCrary (2008), we formally test for a discontinuity in the distribution of the running variable by running a regression that has as the dependent variable the frequency of birthdays over the calendar year, as we did with the previous three pre-determined variables. The p-values are reported in the third column of Table 3. Only for two specifications (cubic and quadratic polynomial) and two bandwidths (five and seven days), do we reject the null at a significance level of 5%. That is, parents might be able to select approximately the week or month of the birth but they cannot choose precisely the day of birth.

As we have already mentioned, part of the variation in the available school set comes from differences across municipalities in the composition of schools according to the date when the student must turn six to be age-eligible. This variation could be potentially problematic in the case that families would choose the municipality of residence as a way to improve their school set. We believe this concern is not relevant in the context of our analysis. First, Chile runs a national voucher system where the school choice is not restricted to the municipality of residence. Second, the evidence does not support that families change municipality when selecting schools in Chile.

Fig. 8
figure 8

Comparative Evolution of Municipality vs School change by grade

Using administrative records, Fig. 8 presents the evolution of three variables over school life: the fraction of students changing schools, the fraction of students changing municipality of residence, and the fraction of students moving to a school in a different municipality. First, the fraction of students switching municipality of residence is on average less than 5 percent per year, and it falls over school life. On the other hand, the fraction of students switching school over school life is on average 20 percent, and it increases over time. We observe a jump in grade 9 (first year of secondary school), with more than 50 percent of the students switching schools. Despite this jump in the fraction of students switching schools when moving to secondary school, there is not a jump in the fraction of students moving to another municipality. The data suggest that this switching of schools is accompanied by a change in the fraction of students looking for a school outside the municipality rather than moving to another municipality.Footnote 54

Fig. 9
figure 9

Ratio \(1^{st}\)grade enrollment/Births in the municipality, by day of birth

Figure 9 shows the ratio between the first-grade enrollment and births in the municipality by day of birth, which is quite stable across cutoffs (ranging from 0.95 to 1.05). Again, if being born on the “wrong” side of a given cutoff had caused migration between municipalities, we should see discontinuities in his ratio around cutoffs. We formally test for evidence of migration between municipalities using data from birth records. Specifically, for every cell defined by day of birth and municipality, we construct the ratio of enrollment to birth and we test for a discontinuity in this variable as we did for the rest of the variables. The results are presented in the last column of Table 3. For none of the specifications or bandwidths are we able to reject the null at a significance level of 5%.

4.3 Differences in schools across municipalities

The estimate of the contribution of school choice in equation (1) comes from the changes in a given outcome to the variation of the “doses of the treatment” (changes in school choice) that treated individuals (early entrants) were exposed to. While the exogeneity of the treated/control status is ensured by the use of the discontinuities in the probability of being an early entrant, the differences in the “doses” come from the relative reduction of the number of slots (over municipal population) at a given cutoff across municipalities. Even though schools choosing a specific cutoff can systematically differ across municipalities, municipality and cutoff fixed effects should account for this heterogeneity. However, the identification assumption behind our specification is that these relative reductions in the school set do not capture other school features of the schools in a given cutoff-municipality. Therefore, our results would raise concern in the case this source of variation was correlated with school characteristics choosing a given cutoff.

We explore this concern by estimating the following specification at the school level:

$$\begin{aligned} y_{scnt}= & {} \eta _{c}+\rho _{t}+\alpha _{n}+\omega |\Delta R_{cn}|+X_{i}^{'}\mathbf {\psi +\epsilon _{scnt}} \end{aligned}$$

with \({y_{scnt}}\) representing one of the outcomes at year t for school s, with a minimum age requirements in the \(n^{th}\) cutoff, and located in municipality c. Like in the previous specifications, \(\eta _{c},\,\rho _{t}\), \(\alpha _{n}\) and \(X_{i}\) represent municipality, calendar year, school minimum age cutoff fixed effects, and a vector of individual covariates, respectively.Footnote 55 Finally, the variable \(|\Delta R_{cn}|\) represents the absolute change in the measure of school choice at a given cutoff and municipality. The parameter of interest in this specification is \(\omega \), that tell us whether or not municipalities with a larger change in a given cutoff differ in a given observed school characteristic.

Table 4 School characteristics and change of school choice set

Table 4 presents the estimates of \(\omega \) for the probability of being a public or voucher school, average SIMCE score, average parents’ education, and class size. For none of these characteristics, we find significant estimates.

5 Results

5.1 First stage

The evidence in Sect. 4 supports the first requirement in our empirical strategy as a source of randomization: other observed and unobserved factors seem uncorrelated with the treatments. Specifically, we found that the induced variation resulting from the different cutoffs is not only uncorrelated with other pre-determined variables but also there is no sign that families can to either manipulate precisely the day of birth or to move strategically between municipalities. Nevertheless, we also need the minimum age requirements to produce a relevant variation in our potentially endogenous variables, early entry and its interaction with the potential slots per student. In Figs. 3 and 4 we have already shown that the age cutoffs are associated with an evident jump not only in the fraction of students entering school as soon they are eligible but also in the school choice set for those who are not delaying. We formally test for a relevant variation in the two endogenous variables in the analysis by estimating equation (2). For clarity in the exposition, we focus on the estimates of \(\delta \)s (the impact of holding a birthday over any of the first six cutoffs (\(\delta _{1}\)) or the last cutoff (\(\delta _{1}+\delta _{2}\))) and \(\gamma \)s (the difference in the impact of early entry associated with a higher school choice set) in Tables 5 to 7, that is, the impact of the excluded instruments. Moreover, as a matter of completeness, we also report the direct impact of the potential number of slots per student (\(R_{cb}\)).Footnote 56

Table 5 First stage estimates
Table 6 First stage estimates: Impact of age eligibility requirements and relative capacity. Sensitivity to bandwidth selection and degree of \(g(x_{i}^{n})\)
Table 7 First stage estimates. Heterogeneity by parents’ educational level

Table 5 presents the estimates for a local linear specification using a three (first four columns) and a six (last four columns) day bandwidth. For each of these two bandwidths, we show the estimates with and without other covariates (on top of the potential number of slots per student). The first element to be noticed is the qualitative robustness of the estimates when including additional variables in the model. This robustness to the inclusion of other covariates is consistent with the evidence presented in the previous section about the lack of correlation between observed predetermined variables and the treatments around the discontinuities. The second element to be noticed is the robustness of the estimates to the bandwidth used.

Regarding the estimated coefficients for the equations for EarlyEntry in the models with controls (columns 3 and 7), we observe first that, keeping slots per student constant, while being born after the first six cutoffs reduces the probability of delaying school entry in approximately four to five percentage points, this probability falls to approximately 38/40 percentage points for individuals around the last cutoff. That is, while students who lose eligibility for schools with an early cutoff are still able to continue searching for a school that grants them admission the academic year closest to their sixth birthday, students crossing the last cutoff (July 1or later) are forced to start school the following year.Footnote 57 Second, notice that the impact of the available school set also has the expected positive sign on the probability of early entry. Specifically, an increase in one slot per student (\(R_{cb}\)), that is, doubling the average number of slots in the municipality, increases the probability of starting early in approximately 32 percentage points for students with a birthday before any of the seven datelines (\(\theta \) in equation 2). Moreover, for students with a birthday after the first six datelines, but before July, the impact is approximately one to two percentage points higher; and for individuals with a birthday just after June 30, the impact falls approximately 20 percentage points, which is consistent with the fact that these last students cannot start school before the calendar year of their seven birthday.

The results for the equations of the interaction of EarlyEntry with the potential slot per student (\(R_{cb}\)), which can be interpreted as the potential school choice set for early starters, reveal that it is positively correlated with the potential ratio (\(R_{cb}\)), and negatively correlated with having a birthday after the first six cutoffs. Notice that, not surprisingly, this negative correlation is decreasing in the school choice set.Footnote 58

Following the equivalence with an IV approach, the value of the F-statistic for the null about the joint relevance of the excluded instruments (reported at the bottom of the table) suggests disregarding any concern about weak instruments for both variables. Specifically, we report the F-statistic for the null of all instruments being insignificant (F all instruments), just the one associated with the age cutoffs (F just cutoffs), and the one associated with the interaction of the cutoffs with the slots per student (F others). Although it is not a formal test for under-identification, the F-statistics suggest that the identification of early entry comes from the minimum age eligibility rules (cutoffs) and its interaction with potential slots, while the interaction of early entry with potential slots, is mainly from the interaction between the cutoffs and potential slots.Footnote 59

Fig. 10
figure 10

Average school’s SIMCE score and ratio vacancies/population, by age of entry

Table 6 reports the previous estimates using a quadratic and cubic (for the 6 day bandwidth) specification for \(g(b_{i})\). The estimates are pretty robust to the one observed with a linear specification. Finally, Table 7 reports the estimates for a 6-day bandwidth, dividing the sample according to the highest educational level of the parents. From this analysis, we not only confirm a relevant variation for these two subsamples, but also the robustness of our estimates across families with different educational levels.

5.2 School characteristics

Our results on outcomes related to the first schools’ characteristics suggest that, even though children who start school early do so in “worse” schools (in terms of their average SIMCE score and parents’ education) than children who delay, this impact is a function of the size of the school set. Therefore, a larger school choice set increases the probability that children who start early would end up in voucher schools, in schools with a higher average SIMCE score and with more educated parents. Figure 10 helps us summarize the main findings. The left panel presents the correlation between the average school’s SIMCE score and the ratio of vacancies to population for students who started school the first year that were eligible. The right panel depicts the same relation for children who delayed entry, that is, did not face any restriction in the set of schools. Moreover, to facilitate the comparison, we fit a linear model. Notice in the figure the positive association between the school choice set (measured by the ratio) and the “quality” of the school (measured by the average SIMCE score) for those children who start early, compared to those who delay entrance.Footnote 60

Table 8 Impact of early entry and school choice set. School characteristics

The estimates of equation (1) on school characteristics are shown in Tables 8 (complete sample) and 9 (by parents’ level of education). Specifically, Panel A of Table 8, shows the OLS estimates for the impact of early entry, and the potential slots per student.Footnote 61 Panel B and C show the RD estimates using the point estimate optimal and the inference optimal bandwidths, respectively. Finally, Panel D presents the under-identification and weak instruments test associated with the results presented in Panel B. From this last panel, we not only confirm an independent source of variation for each of the endogenous variables but also that the source variation induced by minimum age eligibility requirements are such that they allow us to rule out a concern about weak instruments. First, for the complete sample, and different from what is observed from the OLS, a larger school choice is associated with an increase in the probability that children would be enrolled in voucher schools of around 5 percentage points. Taking the difference between January and June in slots per student (around 0.3 slots), this implies an impact of an approximately 1.5 percentage point increase in the probability that a student would be enrolled in a voucher school (2.9% with respect to the sample mean). In terms of the penalty associated with SSA, closing the average gap in school set between students born in June and those born in January would imply a reduction of approximately 35% in the cost of SSA in terms of the reduction of the probability to start in a voucher school.Footnote 62 Focusing on the estimates for the rest of the outcomes, we observe that an increase in the number of slots available per student is associated not only with an increase in the school’s average SIMCE scores but also in the average parents’ education of children enrolled in the school. Specifically, an increase in a third of the slots per student is associated with an approximately 0.02\(\sigma \) increase in school’s SIMCE scores and approximately 0.09 years more in average parent’s education (0.8% in terms of the sample mean). These magnitudes in terms of the penalty of SSA would imply a 25% and 20% reduction in this toll in terms of school SIMCE and average parents education. The only result that may seem counterintuitive is in the case of class size, where we observe a positive impact of slots per student in class size. This finding is jointly consistent with parents/schools prioritizing other school inputs over class size and the larger class size observed in voucher schools (Epple et al. 2017).

Table 9 Impact of early entry and school choice set. School characteristics by parent’s education

Table 9 shows the heterogeneity of the previous results by parents’ education level. The upper two panels present the results using a MSE-optimal bandwidth while the bottom two show those for an inference valid bandwidth. For a given bandwidth type, the upper panel presents the estimates for parents with 12 years of education or less, while the lower panel shows those families with more than 12 years of education.Footnote 63 For this group of outcomes, the results are similar for both groups of parents.Footnote 64

5.3 Student’s individual performance

As in the case of school characteristics, Figure 11, which shows the correlation between the school choice set and standardized (math) scores by age of entry, summarizes our results on individual performance. Notice that, even though children who delay entrance tend to have better scores on average, scores for children who start early depend positively on the ratio vacancies/population.Footnote 65 Our results below also suggest that a larger school choice set decreases the probability of dropping out and the likelihood of switching schools during elementary school and has positive effects on standardized test scores (SIMCE). Moreover, students with more choices are more likely to take the college examination exam (PSU) immediately after high school graduation and, only for children of less educated parents, to enroll in a selective college.

Fig. 11
figure 11

Individual SIMCE score (Math) vs. ratio vacancies/population

Table 10 presents the estimates of equation (1) for the second group of outcomes and Table 11 shows the results by parents’ education.

Table 10 Impact of early entry and school choice set. School performance
Table 11 Impact of early entry and school choice. School performance by parent’s education

Panel A of Table 10 shows that OLS estimates suggest a negative effect associated with starting school early (\(\alpha _{1}+\alpha _{2}R_{cb}<0\)) but also that this impact is decreasing in the school choice set (\(\alpha _{2}\ne 0\)), for all the selected outcomes. Looking at the point estimates (Panels B and C), notice that, with the exception of the outcome of “dropout” and the one associated with the probability of taking the PSU examination the year of high school graduation, the point estimates tend to be lower than those from OLS.Footnote 66 Notice also that, while the school performance outcomes are robust to the bandwidth selection, the two outcomes related to college admission are only significant when using the MSE-optimal bandwidth.

Specifically, we observe a significant effect on dropout, with one more slot per student being associated with a decrease of approximately 4 percentage points. If we consider a change of around a third of a vacancy in the municipality (January–June), this implies an 8% reduction in terms of the sample mean or a reduction of approximately 60% in the penalty of an early start when closing the average gap between students born in June respect to the ones born in January.

Our estimates also suggest a reduction in the probability of switching schools later in elementary school. Specifically, an increase in one additional slot per student reduces by approximately 1.5 percentage points the probability of moving later on to another elementary school (or a little more than 10% reduction in terms of the sample mean). In terms of the penalty associated with SSA, the estimates suggest a reduction of 20% when closing the average gap in school choice set between students born in June with respect to those born the start of the year. This result suggests that a larger school choice set increases the likelihood of a better early match which reduces the future search for other schools and therefore reduces the future potential penalty of switching schools.

Regarding test scores, we also observe that a larger school choice set when being enrolled in primary school is associated with an increase of approximately 0.07\(\sigma \) in the math SIMCE, approximately 4% reduction in terms of SSA penalty.

Finally, the last two outcomes reveal that a larger school choice set is not only associated with an increase in the likelihood that a student would take the PSU, but we also observe an impact on the probability of being enrolled in one of the 33 best Chilean universities. Specifically, an increase in slots per student by one-third is associated with an increase of approximately 2 percentage points in the probability of taking the PSU (or 3% in terms of sample mean) and one percentage point in the probability of being enrolled in one of these colleges.

Table 11 presents the heterogeneity by parents’ educational level. The picture is clear. The previously reported findings are mainly driven by families with lower levels of education. Specifically, we observe that for SIMCE scores an increase of one slot per student is associated with an increase of approximately a 0.1\(\sigma \), a reduction of 5 percentage points in the likelihood of dropping out (30% with respect to the sample mean), and a decrease of 2 percentage points (14%) in the probability of switching school later on. We also observe, for this group of students, that an increase in the number of slots per student is associated with an increase in the probability of being enrolled in college that is robust to the bandwidth selected. Specifically, an increase by one-third in the number of slots per student is associated with an increase of approximately 1.6 percentage points in the probability of being enrolled in college, which in terms of the sample mean is approximately 8%.Footnote 67 This larger effect among families with less educated parents is in line with recent evidence reported for Denmark where SSA has an impact on the family as a whole. Specifically, (Landersø et al. 2020) show that SSA has an effect on mother labor force participation. Moreover, Gelbach (2002) shows for the USA that mothers with a child five years old age-eligible to attend public school increase their attachment in the labor market.

6 Conclusions

Using individual administrative records from Chile, we study the impact of school choice measured by the number of available slots per student in the municipality at the time of enrollment in first grade of primary education on a series of individual educational outcomes. By using a quasi-experimental source of variation in school choice, we uncover positive effects associated with a larger school choice.

We show first that the set of available schools induces a relevant shift in the opportunity to start in a better school measured by a non-high-stake examination. Moreover, this quasi-experimental variation reveals an important reduction in the likelihood of dropping out and a reduction in the probability that a child would switch schools over her/his school life. Second, for a subsample of students who have completed high school, we observe that a larger school choice at the start of primary school increases students’ chance of taking the national examination required for higher education and the likelihood of being enrolled in a selective college.