Introduction

The Italian school system is essentially public and highly centralized as concerns both financial and regulatory aspects. The central government covers teacher and staff salaries and their recruitment, while local governments support the costs for school buildings and their maintenance (provinces when it comes to higher secondary schools and municipalities in the case of primary and lower secondary schools). Regions plan the school network and calendar.

Despite the fact that the allocation of resources to schools is based on uniform parameters across the country, in recent years international and national assessments have highlighted significant regional differences in student performance in areas such as reading, math, science, and problem solving and a remarkable variability of results between individual schools in the same region. Even in lower branches of education, where there is a single track, between-school variance is particularly large: up to 45 percent of the total variance for both reading and math scores in the 2009–2010 INVALSI (Italian national institute for the evaluation of the school system) national standard assessment for grade 8. This means that, beyond the typical North–South divide, attending one school or another can result in significantly different outcomes (Bratti et al. 2007; Ministero dell’economia e delle finanze, Ministero della Pubblica Istruzione 2007). The difference is particularly pronounced in the South, where the share of between-school variance on the total is equal to 60.6 % for mathematics and 55.8 % for reading, compared to well lower values for the North and Central regions, respectively 18.3 and 40.1 % for math and 22.6 and 38.7 % for reading (INVALSI 2010).

The high variability of results between schools suggests that there is scope for improving of efficiency at the school level. Although public schools do not intervene directly in the recruitment and basic remuneration of staff, since the late nineties they have taken over a certain degree of autonomy.Footnote 1 Indeed, schools are directly responsible for running ordinary services and they may actively promote educational initiatives, by carrying out specific projects and/or modifying the annual teaching schedule up to 20 percent of the standard curriculum. Principals represent the school in labor relations with trade unions, concerning for example additional pay. They recruit and contract short-term substitute teachers, handle allowances to members of state exam committees, and provide for other teacher subsidiary payments. Schools also exert some influence on other matters that could have an impact in educational terms such as class size, teaching materials, and the availability of extra-curricular activities.

The central governments’ annual contribution to schools is roughly 41 billion euro for salariesFootnote 2 plus 3 billion for running costs, with reference to the year 2010. The latter are transferred to schools on the basis of parameters and criteria established at the ministerial level. School budgets are mainly funded by the central government (which accounts for approximately 69 percent of the overall school revenues), followed by individuals and families (18 percent), and municipalities, provinces and regions (about eight percent). The residual share of revenues comes from multiple and more volatile sources, among which a noteworthy contribution has come from the European Union and other international bodies, through grants for cultural exchange or regional development programs (and in this case they have been directed especially to schools in the Southern regions).

In 2010, there were about 10.5 thousand public schools in Italy scattered across 42 thousand service delivery points. Over the years, with an aim to trim down costs and improve efficiency of education expenditure, mergers between schools of the same educational cycle (horizontally) or even of different cycles (vertically) have been sought. Traditionally, primary (grade 1–5) and lower secondary (grade 6–8) education were delivered in separate schools. Today schools “specialized” in primary or in lower secondary education exist alongside the so called “comprehensive” schools, which cover all grades from pre-school to lower secondary. The idea driving this integration is that student learning can benefit from a more continuous educational framework and scale economies can be realized as well. Merging schools across different grades was originally promoted under Law 97/1994 and subsequent provisions. Although there is little evidence on whether comprehensive schools yield better results in terms of performance and efficiency, legislation (Decree Law 98/2011) attempts in making them compulsory for schools under a certain size (i.e., under 1,000 students or 500 in special cases, such as schools in small islands, in mountain communities or in geographic areas characterized by cultural minorities).

This work aims to contribute to the debate by exploring efficiency of primary and lower secondary schools in a productivity perspective. Input factors are directly related to the allocation of financial and human resources in schools and to the socio-economic and cultural background of students, while output measures are related to student performance. Given a strong between-school variance in school performance, we investigate whether some schools manage outputs better than others with the same inputs and then identify factors that explain these differences.

Unlike the Italian case, for which empirical evidence on the subject is still sporadic, the international literature on this subject is growing and various approaches are used to analyze the efficiency of schools using both non-parametric techniques, such as data envelopment analysis (DEA) and free disposal hull (FDH), and parametric techniques based on stochastic frontier analysis (SFA).

The literature features a wide diversity with regard to the choice of both inputs and outputs. As for inputs, a first strand of work is based mainly on human resources, i.e. the number of teachers and technical and administrative staff employed by schools (Essid et al. 2009; Oliveira and Santos 2005; Maragos and Despotis 2004; Barbetta and Turati 2003). Other work focuses on the provision of financial resources and equipment (Agasisti 2009; Aaltonen et al. 2006), while a third line also includes variables relating to socio-economic and cultural background of students or the school (Agasisti et al. 2012; Tyagi et al. 2008; Portela and Camanho 2007; Mancebon-Torrubia et al. 2010; Ruggiero and Vitaliano 1999).

Outputs typically try to capture school outcomes in terms of student performance, but only a limited number of studies refer to direct measurements of skills or competence levels (Agasisti 2009; Mancebon-Torrubia et al. 2010). Most contributions use indirect measures such as the share of students which passed the exams or average grades achieved by students in specific disciplines or the share of students who continued their studies.

The unit of analysis does not always coincide with the school but may refer to the level of government responsible for the organization of the education service such as municipalities or school districts (Ruggiero 1996; Aaltonen et al. 2006). Most contributions focus on a specific school cycle (the primary or secondary) and usually refer to a limited sample of schools. These are often schools of a same region or specific area (Barbetta and Turati 2003; Tyagi et al. 2008; Maragos and Despotis 2004; Ruggiero 1996; Ruggiero and Vitaliano 1999) or small national samples (usually \(<\)500 schools).

Depending on the topic of specific interest, some researchers explore whether environmental, geographical and socio-economic factors play a role in school efficiency (Oliveira and Santos 2005; Tyagi et al. 2008; Aaltonen et al. 2006; Portela and Camanho 2007); others study the effects of increasing competition between schools or promoting greater managerial autonomy (Bradley et al. 2001, Bradley and Taylor 2008) and, still others, investigate the role of structural features (typically school size) or ownership (private/public/nonprofit). The competition theme is particularly developed in the Anglo-Saxon literature and, in particular, in the debate around introducing quasi-markets in the school system.

Previous analysis on the Italian case are limited. Barbetta and Turati (2003) deal with the efficiency of lower secondary schools in Piedmont and verify whether the ownership structure of schools (public, private for-profit and nonprofit) accounts for its efficiency. They suggest that non-profit private schools are generally more efficient than public ones but note a possible skimming effect, whereby students with greater performance concentrate in the most efficient schools. For example, in their data, public schools have a higher number of disabled students compared to private schools. On the same line, Agasisti (2009) concludes competition has a positive impact on the efficiency of Italian high schools, using the average OECD PISA-2006 score in science and mathematics as the output variable.

While estimating the efficiency of primary and lower secondary schools with a DEA technique on samples of over 1,000 schools, this paper introduces some novelty. First, it considers among the output variables the results in math and reading according to the INVALSI standard national assessment. Secondly, it uses detailed information from school budgets allowing an insight not only on the amount of expenditure managed, but also on possible financial difficulties schools face. Third, it focuses on the identification of determinants of efficiency related to the territorial dimension (with particular reference to the geographical location of schools and degree of urbanization), the type of school organization (comprehensive schools or “specialized” only in primary or secondary education), the management capacity, and the composition of school spending and funding sources.

The paper is organized as follows: the next section presents the data and methods used to measure the relative efficiency of each school and the variables used in estimating the empirical model; the third section presents the main results, and the last section provides some policy indications, including discussion on how efficiency measurement can be used to promote benchmarking exercises between schools and push the system towards better outcomes with the same amount of resources.

Data and methods

Estimates of efficiency of production units are generally based on two approaches: data envelopment analysis (DEA) and stochastic frontier analysis (SFA). The first is a non-parametric method; it does not require a specific functional hypothesis on how inputs are transformed into outputs and can be applied in a setting of multiple outputs. The second is a parametric method based on a production function estimated through the model. In both cases, the production unit is an organizational structure and autonomous decision-making unit (DMU). DMUs should be uniform (all use the same types of resources to produce the same types of results), independent (one unit must not, in any way, affect the performance of another) and autonomous (each unit must have the ability to decide how to use its resources to produce results).Footnote 3

DEA is a deterministic method; it is sensitive to sampling variation and to measurement errors and outliers. It also requires a second stage procedure to analyze efficiency determinants.Footnote 4 The statistical foundation of the second stage approach is subject to intensive debate due to the bias and serial correlation of the DEA efficiency scores. To address this problem, several authors propose the use of a bootstrap method to correct for the small sample bias and serial correlation of the DEA efficiency estimates (Simar and Wilson 1998, 2000).

Despite its drawbacks, given the multiple outputs and inputs in education and the lack of specific assumptions on the function relating outputs to inputs, DEA is better suited for the estimation of school efficiency than its parametric counterparts.Footnote 5 Moreover, the efficiency frontier drawn by DEA is composed by the most efficient observed units and their linear combinations, allowing for the identification of best practices to provide guidance to less efficient schools. Indeed, benchmarking exercises can be carried out easily by analyzing distances of each school from the frontier. The idea is that if a school can produce a certain level of output utilizing specific input levels, another school of equal scale should be capable of doing the same. The most efficient schools form a virtual combination of schools, allowing the computation of an efficient solution for every level of input or output. The analysis performed in this article is based on two large samples of public schools: one for primary education and one for lower secondary education (both containing specialized as well as comprehensive schools), with reference to the school year 2009–2010. The samples are those built by INVALSI for the national student assessment and the student cognitive tests are administered by external observers in order to avoid cheating or student skimming.Footnote 6 These are statistical samples, with regional significance, drawn according to a two-stage procedure: first the schools and then one or two entire classes of students in each school.

Average INVALSI test scores for each school were combined with other sources of information. Namely, the Ministry of Education provided administrative data on personnel and school service characteristics, detailed data on revenue and expenditure come from school budgets, and statistical variables related the school’s geographical location are from the national statistical institute (ISTAT). We managed to merge data from all sources of interest for 1,098 schools in primary education (against 1,385 from the original sample) and 1,061 schools in lower secondary education (against 1,309 from the original sample).Footnote 7

In addition to school spending (net of salaries) per student, the number of teachers per student and the number of other staff per student, our input variables include a characterization of the school’s catchment area in terms of average socio-economic and cultural background of its students. Indeed, results in terms of student learning outcomes are highly related to cognitive abilities and skills gained by students in previous years and the socio-economic conditions of their families and peers (e.g., Brunello and Checchi 2006; Schütz et al. 2005; Woessmann 2003; Hanushek 2002). Since Italian public schools cannot select their students, the choice of including the social and economic background among inputs in the model is questionable. However, given its relevance, it has been considered to account for its major effect on the outputs, as done by several authors (Oliveira and Santos 2005; Tyagi et al. 2008, Aaltonen et al. 2006; Portela and Camanho 2007). The use of an all-in-one approach DEA estimation—including both discretionary and non-discretionary variables for DMUs—is customary when the latter are continuous and internal to the production process. By using an output-oriented model as in our case (see paragraph 3), all inputs are implicitly treated as non-discretionary anyway (Fried et al. 2008).

Among the inputs of our model, data on spending per student represents the total payments of the school budget on the total number of students served: roughly this corresponds to the schools’ running costs per capita and does not include staff salaries. Schools spend directly for ordinary bills and consumption of goods and services, extra-curricular educational projects, short-term substitute teachers, extra subsidiary payments for their staff, while salaries are paid by the central government and for this reason they are not included in the school budget.

Data on students and teaching staff refer either to the primary or to the lower secondary grades. Even in the case of comprehensive schools, the number of teachersFootnote 8 per pupil is calculated-separately for each educational level (i.e., only for the primary grades or only for lower secondary grades, depending on which sample is being analyzed). On the other hand, administrative, technical and auxiliary staff is associated with the school in its entirety, regardless of the grades, and it is compared to the total number of pupils.

The average socio-economic status and cultural (ESCS) index is calculated by INVALSI on the basis of a student questionnaire.Footnote 9 This variable depends on the context and is largely influenced by location (on which the school can hardly, at least in the medium term, act). The index summarizes the answers given by students regarding the employment status of parents and their educational level, and the presence of some material goods at home: for example, the availability of a quiet place to study, a desk for homework, books, encyclopedias on paper or in digital form, an Internet connection, a single bedroom, the number of bathrooms, a car, etc. At the national level, the index is normalized to a scale with mean equal to 0 and standard deviation equal to 1 (Campodifiori et al. 2010).

Input values are shown in Table 1, for the whole sample (ITALY) and for macro-regional breakdowns.Footnote 10 The average level of expenditure is approximately 740 euros per capita in primary schools and about 640 euros in lower secondary. Average per capita expenditure tends to be greater in the South and the Centre than in the North, especially in primary education. In some areas of the South, for the period under consideration, this is the effect of a relevant additional contribution to school budgets from projects financed by the European Union structural funds.Footnote 11 In primary education there are about 10 teachers per 100 students, with slightly greater values in the North and lower values in the South; in lower secondary education the average teacher to student ratio amounted to 10.8, with greater values in the South, where variability is also larger. Finally, the other staff to student ratios presents even larger variations within each macro-area: the average is just over 3 personnel units for 100 students, but the range in our sample displays from a minimum of about 1.5 to a maximum of over 10.

Table 1 Input variables

Values of the average ESCS index are in line with expectations, with Southern regions presenting a higher concentration of disadvantaged backgrounds. It is worth to note that the coefficient of variation is large in all areas but it presents a considerably greater variability in the North. There is no sign of significant correlation between input variables and, in particular, between school expenditure (net of salaries) per student and average ESCS, even within separate macro-regions.Footnote 12

We consider two output variables: the INVALSI average school performance scores in math and Italian (reading). The average score provides the level of performance of respondents accounting for the difficulty of each test item by applying Rasch methods. By convention, the score is normalized to a scale with mean 500 and standard deviation 100.

In primary classes, the national assessment is conducted in grade 2 and 5. In this case, in order to have a summary indicator for the primary level, we built two weighted indicators, one for math scores and one for reading scores, with weights equal to the actual number of participants to the tests in each grade. For lower secondary classes, the national assessment is conducted in grade 6 and we consider the scores for both the math and the reading performance separately. Average student performance typically suffers the traditional Italian North–South divide, whereby schools in the North display higher scores than those in Central and Southern regions (Table 2).

Table 2 Output variables

The provincial-level maps in Figs. 1 and 2 show an even more complex situation. School scores above the national average are also achieved in several areas of the South, particularly as regards math scores in primary grades. There is a considerable variability of outputs both as concerns reading and math, and within-area variability is more pronounced than between-area variability. In fact, in all macro-areas, the difference between the worse and best performing province is roughly 100 score points, considered to be equivalent to approximately two years of education. In lower secondary grades, however, the score of the worse performing province in the South is way lower than its equivalent in the North (383 against 484 as concerns reading and 410 against 471 as concerns math). Heterogeneity characterizes mostly the South, which displays a standard deviation generally \(>\)20 for both math and reading scores—a difference close to half a year of education. Average performance in math is usually more variable than performance in reading (see “Appendix”, Tables 8, 9).

Fig. 1
figure 1

Outputs for primary education: INVALSI scores (a) reading (b) math

Fig. 2
figure 2

Outputs for lower secondary education: INVALSI scores (a) reading (b) math

Within-area variability of inputs tends to be remarkable as well, and even greater than the within-area variability of output variables. Again the differences are stronger among Southern provinces than among provinces in the other macro-areas. The provincial distribution of teachers per student tends to be more concentrated towards lower values, with a longer tail to reach the maximum value and a greater range in the South. The same holds for the ratio between other staff and students, although maximum values are less distant from the third quartile. School expenditure (net of salaries per student) is generally skewed towards lower values and differences between the last 25 per cent of provinces are greater than between the first 25 per cent, especially in the Center and South. Finally, within-area variability is greatest when it comes to the ESCS index. Despite the fact that in this case Northern provinces display a wider range (from \(-\)0.74 to 0.30 in primary education and from \(-\)0.35 to 0.56 in secondary education), the Southern provinces still dominate the others in terms of variability.

Results and discussion

As mentioned, in our specification the output variables are student performance measures based on school average INVALSI math and reading scores. The inputs are resources used by each school, proxied by four variables: teacher–student ratio, other school personnel-student ratio, school expenditure per student and the average socio-economic status of students (average ESCS). The analysis was carried out separately for the primary and lower secondary education segments.Footnote 13

The DEA models can be estimated by adopting two different approaches, both based on the concept of technical efficiency, i.e. the ability of the DMU, given the current technology, to produce the maximum output level from a given combination of input (output-oriented model), or alternatively to use the least possible amount of input to obtain a given output (input-oriented model). Our models are output oriented, given that schools have a higher control on output than on input. Indeed, human and financial resource allocation is determined by the central government decisions and the socio-economic and cultural background of students is often a consequence of the school’s location. Nevertheless, schools have indirect control over these variables. First of all, through decisions on class making, affecting size (and therefore the number of teachers per pupil) but also on the way students are grouped (possibly to take into account different levels of social or educational difficulties). Schools also decide whether to implement special education projects, schedules, and all matters affecting the management of day-to-day service. Although principals do not choose teachers and only marginally decide their salary levels, school coordination activities can influence the degree of cooperation among personnel and with families, and can support the choice of the “best” educational projects.

Primary and lower secondary school efficiency and potential improvements

The results shown in this paper refer to bias-corrected DEA scores obtained after running 200 bootstrap replicates. For the sake of simplicity, scores are normalized to a scale between 0 (minimum efficiency) and 1 (maximum efficiency).Footnote 14 School efficiency scores were calculated by assuming variable returns to scale of the production factors (VRS). From an interpretation perspective, given that quality of inputs is not directly taken into account in the model, variable returns to scale seem preferable. To support this intuition, we also carried out the Bogetoft and Lars statistical test (Bogetoft and Otto 2011).Footnote 15

Strong variability of the main input variables raises the question of whether the production context is sufficiently homogenous to consider a unique efficiency frontier for all Italian schools. Descriptive statistics in Table 1 suggest the presence of three distinct groups along the North, Central, South axis. There also are some differences in the institutional context: European structural funds are a major additional revenue opportunity only for schools in the South. Therefore, DEA estimates were calculated for separate subsets of schools belonging to the same macro-region.

The resulting VRS bias-corrected efficiency scores are provided in Table 3 and Figs. 3, 4. They are on average high both in primary and lower secondary education and in all macro-regions. However, there are significant differences in the degree of efficiency at individual school level.

Fig. 3
figure 3

Primary education: histogram of the bias-corrected efficiency scores Variable Returns to scale (VRS) for North, Centre and South Italy

Fig. 4
figure 4

Lower secondary education: histogram of the bias-corrected efficiency scores Variable Returns to scale (VRS) for North, Centre and South Italy

Table 3 DEA bias corrected efficiency scores for North, Centre and South

North and Center regions present a greater concentration of more efficient schools and smaller variability, while Southern schools display a lower average efficiency. In fact, average efficiency scores for this macro-area are similar to those of the least efficient schools in the North and Centre and the whole distribution has a longer left-hand tail.

In order to verify the intuition that there are different frontiers, we checked whether the three groups of territorially defined schools come from a common efficiency distribution or not. Basically, the efficiency scores from the three DEA models were compared to the results of a DEA model on the overall national sample with the Wilcoxon-Mann-Whitney signed-rank test. The test dismissed the null hypothesis of single macro-regions and the overall sample belonging to the same population, with the sole exception of the efficiency scores related to the Centre lower secondary schools (Table 4).

Table 4 Wilcoxon-Mann-Whitney signed-rank test between national and macro-region efficiency distributions

This result suggests that it may be more appropriate to carry out benchmarking exercises for schools within their own geographical area.

In order to measure the potential improvements, outputs of inefficient DMUs were expanded proportionally by multiplying the efficiency score times the output.Footnote 16 By summing these “slack” values we obtain a measure of the average potential improvement, given the observed inputs. Data in Table 5 indicate there are, in general, greater (absolute and relative) margins of improvement in primary schools than in lower secondary schools across all macro-areas. However, the potential rate of improvement in the South is more than twice the one in the other areas: given the current level of available resources, if all schools would perform like the most efficient ones, the gain in efficiency would be around 36 percent for primary schools and 28 percent for lower secondary schools (against values of at most 18 and 11 percent in the Centre and North).

Table 5 Potential improvement of school productivity on the output side

Finally, to check the robustness of our reference model (based on four inputs), we estimated a second model based on three inputs, i.e. excluding the average ESCS. There are no dramatic changes in the efficiency patterns. Kendall rank correlation between the efficiency scores of the reference model and second model is over 0.86 in both primary and lower secondary education. This result hints to the fact that, despite ESCS being a strong indicator for the context around the school, efficiency may be influenced by other contextual factors that are not considered in our reference model and should be identified in the second-stage analysis.

Determinants of school efficiency

Having ranked schools according to their efficiency scores, we are interested in investigating whether there are factors related to their organizational structure or to the external context, which play a role in making one school more efficient than another. In particular, we consider the following aspects:

  • features related to the management capacity of schools in terms of revenue sources, the composition of expenditure and financial balance;

  • organizational characteristics, such as “specialized” versus “comprehensive” schools, the number of service delivery points (e.g., the same school can have different buildings in different areas of the same town or, sometimes, in different municipalities when they serve smaller and remote areas), but also the presence of tenured staff, of full-time students (staying for programs in the afternoon after compulsory classes are held), of teachers assisting students with special educational needs (SEN) and class size;

  • territorial characteristics that may influence the ability to exploit economies of scale, such as the remoteness and degree of urbanization;

  • features reflecting the economic status and social cohesion of the area in which the school is located.

Descriptive statistics for the explanatory variables are reported in “Appendix” (Tables 10, 11).

A major financial issue characterizing Italian public schools is the large amount of revenue arrears claimed from the central government. Revenue arrears have accumulated over time and often date several years ago. At the same time, a significant share of schools is in financial trouble because their end-of-year accounts display an amount of cash which is lower than their obligations. In 2010 the percentage of schools with liquidity unable to cover annual commitments was about 33 per cent in the Centre, 21 per cent in the North and 38 per cent in the South. In other words, although there is no sign of the arrears being refundable, schools might be inflating the budget on which they plan their annual activities, even when they end up not being able to pay for them.

In order to explore these issues more in depth, a second stage analysis was conducted to identify determinants of efficiency for primary and lower secondary education by adopting a Tobit model,Footnote 17 with standard error corrected for heteroscedasticity using the White procedure, as follows:

$$\begin{aligned} {\text {EFF}}_{{\text {i}}} = {\text {f}}({\text {Z}}_{{\text {i}}} ,\beta ) + {\text {u}}_{{\text {i}}} \end{aligned}$$
(1)

The observable dependent variable (EFF\(_{s})\) is given by the bias-corrected efficiency scores, assuming variable returns to scale.Footnote 18 The possible determinants of efficiency (\(Z)\) include context variables related to school location,Footnote 19 financial variables concerning the school budget,Footnote 20 variables concerning the structural and organization characteristics of schools.Footnote 21

Results are presented in Tables 6 and 7, separately for primary and lower secondary education, and for the three different macro-regions. For both levels of education, our findings show distinct efficiency determinants for each macro-area. The main common factor refers to critical financial issues, which do not seem to carry any explanation for efficiency: in all three macro-regions and for both levels of education, the level or persistence of revenue arrears and the ability to meet spending commitments with available liquidity do not have a significant impact on the degree of efficiency.

Table 6 Tobit estimates of efficiency factors for North, Centre and South Italy-primary education
Table 7 Tobit estimates of efficiency factors for North, Centre and South Italy-lower secondary education

Starting with primary education, one can note that among all variables taken in the model, factors such as the family and private sector financial contributions to the schools, the percentage of revenue arrears from previous years and the number of service delivery points of the school do not seem to have any relation with the degree of efficiency.

In Northern regions, the percentage of tenured teachers and of capital investment are both positively related to efficiency. Since mobility of tenured teachers is reduced respect to annually-contracted ones, this result hints to the fact that it may be more efficient for a school to keep the same teachers from one year to the other, reducing the need for integration and coordination. The greater experience and stability of the staff and greater familiarity between teachers, students, and their families may also provide benefits. Moreover, a investment is a typical proxy of the “quality” of expenditure, so its positive effect is not surprising. Conversely, being all other variables equal, the percentage of immigrants has a negative effect on efficiency. This raises the issue of the importance of caring for the increasing share of non-nationals in the student body in the more economically developed part of the country.

When it comes to primary schools in the Centre, the larger the number of students, the more efficient the schools are, representing the benefits of scale economies. Female employment also stands out as a major (positive) element for efficiency. Its role could be related to the more favorable economic context characterizing areas with more female employment, but also to the greater motivation working mothers might have for better educational performance of their children. Although they might have less time to check on homework or school activities, they might have greater esteem for teacher’s instructions (often working women like them) and more awareness of the fact that early age education counts for future job opportunities. This factor is partially counterbalanced by the negative effect of the percentage of full time pupils (which is usually positively related to female employment, since working mothers keep their children in school in the afternoon). Longer school hours for these students apparently result into higher costs which are not offset by greater student performance, thus yielding a negative effect on efficiency. Finally, the share of teachers assisting students with special needs (SEN) is marginally significant, with a negative impact on efficiency.

Focusing on schools in the South, we identify only two statistically significant variables and of opposite sign. Once again, female employment has a positive effect on school efficiency. But, as concerns management issues, comprehensive schools (i.e., schools where both primary and lower secondary education are delivered) present an unexpected negative impact. In other words, in Southern Italy, comprehensive schools seem to be less efficient than schools that specialize in one or the other education segment. This may be worth of special attention in the view of recent Italian legislation which requires the fusion of all primary and lower secondary schools into comprehensive schools. The underlying idea is to improve learning conditions in the transition between the two education levels and to rationalize expenditure by exploiting scale economies as school size increases. This action is more recent in Southern regions than in Northern ones.

One could suppose comprehensive schools to be characterized by a greater number of service delivery points than other schools— thereby affecting school costs. It turns out that this variable is not necessarily associated to a more fragmented organization, especially in the South (see “Appendix” Tables 12 and 13).Footnote 22 Therefore, comprehensive schools carry some other relevant characteristic, possibly more intrinsic, which affects efficiency. They need to adjust to the presence of teachers trained for different education purposes and to tackle with younger and older students featuring different needs. They are usually directed by a principal carrying a specific background experience in only one of the two education segments. Being this factor significant only in Southern schools, it might depend on the more recent presence of comprehensive schools in this area and could be a short-term effect.

The efficiency in lower secondary education does not seem affected by the share of teachers assisting students with special educational needs (SEN) and the number of service delivery points of the school, just like in primary education. However, family and private sector financial contributions play a role in the North and Center and the negative effect of comprehensive schools in the South disappears here. Finally, in all macro-areas, the economic and social benefits of the context related to greater shares of female employment is always positive and significant on school efficiency.

The share of students with an immigrant background continues to play a negative role on efficiency in the North and, again, the percentage of tenured teachers is positively related to efficiency. Other factors associated to greater efficiency are the volume of family and private sector financial contributions (pointing out to the potential benefits of direct participation and involvement of the local environment to the school’s mission) and class size (with its scale economies).

These variables are all significant and in the same direction also when examining the schools in Central Italy. The only exception concerns private contributions, which present a negative coefficient here, although of low magnitude and only slightly significant. Furthermore, school efficiency in the Centre is affected by the percentage of full-time pupils like in primary education. However, somehow in contradiction with the previous case, this variable presents a positive relation with the efficiency score. One possible interpretation is that the long hours are too demanding for smaller children in primary education, while they are more valuable for the older ones, in lower secondary education.

In Southern Italy, beyond the contextual positive effect of female employment, the only other significant variable is the percentage of tenured teachers, and it has, as expected, a positive effect.

Conclusion

Despite a highly centralized system, Italian schools are characterized by two different dynamics that are particularly persistent in time: the variability of resources (expenditure per pupil and the number of teachers per student) and the variability of the results in terms of learning outcomes in areas such as reading and math. A significant variability between different schools in the same area suggests it may be useful to study school efficiency and its determinants. The issue is recent in the Italian debate and the relevant literature is limited to the exploration of a restricted number of schools in a specific province or region.

In this paper, efficiency analysis has been addressed by using an INVALSI national sample of public primary and lower secondary schools, matched with information on the characteristics of the schools, their budgets and on the geographical context. Descriptive statistics and institutional features suggest the presence of three distinct groups related with the Italian geographical macro-areas (North, Centre, and South), further confirmed by specific statistical tests. The reference model DEA estimations and second-stage analysis were, therefore, conducted separately for each macro-area.

Relative efficiency scores obtained by using output-oriented VRS DEA models and bootstrap procedures show that, despite a rather high average efficiency in both primary and lower secondary education, there are significant differences in the degree of efficiency at individual school level. Keeping inputs constant, there are greater margins of improvement in primary schools respect to lower secondary schools. Moreover, in both levels of education, potential productivity gains are especially large in the South where given the same level of available resources, if all schools performed like the most efficient ones, outputs would increase about 36 percent in primary education and 28 percent in lower secondary education.

These efficiency scores can be used for benchmarking purposes, pushing schools towards better outcomes with the same amount of resources. However, the different efficiency frontiers underlying the three macro-regions suggest that such exercises make sense within the same geographical area, but not by comparing Southern schools with Northern ones. INVALSI regularly provides schools with information on the distribution of student performance in math and reading at school, province, regional, and national level. Putting these results into context by providing each school with a measure of its potential improvement—while holding inputs equal—may provide an indication on the effort that should made (and be a basis for incentive schemes). Furthermore, in some cases this could induce principals to examine other schools— with a range of inputs similar to theirs but much better results—and understand what kind of organizational and managerial features are making them more efficient.

The second stage analysis shows that determinants of efficiency vary according to education level and macro-area, offering several interesting interpretations. Results of the second-stage Tobit estimation models point out to the fact that, contrarily to general expectations, critical financial issues which could be affecting schools (such as the persistence of a great amount of revenue arrears in their budgets and the inability to meet spending commitments with available cash) have no role as concerns their efficiency. Moreover, the number of service delivery points does not seem to count.

In general, for all macro-areas, some organizational variable and some context variable presents a statistically significant effect on school efficiency, but their sign and what they measure may suggest that different strategies are necessary to support efficiency improvements in different areas of the country.

A greater share of tenured teachers (i.e., more stability and greater experience of the staff) tends to characterize more efficient schools in various macro-areas, especially in the case of lower secondary education. Other factors, such as the share of capital investments (i.e., quality of the expenditure) or of family and private financial contributions (i.e., direct participation and involvement in the school’s mission) and class size (scale economies), may also play a role. Holding all other variables equal, the share of full-time pupils seems to characterize school efficiency in the Centre of Italy albeit with a different sign depending on the level of education (negative for primary education and positive for lower secondary education), maybe hinting to the different burden that longer hours of school produce on students depending on their age. In the South and for primary education, comprehensive schools yield an unexpected negative effect, which could point out to the difficulties in implementing legislative provisions for the rationalization of schools in that area of the country.

Finally, the analysis shows that contextual features weigh on efficiency despite the fact that the school’s average social and economic background (ESCS) was specified in the model as an input. Indeed, the share of immigrants and female employment—which determine, respectively, a negative and a positive impact—have a robust effect in most areas and levels of education. These variables are notably far from the school’s control. Therefore, in addition to actions directed to the school itself, policies aimed at increasing school efficiency might also have to consider taking action on the development and involvement of the surrounding local environment.