Introduction

Universalization of Elementary Education (UEE) is a constitutional provision and a national commitment in India. Universalization implies educating all children up to the age of 14 which is equivalent to completion of upper primary level of education. Over the years, efforts by the government towards achieving the goal of UEE were focussed more on the upper primary level of education. With the expansion of enrolment at the upper primary level of education, the pressure for expansion of the upper primary level of education has increased. The index on educational development and highlight the interstate disparity in development of elementary school education (Raju et al. 2008). The present study was an effort to analyze implications for provision of upper primary education facilities to ensure that this level of education is provided to all eligible children. This implies creation of easy geographical access conditions to enroll all the eligible children in upper primary schools, provision reasonable levels of physical and infrastructural facilities in upper primary schools and teaching learning materials in the classrooms to facilitate meaningful curriculum transaction. Mere assurance of physical access to education cannot guarantee quality education (Sengupta et al. 2004). Universalization of upper primary education in India is normally discussed in terms of enrolling and retaining all children belonging to the age group 11–14. According to Census of India 2011, literacy in India is 74.04 %. In other words, nearly a quarter of parents are still illiterate. This seems more to be a desirable goal to be achieved in the long run than a realisable target at the present levels of development of upper primary education. Enrollment is a function of the relevant age group at the upper primary level of education. However, enrolment in upper primary schools is more a function of upper primary education completion rates than a function of the relevant age group. It is logical to argue that all children in the relevant age group (11–14) cannot be enrolled in upper primary classes unless they complete upper primary level of education. In other words, all relevant age group children can be provided upper primary education only when all children of the upper primary school going age group are enrolled, retained and successfully complete the upper primary stage of education. Since upper primary education is not yet universalised, this implies that universalization of upper primary education means providing upper primary education for all children who have successfully completed upper primary level of education. The present study has adopted this as the operational definition of universalisation of upper primary education in India. However, once universalisation of upper primary education is attained, then there cannot be any difference between providing upper primary level of education to all age group children and those who complete upper primary stage of education. The effort at present needs to be to improve the inter stage transition ratios from primary to upper primary levels of education. DISE data set to generate a host of indicators directly related with educational performance Again (Sengupta and Pal 2012).

Materials

An effort has been made by the National University of Educational Planning and Administration (NUEPA) and the Government of India (MHRD, Department of School Education and Literacy, 2004) to compute an Educational Development Index (EDI) (Table 1), separately for Upper primary and Upper primary levels of education and also a composite index for the entire Elementary education which is exclusively based on the DISE data (DISE Flash Statistics, 2005–2006, 2006–2007, 2007–2008, 2008–2009, 2009–2010, 2010–2011, 2011–2012, 2012–2013, 2013–2014) for the nine consecutive academic sessions periods from 2005–06 to 2013–14. The first such attempt was made in 2005–06. A set of 24 indicators have been used in computing EDI which are regrouped into the four sub-groups, namely Access, Infrastructure, Teachers and Outcome indicators. The Indicators used for constructing EDI were pre-determined by a Working Group on EDI constituted by the MHRD during 2005–06 of which NUEPA was also one of the institutional members. There is no change in indicators this year as the same set of indicators used in 2012–13 has also been used in 2013–14. From last year, some indicators are newly introduced, some are deleted or modified. Percentage of habitations not served, percentage of schools with less than 2 teachers (in schools with more than 15 students) (upper primary schools only), percentage of schools with <3 teachers (upper primary schools/sections), repetition rate and ratio of exit class over Class VI enrolment (upper primary stage) have been removed. Newly introduced indicators are density of schools per 10 km2, percentage of classroom-teacher ratio 1:1, percentage of schools with kitchen-shed (Government and Aided School), percentage of single-teacher schools, average number of instructional days, average working hours for teachers, percentage change in enrolment in Government schools over the previous year and participation of Muslim children. Percentage of schools with ramp has been modified to percentage of schools required and have ramp. NUEPA has been pursuing the goal of creating a reliable system of statistics on school education during the last two decades through the District Information System for Education (DISE) which provides the basis for assessing the progress under SSA and on status of implementation of the Right to Education Act. The importance of this has further increased with efforts to extend the policy of universal education to cover secondary education stage of schooling also. Keeping this in view NUEPA is making a concerted effort to provide a unified system of school education statistics for all levels of schooling from elementary to higher secondary education. The present publication is outcome of this effort through which information from about 1.52 million schools have been collected. It is, indeed, a great pleasure for me to present a summary data base on elementary education in India through present publication. Including this publication, NUEPA brings out on annual basis seven publications exclusively based on DISE data under the following titles (DISE data, 2005–06 to 2013–14): (a) DISE flash statistics: elementary education in India, progress towards UEE: (b) elementary education in India: where do we stand; district and state report cards; (c) analytical tables; and (d) elementary education in rural and Urban India.

Table 1 Educational development index (EDI) normalised value of upper-primary level in India from 2005–06 to 2013–14 (Flash statistics, NUEPA)

Methodology

Cluster-outlier analysis

Cluster-outlier analysis was used to estimate spatial cluster among the states of India at upper–upper upper primary level educational development based on inverse distance weight (IDW) technique in ArcGIS environment (Mitchell 2005; Scott and Warmerdam 2005). The cluster-outlier field has been recognized with a statistically significant level (i.e., 0.05 level or so on); those are fell into cluster of high values (HH), cluster of low values (LL), outlier in which a higher values is surround by lower ones (HL), and outlier in which a lower values is surrounded by higher values (LH). The z scores and p values were then computed to measure of statistical significance. A high positive z score would prefer the surrounding features that have similar values (either high values or low values) and low negative z score (critical value as \( \pm 1.96 \) or 2.58) indicating a statistically significant (0.05 level or 0.10 level) spatial outlier consistence.

Hotspot analysis

Hotspot and cold-spot analysis is performed to delineate the spatial cluster of upper primary education level in India based on Getis-Ord Gi* statistic using fixed distance band in ArcGIS software (Mitchell 2005). The resultant Z score identified the states having the high or low values of cluster spatially. For statistically significant The positive and larger Z scores indicated the more intense the clustering of high values (hot spot) and negative and the smaller the Z score signified the more intense the clustering of low values (cold spot). A z score near zero indicates no apparent spatial clustering. The hot spot analysis tool calculates the Getis-Ord Gi* statistic for each feature in a dataset. The resultant Z score tells you where features with either high or low values cluster spatially. This tool works by looking at each feature within the context of neighbouring features.

The Getis-Ord local statistic is given as:

$$ G_{i}^{ * } = \frac{{\sum\nolimits_{j = 1}^{n} {w_{i,j} x_{j} - \overline{X} } \sum\nolimits_{j = 1}^{n} {w_{i,j} } }}{{\sqrt[S]{{\frac{{\left[ {n\sum\nolimits_{j = 1}^{n} {w_{i,j}^{2} - \left( {\sum\nolimits_{j = 1}^{n} {w_{i,j} } } \right)^{2} } } \right]}}{n - 1}}}}} $$
(1)

where \( x_{j} \) is the attribute value for feature \( j \), \( w_{i,j} \) is the spatial weight between feature \( i \) and \( j \),\( n \) is equal to the total number of features and:

$$ \overline{X} = \frac{{\sum\nolimits_{j = 1}^{n} {x_{j} } }}{n} $$
(2)
$$ S = \sqrt {\frac{{\sum\nolimits_{j = 1}^{n} {x_{j}^{2} } }}{n} - \left( {\overline{X} } \right)^{2} } $$
(3)

The \( G_{i}^{ * } \) statistic is a \( z \)-score so no further calculations are required.

Results and discussion

Aggregate of states with lower or higher values are easily detected in the local cluster analysis. Significant small clusters of educational development index were spread around the entire district (high–high); however, results also showed no significant spatial auto-correlation or a pattern of clustering of low values (low–low) in India during the study period shows in Fig. 1. There are other identified clusters, all associated with p-values greater than 0.05, which are not presented in the results shows in Table 2. The presence of extreme p values only in this current study (less than 0.01) show clearly that statistically significant clusters are identified. The result of our analysis showed Tamilnadu, Andhra Pradesh and Kerala facilitated by highly developed stage of educational condition; whereas, Bihar, Jharkhand, Odisha, West Bengal and Arunachal Pradesh showed very backward stage of educational development during 2005 and 2006. In between 2006 and 2007, Kerala and Tamilnadu portrayed highly developed stage of educational condition. Consequently, Bihar, Jharkhand, Odisha, West Bengal, Arunachal Pradesh and Meghalaya depicted backward stage of educational development. In 2007–2008, entire part of north-east India illustrated backward stage of educational development. During the period between 2011 and 2012, Andhra Pradesh, Tamilnadu and Maharashtra portrayed highly developed stage of educational development. However, in 2013–2014, Bihar, Jharkhand and West Bengal illustrated low level of educational development. There were some outstanding spatial clusters of educational development indices covering specific locations. Each hotspot analysis of educational development indices showed statistically significant hotspots and cold spots (P < 0.01) shows in Fig. 2. The result of the analysis showed clear spatial patterns of highly educational development condition that were mostly spread south and south-west part of India since 2005–2010. Some hotspots were also portrayed in the southern part of India from 2012 to 2014. However, most of the cold spots were found in the north east part of India. Some of the cold spots were also portrayed on the northern part of India during the period between 2007–2008 and 2010–2011. The map for 2005–2007 shows hotspot in the southern and in small pocket of western part; however, the cold spots were found in the north-east part of India. The map for 2007–2008 showed most of the hotspots were found in the west and south-west part of India, however, the similar backward stage of educational development indices cold spots were observed as in earlier period and also extend towards the northern direction. In 2009–2010, there was a slight decrease of hotspot of educational development condition states in southern part. In 2011–2012, there were increased of hot spot in the south and western part of India. In between 2012 and 2014, there were dramatic decreases of hotspot and cold spot areas in the southern and north-east part of India respectively.

Fig. 1
figure 1figure 1figure 1

Spatial cluster among the states of India at upper–upper upper primary level educational development based on inverse distance weightage of the academic year 2005–06 to 2013–14

Table 2 Analysis of high/low clustering for different years using Getis-Ord Gi* statistics
Fig. 2
figure 2figure 2figure 2

Maps showing the hotspot and cold spot over different years (2007–2011) in India

Conclusion

The result of this study show that developmental measures can be used to make out high and low risk areas more efficiently which aids in enriching developmental plans and policies in upper primary educational level. The present study provides planners with a more sophisticated tool to differentiate risk patterns of educational development using spatial statistical approaches rather than relying on annual cumulative incidence alone, so that high-risk areas can be expansively recognized untimely in the low educational development based on their integrated spatial–temporal profiles. Some limitation of this study should also be highlighted, viz. despite the relatively high coverage of geo coding of educational development cases; it is likely to have been worse in peripheral states of the country. Second, whole analysis has been performed based on the state wise data. Despite these limitations, the study showed GIS and GIS based spatial statistical techniques may provide an opportunity to clarify and quantify the development situation of development on upper primary level within re-develop the low developed areas, and lay a foundation to pursue future investigations into the socio-economic factors responsible for the educational development index (EDI). Spatio-temporal diffusion patterns and hotspot detection may offer useful information to sustain low EDI to control and predict development on upper primary level spread over critical hotspot areas only rather than for a whole region. Further, the methodology is based on notions on general principles of spatial statistics may employ the model to plan a strategy to control development on upper primary level by the information received on distribution and hotspots for various annually.