1 Introduction

Assessment of groundwater quality is critical for resource planning and environmental management (Mohamed et al. 2019). Since water quality status can be described by a number of physical, chemical, and biological parameters, large amounts of data make assessment and comparison significantly difficult (Reisenhofer et al. 1998). The most important benefit of water quality indices (WQI) is to provide a simplified approach for the assessment of water quality by greatly minimizing the data volume and simplifying the expression of water quality status (Al-Omran et al. 2018). Thus, the use of these indices in ecosystem monitoring programs makes it possible to inform the general public and decision-makers about the state of the environment (Tiwari et al. 2016; Venkatramanan et al. 2017; Bilgin 2018; Wu et al. 2020). This approach can also help to provide a benchmark for evaluating the successes and failures of management strategies in improving water quality, and it will also indicate what actions should be modified (Katyal 2011). Thematic maps based on water quality indices provide a comprehensive picture of a given environmental problem and are easily understood by those outside the scientific field (Stigter et al. 2006). Thus, the use of water quality indices has become a common practice in describing the status of surface and groundwater (Liou et al. 2004; Espejo et al. 2012; Bora and Goswami 2017; Bouslah et al. 2017; Jha et al. 2020).

However, water quality varies according to the type of use; therefore, acceptable water quality criteria depend upon the characteristic conditions, and these vary from time to time and from region to region. The time of sampling also significantly influences water quality parameters and hence the index value. However, it is extremely difficult to develop a universally acceptable general water quality index (Poonam et al. 2013).

After Horton’s first WQI (1965), numerous indices have been developed in the past decades which display water quality in a single value by comparing different parameters as per the standards. Such indices are the US National Sanitation Foundation Water Quality Index (NSFWQI) (Sharifi 1990), the Canadian Council of Ministers of the Environment Water Quality Index (CCMEwqi) (Lumb et al. 2006), the British Columbia Water Quality Index (BCWQI), the Oregon Water Quality Index (OWQI) (Abbasi 2002; Debels et al. 2005; Kannel et al. 2007), and the weighted average WQI developed by Brown et al. (1970).

Brown et al. (1970) suggested a multiplicative form of the index, where the weight of each parameter was created according to the subjective opinion of the author, based on the given research. The weight assigned to a given parameter (which reflects the significance of the parameter for a given use) has a significant effect on the index. The CCMEwqi compares observation values to a benchmark instead of normalizing observed values to subjective rating curves, where the benchmark may be a water quality standard or site-specific background concentration (CCME 2001; Khan et al. 2003; Lumb et al. 2006). Therefore, this is an advantage of the index, which can be applied by water agencies in different countries with little modification.

Various water quality indices have been examined in several studies, which state that almost all water quality indices depend upon normalizing data by parameter, according to expected concentrations and the interpretation of “good” versus “bad” concentrations (Liou et al. 2004). Pesce and Wunderlin (2000) concluded in their study that the water quality indices examined were generally correlated with the measured concentrations of different parameters. Štambuk-Giljanović (1999) also used different WQIs for the assessment of Dalmatian surface and groundwater supplies. They found that modified arithmetic indices were best suited for discriminating sites according to their water quality condition.

Many studies use different indices to analyze water quality. Backman et al. (1998) carried out a water quality classification in the interior areas of southwestern Finland and Slovakia, where the practicability of the pollution index developed by Rapant et al. (1995) was tested in two different geological areas. The index takes into account the hydrochemical parameters that exceed the contamination limit. Their study concluded that the pollution index is well suited to classifying water quality and cartographic representation. Soltan (1999) used the water quality index to calculate the water quality of the artesian wells in the Dakhla oasis, which is calculated using nine parameters. Stigter et al. (2006) conducted groundwater quality studies in Portugal. According to the water quality index, groundwater shows a high level of contamination due to agricultural activity. Boateng et al. (2016) evaluated the hydrochemical characteristics and quality of groundwater in the Ejisu-Juaben Municipality, Ghana, based on different indices, which revealed that the majority of the samples fall in the good to excellent category, suggesting that the groundwater is suitable for drinking and other domestic uses. Water quality indices have also been identified as excellent tools for evaluation and communication in agro-environmental policy (Alobaidy et al. 2010; Abbasnia et al. 2019; Khalid 2019; Solangi et al. 2019; Zhou et al. 2020). Adimalla et al. (2018) investigated the groundwater in Telangana for drinking and irrigation purposes. They found that, according to the water quality index, 90% of groundwater samples in the study region were well suited for irrigation. Thus, various water quality indices play an important role in water quality assessment, but they are not suitable for a detailed analysis and evaluation of the relationship between physical and chemical parameters.

In rural built-up areas, sewage disposal and landfill sites, as well as sewage and septic tanks, are considered one of the largest sources of pollutant discharge to the environment, which remains an unsolved problem not only in less-developed areas of the world but also in developed areas, as well (Reay 2004; Szabó et al. 2016; Mester et al. 2017; Schuler et al. 2019). In the countries of eastern-central Europe, due to inadequate sewage management and the lack of wastewater treatment systems, sewage infiltration into the groundwater is a crucial issue (Devic et al. 2014; Bugajski et al. 2019; Mester et al. 2019; Janža et al. 2020). As a consequence of this, the groundwater quality has significantly decreased in these areas (Balla et al. 2020).

With its accession to the European Union in 2004, Hungary ratified the Water Framework Directive (2000/60/EC) and the Urban Wastewater Treatment Directive (271/91/EEC) which regulate the issue of contamination originating from agriculture and domestic wastewater. The latter requires the establishment of a sewage system in every settlement with a population over 2000. The establishment of the sewage system in Hungary has accelerated over recent years. While in 2004 31.5% of the households with a public water supply system were not connected to the sewage system, this ratio decreased to 13.3% in 2018 (HCSO 2019). In the Northern Great Plain region—where the settlement investigated is located—this ratio was 14.7%. In the settlement under investigation, the operative works started in 2013, and the sewage system was completed in 2014. In 2017, more than 90% of households were connected to the sewage system; however, there were still households which had not fulfilled the legal requirements.

Although significant progress has been made in the construction of municipal sewerage networks in many countries around the world, no studies have been conducted that have shown changes in water quality in the post-sewerage period. Presumably, this may be due to the fact that these types of studies require several years of monitoring and it is absolutely necessary to assess the baseline condition from the period before sewerage. However, the vast majority of settlements do not have such a database covering the whole settlement. It is a well-known fact that the construction of a sewerage network is an extremely costly investment, and it can be justifiably expected that there be an accurate picture of the positive effects of the investment. In our study, we examined this issue using three water quality indices, during which we compared the quality of the groundwater resources of the settlement examined in the pre-sewerage period with the water quality of the post-sewerage period.

Before performing the measurements, the following hypotheses were made:

  1. 1

    As a result of the decades-long wastewater effluent from the uninsulated sewage tanks, the quality of the settlement’s groundwater resources has deteriorated sharply, which will be detectable on the basis of all three indices.

  2. 2

    In the post-sewerage period, there will be positive changes in water quality, which will also be reflected in a positive change in the pollution categories of the various indices.

  3. 3

    We hypothesized that despite the same input data, there would be differences in the categorization of each water sample for different indices, but they would correlate with each other for the entire database.

2 Material and Methods

2.1 Site Location and Characteristics

The settlement under investigation—Báránd—is located in the eastern part of the Great Hungarian Plain, in the Nagy-Sárrét region on the western part of the alluvial deposit of the Sebes-Körös River (Fig. 1), and has a population of 2611 (HSCO 2019). The altitude of the Nagy-Sárrét is typically 85–89 m, and the region is classified as a flat plain (relative relief 0–3 m/km2). The groundwater level can be found close to the surface, at a depth of 1–2 m; consequently, all the soil types have been formed under the influence of water (Michéli et al. 2006). In the study area, the most frequent soil types are Solonetz, Vertisol, Kastenozem, and Chernozem, and in the built-up area—as a result of anthropogenic effects—Technosol (Balla et al. 2016).

Fig. 1
figure 1

Location of the investigated settlement in Hungary

2.2 Field Sampling and Laboratory Analysis

In this study, we performed an analysis of the water samples collected during the summers of 2013, 2017, and 2018. The most characteristic pollutants of municipal wastewater were included in the analysis. Analytical measurements were carried out in the Geography Laboratory of the University of Debrecen. The pH and EC values were determined using a WTW 315i measuring instrument, the NH4+ concentrations were determined with Nessler reagent (HS ISO 7150-1:1992), the NO2 concentrations were determined with alfa-Naftil-amin (HS 448-18:2009), and the NO3 concentrations were determined with sodium-salicylate (HS 1484-13:2009) using a spectrophotometer. The PO43− concentrations were determined according to Hungarian Standard HS 12750-17: 1974, using a spectrophotometer. The CODps value was determined with potassium-permanganate, and Na+ was measured by PerkinElmer 3110 AAS.

2.3 Calculation of Water Quality Indices

2.3.1 The Water Quality Index Developed by Rapant et al. (1995)

The index reflects the combined effect of parameters harmful to groundwater, taking into account all parameters above the limit value. The index can therefore be considered the sum of the polluting factors. When determining the degree of contamination (Backman et al. 1998), calculations should be performed for each water sample. The calculations were performed according to the following equation:

$$ {\boldsymbol{C}}_{\boldsymbol{d}}={\sum}_{\boldsymbol{i}=\mathbf{1}}^{\boldsymbol{n}}\ {\boldsymbol{C}}_{\boldsymbol{fi}}\kern0.5em \mathrm{where}:\kern0.5em {\boldsymbol{C}}_{\boldsymbol{fi}}=\frac{{\boldsymbol{C}}_{\boldsymbol{Ai}}}{{\boldsymbol{C}}_{\boldsymbol{Ni}}}-\mathbf{1} $$

Cfi = contamination factor for the ith component, CAi = analytical value of the ith component, and CNi = upper permissible concentration of the ith component. The contamination index values were divided into 5 categories (the originally 4 categories with the excellent category have been expanded for comparability with other indices), which are summarized in Table 1.

Table 1 Index values and their categories

2.3.2 The Weighted Water Quality Index by Brown et al. (1970) and the Water Quality Status Calculated from It

Since the importance of different parameters depends on the use of a given water, Brown et al. (1970) proposed the use of a weighted arithmetic index, the calculation of which consists of the following steps:

$$ \mathrm{WQI}=\sum {Q}_n{W}_n/\sum {W}_n $$

where Qn is the quality rating of the nth water quality parameter, and Wn is the unit weight of the nth water quality parameter. The quality rating Qn is calculated using the equation:

$$ {Q}_n=100\left[\left({V}_n-{V}_i\right)/\left({V}_s-{V}_i\right)\right] $$

where Vn is the actual amount of the nth parameter present, Vi is the ideal value of the parameter (Vi = 0, except for pH (Vi = 7)), and Vs is the standard permissible value for the nth water quality parameter. The unit weight (Wn) is calculated using the formula:

$$ {W}_n=k/{V}_s $$

where k is the constant of proportionality and is calculated using the equation:

$$ k=\left[1/\sum 1/{V}_s=1,2,\dots, n\right] $$

The water quality status (WQS) according to the WQI is shown in Table 2.

Table 2 WQI range, WQS, and possible usage of the water sample (Brown et al. 1970)

2.3.3 Calculation of the CCME Water Quality Index

This is a rating system developed by the Canadian Council of Ministers of the Environment in 2001. The ranking system is based on a combination of three factors:

  • F1: Number of parameters tested that exceed the contamination limit (scope).

    $$ \mathrm{F}1=\left(\frac{\mathrm{number}\ \mathrm{of}\ \mathrm{failed}\ \mathrm{parameters}}{\mathrm{total}\ \mathrm{number}\ \mathrm{of}\ \mathrm{parameters}}\right)\ \mathrm{x}100 $$
  • F2: The percentage of failed tests (frequency).

    $$ \mathrm{F}2=\left(\frac{\mathrm{number}\ \mathrm{of}\ \mathrm{failed}\ \mathrm{tests}}{\mathrm{total}\ \mathrm{number}\ \mathrm{of}\ \mathrm{tests}}\right)\ \mathrm{x}100 $$
  • F3: The amount by which failed test values do not meet their objectives (amplitude). Factor 3 can be calculated in three steps:

    $$ {\displaystyle \begin{array}{c}{\mathrm{excursion}}_i=\left(\frac{\mathrm{failed}\ {\mathrm{test}\ \mathrm{value}}_i}{{\mathrm{Objective}}_j}\right)-1\\ {}\mathrm{nse}=\frac{\sum_{i=1}^n{\mathrm{excursion}}_i}{0.01\mathrm{nse}+0.01}\\ {}\mathrm{F}3=\frac{\mathrm{nse}}{0.1\mathrm{nse}+0.01}\end{array}} $$

After calculating all the three factors, the WQI can be determined by the following equation:

$$ \mathrm{CCME}\ \mathrm{WQI}=100-\left(\frac{\sqrt{\mathrm{F}{1}^2}+\mathrm{F}{2}^2+\mathrm{F}{3}^2}{1.732}\right) $$

The factor value of 1.732 is introduced to a scale index ranging from 0 to 100, where 0 is the “worst” and 100 is the “best” WQI value (Table 3).

Table 3 CCMEwqi values and their ratings

3 Results and Discussion

3.1 Categorization of Water Samples Based on Water Quality Indices

Water samples collected before (2013) and after (2017, 2018, 2019) the establishment of a sewerage were rated on a scale of 1 to 5 based on their quality, with 1 being the best and 5 the worst rating. The results of the ranking are shown in Table 4.

Table 4 Categorization of water samples based on water quality indices

For the 2013 pre-sewerage sampling, no water sample was included in category 1 “excellent,” 3 samples were included in category 2 “good” for both the Cd and the CCMEwqi, and no water sample was included in category 2 according to the WQS. The strong pollution of the settlement’s groundwater resources is well illustrated by the fact that 90% of the groundwater wells according to the WQS, 70% of the wells according to the Cd, and 80% of the wells according to the CCMEwqi fall into the 4–5 “polluted–heavily polluted” categories.

Our results showing significant pollution of the water resources of the settlement are consistent with the results of other studies carried out in a rural environment (Adekunle et al. 2007; Khan et al. 2017; Celestino et al. 2019; Muzenda et al. 2019). The results of the pre-sewerage sampling support our hypothesis that heavily polluted groundwater is also expressed on the basis of the indices applied; however, we did not expect that the WQS would place an average of 30% more wells in the worst category 5 than the other two indices.

In 2017, 3 years after the establishment of the sewerage, significant changes can be observed. For all three indices, the number of wells in category 5 indicating the most contaminated samples decreased significantly, while the number of samples in categories 2 (good) and 3 (acceptable) increased. For the WQS, the number of wells in the two categories increased from 4 to 26, for the Cd from 12 to 19, and for the CCMEwqi from 8 to 17 in the two categories (2, 3) compared to the reference year. Both the Cd and CCMEwqi had one well classified as excellent.

In the next year, no significant change was detected based on the Cd and the CCMEwqi. However, in the case of the WQS, the number of wells in category 2 decreased significantly, while in category 4, we saw an increase. The most significant change in 2019 compared to 2018 is that the number of wells in category 4 decreased for all three indices, while the number of wells in category 5 increased. However, their number still lags behind the pre-sewerage situation. The number of wells in categories 2–3 is around 50% for all three indices, compared to 10–30% in 2013.

To obtain a more detailed picture of the degree of contamination, we plotted the index values of each well on a boxplot diagram (Fig. 2).

Fig. 2
figure 2

CCMEwqi values in the years before (2013) and after sewerage (2017, 2018, 2019)

For the WQS and Cd indices, a lower value indicates better water quality, while for the CCMEwqi, a higher value on the hundred scale indicates better water quality.

Changes in the mean, minimum, and maximum values of all three indices indicate positive changes in water quality (Table 5). For the WQI, the pre-sewerage average of 156.76 decreased to 87.16 by 2019. For the CCMEwqi, the average rose from 50.17 to 59.55, an increase of nearly 10%. The maximum value in the year following the establishment of the sewerage is 100, which shows that no limit value was exceeded for any of the examined parameters.

Table 5 Descriptive statistics of index values

In order to determine whether the results before and after the sewerage can be separated, the results were subjected to discriminant analysis. The Wilks–Lambda test showed a significant value (p = 0.000). A total of 75.6% of the cross-validated values were successfully categorized into the appropriate category (Table 6). The correct classification of more than two-thirds of the samples indicates a significant difference between pre- and post-sewerage. The difference between the two periods for the different indices was assessed by using the Wilcoxon test (Table 7). Based on the results, it can be concluded that after the establishment of the sewerage system, the WQI values of the groundwater wells did not improve by accident; the reason for the improvement was the significant reduction in wastewater discharge.

Table 6 Results of discriminant analysis
Table 7 Differences between categories pre- and post-sewerage

The positive changes in water quality observed in the years following the sewerage could clearly be traced back to the cessation of wastewater outflow. The results support our hypothesis 2, according to which changes in individual hydrochemical parameters resulted in a positive shift in the water quality category of most of the wells.

3.2 Spatial and Temporal Distribution/Variation of Groundwater Quality

In order to explore and illustrate the spatial distribution of the degree of contamination (which is one of the advantages of the indices), the values for each well were also plotted on thematic maps (Fig. 3). We assigned a color code to each of the 5 categories established. Because a large number of samples were available, we also generated interpolated maps of the extent of contamination (Fig. 4).

Fig. 3
figure 3

Spatial distribution of water quality indices in the years before (2013) and after sewerage (2017, 2018, 2019)

Fig. 4
figure 4

Spatial distribution of water quality indices in the years before (2013) and after sewerage (2017, 2018, 2019)

Based on the thematic maps, it can be stated that in 2013, all three indices showed the highest pollution in the inner and southern parts of the settlement, while the northern areas of the settlement were less polluted. The main differences were in the extent of the heavily contaminated areas (WQS vs. Cd–CCMEwqi).

In 2017, the proportion of the most contaminated areas marked with a red color code decreased significantly. In the inner areas of the settlement, the degree of pollution decreased from levels 4–5 to levels 3–4. In the case of all three indices, the “good” water quality status appeared in varying degrees in the north-central and southern areas of the settlement as well. The extent of these areas was the largest in the case of the WQS, while a smaller extent was observed in the case of the Cd and CCMEwqi. The most significant difference compared to the WQS is in the SE part of the settlement. For the Cd and CCMEwqi indices, a very similar picture emerges in all the 4 years studied.

In 2018, the western part of the settlement can be considered heavily polluted again, on the basis of all three indices; however, the central and eastern parts showed significantly lower pollution (medium and good) for all three indices. In 2019, compared to 2018, an improvement in water quality can be observed; typically, we have shown the purification of the western areas. The extent of the low-pollution, north-south zone passing through the settlement area increased in the case of all three indices.

3.3 Comparison of the Water Quality Indices Applied

To determine the strength of the relationship between the indices, correlation studies were performed (Table 8). Based on the results, we found that there is a significant (p > 0.01) relationship between all three indices. The weakest relationship was between the Cd and the WQS (r = 0.552), while the strongest relationship (r = 0.853) was between the CCMEwqi and the Cd. The CCMEwqi–Cd matrix was also plotted on a scatter plot diagram (Fig. 5). The numbers at the points show the number of wells in the same category.

Table 8 Correlation matrix of water quality indices based on 4-year data
Fig. 5
figure 5

Scatter plot diagram for Cd vs. CCMEwqi categories

Correlation is significant at the 0.01 level

After categorizing the water samples, we calculated how differently the indices categorize the same sample (Table 9). During the study, we did not separate the data for 4 years, they were treated as one data set. The results show that the difference between the indices is—in the vast majority of cases—a maximum of ± 2 on a five-point scale. Of the 160 cases, in only 4 cases was the difference ± 3. In the case of the WQS–Cd, a difference of − 1.0, + 1 was detected in 81.6% of the cases. The equivalent figures for the WQS–CCMEwqi and the CCMEwqi–Cd are 85.8% and 99.2%, respectively. In the case of the CCMEwqi–Cd, 72.5% of the wells fell into the same category.

Table 9 Differences between categories for 4 years

Based on the above, it can be stated that according to our hypothesis 3, despite the same input data, different water quality indices classified some of the samples into different categories. Although a significant relationship was found between the categories created by the water quality indices applied (Table 8), significant differences in the strength of the relationship were found.

In order to determine the reasons for the differences as accurately as possible, we prepared the difference value maps of the indices (Fig. 6). Areas where both indices classified the sample in the same category were marked in white. The smallest difference for the CCMEwqi–Cd was detected in 2017. In the case of the WQS–Cd, in 2013, the WQS values in the northern part of the settlement were + 1 + 2 more polluted than the Cd values; however, in 2017, the WQS values in the SE part of the settlement were − 1–2 lower, than the Cd values. A similar pattern emerges for the WQS–CCMEwqi. The differences are due to the different weighting of the WQS index, as a result of which the index responds more sensitively to changes in ammonium and phosphate values. The other factor is that the WQS is less sensitive to nitrate changes, which are given more weight in the other two indices (Cd, CCMEwqi). Another influencing factor is that the Cd index does not take into account parameters below the limit, even if they are very close to it, while the WQS and CCMEwqi take these into account.

Fig. 6
figure 6

Difference value maps of the indices

4 Conclusions

In the present study, the groundwater quality changes of a settlement were evaluated after the construction of its sewerage network using different water quality indices. The focus of our research was on how different indices categorize water samples based on the same input data and how well they are suitable for accurately expressing changes in water quality.

The strong pollution of the settlement’s groundwater before the establishment of the sewerage (2013) is well illustrated by the fact that 90% of the groundwater wells according to the WQS, 70% of the wells according to the Cd, and 80% of the wells according to the CCMEwqi fall into the 4–5 “polluted–heavily polluted” categories. After the establishment of the sewerage (2017, 2018, 2019), significant changes can be observed. For all three indices, the number of wells in category 5 indicating the most contaminated samples decreased significantly, while the number of samples in categories 2 (good) and 3 (acceptable) increased.

Significant correlations between all three indices were found, but the strongest (r = 0.853) was between the CCMEwqi and the Cd indices. The two indices classified 71.9% of the wells in the same category on a five-point scale and 98.1% of the wells with a difference of a maximum of + 1–1.

Based on the indices, pollution maps of the settlement were created. In the period before sewerage, based on all three indices, the central and southern areas were the most polluted, the difference was only in the extent of the contaminated area. In the case of the measurements following the establishment of the sewerage, all three indices showed the purification of the central and southern areas. Based on the result, it can be stated that all three indices adequately reflect the status of pollution, well reflecting positive changes in water quality.

It was determined that the differences observed between the indices are due to the different weighting of the WQS index, as a result of which, the index responds more sensitively to changes in ammonium and phosphate values. The other factor is that the WQS is less sensitive to nitrate changes, which is given more weight in the other two indices (Cd, CCMEwqi). Another influencing factor is that the Cd index does not take into account parameters below the limit, even if they are very close to it, while the WQS and the CCMEwqi take this into account.

It is important to mention that when applying indices, the parameters that are expected to be important should not be excluded from the calculation of the indices. Accordingly, other parameters should be considered when assessing the status of an agricultural, industrial, or urban environment.

With the help of the indices, we determined that the process of groundwater purification in the settlement has started, although it will continue for years to come. Maps created on the basis of indices can help to illustrate purification processes more easily, which can be very profitably used to support further environmental measures.