1 Introduction

The good quality of drinking water is far behind for utmost populations throughout the world [1]. This is due to the reason that resources are contaminated at incessant rate due to unorganized industrialization and urbanization [2]. Presently, water quality is of key issue, because it is the source of poor health in major countries of the world [3, 4]. The countries which are growing, inappropriate access to clean drinking water is harmfully influencing the human health [5]. Population explosion and growing living standards are enhancing the requirement of water in urban areas and this requirement has put enormous pressure on the inadequate water resources [6]. The over utilization of groundwater associated with nominal renew leads to groundwater degradation and scarcity [7].

European Environment Agency framed policies regarding management of groundwater and to recognize point source pollutants as the primary initiative for the remediation. However, in many urbanized regions, pollutant plumes formed by single point sources commonly synchronize with prevalent pollution originated from diverse point sources [8]. The proper attention should be given to prevent and control the groundwater contamination in order to avoid expensive remediation, because groundwater contamination is invisible, complex and has prolonged effects. To assess the probable effects of any contaminant in the groundwater pollution is an efficient strategy for management of groundwater [9, 10].

Aquifer systems are affected by the spatial and temporal alterations of natural as well as human activities. The associations and spatial variability of these aspects, limits and hierarchy of their associations between the constituents, statistically elucidated [11,12,13]. Geostatistics is important for spatial mapping of groundwater parameters, variograms are employed to determine spatial relationship, and the kriging variance is a role of geometric pattern. The kriging determination variances are independent and are associated to model variogram and the spatial organization of the sampling statistics. Kriging interpolation is applied to give the good variable nearness value of an approximate in association to the real value employing least square approach [14, 15]. Multivariate techniques have been used to find the information from the large data and such techniques applied in exploratory data assessment as approaches to categorize samples and recognize pollution sources [16, 17]. Cluster analysis and principal component analysis techniques helps in interpretation of enormous parameters to accomplish a great understanding of the hydrochemical mechanisms engaged [18, 19]. To improve organization of water resources on a local level, the objective of this research is to recognize and assess the interfering natural and human factors concerning with groundwater hydrochemistry in an urban area. Because of the multifaceted interactions between human and geogenic features influencing groundwater hydrochemistry, this study is distinctive in its application of integrated techniques hydrochemical exploration, geostatistics, and multivariate techniques in urban areas. To protect and maintain the speedily diminishing groundwater resources in urban regions, sophisticated evaluation methods are required to better recognize the relationships among land-use and natural and, human causes of pollution.

2 Materials and methods

2.1 Study area

The present study was accomplished in Shiraz megacity, 5th populated city of Iran with 223.4 km2 area positioned at 29°37′8″N lat. and 52°31′14″E longitude (Fig. 1). About 1500 m is the surface elevation of this city from the sea level. The main direction of the Shiraz aquifer is from northwest to south east. The 1009 km2 Shiraz basin partly comprises of the watershed of the Khoshk seasonal river, which runs to Maharloo salt lake located to the south of the city. The hydraulic gradient of the eastern part is less than that of central and western parts. According to the Iran Meteorological Organization (IRIMO) reports, the annual mean temperature and precipitation are 18.7 °C and 324.2 mm respectively. The climatic condition in the surrounding study area is semi-arid. Approximate half of the water used for domestic and industrial purposes is supplied from deep and semi-deep wells dispersed in the area. Due to improper sewage system in the city, about 80–85% of the municipal water used, i.e., 250 L per cap. per day moves into the groundwater, mainly goes to Maharloo Salt Lake [20]. The area is waterlogged during wet years when the level of Khoshk River is high.

Fig. 1
figure 1

Study area and location of sampling sites

2.2 Hydrological settings and aquifer formation

The geophysical investigations in the Shiraz have established that Shiraz water bearing layer in more depths, it suffers from unsuitable quality. Studies showed that groundwater of Shiraz comprises of surface unconstrained aquifer and a deep aquifer [20]. Two seasonal rivers Khoshkrud and Chenarrahdar occured in Shiraz area. The seasonal Khoshkrud River enters in Golestan and Ghalat mountains heights after connecting with Khoshk River, it arrives in the Shiraz city. Agricultural runoff and industrial wastewater arrives into the river at the south eastern part of Shiraz. The geological information showed that the development that takes place in the area are from old to new such as Formation of Tarbor takes place during Campanian to Maastrichtian, Pabdeh–Gurpi developments occurred in Paleocene, formation of Sachun takes place in Paleocene, Jahrom formation occurred in Eocene, Asmari establishments takes place in Oligocene, Razak formation during Miocene, Agha Jari development occurred in Miocene to Pliocene, formation of Bakhtyari made in Pliocene to Pleistocen, and quarternary alluvial deposits [21].

Shiraz alluvial aquifer is formed during the Quaternary alluvium that comprises of grain sediments and moderate to fine grained clay particles carried by stream and flood wash of catchment geological units. The northern side comprises of coarse alluvium. Oligo-miocene carbonate formed from the Asmari-Jahrom, Miocene marl formed from Razak and Plio-Pleostocene conglomerate developed from the Bakhtyari form the catchment geological settings in Shiraz [22]. In addition to Shiraz alluvial aquifer, many karstic anticlines adjoining the Shiraz also provides drinking water to the peoples of city. The northern karstic aquifer is linked with the alluvial aquifer. Comprehensive pumping in the northern karstic aquifer has reversed the ground water flow route, taking place in alluvial aquifer water arriving the karstic aquifer [22].

2.3 Sampling and analysis

For the assessment of groundwater quality, samples were collected from 80 sites from the deep wells distributed throughout the area. Geographic coordinates of each sampling site was noted using geographic positioning system apparatus. All the samples were filtered and stored in 250-mL polypropylene bottles cleaned with water samples and kept at temperatures below 4 °C to avoid unforeseeable variations in chemical constitution [23]. pH was determined by using pH meter (Metrohm 827). EC and TDS were measured by using EC meter (Hanna EC214). SO42− and NO3 content were measured in the groundwater by using HACH DR 5000 UV–Vis spectrophotometer. Na+ and K+ were determined by using flame photometer (Feter Electronic 405). Ca2+, Mg2+ and HCO3 were measured by EDTA titration method. Cl was measured in the groundwater samples base on ASTM [24] method. Alkalinity was measured in the samples by using ETDA titration. Total hardness (TH) was calculated by TH (mg/L) = 2.497 Ca + 4.115 Mg [25].

2.4 Spatial maps

Spatial distribution maps for all the groundwater parameters were made in ArcGis software v. 10 using kriging approach.

2.5 Statistical analysis

The results were statistically analyzed by using descriptive statistics in PAST software v. 3.15. The data was also analyzed by using various multivariate techniques such as Pearson’s correlation analysis R programming software v. 3.1.3, Cluster analysis (CA) and principal component analysis (PCA) by SPSS software v. 16 and non-metric multidimensional scaling (NMDS) by PAST software v. 3.15. CA is mainly applied in hydrology with the objective to find out relationship among the variables [26, 27]. In present work, we conducted a hierarchical CA following Ward’s method and Euclidean distance [28, 29]. Ward’s method indicates that proximity between two clusters is the enhancement in squared error. The outcomes are shown in the form of dendrograms [30]. PCA is conducted to decrease the data and determine the common arrangements within a large data [31]. In hydrology, PCA is mainly applied in hydrochemical and hydrogeological works [32]. It permits decreasing the dimensions of data with a great number of correlated variables. The decrease is made by converting original data into a new set of data called principal components. The data was rotated by varimax rotation to enhance the precision of results [33].

3 Results and discussion

The physiochemical characteristics and statistical analysis for all the 80 groundwater samples were determined. Table 1 presents the mean, minimum, maximum, standard deviation, skewness, kurtosis and coefficient of variation for the groundwater samples. The pH of the groundwater samples ranged from 6.8 to 8.31, indicating acidic to slightly alkaline nature of water. The pH values of present study were found within the range of world health organization [34] for drinking and public health purposes. The electrical conductivity varied from 518 (µS/cm) to 12160 (µS/cm) with average value of 1877.74 (µS/cm) for the present study. The maximum permissible limit of EC prescribed by WHO [34] is 1500 (µS/cm). About 62.5% samples exceeded WHO [34] limit for drinking water. The higher EC values indicate enrichment of salts in the groundwater [35]. The K+ content ranged from 39 to 2952.3 mg/L with mean value of 246.14 mg/L. All the samples for K+, exceeded the value of WHO [34] for drinking water, i.e., 12 mg/L. The Na+ varied from 8.28 to 963.7 mg/L with average value of 113.6 mg/L. About 12.5% samples exceeded the WHO [34] limit for drinking water. The maximum allowable limit prescribed by WHO [34] is 200 mg/L. High EC contributes to saline soil formation, while high Na+ concentration forms alkaline soil [36]. Ca2+ ranged from 36.8 to 134 mg/L with average value of 63.13 mg/L. The limit prescribed by WHO [34] for drinking water are 75 mg/L. 30% samples exceeded the WHO limit for Ca2+ in the present study. Mg2+ varies from 33.6 to 874.8 mg/L, and about 97.5% samples exceeded the limit of WHO [34] for drinking water. HCO3 content ranged from 146.4 to 463.6 mg/L with average value of 358.07 mg/L. For all the sampling sites, HCO3 content was found lower than the WHO [34] for drinking water, i.e., 500 mg/L. Cl content varied from 28.4 to 2272 mg/L with average value of 246.6 mg/L. The maximum allowable limit for Cl is 250 mg/L as recommended by WHO [34] and about 28.7% samples exceeded the WHO [34] limit. SO42− content ranged from 4.8 to 2457.6 mg/L with mean value of 280.56 mg/L. The limit prescribed by WHO [34] for SO42− is 250 mg/L and about 40% samples exceeded this limit for SO42− content. NO3 content varied from 0.16 to 8.96 mg/L with average value of 4.31 mg/L. The limit recommended by WHO [34] for NO3 is 45 mg/L, and all samples were found below this limit for NO3 content. TDS content ranged from 331.5 to 7782.4 mg/L with average value of 1201.7 mg/L. The limit prescribed by WHO [34] for TDS is 500 mg/L, and 97.5% samples exceeded this limit for TDS. The hardness values varied from 230 to 3980 mg/L with average value of 681.69 mg/L. The level of hardness is categorized as: 0–75 soft, 75–150 moderately hard, 150–300 hard, and > 300 very hard [37]. 97.5% samples showed that hardness content is very hard for the present study. The alkalinity values ranged from 144 to 456 mg/L with mean value of 352.2 mg/L. The alkalinity values were found lower than the maximum limits recommended by WHO [34] for drinking water such as 600 mg/L. The skewness and kurtosis of pH, EC, K+, Na+, Mg2+, Cl, SO42−, hardness and TDS were found greater than one representing right handed skewness and leptokurtic [38]. K+ followed by SO42−, Cl and Na+ showed greatest variation in the groundwater samples of the present study. Figure 2 presents the spatial distribution of pH, EC, SO4, NO3, Cl, HCO3, alkalinity, hardness and TDS for the present study on the basis of kriging method. From the spatial maps, it was predicated that both anthropogenic activities like cement factory, gas power plant, Rishmark factory and vegetable Oil Company, and natural processes like rock weathering are responsible for the contents of groundwater parameters.

Table 1 Descriptive statistics of groundwater parameters from Shiraz
Fig. 2
figure 2figure 2figure 2

Spatial distribution of groundwater parameters (pH, EC, Cl, SO4, NO3, alkalinity, hardness, TDS, Na% and SAR)

Pearson’s correlation analysis was applied to groundwater parameters to find relationship among different parameters (Fig. 3). They presented efficient method to indicate the associations between diverse variables and consequently useful in understanding the contributing aspects and sources of chemical constituents. pH showed correlation with HCO3 and alkalinity. Correlation of NO3 found with Ca. The correlation of EC existed with K, Na, Ca, Mg, Cl, SO4, Hardness, TDS and SAR. The K, Na, Ca and Mg showed correlation with Cl, SO4, hardness, TDS and SAR. Cl, SO4, hardness, TDS and SAR showed correlation with each other. The correlation of EC and TDS is attributed by the fact that dissolved constituents cause enhanced ionic content and EC content [39]. The correlations results of different ions showed that these ions involve diverse physical as well as chemical processes such as oxidation/reduction reactions as well as ion exchange in groundwater aquifers, indicating that similar factor have strong influence on them [40, 41]. The correlation of Cl and SO4 usually used to recognize the processes of salinity in arid and semiarid places [42].

Fig. 3
figure 3

Pearson’s correlation analysis of groundwater parameters

3.1 Sodium (Na)% and Sodium absorption ratio (SAR)

Na% is a widespread index calculated to evaluate the acceptability of groundwater. Na concentration in groundwater is exchanged with Ca2+ and Mg2+ of clay particles of the soil, which leads to decrease of soil permeability, internal drainage and air circulation. % Na is computed by following Wilcox [43]:

$${\text{Na}}\% = \frac{{\left( {{\text{Na}}^{ + } + {\text{K}}^{ + } } \right)}}{{\left( {{\text{Ca}}^{2 + } + {\text{Mg}}^{2 + } + {\text{Na}}^{ + } + {\text{K}}^{ + } } \right)}} \times 100$$

Higher percentage of Na in groundwater decreases the permeability of the soil. The Na% varies from 5.25 to 38.59 with average value of 24.05. The 31.2% samples showed < 20 Na%, indicating excellent for irrigation and agricultural practices, and 68.7% samples were fall in the category of 30–40 Na%, indicating good category of groundwater and suitable for irrigation and agricultural practices. Figure 2 presents the spatial distribution of % Na for the present study on the basis of kriging approach.

SAR is vital parameter to evaluate the groundwater quality. It represents the impact of relative cation content on Na2+ accumulation in the soil, is applied to assess the Sodicity risk of water. Soil permeability reduces with the enhancement in Na2+ in relation to Ca2+ and Mg2+ concentrations. It is computed by following Hem [44]:

$$SAR = \frac{{{\text{Na}}^{ + } }}{{\sqrt {\frac{{{\text{Ca}}^{2 + } + {\text{Mg}}^{2 + } }}{2}} }}$$

SAR levels were presented in meq/L and varied from 0.21 to 6.64 with mean value of 1.76. SAR values were found below < 10, belongs to excellent category. Richards [45] suggested following grades to classify the SAR: < 10: excellent, 18: good, 18–26: doubtful and > 26: unsuitable. The skewness and kurtosis of SAR values were also recorded greater than one indicating right handed skewness and leptokurtic [38]. SAR values for all the samples were excellent for irrigation and agricultural practices. Figure 2 presents the spatial distribution of SAR values for the present study on the basis of kriging approach.

3.2 Base‑exchange indices (BEI) and meteoric genesis indices (MGI)

The BEI and MGI were determined by following Matthess [46], Soltan [47, 48] and are calculated by following equations:

$$BEI = \frac{{{\text{Na}}^{ + } - {\text{Cl}}^{ - } }}{{\mathop {{\text{SO}}_{4} }\nolimits^{2 - } }}$$
$$MGI = \frac{{[({\text{Na}}^{ + } + {\text{K}}^{ + } ) - {\text{Cl}}^{ - } ]}}{{\mathop {{\text{SO}}_{4} }\nolimits^{2 - } }}$$

On the basis of these indices, groundwater is grouped as: Na+–HCO3 type if BEI > 1 and Na+–SO42− type if BEI < 1. If MGI < 1, then water type is of deep meteoric type and if MGI > 1, the it is of shallow meteoric percolation type. The results of BEI and MGI are presented in Table 2. From the results of MGI, it was found that 83.75% sampling sites showed shallow meteoric percolation type, while 16.25% sampling sites indicated deep meteoric percolation type. 83.75% sampling sites showed BEI values less than one and indicated Na + SO4, while 16.25% sampling sites showed Na + HCO3 water type.

Table 2 Groundwater classification according to base-exchange index (BGI) and meteoric genesis index (MGI) criteria

3.3 Multivariate statistical analysis of groundwater parameters

Cluster analysis (CA) was prepared to classify the different parameters (Fig. 4). CA mainly categorized the groundwater parameters into two groups such as alkalinity and HCO3 in one group and rest of the parameters are included in other cluster which is further grouped into two subgroups: subgroup I (NO3, pH, Na% and SAR) and subgroup II (EC, TDS, Ca2+, Na+, Mg2+, SO42−, Cl, hardness and K).

Fig. 4
figure 4

Cluster analysis of groundwater parameters

CA was also prepared for different sampling sites on the basis of groundwater parameters (Fig. 5). Mainly three clusters were formed for sampling sites which are further categorized into subgroups. Cluster 1 comprises of sites (1, 24, 13, 2, 8, 5, 45, 53, 74, 78, 6, 19, 77, 60, 76, 11, 14, 25, 9, 18, 17, 20, 10, 23, 26, 15, 16 and 21) which is categorized in four sub-clusters (1, 24, 13, 2 and 8), (5, 45, 53, 74 and 78), (6, 19, 77, 60, 76, 11, 14 and 25), and (9, 18, 17, 20, 10, 23, 26, 15 and 16). Site 21 formed a separate cluster 2 which may be attributed to high Na, K, Cl, SO4, hardness and alkalinity of the groundwater. Rock containing minerals are responsible for the contribution of these parameters in the area. Cluster 3 comprises of three sub-groups: (3, 12, 36, 40, 50, 80, 71, 7, 30, 50, 52, 64, 75, 73, 27, 41, 49, 55, 65, 70, 22, 69 and 62), (28, 58, 79, 43, 48, 44, 68, 46, 29, 51, 31, 37, 63, 34, 56 and 57), and (4, 32, 42, 33, 39, 66, 35, 54, 38, 47, 72, 61 and 67). From the results of CA, it was established that combined anthropogenic activities like cement factory, gas power plant, Rishmark factory and vegetable Oil Company, and natural processes like rock weathering are responsible for the source apportionment of groundwater parameters. The content of various physicochemical parameters of groundwater for six clusters are presented in Table 3. Cluster 1 showed high content of EC, Ca2+, Mg2+, SO42− and hardness. Cluster 2 is dominated by high content of alkalinity and HCO3. Cluster 3 is contributed by high concentrations of K+, Na+, Na (%) and SAR, while cluster 4 is dominated by Cl and NO3. Among the entire parameters cluster 5 is controlled mainly by pH. Cluster 6 showed alterations in the content of all the parameters.

Fig. 5
figure 5

Cluster analysis of sampling sites

Table 3 Physicochemical parameters of groundwater for six clusters

Principal component analysis (PCA) was applied to the groundwater parameters (Table 4). The first three PCs accounted for 89% of the total variation. Three components were extracted on the basis of eigenvalues of more than one [49]. The loadings of the PCs in the component matrix explained that PC1 had maximum loadings on EC, K+, Na+, Ca2+, Mg2+, Cl, SO42−, hardness and TDS, and accounted for 64.8% of the variation. EC and TDS loadings control the overall mineralization. Loading of Ca may be ascribed to its usage in the cement factory located in the study area. K+ and Na+ are generally contributed by rocks and anthropogenic sources. Cl containing minerals contributes Cl content in the groundwater [50]. Atmospheric deposition and power plants contributes SO42− content in the area [51]. Hardness occurs due to chemical and mining industry wastage or high usage of lime. The PC1 is mainly contributed by anthropogenic activities, i.e., cement factory, gas power plant, Rishmark factory and vegetable Oil Company existed in the Shiraz city as well as natural processes. 15.7% of the variation explained by the PC2 and have maximum loadings on HCO3 and alkalinity. The HCO3 and alkalinity are contributed mainly by rock sources to this PC. NO3 had maximum loading on PC3 and accounted for 8.4% of the variance. The application of agricultural fertilizers contributes to this PC. The similar type of loadings was obtained for different groundwater parameters after varimax rotation. Selvakumar et al. [52] in their studies on groundwater in Tamil Nadu, India reported loadings of EC, Ca2+, Mg2+, Na+, K+ and Cl on PC1 which supports our results. Our results find support from Ravikumar and Somashekar [53] in their research on groundwater of Karnataka and reported loadings of EC, TDS, hardness, Ca2+, Mg2+, K+, HCO3, SO42− and Cl on PC1. Non-metric multidimensional scaling (NMDS) was done on 80 different sampling sites on the basis of groundwater parameters. Correlation was used as the similarity measure. NMDS is a nonparametric multivariate technique. The ranks were compressed in two dimensions. NMDS scatter plot and Shepard plot for the sampling sites are presented in Fig. 6a, b. Shepard plot indicated a stress level of 0.0078. A stress level less than 0.05 is a good fit of the data. The stress level formed in NMDS showed that levels of sampling points are maintained on data compression, indicating that differences in the sampling points are dependable on groundwater quality parameters.

Table 4 Principal component analysis of groundwater parameters
Fig. 6
figure 6

a NMDS scatter plot (95% ecclipse) of groundwater parameters using correlation, b NMDS Shepard 2-D plot of groundwater parameters

4 Conclusion

The study area comprised of surface unconstrained to deep aquifers. Due to shallow water table groundwater may become susceptible to discharge of pollutants from the surface, and about 80 samples were collected and analyzed for various physiochemical parameters. The mean values of EC, K+, Mg2+, SO42− and TDS were found above the permissible limits of WHO for drinking water. 97.5% samples for hardness were falls in the category of > 300, indicating very hard water. SAR and Na% results indicated that water is acceptable for irrigation as well as domestic purposes. The MGI results indicated that 83.75% sampling sites showed shallow meteoric percolation type, whereas 16.25% sampling sites showed deep meteoric percolation type. 83.75% sampling sites showed BEI values less than one, while 16.25% sampling sites showed BEI values greater than one showing Na + HCO3 water type. Multivariate statistical analysis (CA & PCA) showed that both anthropogenic activities such as cement factory, gas power plant, Rishmark factory and vegetable Oil Company, and natural processes like rock weathering are responsible for the contents of groundwater parameters.