Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Non-point source (NPS) pollution in the vadose zone (simply defined as the layer of soil extending from the soil surface to the groundwater table) is a global environmental problem. The knowledge and information required to address the problem cross several technological and sub-disciplinary lines: spatial statistics, geographic information systems (GIS), hydrology, soil science and remote sensing (Corwin 1996; Corwin et al. 1997; Coppola et al. 2014).

As discussed by Stewart and Loague (2003), the main issues encountered in NPS groundwater vulnerability assessment are the large spatial scales, the complex processes that govern fluid flow and solute transport in the unsaturated zone (Comegna et al. 2010; Coppola et al. 2011), the absence of unsaturated zone measurements of diffuse pesticide concentrations in 3D regional-scale space (as these are difficult, time-consuming and prohibitively costly) and the computational effort required for solving the nonlinear equations for physically based modelling in heterogeneous media at regional scale (Coppola et al. 2014). This results in significant simplifying assumption in NPS contaminant leaching models.

Currently, existing regional-scale leaching models can be grouped into four main categories, ranging from qualitative models that rely on index and overlay techniques, over simple drainage algorithms, to stream tube models coupled to process-based numerical simulations (Stewart and Loague 2003; Coppola et al. 2014). The large datasets of physical factors involved in the leaching process required for regional-scale studies limit the use of deterministic methods that would result in more realistic estimates of solute concentration. Besides, the use of overlay models encounters less computational and data lacking issues, though resulting in relative output information in terms of risk factor.

As a compromise solution, an approach is presented here which is based on coupling of texture-based transfer function (TFtb) and GIS modelling. TFtb are texture-based travel time probability density functions describing a characteristic leaching behaviour for soil profiles with similar soil hydraulic properties. They actually represent the result of an upscaling procedure applied to Jury’s transfer function model (TFM) (Jury and Roth 1990).

GIS represents a spatially enabled database management system (DBMS) that is able to depict real-world geographic features of relevance to the leaching process, serving both as a baseline data depot for hydraulic modelling and a final gateway for output representation and interactive delivery.

With these premises, the main objective of this study was developing a regional-scale simulation methodology for vadose zone leaching that is capable of overcoming the limits of both fully deterministic and fully qualitative models in that it relies on easily available and accessible data and on affordable computational efforts and finally offers quantitative answers to groundwater vulnerability to agrochemical leaching at regional scale within a defined confidence interval.

This result was pursued through (1) the design and building of a spatial database containing environmental and physical information regarding the study area, (2) the development of the TFtb for layered soils and (3) the final representation of results through digital mapping.

One side GIS modelled environmental data in order to characterise, at regional scale, soil profile texture and depth, land use, climatic data, water table depth and potential evapotranspiration. On the other side, such information was implemented in the development of a set of TFtb, each describing the leaching behaviour of soil profiles with specific hydraulic properties. The latter, in turn, were estimated by area-specific pedotransfer functions developed on our own texture–hydraulic properties datasets coming from several sites in the investigated area. A wide area (about 12,000 ha) in the Metaponto plain in southern Basilicata, Italy, was completely characterised by a pedological point of view by digging several soil profiles. The textural properties of soil horizons of each soil profile were converted to the corresponding hydraulic properties by using a PTF specifically calibrated for the soils of the area (mainly silty loam, silty and silty clay). The solute travel times to water table for specific soil profiles were then imported back into GIS, and finally estimation of groundwater vulnerability for each soil unit was represented into a map.

2 Materials and Methods

2.1 Study Area

For this study, the area of interest was subbasin of Metaponto agricultural site, located in southern Basilicata, Italy, across the municipalities of Policoro, Scanzano Jonico, Montalbano Jonico and Pisticci, approximately 11,698 ha in size and crossed by two main rivers, Sinni and Agri, and from many secondary water bodies. Figure 1 shows the location of the study area.

Fig. 1
figure 1

Study area overview (circles indicate soil profile sites)

Topographically, the area is characterised by fairly distinct variations in elevation in the western part of the basin, away from the coast, and extremely flat terrain in the nearshore portions of the basin, to the south and east. The main soils in the area are the “soils of the alluvial plains” and the “soils of the Ionian coastal plain”. The soils of the alluvial plains are those formed along the fluvial channels of the rivers crossing the area. Therefore, they are soils on alluvial and lake deposits, with variable grain size from clayey to stony. The soils of the Ionian coastal plain consist of marine deposits of different ages, from Pleistocene to Holocene, and of alluvial deposits of variable grain size. The soils of the internal areas, less extensive than those described above, are those that formed on a substrate of sandstones with alternations of marl and clay.

Much of the basin is used for agricultural purposes. Major crops and land uses receiving applications of nutrients and chemicals include cereals, vegetables and fruit orchards. Soils that support these land uses range from the loam to clay loam, in the northern portions of the basin, to the fine silty loams in the southern basin.

2.2 Geo Database Implementation

A spatial digital database for the study area was established through the assembly of various publicly available physiographic data sets (land use, soils, climate, depth to groundwater, elevation).

Such datasets characterised features considered to be directly or indirectly involved in the leaching process, thus characterising boundary conditions. Data was structured into feature datasets and raster datasets, in that both discrete and continuous data types were implemented in this study (Table 1).

Table 1 Summary of the GIS datasets

Data manipulation and spatial analysis were performed to finally produce output datasets as described in the previous table.

2.2.1 Land Use

Land use dataset, published within Progetto Sigria, INEA 2000, was used to describe cropped species at parcel level and furthermore to define the spatial extent of the whole GIS project in this work. Table 2 shows cropped surfaces in hectares summarised by aggregated land use classes.

Table 2 Land use classes from the original map are summarised into 10 main categories

Fruit orchards land use dataset, published within Progetto Sigria, INEA 2000, was used to describe cropped species at parcel level and furthermore to define the spatial extent of the whole GIS project in this work. Fruit orchards cover almost 44 % of total study area, followed by ploughed areas covering 15 %, vegetables 10 % and cereals 10 %. Non-vegetated areas, such as urban or water bodies, make up for 15 %.

Soil data were originally acquired from regional geology agency in tabular format for 52 soil profiles sampled across the study area (see circles in Fig. 1. For each profile, a fictitious soil system was adopted, assuming the soil to be composed of only two layers, A and B, the first being superficial and 40 cm thick (this was the average depth of the first horizon for all the soil profiles and the latter reaching the water table). A fictitious soil profile was obtained for each real soil profile by averaging textural data with a weighted procedure using horizon depths as weights.

Data were then imported into GIS and spatial structure of sand, clay and silt contents, and depth measures were analysed with Exploratory Spatial Data Analysis (ESDA) tools, by means of classic statistical and geostatistical analyses. After interpolation, continuous surfaces for sand, clay, silt and soil profile depth were produced in raster format for both A and B soil layers (maps not shown).

Sand, silt and clay raster datasets were concurrently queried with a map calculator using a set of SQL statements, each defining a texture constraint for each soil class according to USDA classification system, in order to produce two final texture maps (Fig. 2), one for each fictitious soil layer.

Fig. 2
figure 2

Texture map for layers (a) and (b)

2.2.2 Precipitation, Temperature and Evapotranspiration

Climate data on a daily basis for the 1999–2009 period for all the climatic stations localised in the area of interest were provided by Regional Agriculture Services. The Voronoi algorithm was used for spatial partitioning of the study area according to the subarea of influence of each rainfall station. The Voronoi cells for the area under study are shown in Fig. 3.

Fig. 3
figure 3

Positions of meteorological stations and Voronoi polygons of the study area. Alphanumeric codes identify the soil profiles used for this study

2.2.3 Elevation

Contour lines on IGM 1:5,000 base maps were digitised, and a triangulated irregular network was created from them and finally converted into a raster dataset. From elevation grid, slope grid was derived as well.

2.2.4 Depth to Groundwater

The laboratory of Soil and Contaminant Hydrology of the University of Basilicata provided tabular data for 192 measuring wells, 51 of which were located within the study area. Once imported into GIS as a point feature dataset, groundwater depth at measuring stations was interpolated using Kriging, thus resulting in a continuous surface (map not shown).

2.3 Geoprocessing and Data Handling

The role of GIS in this study was to provide hydrological modelling with a spatial database carrying all the necessary information for leaching assessment: textural class, soil depth, water table depth and net water recharge.

In order to achieve such goal, datasets carrying spatially distributed information of these data, collected as described above, were aggregated at homogenous soil units level on one common layer enabling further data manipulation and derived variable creation, such as water recharge.

Homogenous soil units (being areas characterized by sufficiently similar leaching behaviour) were defined intersecting soil texture information for layers A and B through geoprocessing tools, which finally resulting in a polygon feature vector dataset. Each polygon defines a combination of two overlaying textural classes; in all there were 31 possible texture combinations classes.

Finally zonal attribute transfer was then performed from land use, rain, freatimetry, temperature, slope and elevation datasets to soil unit feature dataset, in order to associate to each soil unit such spatially distributed information averaged for each polygon surface. Furthermore for each soil unit potential evapotranspiration (ETp) was calculated using the Thornthwaite formula (USDA 1975).

At this point the output spatial database was set up with all calculated, estimated and derived physical attributes related to each polygon representing an instance of a homogeneous soil unit. In Table 3 are shown three explanatory records out of the total 7,872. Such database was implemented in hydrological modelling as discussed in the following section.

Table 3 Spatial database for three explanatory records

2.4 Stochastic Development of TFtb

2.4.1 Texture-Based Hydraulic and Solute Travel Time Distributions

A large dataset of hydraulic properties was already available for the textural classes of the area. They were measured in the laboratory on undisturbed soil samples (490 samples) collected at the soil surface during several previous measurement campaigns.

Soil water retention was described by the unimodal θ(h) relationship proposed by van Genuchten (1980) and expressed here in terms of the scaled water content (S e) as follows:

$$ {S}_{\mathrm{e}}=\frac{\theta -{\theta}_{\mathrm{r}}}{\theta_0-{\theta}_{\mathrm{r}}}={\left[1+{\left|{\alpha}_{\mathrm{VG}}h\right|}^n\right]}^{-m} $$
(1)

where α VG (cm−1), n and m are curve-fitting parameters and h the pressure head.

Mualem’s expression was used to calculate relative hydraulic conductivity, K r (Mualem 1976); assuming m = 1 − 1/n, van Genuchten (1980) obtained a closed-form analytical solution to predict K r at a specified volumetric water content

$$ {K}_{\mathrm{r}}\left({S}_{\mathrm{e}}\right)=\frac{K\left({S}_{\mathrm{e}}\right)}{K_0}={S}_{\mathrm{e}}^{\tau }{\left[1-{\left(1-{{\mathrm{S}}_{\mathrm{e}}}^{1/m}\right)}^m\right]}^2 $$
(2)

where τ is a parameter accounting for the dependence of the tortuosity and the correlation factors on the water content and K 0 is the hydraulic conductivity at h = 0.

For each soil sample a detailed particle-size distribution (PSD) was also available. PSD data were used as a basis for estimating soil water retention (and the corresponding parameters in Eq. (1) of soil horizons of each of the observation soil profiles by using the physico-empirical PTF approach proposed by Arya and Paris (1981). Hereafter, such an approach will be referred to as the AP approach. The original formulation based on a single optimisation parameter (αAP) was thus made more flexible by assuming a variable αAP(h) with the pressure head h.

Arya et al. (1999) also derived an expression to compute K(θ) directly from the PSD, based on the same soil structure model leading to the θ(h) function. We opted for using the method only for estimating the saturated hydraulic conductivity. The whole hydraulic conductivity curve was estimated by using Eq. (2), with retention parameters and K 0 estimated by the AP method and setting τ = 0.5. A specific αAP(h) curve was obtained for each of the textural class present in the investigated area. The measured hydraulic properties were partly (200 samples) used for the PTF calibration, by keeping the remaining data for the PTF validation.

In synthesis, site-specific PTF allowed estimating a complete set of hydraulic parameters (θ 0, θ r , α VG, n, K 0 and τ = 0.5), for each of the textural classes found in the area. A Kolmogorov–Smirnov test showed that all the parameters were normally distributed. The mean and the covariance matrix for the five parameters (all but τ) were computed for each of the textural classes encountered along the soil profiles. Thus, random field of the five parameters was produced with a Monte Carlo procedure from the correlated multivariate normal distribution for any textural classes by generating a vector x of independent standard normal deviates and then applying a linear transformation of the form x = μ + Lr n , where μ is the desired vector of means and L is the lower triangular matrix derived from the symmetric covariance matrix V = LL T decomposed by Cholesky factorisation. In other words, the procedure generated random field with correlated parameters by multiplying the lower triangular Cholesky decomposition of the covariance matrix with vectors, r n , containing five N(0,1) randomly distributed numbers and by summing up the result to the mean of the parameters. We recall that using the statistical moments of the parameters of the hydraulic functions for generating the random field to be used in Monte Carlo simulations implies the assumption that soil hydraulic variability can be described by the statistical distribution of such parameters. This widely used approach is conceptually simple and is based on the idea of approximating stochastic processes by a large number of equally probable realisations. In this study, 400 sets of equally probable hydraulic parameter realisations were generated for each of the textural classes.

As for solute transport, solute travel time distributions were deduced by applying the method proposed by Scotter and Ross (1994), which estimates breakthrough curves of a tracer at a given depth for a given soil starting with the hydraulic conductivity function of that soil. According to the transfer function model (TFM), the flux concentration at a depth z, C f(z, I), given a time-varying flux concentration at the input surface C f(0, I) is given by

$$ {C}^{\mathrm{f}}\left(z,I\right)={\displaystyle {\int}_0^z{C}^f\left(0,I-{I}^{\prime}\right)}{f}^f\left(z,I\right)d{I}^{\prime } $$
(3)

where f f(z, I) is the steady-state travel time distribution (travel time pdf) defining the changes in the normalised concentration in the drainage as the cumulative drainage I builds up. For steady-state flow conditions, I = qt, where q is the steady-state flow rate and t is the time. Scotter and Ross (1994) assumed a gravity-induced water flow, a conservative and nonreactive solute and a purely convective flow, thus ignoring any convective mixing of solute flowing at different velocities and the effects of molecular diffusion. With these assumptions, the \( {f}^{\mathrm{f}}\left(z,t=I/q\right) \) for a pulse input of solute can be obtained as

$$ {f}^{\mathrm{f}}\left(z,t\right)=\frac{\mathrm{d}{C}^f\left(z,t\right)}{\mathrm{d}t}=-\frac{1}{q}\frac{\mathrm{d}K\left(\theta \right)}{\mathrm{d}t} $$
(4)

For a log-normal distribution of the cumulative drainage (or of travel times) the analytical expression for the pdf is a log-normal density function

$$ f(I)=\frac{1}{\sqrt{2\pi}\sigma I} \exp \left[-{\frac{\left( \ln I-\mu \right)}{2{\sigma}^2}}^2\right] $$
(5)

in which μ and σ are the parameters of the log-normal pdf.

For the case of a stochastic–convective with log-normal distribution of travel time (CLT) model, if the f f(z, I) is known at a given depth z 1, then the TFM model allows for scaling that pdf to a depth z 2 according to the equation

$$ {f}^{\mathrm{f}}\left({z}_2,I\right)=\frac{z_1}{z_2}\left({z}_1,I\frac{z_1}{z_2}\right). $$
(6)

This means that z 1 = z 2 and z 2 = z 1 + ln(z 2/z 1). If, to the contrary, the transport process obeys to the advection–dispersion (AD) model, z 2 = 0.5z 1(z 1/z 2) and z 2 = z 1 + ln(z 2/z 1) + 0.5z 1 2(1 − z 1/z 2).

For each of the hydraulic parameter random vectors obtained for each textural class, a corresponding fictitious breakthrough curve, f f(z, I), at an arbitrary depth of z = 40 cm was calculated according to Eq. (4) for a solute pulse injection at the surface. An inert, non-adsorbed (a tracer) solute was selected for simulation purposes. In order to calculate the cumulative drainage I, an hourly inflow rate was calculated by assuming that all the net recharge (the rainfall infiltrated in the soil minus the evapotranspiration) was uniformly distributed over the year and that storage and surface run-off were negligible.

By simply averaging over the 400 simulated breakthrough curves f f(z, I), an upscaled probability density function was obtained. Effective parameters μ(μ ef) and σ(σ ef) at 40 cm for each textural class were estimated by fitting Eq. (5) to the upscaled curve.

2.4.2 Upscaled Solute Travel Time Distribution for Textural Sequences

By assuming the independence between two successive layers along a textural profile, assumed to consist of two layers A and B, the mean E(I, z = 80 cm) and variance VAR(I, z = 80 cm) were obtained by summing up the E(I, z = 40 cm) and VAR(I, z = 40 cm) of the two textural classes for any textural sequence. The E(I, z) and the VAR(I, z) were calculated as

$$ \begin{array}{l}E\left(I,z\right)= \exp \left({\mu}_{\mathrm{z}}+0.5{\sigma}_z^2\right)\kern2.25em \\ {}\mathrm{V}\mathrm{A}\mathrm{R}\left(I,z\right)= \exp \left(2{\mu}_{\mathrm{z}}+{\sigma}_z^2\right)\ \left[ \exp \left({\sigma}_z^2\right)-1\right]\kern2em \end{array} $$
(7)

Once the upscaled 80 cm pdf was obtained, it was scaled with depth down to the water table according to the following hypothesis:

  1. 1.

    The transport mechanism in the second layer down to the water table is the CLT (the effective parameters were scaled according to the CLT model).

  2. 2.

    The transport mechanism in the second layer down to the water table is the CDE (the effective parameters were scaled according to the CDE model).

3 Results and Discussions

The outputs of the numerical simulations, carried out in a stochastic framework, were interpolated for producing continuous maps of the modal travel time, along with the corresponding uncertainty levels. Modal travel times for each of the textural sequences and for both the assumed transport mechanisms are synthesised in the maps in Fig. 4a, b; they exhibit remarkable differences in terms of travel time estimations for same soil textural classes due to the underlying transport approaches and soil layer conceptualisations.

Fig. 4
figure 4

(a) Modal travel time map for CLT process; (b) modal travel time map for AD process

Independently on the mechanism assumed, the spatial variability of textural layers was a major factor influencing the field water and solute transport in alluvial soils. Results showed large differences in the magnitude of the different travel times and related uncertainties among different profiles. The lower or higher vulnerability was found to be mainly related to the average silt content of the soil profiles. Higher travel time uncertainty was mainly related to the clay content in the range 20–40 %.

As for the CDE mechanism, travel times are generally higher than for the CLT by an average of 88 %, a minimum of 30 % (loam on loam) and a maximum of 220 % difference (silty clay on clay). Highly vulnerable areas, being characterised by a shorter travel time, showed to be more subject to estimation value fluctuations depending on the transport mechanism compared to more protected areas.

Of course, the local travel time should be interpreted according to the specific local conditions, especially in terms of land use, crop, water table depth and rainfall. Referring to CDE transport mechanism, travel time values were reclassified using natural breaks classification method into three main vulnerability classes: low risk, medium risk and high risk. In figure 5 is showed a comparison between land use classes versus vulnerability risk classes, using a spatial join constraint. Very low vulnerability resulted associated with areas not subject to intensive agrochemical inputs such as forest, to unutilized areas and to fruit tree orchards; fruit and vegetables together with olive orchards and cereals classes showed an average medium risk. Finally grape orchards and grassland/pasture land are associated with high risk.

Fig. 5
figure 5

Vulnerability risk for land use classes

Spatial distribution, fragmentation and perimeter to area ratio of land use patches were taken into account to ensure data homogeneity during generalisation of travel times and subsequently land use class risk rankings (Batty and Longley 1994).

4 Conclusions

All the past efforts to evaluate solute travel times to groundwater at the regional scale have frequently been hindered by the problem of how to account for the variability of soil layering. Past studies tended to decompose a profile into several (usually three to four) functional horizons and assumed that they are identical within a certain area when the water flow and the solute transport are modelled. This method may be suitable for some genetic soils that consist of intrinsic genetic horizons and do not vary significantly in the thickness of every horizon within a certain area. However, for alluvial soils, which are widely distributed in the alluvial plain investigated in this paper, the textural layering is very complex. Accordingly, to accurately quantify the solute transport process at regional scale, we took the spatial variability of textural layers explicitly into account. Information on soil textural profiles was coupled with a texture-based transfer function solute transport model to conduct a stochastic analysis of the solute transport in the Metaponto area (Basilicata region, South Italy). The aim was to assess the effect of spatial variability of textural layers on the solute travel times, along with their probability distributions. The strength of the methodology becomes apparent especially if compared to qualitative models that, while being the most common solution for regional studies, rely uniquely on empirical conceptualisations of chemical leaching processes and give as outputs only general qualitative indication rankings without quantification of risk in terms of travel times.