1 Introduction

Soil acts as an immobilizing system of trace elements, checking their passing to the groundwater. However, this ability is a function of the physic-chemical soil properties and environmental requirements [1, 2]. Usually, heavy metals, such as As, V, Cr, Ni, Cu, Z, Cd and Pb, occur naturally as “trace” in soil (< 1000 mg kg−1) by pedogenetic processes of parent materials, but also by some human activities and industrial processes, such as land application of fertilizers or pesticides, presence of landfills, disposal of high metal wastes, municipal solid waste plants, mine tailings, coal combustion residues, etc. [3, 4]. Generally, the surface sediments hold more heavy metal of the than the waterbody, so it is necessary to examine their contents in the surface sediments [5, 6]. Normally, the highest levels of trace elements originated from the parent rock are chromium, manganese, nickel, cobalt, copper, zinc and lead. Instead, arsenic, cadmium and mercury are usually present at deficient concentrations [7, 8]. In the environmental sector, trace elements are ubiquitous due to some phenomena such as chemical adsorption and physical precipitation in the sediments of groundwater. These phenomena can cause changes in the water or sediments condition in terms of pH, electrical conductivity (EC), oxidation–reduction potential (ORP), and chemical oxygen demand (COD) [9,10,11]. However, these can cause changes for chemical forms (speciation) and bioavailability of trace elements [3]. These changes can pose risks and hazards to humans and ecosystem, through direct ingestion or contact with contaminated soil, the food chain, drinking of contaminated groundwater, reduction in food quality via phytotoxicity and in land usability for agricultural production causing food insecurity [12]. Therefore, a global environmental issue results due to the crucial importance of food production and security. So, it is necessary an adequate protection and restoration of soil ecosystems contaminated by heavy metals [3]. The calculation of environmental geochemical baselines is necessary to assess the soil pollution status and to provide guidelines and quality standards in environmental legislation, especially for the evaluation of contaminated soils and environmental risk assessment [13]. However, the geochemical baseline assessment is complex and not always possible, so this paper proposes a statistical procedure to determine the geochemical baseline values. The statistical approach (cumulative frequency and normalization method) proposed, combined with the evaluation of two pollution indicators (geoaccumulation index and enrichment factor), allow to establish the soil pollution status.

1.1 Soil pollution status: the definition of background value (BVs) and geochemical baseline concentrations (GBCs)

The geo-availability controls the behavior of trace elements geogenic in soils. It is the fraction of the total content of a chemical element that can be freed to the environment by mechanical, chemical, or biological processes. Generally, the anthropogenic sources cause the more mobility of heavy metals in the soil than pedogenic or lithogenic ones [14]. In order to determine if a soil is polluted, it is essential to characterize trace elements presence for geogenic or anthropogenic sources [15]. The measurement of the rates of trace elements in the soil surface horizons cannot provide evidence about the soil contamination status, because not support the distinction between natural background and anthropogenic enrichment [16]. In fact, it is necessary to define:

  • Background value (BVs): natural contents of substance in the soil dependent on the compositional and mineralogical characteristic of the parent/source geological material;

  • Geochemical baseline concentrations (GBCs): concentration range of an element in a specific area dependent both on the parent geological/source material and on the historic diffuse release [17, 18].

The former one represents the natural concentrations of an element in the soil [19], a measure performed to differentiate between the natural compound concentrations and the concentrations with an anthropogenic influence on an environmental sample [20]. Once the natural background of the metals has been defined, the geochemical anomaly is defined as the deviation of the metal content from the limit value defined by the legislation [21]. The background value depends on the geological substrates, and the natural processes performed the soils [22]. Still, due to atmospheric deposition that can contaminate soils with certain trace elements, it is almost impossible to learn natural background levels [23, 24]. It is difficult to determine the background value in areas subject to anthropogenic influence or marked by high spatial variability of the pedological/geological characteristics of the soils. Moreover, the absence of information relating to the soil geochemical composition in a study area is also a strong limit for the assessment of soil contamination. The definition of the natural background, often, occurs by international reference values, but they derived from other studies and the reference may cause severe uncertainty. Therefore, the geochemical baseline concentration assessment is more useful, since it represents conditions where a certain human impact on the environment already exists [25]. As already mentioned, the geochemical baseline concentrations include the geogenic natural content and the anthropogenic contribution in the soils [17, 18]. However, a geochemical baseline should be determined separately for each trace elements in the study area, otherwise the limit values for contaminated soils may be lower than the natural concentrations calculated for an extensive area [26]. Although referring to a Regional Geochemical Baseline Mapping is a valid procedure, it would be better to determine the local geochemical baseline, because it is more representative since the trace elements anomalous values recorded to particular sites could be geogenic [2]. In fact, to determine local geochemical baseline, its necessary sampling of topsoil at depths not exceeding 1 m of soils (subsurface horizon) [15, 27, 28]. The surface layers (from 0 to 20 cm) provided more information on the pollution levels caused by the geogenic processes of soil formation or anthropogenic sources. Instead, the subsurface layers (from 40 to 100 cm) exclusively represent the lithogenic contributions, since there is little probability of contamination through atmospheric deposition, and for this reason the subsurface horizon can be used to determine the local geochemical baseline [13]. In the hypothesis that it is not possible to carry out a subsurface soil horizon samples and the Regional Geochemical Baseline Mapping is not available, some studies [29,30,31] identify statistical methods (cumulative frequency and normalization method) as useful tools to define the geochemical baseline values. In order to assess the soil pollution status, some studies [28, 32,33,34,35,36] use some quantitative indexes: the geoaccumulation index (Igeo) and the enrichment factor (EF). By comparing the concentrations of a trace element in the topsoil and the locally natural background for the study area, these indexes allow the “health of a soil” assessment. These indexes identify, numerically, pollution level soils and normally they are calculated on the soil exchangeable fraction because it represents the real bioavailable fraction. The bioavailable metal content in soil exerts a decisive impact on soil quality and it is used in food production [16]. This fraction is obtained by applying the first step of Tessier et al. [37] procedure and optimized by Frankowsky et al. [38].

The objectives of this paper are:

  • to define a procedure able to evaluate the soil contamination status, by some Italian case studies in Lombardy and Lazio;

  • to assess the exchangeable fraction of topsoil, associated with the calculation of the geoaccumulation index (Igeo) and metals enrichment factor (EF);

  • to propose a statistical approach to estimate the geochemical baseline values.

2 Proposed method

The pollution by trace elements is one of the most critical environmental risks to health. Generally, trace elements can be present at low concentrations (mg kg−1) in soils, but sometimes, they can be toxic to the environment. The Environmental Protection Agency (EPA) has presented a table of pollutants with some trace elements, such as silver, arsenic, beryllium, cadmium, chromium, copper, mercury, nickel, lead, antimony, selenium, thallium and zinc. Among these elements, the most widespread in contaminated soils are As, Cd, Cr, Cu, Hg, Pb, and Zn [2]. Contamination is linked to the presence of a substance, for example, a trace element, at concentrations higher than the natural background values, because the contaminants are dangerous only under specific conditions in the environment [39]. Once an element passed from soil to water, to be hazardous it must be found in abnormal concentrations, i.e. higher than naturally background values. In the environment, the element could be available, and the hazard will depend on its “bioavailability”. It is the degree to which a contaminant is possible to uptake in soil or groundwater, i.e. the movement into or onto an organism [40]. Hence, it seems useful to show a method to assess trace elements concentration in soils and to explain their presence due to natural enrichment or anthropogenic contamination (Fig. 1).

Fig. 1
figure 1

Flow-sheet diagram to assess trace elements contamination in soils

Firstly, the study area must be presented, highlighting the geological and hydrogeological settings. In order to characterize an area, it is also important to consider the study area location: the presence of infrastructure, industrial areas, quarries or mining areas, land uses, agricultural areas, plants. In order to assess the soil contamination by trace elements, a sampling survey should be performed according to Italian government regulation. At the laboratory, chemical analysis is followed by inductively coupled plasma mass spectrometry (ICP-MS). The trace elements distribution and their relative mobility in the different soil phases can be pursued through extraction method, specifically sequential chemical extractions. The method uses different specific extractants, based on complex reactions. The result is an operational speciation leading to the trace elements mobility and their partitioning into the main soil phases [37]. In order to assess environmental issues, this method allows the determination of “total” (aqua regia extraction) and “exchangeable” (acetic acid, ammonium acetate and ethylenediaminetetraacetate—EDTA extraction) fraction of trace elements. The “exchangeable” fraction represents the real bioavailable fraction on the soils [2, 16]. Then, the proposed method considers some in situ parameters: pH, redox potential (Eh), electrical conductivity (EC), temperature (T) and dissolved oxygen (DO). These parameters, in fact, may undergo variations due to trace elements concentrations beyond the threshold values by Italian Legislative Decree 152/06. In order to study the trace elements distribution in the soils, a statistical analysis is necessary, that allows: (1) to compare the trace elements concentrations with the threshold values of Italian Legislative Decree 152/06, (2) to define background values, according to ISPRA Procedure [41], (3) to verify the spatial and temporal distribution of trace elements concentrations, (4) to perform statistical correlations between the trace elements concentrations (Boxplot diagram, Spearman correlations) and correlations also with the chemical-physical elements. The proposed method, at the end, identifies geoaccumulation index (Igeo) and enrichment factor (EF) as useful tools to assess the potential pollution risks of trace elements in the soil. These indexes, defined on the bioavailable fraction of the trace elements, can give an indication if the contamination is due to anthropogenic causes or natural enrichment, therefore linked to geogenic phenomena occurring locally.

3 Materials and methods

3.1 Sampling and chemical analysis

Sampling is a basic phase of each environmental assessment and the results quality is closely linked to the sampling itself. It is therefore a complex step affecting the results of all subsequent operations. Therefore, it is necessary to prepare a Sampling Plan aimed at collecting representative samples for a correct contamination phenomena evaluation. The following method is widely used in environmental geochemistry and recommended by Italian government regulation. In general, to assess the topsoil natural enrichment, the samples are collected to a maximum depth of 30 cm from the surface. At the laboratory, soil samples were placed in plastic containers and dried at a constant temperature of about 40 °C. Once dry, the samples were sieved manually using a 2 mm mesh sieve. Chemical analysis is performed on a fraction of soil (Ø < 2 mm), according to the Legislative Decree 120/2017. For each soil sample, a granulometric fraction of less than 2 mm weighing 1 g is powdered using an agate mortar [42]. The chemical analysis is followed by Thermo Scientific Application Note 40619 (US EPA SW-846 Method 6020A Using the XSERIES 2 ICP-MS). The solubilization procedure by “acqua regia” (HCl-HNO3) allows to determine “pseudo-total” content of trace elements. It can enable the estimation of the maximum amounts of trace elements that are potentially available or mobile in soils, in fact it does not dissolve silicate matrices [43]. The procedure uses 250 mg of soil sample and it mixes with 9 ml of HCl plus 3 ml of HNO3 (“aqua regia”) and placed on a hot plate at 50 °C for 2 h. Once the solubilization is performed and cooled for 20 min, the concentrations of minor and trace elements are measured using an ICP-MS (X Series 2, Thermo Fisher Scientific) following filtration (0.45 μm) and acidification in the field (HNO3, 3%). The analytical accuracy of these methods ranged from 2 to 5%. An internal standard, Rh, was used to correct the ICP-MS instrumental drift. Ultrapure water (Millipore, Milli-Q, 16 M X cm) was used to prepare blanks, standard solutions and sample dilutions [37]. Instead, there are a lot of methods to define “exchangeable” fraction of trace elements, such as calcium chloride (CaCl2), calcium nitrate (Ca (NO3)2), ammonium nitrate (NH4NO3) or ammonium acetate (CH3COONH4) [44, 45]. It allows to assess the mobility and availability of trace elements in soils. Once the solubilization is performed, the concentrations of trace elements can be measured using an ICP-MS (X Series 2, Thermo Fisher Scientific). On the contrary solubilization procedure by “acqua regia”, the definition of “exchangeable” fraction is not required by Italian government regulation.

3.2 Geochemical baseline values assessment

In order to assess the soil pollution, it is necessary to define Geochemical Baseline Values in the study area. Some studies [29,30,31] have shown the need to use statistical methods to determine the geochemical baseline values. According to Zhou et al. [29], the used statistical approach considers the combination between the cumulative frequency curve (CFD) and the normalization method through a linear regression. A cumulative frequency distribution (CFD) curve is plotted. The curve is generally characterized by:

  • Two inflexions. In this case, the lower data represents the upper limit of natural origin concentrations and the higher ones are the limit of abnormal concentrations. The extreme points are deleted until the remaining data meet the criteria for the linear regression (R2 > 0.95). These data are selected as average metal concentrations to define local geochemical baseline;

  • One inflexion. In this case, the geochemical baseline is obtained from the data below the inflexion value;

  • No inflexion. If the curve is approximately linear, all the data are used to calculate the baseline.

According to some studies [29,30,31], the geochemical baseline is defined by arithmetic mean or median value. For the normalization method, a conservative reference element is selected because they are not influenced by anthropogenic inputs. Generally, the commonly used reference elements are: aluminum, iron, manganese, rubidium and scandium [46, 47]. The normalization method involves a linear regression equation between “n” trace element (Cn) and the reference element (Cref) by following way:

$$C_{n} = a*C_{\text{ref}} + b$$
(1)

where a and b are regression constants. In Eq. (1), the natural sediment is defined by the data points falling inside the 95% confidence band. Instead, data falling outside the 95% confidence band reflect probably anthropogenic input [31, 32]. Data points inside the 95% confidence band are used to calculate the geochemical baseline of “n” trace element (Bn) as follows:

$$B_{n} = c*\bar{C}_{\text{ref}} + d$$
(2)

where \(\bar{C}_{\text{ref}}\) is the mean concentration of the reference element. The geochemical baseline value at each sampling site can be calculated according to Eq. (2). A high dataset with a greater number of data, with a homogeneous distribution over the study area, makes the use of the statistical method more effective, stable and statistically valid.

3.3 Pollution indicators

Some pollution indices can be used to provide a relative ranking of contamination levels. In fact, geoaccumulation index (Igeo) and enrichment factor (EF) can be employed to assess the pollution status and to estimate the impact of anthropogenic activities [16, 33, 48,49,50]. These indicators are used to assess the presence and intensity of anthropogenic contaminant on topsoil. They are calculated by the normalization of a trace element concentration in the topsoil respect to the concentration of a reference element [16].

3.3.1 Igeo values

The geoaccumulation index (Igeo) has been used since the 1970s as a measure of contamination of bottom sediments [51] and has been used evaluating the contamination of trace elements on soils [16, 52,53,54]. It is used to evaluate enrichment of the upper horizons respect to “local background”. The Igeo was calculated (in the fraction ≤ 2 μm) by following way:

$$I_{\text{geo}} = \ln \left( {\frac{Cn}{{1.5*B_{n} }}} \right)$$
(3)

where Cn is the concentration of “n” trace element in topsoil and Bn is the geochemical background concentration of “n” trace element. The factor 1.5 is the background matrix correction factor due to lithogenic effects and anthropogenic influence. According to Muller [55], the geoaccumulation index (Igeo) consists of 7 classes: unpolluted (Igeo ≤ 0), unpolluted to moderately polluted (0 < Igeo ≤ 1), moderately polluted (1 < Igeo ≤ 2), moderately to heavily polluted (2 < Igeo ≤ 3), heavily polluted (3 < Igeo ≤ 4), heavily to extremely polluted (4 < Igeo ≤ 5), or extremely polluted (Igeo ≥ 5).

3.3.2 Enrichment factor

The enrichment factor (EF) is used to identify the contribution of anthropogenic sources in topsoil [56]. The enrichment factor is expressed as follows:

$${\text{EF}} = \frac{{\left( {C_{n} /C_{\text{ref}} } \right)}}{{\left( {B_{n} /B_{\text{ref}} } \right)}}$$
(4)

where Cn is the concentration of the “n” trace element in topsoil, Cref is the concentration of reference element in topsoil sample, Bn is the geochemical background concentration of the “n” trace element in topsoil (background value) and Bref is the geochemical background concentration of reference element in topsoil sample. In fact, this index allows to evaluate the variation of a potentially toxic element respect to a reference element. A reference element is an element particularly stable in the soil, which is characterized by absence of vertical mobility and/or degradation phenomena. The commonly used reference elements are: aluminum, iron, manganese, rubidium and scandium [46, 47]. According to Zhang et al. [57], EF values between 0.05 and 1.5 indicate that the trace elements enrichment is due to crustal materials or natural processes, whereas EF values higher than 1.5 suggest that the contamination is linked to be anthropogenic sources. In fact, generally the soils can be classified as deficiency to minimal enrichment (EF < 2), moderate enrichment (2 ≤ EF < 5), significant enrichment (5 ≤ EF < 20), very high enrichment (20 ≤ EF < 40) or extremely high enrichment (EF ≥ 40) [47].

4 Results

This paper presents some Italian case studies: (1) the Bevera Valley Basin (Lombardy), (2) the Appian Way Regional Park, located in the center of the city of Rome (Lazio), (3) Borgo Montello and Cisterna di Latina in Pontina Plain, located to the southeast of Rome (Lazio) and (4) Roccasecca located within Latina Valley (Lazio).

4.1 The Bevera Valley Basin in Lombardy

The Bevera Valley Basin is located in the Municipality of Arcisate (VA), in the northeast part of Lombardy Region (Fig. 2). In the southeast part of the Municipality of Arcisate, there are three big quarrying areas: Rainer Quarry and Femar Quarry, both non-operative since the 90s, and Valli Quarry (operative).

Fig. 2
figure 2

Localization of Arcisate (VA) in the northeast part of Lombardy Region

During the sampling survey, from April 2017 to March 2018, the Arsenic, Manganese and Iron concentrations in groundwater exceeded the threshold values of the Italian Legislative Decree 152/2006. In order to verify possible interactions with groundwater, two topsoil samplings were performed in Rainer Quarry and Femar Quarry and they were analyzed according to procedure by Italian Legislative Decree 152/2006 (Fig. 3).

Fig. 3
figure 3

Trend concentrations trace elements for Rainer and Femar Quarry

Figure 3 shows an analogy of the trends in the two quarries. The trends are linked, in fact, to the common geological origin of soil constituting two deposits, with high values for aluminum and iron.

4.2 Case studies in Lazio region

The studied areas are the Appian Way Regional Park (RM), Borgo Montello (LT), Cisterna di Latina (LT) and Roccasecca (FR) (Fig. 4). In southeast Rome, there is the Appian Way Regional Park, located in an urban environment within the Campagna Romana. This area is mainly characterized by pyroclastic deposits, coming from the Tosco-Laziale volcanic activity. The second and third areas are localized in the Pontina Plain, at the southeast of Rome, positioned in the geologically transitional zone between the Lazio volcanic deposits and the terrigenous coastal deposits [58]. Roccasecca, instead, is situated inside the Latina Valley and it is characterized mainly by alluvial deposits. Borgo Montello, Cisterna di Latina and Roccasecca are positioned near landfill sites [35].

Fig. 4
figure 4

Localization of case studies in Lazio Region

The topsoil units were selected: 9 samples were collected in the Appian Way Regional Park (Fig. 5), 16 samples in Borgo Montello (Fig. 6), 10 samples in Cisterna di Latina (Fig. 7), and 5 samples in Roccasecca (Fig. 8). The concentrations of V, Mn, Fe, Co, Ni, Cu, As, Cd and Pb in topsoil and background samples were analyzed.

Fig. 5
figure 5

Trend concentrations trace elements in the Appian Way Regional Park

Fig. 6
figure 6

Trend concentrations trace elements in Borgo Montello

Fig. 7
figure 7

Trend concentrations trace elements in Cisterna di Latina

Fig. 8
figure 8

Trend concentrations trace elements in Roccasecca

Figure 5 shows that in topsoil samples the Al, Fe and Mn trace elements have high concentrations. Instead, low values of Cd are showed in all samples, with concentrations lower than 1 mg kg−1.

Like the trace elements concentrations trend of Appian Way soils, Al, Fe and Mn are the major concentrations detected in Borgo Montello area (Fig. 6). Furthermore, Cd concentrations are lower than 1 mg kg−1.

The results (Fig. 7) for Cisterna di Latina topsoil indicate that Al, Fe and Mn are the main constituents, with low values for Cd.

Figure 8 shows that Fe and Mn are also the main constituents of the Roccasecca topsoil, with Cd concentrations lower than 1 mg kg−1. Figures 5, 6, 7 and 8 show similar trend of trace elements concentrations for topsoil samples in each considered case studies, with high values of manganese, iron and aluminum. These elements are associated to soil origin; in fact, these are common constituents of soils especially for soil of sedimentary deposits [59].

5 Discussions

The potential pollution risks of trace elements in topsoil were evaluated by method of geoaccumulation index (Igeo) and enrichment factor analysis (EF). The objective is verify that some trace elements (i.e. As, V, Cr, Ni, Cu, Co, Cd and Pb) with higher concentration in topsoil samples compared to the background values can be possibly linked to some anthropogenic activities in addition to the content from parent soils.

5.1 The Bevera Valley Basin in Lombardy

Negative values of Geoaccumulation Index suggest that the trace metals may be entirely from natural weathering processes [57]. Instead, an Igeo > 0 value indicates that a portion of the trace metals derives from non-crustal materials [47], for example by pollution sources. Table 1 shows the results of the Geoaccumulation Index in Rainer and Femar quarries.

Table 1 Geoaccumulation index (Igeo) for Rainer and Femar Quarry

The Igeo results (Table 1) highlight the absence of contamination, with negative values, for Rainer and Femar quarries. In order to distinguish the geogenic enrichment or anthropogenic contamination of trace elements in Rainer and Femar quarries, the enrichment factor (EF) was evaluated (Fig. 9).

Fig. 9
figure 9

Enrichment factor (EF) for Rainer and Femar Quarry

The aluminum (Al) is chosen as reference element to assess the enrichment factor (EF), because it is a highly conservative element and an important constituent of clay minerals [60]. In order to calculate the geoaccumultaion index (Igeo) and the enrichment factor (EF), the geochemical background concentration of ‘n’ trace elements has been considered by “Progetto di Monitoraggio Ambientale su tutto il Territorio della Regione Lombardia—Progetto Soil” [61], equal to the 95th percentile. It is possible to verify that Rainer Quarry is enriched more in Arsenic (As) and Copper (Cu). Other previous studies [62, 63] have shown that the significant enrichment of Arsenic in Rainer Quarry is linked to the clayey nature of the soil, with peat lenses, in the study area. This evaluation makes it possible to state that the two sites are not contaminated and, therefore, the trace elements are not linked to anthropogenic inputs, but exclusively to natural enrichment.

5.2 Case studies in Lazio region

Samples are assembled at a depth of 1 m to assess the geoaccumulation index and the enrichment factor and test and differentiate between natural enrichment or anthropic contamination of the trace elements. For studies cases, it was possible to select one background value for all samples, except for the Appian Way Regional Park (one background value every three samples) and Roccasecca (only one background value for all samples) due to the closeness of the samples and the similarity of the features.

Table 2 presents Igeo results for the Appian Way Regional Park, Table 3 shows Igeo values for Borgo Montello soils, Table 4 presents Igeo values for Cisterna di Latina soils and Table 5 presents Igeo values for Roccasecca soils. Alternatively, Fig. 10 shows EF results for the Appian Way Regional Park; Fig. 11 shows EF values for Borgo Montello soils, Fig. 12 shows EF values for Cisterna di Latina soils and Fig. 13 shows EF values for Roccasecca soils.

Table 2 Geoaccumulation index (Igeo) for Appian Way Regional Park
Table 3 Geoaccumulation index (Igeo) for Borgo Montello
Table 4 Geoaccumulation index (Igeo) for Cisterna di Latina
Table 5 Geoaccumulation index (Igeo) for Roccasecca
Fig. 10
figure 10

Enrichment factor (EF) for Appian Way Regional Park

Fig. 11
figure 11

Enrichment factor (EF) for Borgo Montello

Fig. 12
figure 12

Enrichment factor (EF) for Cisterna di Latina

Fig. 13
figure 13

Enrichment factor (EF) for Roccasecca

Tables 2, 3, 4 and 5 show negative Igeo values for each trace elements in almost all samples, showing the absence of anthropogenic enrichment of topsoils. In a few topsoil samples by Appian Way Regional Park (Table 2) and Cisterna di Latina (Table 4), the geoaccumulation index has positive values for some trace elements (V, Cr, As, Cd and Pb), indicating weak or light enrichment. Instead, in some topsoil samples by Borgo Montello (B7, B8, B12, B13 and B15—Table 3) and Roccasecca (R1 and R8—Table 4), there are Igeo values between 1 and 2 for some trace elements (As, V, Cr, Cd, Co) that indicate moderate conditions of contamination. This could be linked to the proximity of the Borgo Montello and Roccasecca to a municipal solid waste landfill and a waste treatment plant. The wastes with trace elements such as V, As, Cr, Cd and Co are transported to landfills daily and they may be carried to the soil by wind and may, therefore, be subject to atmospheric deposition. The calculation of the enrichment factor confirms the results defined for these case studies (Figs. 10, 11, 12, 13). The aluminum (Al) is chosen as a reference element to assess the enrichment factor (EF), because it is a highly conservative element and an essential constituent of clay minerals [60]. Boxplots were used to show the results of the enrichment factor (Figs. 10, 11, 12, 13), in order to represent the statistical distribution synthetically, using lighter and darker colors to highlight the increase in the enrichment factor. Borgo Montello (Fig. 11) is characterized by high values of the enrichment factor, with amounts almost equal to 25 for the Cd, thus confirming anthropic contamination. The enrichment factor results for the Roccasecca (Fig. 13) also confirm significant values, especially for the As, probably linked to the presence of municipal solid waste landfill. Instead, for Appian Way Regional Park (Fig. 10) and Cisterna di Latina (Fig. 12), there are enrichment factor values less than 5, highlighting a moderate enrichment in trace elements, probably associated with the geogenic nature of the soils.

5.2.1 Assessment of geochemical baseline values in Borgo Montello

According to Zhou et al. [29], the statistical approach is applied to Borgo Montello case study. Borgo Montello is the case study in Lazio region with the greatest number of data, with concentrations above the threshold values of the Italian Legislative Decree 152/2006. The results provided by the statistical approach are incomplete, due to the small number of samples available (16 surface sediment samples), but it may be useful for subsequent processing with more topsoil samples. The Pearson correlation matrix (Table 6) is used to determine the relationship between trace elements.

Table 6 Pearson correlation matrix for Borgo Montello

The statistical approach is characterized by two steps:

  1. 1.

    Processing of the cumulative frequency curve (CFD) for the trace element whose geochemical baseline value is to be checked;

  2. 2.

    Normalization (a linear regression) through the expressions 1 and 2 between two elements: the trace element and a reference element present significantly in the soils. A conservative reference element is chosen because it is not affected by anthropogenic inputs and its high behavior is linked to geogenic enrichment of the soils.

For Borgo Montello, manganese is chosen as reference element, because it is present with high amount in soils [46, 47].The statistical approach was applied to vanadium, chromium, cobalt and nickel to define geochemical baseline values, because significant correlations between manganese and the trace elements are highlighted by the Pearson matrix (Table 6). This results from the biological and geochemical properties of this element and from the fact that in soils chromium is bound mainly by the organic compound.

In this study, the arithmetic mean of V is 142.34 mg kg−1, Cr is 105.46 mg kg−1, Co is 12.97 mg kg−1 and Ni is 31.2 mg kg−1. Therefore, the cumulative frequency curve method is applied to V (Fig. 14a), Cr (Fig. 14b), Co (Fig. 14c) and has Ni (Fig. 14d): their curve has one inflexion.

Fig. 14
figure 14

Cumulative frequency curve method for vanadium (a), chromium (b), cobalt (c) and nickel (d) in Borgo Montello

Thus, the geochemical baseline for V, Cr, Co and Ni was obtained from the data below the inflexion value, considering the arithmetic mean of the remaining values. The calculated baseline concentration of V is 78.24 mg kg−1, Cr is 84.10 mg kg−1, Co is 6.67 mg kg−1 and Ni is 23.7 mg kg−1. For normalization method (Fig. 15), Mn is selected as the reference element.

Fig. 15
figure 15

Normalization method between manganese and vanadium (a), manganese and chromium (b), manganese e cobalt (c), manganese e nickel (d) in Borgo Montello

The correlation coefficients increase after points falling out of the 95% confidence band are removed. The remaining data points were used to calculate the geochemical baseline as follows:

$$B_{\text{V}} = - 0.1469* \overline{\text{Mn}} + 178.92,\;r = 0.83$$
(5)
$$B_{\text{Cr}} = - 0.042* \overline{\text{Mn}} + 99.294,\;r = 0.88$$
(6)
$$B_{\text{Co}} = 0.0418* \overline{\text{Mn}} - 6.68,\;r = 0.97$$
(7)
$$B_{\text{Ni}} = - 0.0549* \overline{\text{Mn}} + 50.58,\;r = 0.81$$
(8)

The calculated geochemical baseline value of V is 123.07 mg kg−1, greater than the result obtained by cumulative frequency curve method (78.24 mg kg−1), Cr is 82.10 mg kg−1, similar to the value obtained using the cumulative frequency curve method (84.10 mg kg−1), Co is 9.41 mg kg−1 and Ni is 29.70 mg kg−1, both quite similar to values obtained using cumulative frequency curve method (respectively for Co is 6.67 mg kg−1 and Ni is 23.7 mg kg−1). The number of topsoil samples available is too small to make a more correct estimate, in fact according to Zhou et al. [29] the number of topsoil samples used it is approximately 40. It is however possible to verify that the chromium, cobalt and nickel geochemical baseline values established using both the cumulative frequency method and the normalization method are similar. Instead, the calculation of vanadium geochemical baseline values through statistical approach determines irregular values.

6 Conclusions

The article proposes a procedure to determine the origin of trace elements in topsoil, in order to distinguish natural enrichment, due to parent material and geogenic origin, and anthropogenic contamination, caused by activities such as presence of landfills, municipal solid waste plants, industrial areas, non-operative quarries, etc. In order to verify the origin of the trace elements in the soils, the procedure highlights the need to distinguish between the total fraction and the exchangeable fraction of the trace elements in the soils. In fact, the exchangeable fraction identifies the rate of an element adsorbed on the main constituents of the soil (clayey material, iron oxides or hydroxides, organic material) and environmentally it represents the bioavailable quote of the trace elements can enter in contact with environment (soil or groundwater) and with food chains, thus causing actual pollution phenomena. The proposed procedure also highlights the need to determine the local geochemical baseline values in order to assess the soil pollution status. In the hypothesis of absence of a Regional Geochemical Baseline Mapping or of the impossibility of subsurface soil horizion sampling at a maximum depth of one meter, a statistical method is considered to assess the geochemical baseline. Furthermore, it is essential to consider some pollution indicators that allow to verify the possible origin of trace elements in the soils. In fact, the proposed procedure involves the definition of the geoaccumulation index (Igeo) and the determination of the enrichment factor (EF). These indices have been applied to several Italian case studies with the aim to verify the origin of the trace elements, if the concentrations are above the threshold values of the Italian Legislative Decree 152/06. In the case of Bevera Valley Basin in Lombardy, the geoaccumulation index and the enrichment factor allowed to verify the absence of anthropogenic contamination, but an arsenic geogenic enrichment in Femar and Reinar querries due to peat lenses in the study area. However, for the case studies in the lower Lazio region, in the Borgo Montello and Roccasecca sites, contamination is plausibly associated with the presence municipal solid waste landfill and a waste treatment plant. On the contrary, for the cases of the Appian Way Regional Park and Cisterna di Latina, a natural enrichment of the soils is confirmed due to the geogenic origin. Moreover, the statistical approach to assess the geochemical baseline values was applied to Borgo Montello case study. The results highlight the need to have a high number of topsoil samples and homogeneously located in the study area. Although the small amount of data available for Borgo Montello, it is possible to verify for chromium, cobalt and nickel a good correlation between the geochemical baseline value obtained by the cumulative frequency method and the normalization method with manganese. The result highlights also the biological and geochemical properties of this element and from the fact that in soils chromium is bound mainly by the organic compound.