Introduction

Conventional agriculture has been applying pesticides to control pests such as weeds, insects, and fungi and maintain high levels of productivity (Hvězdová et al. 2018, Li 2022). According to the Food and Agricultural Organization of the United Nations (FAO), the global agricultural use of pesticides in 2020 was 2.7 Mt of active ingredients (ai), and the average application per crop area was 1.8 kg/ha (FAO 2022). The Americas applied the highest level of pesticides in agriculture with an average of more than 1 Mt pesticides/year and also topped the levels of pesticides per cropland with 2.8 kg/(ha × year) during 1990–2020 (FAO 2022).

Despite the benefits of pesticides in terms of food production and crop yields (Silva et al. 2019), their intensive and long-term application can negatively impact the environment (Sabzevari  and Hofman 2022, Wang et al. 2020). Currently used pesticides (CUPs) are considered more environmental friendly, effective and safe than for instance the banned organochlorine pesticides (Hvězdová et al. 2018). However, some CUPs can still persist in soil (Geissen et al. 2021, Riedo et al. 2021), appear in remote areas (Hvězdová et al. 2018), contaminate plant tissues (Kalyabina et al. 2021), affect food safety (Carvalho 2017, Seneff 2021, Zikankuba et al. 2019), occur in drinking or surface waters (de Souza et al. 2020, Syafrudin et al. 2021), and generally represent a risk for human health (Lewis et al. 2016, Seneff 2021, Sidhu et al. 2019), and the environment (Carvalho 2017, Lewis et al. 2016, Md Meftaul et al. 2020, Tudi et al. 2021, Zikankuba et al. 2019).

Once the pesticides are applied, agricultural soils are among the first exposed environmental media. Recently, a number of articles of residues of CUPs in agricultural soils of temperate regions such as Europe have been published (Hvězdová et al. 2018, Padilla‐Sánchez et al. 2015, Riedo et al. 2023, Sabzevari and Hofman 2022, Silva et al. 2019). However, in tropical areas, monitoring of CUPs is limited (Tan et al. 2020), and research was mainly conducted in Asian countries (Sabzevari and Hofman 2022), first of all China (Tan et al. 2020) and India (Kumari et al. 2008, Murugan et al. 2013). In tropical areas of Latin America and the Caribbean, only a few reports exist of Brazil (Bortolozo et al. 2016, Guarda et al. 2020), Mexico García-Hernández et al. 2021, Velasco et al. 2012), and Argentina (Primost et al. 2017).

Although the access to pesticides in Cuba is limited due to the trade and financial embargos, crops such as potato, tomato, and beans having a high national demand are prioritized when it comes to the distribution of CUPs, which happens on a national level. The most recent monitoring of pesticide residues in Cuba dates back to 1996, where concentrations ranged from 90 (desmetryn, a triazine) to 1960 μg/kg (diuron, a phenylurea) in soil 60 to 164 days after harvest (Dierksmeier 1996). Limits of detection of the 11 compounds analyzed in soil ranged from 20 to 300 μg/kg. Dierksmeier (1996) already used multi-residue techniques almost 30 years ago, but emphasized the need for more sensitive methods especially when it comes to food samples whose regulated maximum residue levels (MRL) need to be analytically met. The author (Dierksmeier 1996) concluded, after having monitored Cuban soil, water, and some cash crops such as beans, potato and tomato, that the island needs more monitoring studies in zones of agricultural impact.

Therefore, the aim of this paper was to develop and validate a multi-residue method for CUPs in agricultural, tropical soils, or different types. The method is based on QuEChERS (quick, easy, cheap, effective, rugged and safe) extraction (Anastassiades et al. 2003a), compounds are measured by gas chromatography coupled to tandem mass spectrometry (GC-MS/MS) and quantified with internal standard method. The target analyte list includes ai approved and mostly used in Cuban contemporary agriculture and some of their transformation products (TPs) with a focus on potato being a crop to which most pesticides are applied in Cuba. Whenever available and applicable in MS/MS, isotope-labeled analogues were used as internal standard (IL-IS). The method should be easy to transfer to Cuban laboratories and similar equipment such as GC-single quadrupole MS, GC-electron capture detector (ECD), and GC-nitrogen phosphorus detection, that are more available in developing and tropical countries. It was applied to a set of 30 soil samples of 150 from a 4-year-monitoring campaign between 2018 and 2022, with the purpose to quantify CUP residues in different agricultural soils of potato production. The results of this campaign will serve to derive regulatory guidance values (RGVs) for Cuban soils (Jennings and Li 2014).

Material and methods

Reagents and standards

Table S1 of the supporting information (SI) lists all pesticides, TPs, and isotope-labeled internal standards (IL-IS) used in this study, their CAS numbers, providers and purities ranging from 93 to 100%. Most of the compounds were dissolved in acetonitrile (ACN), except prosulfocarb and dimethomorph that were in ethanol (EtOH), to obtain stock solutions and stored at −20 °C. All solvents used (HiPersolv Chromanorm for HPLC-ultra LC-MS) were purchased from VWR Chemicals BDH (Dietikon, Switzerland). Primary secondary amine sorbent (PSA, 40 μm, Bondesil) was purchased from Supelco, Bellafonte, USA. Magnesium sulfate (MgSO4, anhydrous reagent Plus ≥ 99.5%), dried in a muffle oven at 500 °C for at least 4 h, and sodium acetate (CH3COONa, puriss p.a. ACs reagent, and anhydrous) were purchased from Sigma-Aldrich (Merck & Cie, Schaffhausen, Switzerland), sodium chloride (NaCl, for analysis) from Merck (Darmstadt, Germany), and formic acid (FA, Analar NORMAPUR, 99-100%) from VWR Chemicals BDH (Dietikon, Switzerland).

Selection of pesticides

The compound list (Tables S1 and S2) contained 38 ai of which 39% were fungicides (F), 24% herbicides (H), and 18% insecticides/acaricides (I/A). Additionally, some TPs (18%) as referred to in the database of pesticide properties (PPDB 2023) and based on availability on the market were added to the list (Tables S1 and S2). The CUPs were selected according to the official list of pesticide applications to potato crop in Cuba (MINAG 2016, 2022) but also comprised legacy compounds such as endosulfanes (α and β) and their TP (endosulfane sulphate). They were included because, although banned in Cuba since 2013 (Pérez-Consuegra and Montano-Pérez 2021), they continued in use (MINAG 2016) until 2022 (MINAG 2022). Prosulfocarb and carfentrazone-ethyl have not been approved for use in Cuba at least for 10 years (MINAG 2008) but were studied as negative control because they have frequently been applied in temperate agroecosystems (EC 2019, Hvězdová et al. 2018). Table S2 shows some physicochemical properties of all analytes.

Soils, sampling, and sample preparation

The soil type rhodic ferralic nitisol (RFN, classification according to World Reference Base for soil (WRB 2015)) is predominant in the Mayabeque province of Cuba and appropriate for agriculture (Febles-González et al. 2013). However, an agricultural soil may contain ai and/or TPs that limit its use for method development and validation (i.e. quantification by a matrix-matched (MM) calibration and as a negative, blank control). Thus, other soils possibly without residues were additionally needed. Table 1 lists the different soil types (skeletic regosol (SR), RFN, xanthic ferralic nitisol (XFN), and dystric cambisol (DC)) taken for MM calibration to preliminarily quantify pesticide residues and to study effects of the matrix. Rhodic ferralic nitisol was included twice, i.e., as organically and conventionally managed, respectively, because soil parameters were not the same in the two of them. The SR, a forest soil, was used for the validation because it originates from a natural park where pesticides were never applied.

Table 1 Physico-chemical characteristics of the soils used for method development and validation

The soils were sampled in the Mayabeque province, Cuba (Fig. S1) between 2014 and 2021 (Table S3) according to the Swiss soil monitoring network NABO, which is described in Gubler et al. (2015). In a 10 m × 10 m area 100 soil cores were taken with an auger of 1.8-cm diameter at a soil depth of 20 cm. Twenty-five of the soil samples were bulked to one sub-sample, immediately brought to the lab, dried in an air-conditioned room of 25 °C at 5% humidity for 7 days until constant weight, crushed, and sieved over a 2-mm sieve and stored in the dark until analysis.

Preparation of target analytes, IL-IS, syringe standard, and calibration solutions

Stock solutions of each ai or TP around 2000 µg/mL were dissolved in ACN (or EtOH, see “Reagents and standards”). A working mixture standard solution was diluted by mixing the stock solutions of each compound with ACN to a concentration of 1 µg/mL. The 10 IL-IS (Table S1) were individually dissolved in ACN obtaining stock solutions with a concentration around 1000 µg/mL. Then, an IL-IS mixture standard solution was produced in ACN at 0.5 µg/mL. Triphenyl phosphate (TPP) was used as syringe standard. It was dissolved in ACN at a concentration of 2000 µg/mL and diluted in ACN at 0.5 µg/mL. All solutions were prepared gravimetrically.

Analyte solutions for calibration (0, 5, 10, 25, 50, 75, and 100 ng/mL) were prepared from dilutions of the working standard mixture in ACN with FA 2.5% (ACN/FA). The IL-IS mixture and TPP (Table S1) were added in each calibration point at a concentration of 10 ng/mL each. All solutions such as stock, intermediate, and the working solutions were stored at −20 °C in the dark until use.

Extraction and clean-up by the QuEChERS method

The methods of Acosta-Dacal et al. (2021) and Rösch et al. (2023) were applied with modifications. Briefly, soil prepared as described in “Soils, sampling, and sample preparation” was mixed vigorously for 5 min in a Turbula shaker-mixer (Willy A. Bachofen AG, Muttenz, Switzerland) before taking a subsample of 5 g. It was put into a centrifuge tube (15 mL), and 10 ng/g IL-IS mixture was added (0.1 mL IL-IS solution). After 30 min of rest to allow the ACN to evaporate, the sample was rehydrated with 5 mL of Milli-Q water (Valverde et al. 2021), vortexed for 1 min, and kept at room temperature for 1 h. Next, 5 mL ACN/FA was added to the sample in the centrifuge tube, vortexed for 3 min, and sonicated for 15 min at 35 kHz with 80% microprocessor control (Sonorex Digital 10 P, Bandelin from IG, Zurich, Switzerland). Then, 2 g of MgSO4, 0.5 g NaCl, and 0.5 g CH3COONa were added to the mixture (Mahdavi et al. 2021), vortexed immediately for 1 min, homogenized with a Turbula for 10 min, and sonicated for 15 min with the same settings as indicated above. Subsequently, the sample was centrifuged for 10 min at 4200 rpm in a 5804 R Eppendorf centrifuge (Eppendorf, Hamburg, Germany). One mL of supernatant extract was filtered through a 0.20-μm Chromafil® PET filter (Macherey-Nagel, Düren, Germany). Finally, 0.02 mL of TPP corresponding to 10 ng/g was added as syringe standard before injection. An aliquot of 0.5 µL of the extract (see below) was directly injected into GC-MS/MS. The pesticide extraction with the procedure described before was compared to one with an additional clean-up step (0.05 g PSA, 2 g MgSO4, and 0.5 g CH3COONa). No differences in the pesticide concentrations were observed applying these extraction methods. However, we choose to apply the additional clean-up step for protecting the GC capillary column and the MS under routine analysis.

Instrument conditions

GC-MS/MS system

A GC-MS/MS (GC-2030 system MS-TQ8050 NX model, Shimadzu Corp., Kyoto, Japan) with an AOC-20i+s Plus (auto-injector and auto-sampler coupled to the GC) and an electronic flow controller were used in this study. Instrument control, data acquisition, and processing were performed using the Shimadzu GCMSolution software (version 4.45). Data were quantified by the LabSolutions Insight (version 3.1).

The analytes were separated on an SH-Rxi-5ms capillary column (30 m × 0.25 mm id and 0.25-μm film thickness; Shimadzu, USA). Helium 5.0 (> 99.999% purity, PanGas AG, Dagmersellen, Switzerland) was used as carrier gas (head pressure was set at 92 kPa with total flow at 26.6 mL/min, column flow at 1.35 mL/min, linear velocity at 46.2 cm/s, and purge flow at 5 mL/min). The injector temperature was at 280 °C (Shimadzu recommendation: 200–300 °C, Shimadzu GCMS-TQ series Jul 2018, 225-38195) and injection took place at high pressure of 250 kPa for 1 min onto a Topaz Liner with glass wool, split 3.5 mm × 5.0 × 95 for Shimadzu GCs (Cat #23319 from Restek, BGB Analytik, Boeckten, Switzerland). The oven temperature gradient program was as follows: hold an initial temperature of 80 °C (1.5 min, Shimadzu recommendation: 40–100 °C, Shimadzu GCMS-TQ series Jul 2018, 225-38195), ramp to 300 °C with 25 °C/min, and hold for 11 min. The total run time was 21.3 min. The solvent delay was 4 min; the retention time of the compounds is between 6 and 14 min (Table S4); and the column is reconditioned at 300 °C for 8 min.

The MS analyses were conducted in positive electron ionization (EI+) mode with a filament current set at 60 μA, ionization energy at 70 eV, and detector voltage at 1.9 kV. The ion source and transfer line temperatures were heated at 300 °C each. For ion fragmentation, argon 5.0 (> 99.999% purity, PanGas AG, Dagmersellen, Switzerland) was used as collision-induced dissociation gas. The optimization of chromatographic and mass spectrometric conditions from injection until finding the optimal transitions of the analytes is explained in the SI (“Optimization of chromatographic and mass conditions”).

Quantification of pesticides and transformation products

Quantification was done with the internal standard method (normalization of analyte peak area against one of the corresponding IL-IS) with a MM calibration and concentrations of 0, 5, 10, 25, 50, 75, and 100 ng/mL. All calibration regressions were calculated with the linear least square method and NOT forced through the origin. Ideally, each compound would have a corresponding structure identical (si) IL-IS, which was not possible. The parent ions of deuterated IL-IS were often the same as for native compounds due to the strong ionization in the source of the GC-MS/MS. Hence, only 10 IL-IS had different parent ions than their native analytes and could be added to the compound list in Table S4 for identification and quantification of compounds with si IL-IS. Please note that the IL-IS n-methyl-metribuzin-D3 was used for both, metribuzin desamino (DA), and desamino diketo (DADK) as si IL-IS (Table S4, IL-IS group 4). For this reason, there are 10 IL-IS but 11 analytes with si IL-IS. All other compounds (27) are quantified with a non-structural (nsi) IL-IS, where the IL-IS group is given in Table S1 and the corresponding analyte in Table S4.

Method validation

The validation of the method was primarily preformed according to the newest SANTE guideline for soil matrices (SANTE 2020). For reproducibility, the SANTE guideline for food and feed (SANTE 2021) was used. While individual validation parameters are additionally evaluated for all of the soils listed in Table 1, the complete validation is carried out for SR due to the reasons outlined in “Soils, sampling, and sample preparation.” Analytical parameters evaluated were ME, slopes of MM calibration curves of the different soils, repeatability and reproducibility (as measures of precision), absolute and relative recovery (the latter as a measure of trueness (SANTE 2021)), limit of quantification (LOQ), linearity, selectivity, and specificity. Each of them is briefly described in “Matrix effect” to “Selectivity and specificity” and “Statistics and data presentation,” and corresponding results presented and discussed in chapters “Matrix effects of different soils, matrix matched calibrations and the role of isotope-labeled internal standards,” “Precision (repeatability (RSDr) and reproducibility (RSDR)) and trueness for skeletic regosol (SR),” “Limit of quantification (LOQ) and linearity for SR,” and “Selectivity and specificity for SR.”

Matrix effect

While a comparison between internal and external standards (without normalization of analyte peak area against one of the corresponding IL-IS) method in soil extracts was carried out for all compounds, and more specifically for compounds with si IL-IS (Hartmann et al. 2007), to show ME, the routine analysis exclusively used the former. The concentrations of the standards in the ACN/FA solutions were the same as in MM calibrations.

The influence of co-extracts on the chromatographic signal, i.e., the ME, was evaluated by comparing the slopes of the calibration curves used for external (ext) standard method (Eq. (1a)), and internal (int) standard method (Eq. (1b)), respectively, of the individual compounds in MM calibration from the five soils (Table 1) with those in ACN/FA. The ME was quantified with Eqs. (1a) and (1b) and considered significant if |ME| > 20% according to the SANTE guidelines (SANTE 2020). A suppression is an ME smaller than −20% and an enhancement larger than 20% (SANTE 2020).

$${{\text{ME}}}_{{\text{ext}}} \left(\mathrm{\%}\right)=\left(\frac{{\mathrm{slope\;of\;MM\;calibration}}_{\mathrm{ext }} }{{\mathrm{slope\;of\;calibration\;in\;solvent}}_{{\text{ext}}} }-1\right)\times 100$$
(1a)
$${{\text{ME}}}_{{\text{int}}}\left(\mathrm{\%}\right)=\left(\frac{{\mathrm{slope\;of\;MM\;calibration}}_{\mathrm{int }} }{{\mathrm{slope\;of\;calibration\;in\;solvent}}_{{\text{int}}} }-1\right)\times 100$$
(1b)

Precision and trueness as a measure of accuracy

Precision and trueness are two complementary sides of accuracy, the closeness of an analytical result to a true value, where the former covers random and the latter systematic analytical errors (SANTE 2021). Precision was calculated as relative standard deviation (RSD) of quadruplicate analyses at each of three concentration levels (10, 25, and 50 ng/g), analyzed at the same day, and at three different days, to yield repeatability (RSDr), and reproducibility (RSDR), respectively (Table 2). The highest concentration (50 ng/g) corresponds to the acceptable LOQ for pesticide residue analyses in soil (EC 2010). Reproducibility was only included in the SANTE guideline (SANTE 2021) for food and feed with ≤ 20%. The Horwitz function describes the relation of RSDr and RSDR (originally: coefficients of variation) of multiple analyses as a function of analyte concentration, which serves here as an additional evaluation criterion (Horwitz 1982).

A significant ME (“Matrix effect”) may lead to low or high analyte recoveries, which can, however be accounted for by a MM calibration (Anastassiades et al. 2003b, Łozowicka et al. 2017). To reduce the variability of the recoveries (absolute and relative) induced by different soil types in a set of samples, MM calibrations were applied. Absolute recoveries of the 10 IL-IS determined from four replicates at the three fortification levels were quantified by dividing the ratio of the area of IL-IS (nominal concentration 10 ng/g, ISO-13876, 2013) (ISO 2013) spiked before extraction over the area of TPP (nominal concentration 10 ng/g, ISO-13876, 2013) (ISO 2013) spiked before injection over the ratio of areaIL-IS/areaTPP from the MM calibration. The IL-IS/TPP spiking time shift compensated for losses of the correspondent native compounds during preparation and clean-up and for ME. In contrast, relative recovery determined for all target analytes is the relation of analyte concentration quantified with MM calibration over the one spiked at a given concentration into the soil prior to extraction (Table 2). As such, it is a measure of trueness.

LOQ and linearity

The LOQ was the lowest detectable concentration of the MM calibration (0.1, 0.25, 0.5, 1, 5, 10, 25, 50, 75, and 100 ng/mL), with a signal of the quantifier ion transition over the noise ratio (S/N) >10 and the S/N ratio of the qualifier ion transition > 3. MM calibration curves were checked for linearity in the concentration levels (5, 10, 25, 50, 75, and 100 ng/ml). The 0 ng/mL was not included in the calibration curve because the data acquisition program did not allow a “non”-concentration. The range of analyzed concentrations was within the range of LOQ to 500 × LOQ. The calibration was carried out by the IL-IS method (IL-IS concentration: 10 ng/mL). The linearity was expressed as coefficient of determination R2 and RSD.

Selectivity and specificity

Selectivity is the unique identification of an analyte by its retention time, which was the middle of the base of the peak ± 0.5 min, and the quantifier to qualifier ion transition ratio, given by the respective MM calibration ± 50% for a sample. Specificity is the differentiation of the analyte from components in the matrix or other compounds of the method. This parameter is especially important when working, as in our case, with deuterated IL-IS and the corresponding native compound in mass spectrometer techniques where the ionization is harsh and m/z of their fragments often the same. Abundance and concentration of each pesticide of (i) the lowest MM calibration level (5 ng/g), (ii) SR blanks, and (iii) SR fortified at the lowest level for each matrix (10 ng/g) were compared qualitatively (Fig. S4). The chromatograms should show a clear peak/resolution especially between the 5 and 10 ng/g concentrations and no peaks in the SR blank, and the latter should, according to the SANTE (2020) guidance, not be higher than 30% of the LOQ. Otherwise, detailed justification has to be provided. Both parameters were not only part of the validation but contribute to quality control and quality assurance (QA/QC) under routine analysis.

Method application to real Cuban soil samples

The validated method was applied to 30 samples of Cuban soils taken from 14 different potato fields (Fig. S1; Table S3). The samples represent a subset of a large monitoring campaign with different pesticide use of conventional (n = 26) and organic (n = 4) farming of RFN soil type. Sampling and preparation of soil samples were carried out as described in “Soils, sampling, and sample preparation.” Two extraction series were conducted, consisting each of 15 samples, three spiked RFNo (10, 25, 50 ng/g, n = 1 for each concentration), non-spiked RFNo in triplicates (n = 6 in total), three non-spiked sand samples (n = 6 in total), and 12 non-spiked SR (blank SR from the reproducibility batch of 3 days in quadruplicates/day). All non-spiked samples served as blanks. Reproducibility as a measured of extended method precision in RFNo was assessed by the corresponding coefficient of variation (RSD) of relative recoveries for each spiked level (10, 25, and 50 ng/g). Ten ng/g of IL-IS was spiked before extraction, and TPP were added before injection to each sample according to “Extraction and clean-up by the QuEChERS method.” All concentrations were calculated using a MM calibration curve with RFNo from the organic potato field as its soil was essentially pesticide-free as well, but this was only known after the method had been developed. The results are discussed in “Application of the method to real samples.”

Statistics and data presentation

Descriptive statistics (Figs. 1, 2, and 3) and statistical data evaluation were carried out with R version 4.0.2 (2020-06-22). The slopes of the calibration curves of all compounds of SR, the soil used for the validation of the method, were compared against all other soils (Table 1) with the null hypothesis H0 that the slope β1 = β2 or β1β2 = 0 to evaluate its suitability. A Student’s t-test according to Eq. (2) described in Andrade and Estévez-Pérez (2014) assuming unequal variances was used for comparison.

$$t= \frac{{b}_{1}- {b}_{2}}{\sqrt{{{s}^{2}}_{b1}+ {{s}^{2}}_{b2}}} \sim T\left({n}_{1} {+ n}_{2}\right)$$
(2)

where b is the slope of the MM calibration of the different soils, and s2b1, s2b2 are the variances of the respective slopes of the first order linear regressions. In most cases of compound calibrations, sample size was equal (n1 = n2), and t-test results were the same whether variances were pooled or not (Andrade and Estévez-Pérez 2014).

Results and discussions

Method validation

Matrix effects of different soils, matrix-matched calibrations, and the role of isotope-labeled internal standards

In GC-MS, along the way from the injector to the detector, different processes occur that enhance or suppress the signals of the target analytes, leading to a ME. On the one hand, the presence of matrix components protects the analytes from adsorption or degradation during evaporation in the inlet (Acosta-Dacal et al. 2021, Asensio-Ramos et al. 2010). On the other hand, both the chromatographic separation and signal-to-noise ratios of target analytes, as well as their ionization efficiency in the ion source might be disturbed by overwhelming matrix constituents.

The ME manifests directly and pronouncedly in the comparison of matrix-matched external calibrations. Figure 1 A and C present exemplary calibration curves of azoxystrobin and boscalid, respectively, in a pure solvent (ACN/FA) and in extracts of different soil types. In all cases, the slopes of the latter exceeded those of the former, by up to a factor of four. Such ME can effectively be compensated for by normalizing peak areas (and respective concentrations) to those of a suitable IL-IS, as illustrated in Fig. 1 B for azoxystrobin and its si IL-IS azoxystrobin-D4. No si IL-IS was available for boscalid, but normalizing it to azoxystrobin-D4 as a nsi IL-IS (Fig. 1D) reduced slope variabilities of different soil types by roughly a factor of two. Slopes and corresponding ME of both external and internal calibrations of all analytes are listed in Tables S5 and S6, respectively.

Fig 1
figure 1

Matrix-matched external (panels A and C) and internal (panels B and D) calibration curves of azoxystrobin (panels A and B) and boscalid (panels C and D). Azoxystrobin-D4 was used as a structure identical isotope labeled internal standard (IL-IS) for azoxystrobin (panel B), and as a non-structure identical IL-IS for boscalid (panel D). Matrix-matched calibrations are from different soil types: skeletic regosol (SR, black dots), rhodic ferralic nitisol, organically managed (RFNo, transparent triangle), and conventionally managed (RFNc, star), xantic ferralic nitisol (XFN, transparent diamond), and dystric cambisol (DC, black triangle). The solvent was acetonitrile with 2.5% (v/v) formic acid (ACN/FA, gray box).

Applying Eq. (1a) of the external standard method for all soil types leads to the corresponding MEext. The overall mean of the MEext of 38 compounds over the five soils was 104% and the standard deviation (STD) ±108%. Hence, the majority of the soils exhibited a significant ion enhancement (MEext > 20%, Fig. 2, white boxes). While the RFNo had the lowest mean MEext over all compounds with 34%, the other soils had mean MEext ranging from 89% (RFNc) to 146% (SR, Table S5). First and third quartiles of the MEext ranged from 11% in RFNo to 190% in SR (Fig. 2, Table S5). Hence, half of the values (1st to 3rd quartiles) span from 57% (RFNo) to as much as 254% (DC, please note the different y-axis scales in Fig. 2). Ion enhancement occurred in 45 to 97% of the 38 compounds in all soil types and went as high as 839% (Fig. 2), with dicofol exhibiting maximum MEext in all soils (MEext from 263 to 839%, Table S5). Ion suppression (ME < −20%) of all soils and compounds was not as relevant as enhancement except for pyraclostrobin representing the minimum MEext or the highest ion suppression in all soil types. The RFNo had the lowest number of significant MEext (16 out of 38 analytes) and SR and DC had all analytes with a significant MEext (Table S5). The MEext of individual analytes in the different soils was compared with their respective properties (Table 1), in particular their OM content, but no correlation with any of them could be observed. This result is unlike the one for another method recently developed in our lab, also with different soils but measured with liquid chromatography MS/MS, where the ME increased with increasing OM and concomitantly shifted towards ion suppression (Rösch et al. 2023).

Fig 2
figure 2

Matrix effect (ME) of skeletic regosol (SR), rhodic ferralic nitisol organically managed (RFNo), RFN conventionally managed (RFNc) xanthic ferralic nitisol (XFN), and dystric cambisol (DC). Please note that the y-axes have different scales. White boxes represent ME of all 38 analytes quantified with external standard calibrations without isotope labeled internal standards (IL-IS) according to Eq. (1a), and blue ones ME determined with the internal standard method and IL-IS with Eq. (1b). Eleven compounds had a structure identical isotopically labeled internal standard (si IL-IS), and 27 compounds were quantified with non-structure identical (nsi) IL-IS. The boxes represent the 25th to the 75th percentiles; the whiskers are the 10th and 90th percentiles; the dots are outliers; and horizontal, black bars in the boxes are medians.

Secondly, MEint by Eq. (1b) was calculated using the internal standard method, which entails the normalization of analyte peak areas against the ones of IL-IS. As expected, with a mean of 8% and a STD ± 50% for all 38 compounds over the five soils, the MEint were drastically smaller than the MEext. Similarly, the median MEint of compounds with si IL-IS (n = 11) were closer to 0% than those of analytes with nsi IL-IS (n = 27) (Fig. 2, blue boxes, Table S6). However, as MEint are normalized values, they do not discriminate between suppression or enhancement anymore. Means of MEint of compounds with si IL-IS ranged from -17% (XFN) to 7% (RFNo) and laid more in the negative than in the positive but within the non-significant span demanded by SANTE guideline (SANTE 2020). The means of MEint of compounds with nsi IL-IS span from 3 to 27%, again meeting the SANTE requirements, except for RFNo with 27%. First and third quartiles of the MEint of compounds with si IL-IS ranged from −30% in XFN to 14% in RFNo (Fig. 2, Table S6). Hence, the span (difference) of the 1st to the 3rd quartiles of the MEint for compounds with si IL-IS of SR, RFNo/c, and DC was as narrow as 14 to 17%, except for XFN that was 37%. Although the 1st to the 3rd quartiles of the MEint for compounds with nsi IL-IS was broader (−31 to 63%, both in XFN) than for the ones with si IL-IS, the behavior was the same because the span of MEint was 34 to 54% for SR, RFNo/c, and DC with the exception of XFN (94%). Of course, the measurement by GC-MS/MS (“Quantification of pesticides and transformation products”) invoked the different number of compounds in the IL-IS groups, 11 versus 27, which obviously also made the variation of the MEint smaller in the former than the latter group. However, the comparison of MEint of nsi IL-IS with the MEext (Fig. 2; no IL-IS) revealed the advantage of using IL-IS, even if nsi: the MEext were factors bigger than those of analytes with nsi IL-IS, except for RFNo (Fig. 2). Hence, the use of IL-IS, irrespective of si or nsi, did not only compensate the ME more than 10 times in comparison to the external standard method (see above) but also its variability, which is in line with that of Homazava et al. (2014).

The assumption for single analytes was again that the MEint for compounds with si IL-IS would be smaller than for those with nsi IL-IS. Indeed, no significant (according to SANTE) MEint were observed for 2,6-dichlorobenzamid, metribuzin desamino (DA), metalaxyl, fluopicolide, epoxiconazole, and azoxystrobin in all five soil types. These compounds, except epoxiconazole, had an si IL-IS. In contrast and as expected, compounds with nsi IL-IS as prosulfocarb and oxyfluorfen showed a significant MEint in all five soils (Table S6) and fluazifop-p-butyl, trifloxystrobin CGA, benalaxyl, endosulfane sulfate, tebuconazole, bifenthrin, fenamidone RPA, and pyraclostrobin in four soil types. Pyraclostrobin showed significant MEext and MEint in all soils of the former and four soils of the latter method. Between 12 and 20 compounds per soil type had no significant |MEint| (< 20%) (Table S6) which is according to what other authors reported (Bortolozo et al. 2016; Łozowicka et al. 2017).

The differences of slopes in calibration curves evoked above by different soil matrices could be substantially reduced by IL-IS, be it si or nsi (Fig. 1). However, the slopes of the MM calibration curves calculated with the internal standard method of the different soils still needed to be compared to know whether SR, the soil validated here was suitable, especially as RFNo, is the one used in routine. This was statistically expressed in “Statistics and data presentation” by the H0 that the slope β1 = β2 or β1β2 = 0 of an analyte in soil1 and soil2, where the desired outcome would be H0 not rejected and calculated according to Eq. (2). The slopes of compounds with a si IL-IS agreed in more cases, meaning they showed no significant difference in the t-test, than compounds with nsi IL-IS. Specifically, SR versus RFNo slopes showed in 82% of compounds with si IL-IS (n = 11) and no significant difference, and in 30% with nsi IL-IS (n = 27), versus RFNc slopes 64% and 37%, versus DC 82% and 56%, and versus XFN 18% and 11%, respectively. Slopes of SR with XFN correlate the least, which is also seen in Fig. S5. The smallest correlation coefficient r for the other soils was with the solvent (ACN_FA), which again underlines the necessity for MM calibrations. Furthermore, 64% of compounds with si IL-IS show no significant differences of slopes when RFNo is compared to RFNc and 33% of nsi IL-IS. This is also reflected in the correlation coefficient r = 0.89 in Fig. S5 when RFNo and RFNc are correlated. Hence, even the same soil type but managed differently exhibited significantly different slopes, which might be due to the different soil properties of the RFN (Table 1). This slope comparison including the aforementioned findings emphasize the use of as many as possible IL-IS even for analytes with nsi IL-IS when dealing with different soils and relativized the question of SR’s suitability.

Precision (repeatability (RSD r) and reproducibility (RSD R) and trueness for skeletic regosol (SR)

Table 2 lists accuracy expressed as precision and trueness obtained with the here presented method. The means of repeatability (RSDr) for the 10, 25, and 50 ng/g fortification levels were 9.2, 7.3, and 5.5%, respectively, with a clear tendency of higher precision with higher fortification concentration. These results met well the quality criterion of the SANTE guideline (SANTE 2020) of RSDr ≤ 20%. Only metribuzin DADK and bifenthrin slightly exceeded 20% RSDr at 10 ng/g (20.4 and 20.8%), while their RSDr of 25 and 50 ng/g complied with the quality criterion (SANTE 2020). The RSDr at different fortification levels were lowest for 2,6-dichlorobenzamid (1.6, 2.5, and 2.8 ng/g) and highest for bifenthrin (20.8, 15.3, and 9.3 ng/g). The precision in terms of reproducibility (RSDR) is generally lower, i.e., has a higher percentage than RSDr because it is obtained under changing conditions (SANTE 2021), in this case at different days. Indeed, the RSDR means of all compounds and fortification levels were higher than for RSDr and were 15.5, 11.8, and 7.3% again with increasing precision for increasing concentration. Minima ranged from 3.1% (metribuzin DADK at 50 ng/g) to 4.3% (2,6-dichlorobenzamid at 25 ng/g), and maxima from 15.4% (dicofol at 50 ng/g) to 36.6% (bifenthrin at 10 ng/g). The SANTE guideline for food and feed (SANTE 2021) sets the within-laboratory reproducibility (RSDwR, which is the same as our RSDR) at ≤ 20%. All of the values met this criterion at the highest fortification level, 34 out of 38 compounds at 25 ng/g and 30 out of 38 at the lowest level. This is a satisfactory performance for concentrations at the ppb level. Horwitz considers the within-laboratory RSD (RSDwl), which is corresponding to our RSDR, to be two-thirds of the between laboratories (RSDbl (%) = 2(1−0.5×logC), where C is the concentration) and thus around 30% for organic compounds (Horwitz 1982). So, also according to Horwitz, the RSDR of this study were good with only three compounds at the lowest fortification level (10 ng/g) exhibiting an RSDR >30%: endosulfane sulfate (35.1%), bifenthrin (36.6%) and pyraclostrobin (30.9%). Such precision ranges also compare well with those of earlier papers in this domain (Łozowicka et al. 2017, Rösch et al. 2023).

Table 2 Method validation figures of merit with relative standard deviation (RSD) of repeatability (RSDr) and reproducibility (RSDR) as measure of precision, relative recovery as measure of trueness, limit of quantification (LOQ), and linearity of matrix-matched calibration curves in skeletic regosol. Means were derived from quadruplicates of each fortification level and validation parameter. Compounds, type of pesticides, and the corresponding isotopically labeled internal standards (IL-IS) group are shown in columns 2–4 from the left.

Absolute recoveries of IL-IS spiked to SR were 92% ± 20% (mean ± STD of the means of three fortification levels in Table S7). Although the SANTE guideline (SANTE 2020) sets the limit for relative recoveries, we consider 70–120% for absolute recoveries as satisfactory too. Exceptions were n-methyl-metribuzin-D3 with an absolute recovery of 43% ± 4% and azoxystrobin-D4 of 123 ± 23% (Table S7). Absolute recoveries from RFNo under routine method execution are presented in “QA/QC in RFNo soils under routine operation.” The application of a larger number of IL-IS, be it si or nsi, and reporting of their absolute recoveries represents an advantage over other multi-residue methods that only used a few internal standards or IL-IS. For instance, Vu-Duc et al. (2023) established a method for 21 organochlorinated compounds by GC-MS/MS in different soil samples (n = 6) using α-HCH-D6 as internal standard, but absolute recovery was not evaluated. Acosta-Dacal et al. (2021) presented a method for 51 pesticides by GC-MS/MS, and only two IL-IS were used. However, the paper does not report any result of absolute recoveries to compare our results with.

For relative recoveries, representing accuracy in terms of trueness, 100% corresponds to the “true” value. The means of relative recovery from fortified samples in SR ranged from 89.9 to 91.2% (min = 50.2%, max = 121.2%, Table 2). The SANTE guideline (SANTE 2020) proposes relative recoveries between 70 and 120% for matrices other than food/feed of plant and animal origin. Hence, the relative recoveries were satisfactory. Almost all analytes met the criterion, except bifenthrin that presented a mean relative recovery of 62% and was similarly low in all three fortified levels (67, 61, and 58%, Table 2). This might be due to its high octanol-water partition coefficient (log KOW) of 6.6 (Table S2) indicating strong sorption to, and/or incomplete extraction from the soil’s organic carbon. Metribuzin DADK (50% at 10 ng/g), chlorothalonil (121% at 25 ng/g), dicofol (65% at 10 ng/g), and pyraclostrobin (65% at 50 ng/g) showed relative recoveries outside the acceptable range for individual spike levels. However, their overall mean recovery of these analytes was still in the range of 70–120%.

These results are similar to the ones other authors reported. In contrast to the SANTE guideline (SANTE 2020), some researchers considered the method for routine analysis in GC-MS/MS fit for purpose when compounds had a mean relative recovery between 60 and 140% if the compound had a good precision (Acosta-Dacal et al. 2021, Łozowicka et al. 2017). For instance, Łozowicka et al. (2017) employed the QuEChERS method without cleanup for 216 pesticides and metabolites, analyzed the extract with GC-MS/MS, obtained a recovery in a range of 71–120% for all pesticides, except for five of them (recovery between 65 and 69%) and considered this result satisfactory. Acosta-Dacal et al. (2021) found recoveries in the range of 70–120% for 59% of the analytes quantified by QuEChERS and GC-MS/MS (51 compounds in total) for all fortification levels ranging from the highest concentration (50 ng/g) to the LOQ set for each analyte. However, 18 of them exhibited recoveries over 120% (121–131%), and three had recoveries below 70% (60–69%). These compounds were nevertheless included in the method for routine analysis.

In summary, precision (RSDr ≤ 20% and RSDR ≤ 30%) and relative recoveries (60–140%) (Acosta-Dacal et al. 2021, Łozowicka et al. 2017, Horwitz 1982) were satisfactory, for mostly compounds. Some minor exceedances for relative recoveries were recorded for metribuzin DADK, chlorothalonil, and dicofol but are nevertheless trustful due to the good precision. If relative recovery and precision for a compound were low (i.e. bifenthrin and pyraclostrobin), concentrations between LOQ and lowest spiking limit (10 ng/g) need to be interpreted with care for SR. Therefore, soil type specific QA/QC should be included when applying method for soil monitoring.

LOQ and linearity for SR

The LOQs of the different pesticides spiked to SR ranged from 0.1 to 25 ng/g with a mean of 5 ng/g (Table 2). All values were in the same order of magnitude as those of other soil studies using tandem MS coupled to GC (Acosta-Dacal et al. 2021, Łozowicka et al. 2017, Słowik-Borowiec et al. 2022). All compounds had an LOQ between 2 and 500 times lower than required by the SANTE guideline (50 ng/g) (SANTE 2020), except metribuzin DADK showing a peak in the unspiked SR (red line of first panel in Fig. S4) close to LOQ (10 ng/g, Table 2), and are thus fit-for-purpose as a pesticide soil exposure assessment tool. Linearity data (equation and R2) of calibration curves were from concentrations ranging from 5 to 100 ng/mL in the SR matrix obtained in triplicates using the internal standard method (Table 2). Coefficient of determination (R2) were satisfactory (i.e., > 0.99) for all analytes.

Selectivity and specificity for SR

The lowest fortification level (10 ng/mL) in SR was compared with the lowest calibration point in solvent ACN/FA (5 ng/mL) and with an uncontaminated SR soil (blank, Fig. S4). Transitions of qualifier ions were depicted too (Fig. S3). Each analyte could be detected without interference from impurities (< 30% LOQ), degradation products or excipients present in the matrix except metribuzin DADK (“LOQ and linearity for SR”).

Application of the method to real samples

QA/QC in RFNo soils under routine operation

The above outlined method was routinely applied to a subset of Cuban soils, and the results are presented here. Real samples were mainly from RFN soil with the majority RFNc, but QA/QC was performed with RFNo due to reasons outlined in Soils, sampling, and sample preparation. Absolute recoveries from RFNo (IL-IS mix spiked before extraction, TPP spiked before injection) were satisfactory (overall mean ± STD = 102% ± 10%; derived from of the means of two extractions series) for all IL-IS (Table S8) and in the same range as those of SR (see above), but with a smaller STD. Relative recoveries of most analytes in RFNo laid within the satisfactory range of 60 to 140%, with good reproducibility (RSDR < 30%) (Table S9). The lowest spiked level (10 ng/g) had the highest number of compounds with recoveries < 60% or > 140% (metribuzin DADK, trifloxystrobin CGA, carfentrazone-ethyl, benalaxyl, fenamidone, fenamidone RPA, and spirotetramat). Spirotetramat showed a low recovery (< 60%) and high reproducibility value (RSDR > 30%) at 10 ng/g. Concentration of compounds with recoveries (< 60% or > 140%) and RSDR (> 30%) out of range need to be interpreted carefully (Table S9).The SR showed slightly less compounds outside the desired range (Table 2), which is a better performance than RFNo. The LOQ of RFNo ranged from 0.1 to 25 ng/g, which were the same min and max as for SR, but the former soil had a slightly lower mean (Table S9 and Table 2). Different blank samples in RFNo showed signals below LOQ, except for metribuzin DADK. Similar to SR (“LOQ and linearity for SR” and “Selectivity and specificity for SR”), Sand and RFNo blanks run under routine revealed metribuzin DADK clearly above the LOQ of 5 ng/g for RFNo (Table S10). Sources of cross contamination were searched, but no evidences were found. Due to the good validation results for this analyte, precision and accuracy in the required ranges, R2 > 0.99, selectivity and specificity given, we did not cancel it from the compound list in the routine analysis but referred to the sand and blank samples in every batch of routine analysis. Thus, concentrations of metribuzin DADK in real samples were only reported when higher than sand and RFNo (blank) samples.

In summary, QA/QC for both, SR and RFNo were all within the SANTE guidelines (SANTE 2020, 2021). The comparable figures of merits (i.e., absolute and relative recoveries, LOQ, spans of the 1st to the 3rd quantiles of ME for compounds with si and nsi IL-IS) obtained for RFN (i.e. the most common, though frequently pesticide contaminated, and agricultural soil type) and SR (a Cuban soil-type free of pesticides) lend credit to our decision to use the latter for method validation.

Routine analysis

Based on the validation results and quality control presented above, our method proofed fit-for-purpose for 38 analytes. These were monitored in agricultural soils for potato production from Mayabeque, Cuba. A total of 29 compounds were detected and quantified above the LOQ in 30 soil samples (Fig. 3, RFNc n = 26 and RFNo n = 4). Only four of these compounds presented recoveries < 60% or > 140% (Table S9), with good precision (RSDR < 30%) at the lowest spiked level (10 ng/g). Their concentrations between LOQ and 10 ng/g were interpreted with care. However, all compound’s concentration values were kept and used (Fig. 3) as such for further data interpretation. Ninety percent of the soil samples had at least one CUP and/or TP residue in a range of 0.3–306 ng/gdry weight (dw) (median = 10 ng/gdw). Pesticide residues most frequently detected (% in soil samples, min–max concentration, Fig. 3) were dicofol (73%, 0.8–35 ng/gdw), s-metolachlor (70%, 1.5–306 ng/gdw), dimethomorph (60%, 1.7–59 ng/gdw), and azoxystrobin (57%, 0.5–198 ng/gdw). Only three (fluopicolide, chlorothalonil, and dimethomorph) of the ten most frequently applied pesticides in potato in Cuba (mancozeb, chlorothalonil, potassium phosphite, propineb, dimethomorph, fluopicolide, propamocarb hydrochloride, valiphenalate, folpet, and glyphosate, (Barroso Planas 2014)) coincide with the investigated ai (marked with a “+” in Fig. 3). All soil samples of conventionally managed fields exhibited co-occurrence of three or more CUP and/or TP residues. In contrast, only two CUPs could be detected in one soil sample of the organically managed sites: chlorothalonil (3.9 ng/gdw) and azoxystrobin (8.1 ng/gdw). The residues detected and quantified in this study were magnitudes lower than reported by Tan et al. (2020) who quantified 41 pesticides in a tropical soil of China in a concentration range from not detected to 11.7 mg/kg, with a median of 0.20 mg/kg. The CUP concentrations in this study were also lower than the ones reported by Silva et al. (2019) in temperate soils from 11 countries of Europa (median = 0.15 mg/kg, maximum = 2.9 mg/kg). Additionally, the sum of co-occurring pesticides exceeded in 63% (n = 19) and 67% (n = 20) of soil samples the Czech limit (100 ng/g) (MoECR 1994) and the Netherlands limit (70 ng/g) (VROM 2006), respectively. Currently, soil guidance values are yet to be established in most of countries, including Cuba.

Fig 3
figure 3

Pesticide residues quantified in 30 Cuban soil samples. Blue bars (left y-axis) indicate how many times a respective compound was detected. The right y-axis reports median (empty red circles) and maximum (empty black triangles) concentrations (ng/gdw) in the soil samples. From the top-ten most applied pesticides to potato crops according to Barroso Planas (2014) compounds that coincided with the investigated ai are marked with a “+” in x-axis labels.

In summary, the here developed analytical method could be successfully applied to real soil samples to quantify CUPs and TPs from Cuban agroecosystems. This method will now be applied to a larger monitoring campaign where soils of agricultural fields with potato plants were sampled over 4 consecutive years. The results will be published in due course and time.

Applicability of the method to other low-income countries

A QuEChERS extraction and GC-MS/MS based method was thoroughly validated for 31 CUPs and seven TPs in SR, a tropical soil from Cuba. Additionally, MEext/int were evaluated for five different Cuban soils. All figures of merit proved satisfying for all compounds. The method was successfully applied to 30 soil samples (all RFN) and demonstrated to be well suitable for monitoring pesticide residues in this matrix. We consider it fit to transfer to other laboratories, contributing to the establishment of soil monitoring programs in developing and tropical countries, in which studies are limited. Often, no MS is available in such labs but given the importance of internal standards, other, nsi, thermostable compounds with similar volatility as the analytes of interest could be used. Therefore, monitoring studies are crucial for future establishment of RGV in agricultural soils of tropical countries, including Cuba. Moreover, we present a solid research tool with less means than in high-income countries at comparably higher demanding environmental conditions, such as tropical humidity and temperature.