Background

There is a growing debate on the role of soil ecotoxicology and biodiversity in risk assessment [1,2,3]. Soils contain abundant and diverse microbial populations that facilitate ecosystem functions such as nutrient cycling [4]. The registration and approval of substances such as chemicals, pesticides and biocides therefore requires tests for potential effects on these organisms [5,6,7]. Such tests focus on the evaluation of microbial nitrogen transformation as per OECD Guideline 216 [8]. This approach provides information about the populations of organisms capable of ammonification and nitrification.

Following a request from the European Food Safety Authority (EFSA), the Panel on Plant Protection Products and their Residues prepared an opinion document on the science behind the risk assessment of plant protection products, evaluating the tests used to determine effects on soil-dwelling organisms [1]. The document identifies specific protection goals to which various phylogenetic and functional microbial groups contribute, including biodiversity, genetic resources, cultural services, nutrient cycling, and pest control. The document recommends that the most relevant community-based microbial test systems should cover the broadest possible range of metabolic processes without compromising test sensitivity. However, current community-based functional tests for effects on nitrogen transformation (mandated in Europe and North America) and CO2 evolution as a measure of general microbial activity (mandated in North America) are crude and lack the sensitivity to indicate toxic effects against essential microbial processes carried out only by a small number of specialized microorganisms.

The EFSA document refers specifically to the MicroResp test system, which measures microbial respiration by detecting the evolution of CO2 [9]. Although this system has not yet been used to study the effects of pesticides on soil microbial processes, the EFSA document recommends further evaluation. The impact of soil type, assay duration and soil water content on MicroResp results have been investigated thoroughly, and the method is considered suitable for soil ecotoxicity studies [10]. The MicroResp system measures CO2 evolution in a microtiter plate format. It was developed for short-term measurements (4–6 h) with initial and endpoint readings [9, 11]. However, OECD Guideline 217 [12] recommends that soil respiration should be measured continuously or at hourly or two-hourly intervals for 12 h after the addition of glucose, and is based on the testing of larger quantities of soil (~ 100 g), which limits the number of replicates. In contrast, the 96-well format of the MicroResp system works with smaller samples and thus allows many different carbon sources to be used in parallel to test for substrate-induced respiration activity, thus creating a metabolic profile of the microbial population.

Other miniaturized tests, such as those complying with ISO test guidelines [13, 14], focus on the measurement of exoenzyme activity. Enzymes in the soil are often associated with proliferating cells, but they may also be free in solution or bound to humic colloids and clay minerals [15]. The dynamics of these different enzyme pools and their contribution to ecosystem processes remain uncertain [16]. Enzyme stability is influenced by geochemical factors, by the presence of proteases, and by the intrinsic properties of the enzyme itself. For example, β-glucosidase is a stable enzyme, whereas the levels of α-glucosidase fall quickly after cell death indicating that constant synthesis is necessary to balance its rapid degradation [17]. An inter-laboratory comparison of the MicroResp system and a multi-enzyme assay using fluorogenic substrates showed that both methods are prone to inter-laboratory variation [18]. However, following the same strict protocol can minimize such differences [19].

The development of miniaturized, high-throughput test systems has increased our understanding of microbial diversity in the environment [20,21,22]. However, the relationship between microbial diversity and soil functions remains unclear because the same functions can be fulfilled by many taxonomic groups [4, 23, 24]. Confining the functions of specific phylogenetic units is still a challenge, and the EFSA opinion paper states that although various methods can be used to measure soil microbial properties in terms of abundance, functions and community structure, further adjustments are necessary for use in risk assessment [1].

The registration and approval of substances currently requires tests to determine the impact on soil nitrogen transformation, as per OECD test guideline 216 [8]. Soil is incubated with the test substance and an organic nitrogen source, and periodic measurements are taken to record the concentrations of ammonium, nitrite and nitrate. These nitrogen-containing inorganic compounds are generated by several transformation processes (e.g., microbial ammonification, nitrification and nitrogen fixation) and the test therefore integrates a number of microbial activities. In contrast to the method described in the OECD test guideline, the potential ammonium oxidation (PAO) test [25] is suitable for the assessment of toxic ion-releasing substances [26]. A soil sample is mixed with the test substance, and a nutrient source (NH4+) is added at three time points so that NO2 (as indicator of transformation activity) can be determined after a short incubation, generating a profile of stimulation and inhibition during the 28 days of the test.

To assess the suitability of different tests for functional and genomic microbial diversity in the context of risk assessment, we compared the PAO test [25] with the MicroResp system [9] and a colorimetric test for exoenzyme activity [13]. For the MicroResp and exoenzyme tests, we selected four substrates (two of which were directly comparable between the tests). We complemented the three functional tests with a survey of bacterial genomic diversity, sampling the 16S rRNA gene by next-generation-sequencing (NGS). In light of current harmonization efforts, the tests currently applied to plant protection products will probably extend to cover other substances, including biocides and chemicals. Therefore, instead of a conventional pesticide, we selected a silver nanomaterial as an environmentally relevant model substance [27] with well-known antibacterial activity [28,29,30].

Methods

Soil

The experiments were carried out using the reference soil RefeSol 01A (sieved ≤ 2 mm), which is a loamy, moderately acidic sand (73% sand, 22% silt and 5% clay; pH 5.6) with a low organic matter content (Corg = 1.1%). Appropriate amounts of soil were sampled 1–4 weeks before the test. If the soil was too wet for sieving, it was dried at room temperature to 20–30% of the maximum water holding capacity (WHCmax) with periodic turning to avoid surface drying. If the tests did not start immediately after sieving, the soil was stored in the dark at 4 °C under aerobic conditions.

Spiking and incubation

The silver nanomaterial NM-300K was used as proposed by the OECD Sponsorship Programme [31]. This is a colloidal silver dispersion with a nominal silver content of 10% (w/w) and a particle size of ~ 15 nm with a narrow size distribution (99%). A second particle size of 5 nm, which is much less abundant (1%), was identified by transmission electron microscopy [32]. The particles are dispersed in a mixture of a stabilizing agents (NM-300K DIS) comprising 4% (w/w) each of polyoxyethylene glycerol tri-oleate and polyoxyethylene sorbitan monolaurate (Tween-20). We added 8 mL of deionized water to a vial containing 2 mL NM-300K and homogenized the suspension in an ultrasonic bath. The required amount was mixed with 10 g air-dried soil and added to 990 g soil (dry matter) which had been adjusted to 10–15% WHCmax. After thorough mixing, the water content was adjusted to 45% WHCmax. Five test concentrations of NM-300K were prepared (0.19, 0.56, 1.67, 5 and 15 mg/kg) as well as two replicates of the untreated control. The soil (1000 g dry matter per replicate) was incubated under aerobic conditions at 20 ± 2 °C for 4 weeks. Every 7 days, any evaporated water was replaced. Microbial activity was determined after 7, 14 and 28 days of incubation. Genomic diversity was measured after an incubation period of 28 days.

Microbial test systems

PAO test

The short-term PAO test was performed according to ISO guideline 15685 [25]. Four 250-mL Erlenmeyer flasks per treatment were filled with 25 g dry matter of spiked soil along with four matching controls. The vessels were incubated in the dark at 20 ± 2 °C for 24 h and then mineral test medium (0.56 mM KH2PO4, 1.44 mM K2HPO4, 5 mM NaClO3, 1.50 mM (NH4)2SO4) was added to make up the volume to 100 mL. The slurries were incubated on an orbital shaker at 25 ± 2 °C, and 10-mL samples were removed after 2 and 6 h. The samples were supplemented with 10 mL 4 M KCl and filtered, and the nitrite levels in the filtrate were determined by spectrophotometry at 530 nm.

MicroResp system

Substrate-induced respiration activity was measured using glucose and cellobiose to represent the carbon cycle, and the amino acids alanine and cysteine to represent the nitrogen and sulfur (cysteine) cycles. The test was performed according to the manufacturer’s protocol. For each substrate, eight replicates were prepared. The absorbance of the detection plates containing a detection gel with an indicator dye was measured at 570 nm before contact with the Deepwell plates. After an incubation period of 6 h at 25 °C, the detection plates were scanned again, and the quantity of evolved CO2 was calculated.

Exoenzyme test

The activities of β-glucosidase, arylsulfatase, arylamidase and phosphatase were measured using colorimetric substrates in 96-well plates according to the ISO test guideline 20130 [13]. For each test, we transferred two replicate aliquots of soil dry matter (4 g) to Erlenmeyer flasks and added 25 mL deionized water. The mixture was homogenized for 10 min using a magnetic bar. We transferred 125-µL aliquots from these mixtures to the wells of microtiter plates, with separate plates for each enzyme. We prepared four wells per sample for each enzyme, one as a control and three for the substrate. The plates were incubated for 30 min (phosphatase), 1 h (β-glucosidase) or 4 h (arylsulfatase) before adding 25 µL CaCl2 and 100 µL Tris (pH 12) as stop reagents to each well, and 23 µL of substrate to the control. The plates were then centrifuged (5 min, 1500g, 20 °C) and we transferred 200 µL of the supernatant from each well to new plates before reading the absorbance at 405 nm. The reaction with arylamidase was stopped after 2 h by adding 150 µL 96% ethanol. After adding substrate to the control followed by centrifugation as described above, we transferred 100 µL of the supernatant from each well to new plates, added 100 µL acidified ethanol and 100 µL p-dimethylaminocinnamaldehyde (DMCA), incubated the plates for 20 min at room temperature in the dark and read the absorbance at 540 nm. Enzymatic activity was calculated according to ISO test guideline 20130.

Statistical analysis of microbial activities

Statistical analysis and the calculation of EC10 and EC50 values were carried out using ToxRat Pro v3.3.0 for ecotoxicity response analysis (ToxRat Solutions, Alsdorf, Germany). The normal sigmoid model (three parameters) was selected for the calculation.

Microbial genomic diversity

DNA was extracted from each soil sample (~ 0.25 g fresh weight) in duplicate using the PowerSoil DNA Isolation Kit (MoBio Laboratories, Carlsbad, CA, USA). The yield and purity were measured using a NanoDrop ND-2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). The duplicate DNA extracts for each soil sample were mixed and pooled to generate a unique sample for NGS analysis. DNA from each treatment was submitted to Eurofins Scientific (Luxembourg) for 16S amplicon sequencing using the Illumina Miseq platform. Briefly, amplicons of the hypervariable regions V3–V4 were generated by PCR using forward primer 5′-TAC GGG AGG CAG CAG-3′ [33] and reverse primer 5′-CCA GGG TAT CTA ATC C-3′ [34]. In a second round of PCR, the adapter overhang nucleotide sequences for the forward and reverse index primers were added, before amplicons were purified using AMPure CP beads (Beckman Coulter, Brea, CA, USA) and amplified by PCR using Nextera CT index primers (Illumina, San Diego, CA, USA). The indexed samples were purified using AMPure XP beads (Beckman Coulter), quantified and pooled in equal quantities. The pooled library was quantified using an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA) and sequenced using v3 chemistry (2 × 300 bp paired-end reads). The sequences were then demultiplexed, primer sequences were clipped, forward and reverse reads were merged, and quality filters were applied. Reads containing ambiguous bases and chimeric reads were removed [35]. The remaining high-quality reads were processed using minimum entropy decomposition [36, 37], allowing dataset decomposition at single-nucleotide resolution. Taxonomic information was assigned to each operational taxonomic unit (OTU) using DC-MEGABLAST (reference database: NCBI_nt). Further processing of OTUs was performed using QIIME [38].

Microorganisms were identified up to the order level. Based on these values, the Shannon index (H′) and the species evenness (EH) were calculated to describe diversity, and Stander and Steinhaus similarity indices [39] were determined in Community Analysis v4.3. To identify changes due to the NM-300K treatment, each order in the treated and untreated samples was normalized to the mean value of the controls. Deviations between the treatments and controls were marked using colors representing six ranges (< 0.01, < 0.1, < 0.5, 0.5–2, > 2, and > 10, the first three values corresponding to reductions of > 99%, > 90% and > 50% in the treated samples vs controls, and the last three values representing fold-increases in the treated samples vs controls). EC10 and EC50 values were calculated as described above.

Results

Functional tests

A clear concentration–effect relationship was observed in the PAO test (Fig. 1, Additional file 1: Table S1) with significant effects recorded at NM-300K concentrations ≥ 1.67 mg/kg. At a concentration of 1.67 mg/kg, the effect of NM-300K exposure increased over time, but the increase was less pronounced at higher concentrations (5 and 15 mg/kg) because strong effects were already observed at the start of the test. Given the strong inhibition of PAO activity at the two highest NM-300K concentrations, the EC values declined only slightly during the incubation period of 28 days, with narrow but nevertheless overlapping confidence intervals (Table 1).

Fig. 1
figure 1

Effect of the silver nanomaterial NM-300K on potential ammonium oxidation activity. Statistical significance (α = 0.05) is indicated on the bars representing inhibition: *0.01 ≤ p < 0.05; **0.001 ≤ p < 0.01; ***p < 0.001

Table 1 EC10 and EC50 values [mg/kg] of the silver nanomaterial NM-300K for the inhibition of potential ammonium oxidation activity

We also observed concentration–effect relationships for the substrate-induced respiration activities measured using the MicroResp system. The relationships were clear for the tests on all four substrates at the start of the experiment, but were less pronounced for the amino acid substrates at the end (Fig. 2, Additional file 1: Table S2). The concentration–effect relationships for the two sugars were similar at all three sampling points, with clear concentration–effect curves observed after incubation periods of 7 and 28 days. The concentration–effect relationships for the two amino acids were similar after 14 and 28 days but differed on day 7. After 14 days, effects on all four substrates were observed mainly for the two highest test concentrations. Due to the limited concentration–effect relationships, EC10 and EC50 values could not be calculated for every carbon source and sampling interval. The values for glucose and cellobiose indicated slightly decreasing toxicity, with overlapping confidence intervals (Table 2).

Fig. 2
figure 2

Effect of the silver nanomaterial NM-300K on substrate-induced respiration activity. Statistical significance (α = 0.05) is indicated on the bars representing inhibition:*0.01 ≤ p < 0.05; **0.001 ≤ p < 0.01; ***p < 0.001

Table 2 EC10 and EC50 values [mg/kg] of the silver nanomaterial NM-300K for the inhibition of substrate-induced respiration activity

The effects on exoenzyme activities varied over time and were specific for each enzyme (Fig. 3, Additional file 1: Table S3). Concentration–effect relationships were clearly observed for the glucosidase and arylsulfatase activities, with decreasing effects during the incubation period for the two lower test concentrations. EC10 values could not be calculated for every sampling interval, and EC50 values ranged from 2 to 22 mg/kg for both enzymes (Table 3). The effects on arylamidase and phosphatase activities varied over time, but there was no evidence of concentration–effect relationships so we were unable to calculate EC values. Medium concentrations of NM-300K had the greatest impact on arylamidase activity, with a concentration-dependent decrease at lower and higher concentrations at the end of the test. Phosphatase activity was not affected at the two lowest NM-300K test concentrations after 28 days, whereas the three higher concentrations each caused ~ 25% inhibition despite the ~tenfold concentration range (1.67–15 mg/kg).

Fig. 3
figure 3

Effect of the silver nanomaterial NM-300K on microbial exoenzyme activity. Statistical significance (α = 0.05) is indicated on the bars representing inhibition: *0.01 ≤ p < 0.05; **0.001 ≤ p < 0.01; ***p < 0.001

Table 3 EC10 and EC50 values [mg/kg] of the silver nanomaterial NM-300K for the inhibition of the enzymes glucosidase and arylsulfatase

Genomic diversity test

We used NGS to survey bacterial diversity by sequencing the 16S rRNA gene in each soil sample (Fig. 4, Additional file 1: Table S4). Bacterial orders differing in representation between samples spiked with NM-300K and untreated controls are highlighted in Table 4. The total abundance of bacteria across all samples varied by a factor of 1.7, but there was no overall concentration–effect relationship. In the two control replicates, there was a < 15% difference in abundance between samples in 60% of the bacterial orders. Increasing concentrations of NM-300K led to the depletion of some orders but the enrichment of others. For example, the abundance of Desulfovibrionales in the control samples was low, and this order was eliminated in all samples containing NM-300K. Propionibacteriales were depleted by the nanomaterial in a concentration-dependent manner, with effects observed even at the lowest test concentration, whereas Rhodospirillales were more resilient and depletion was only evident at higher test concentrations. For three orders (Propionibacteriales, Rhodospirillales, and Gemmatimonadales) the progressive depletion with increasing concentrations of NM-300K allowed us to calculate reliable EC50 values of 0.72–19.5 mg/kg, confirming the order-specific differences in sensitivity (Table 5). As well as showing clear concentration–effect relationships, some orders (e.g., Sphingobacteriales and Cytophagales) showed the maximum deviation from the control at medium concentrations of NM-300K. The Steinhaus similarity index (based on absolute values) and the Stander similarity index (based on normalized values) both indicated slight differences from the controls at the two highest test concentrations (Fig. 5). However, we observed no differences in the Shannon index or evenness as indicators of diversity (Additional file 1: Figure S1).

Fig. 4
figure 4

Effect of the silver nanomaterial NM-300K on microbial genomic diversity following an incubation period of 28 days (c = class; o = order)

Table 4 Effect of the silver nanomaterial NM-300K on bacterial orders, marked according to the fold-difference from the mean value of the two normalized controls (decreases are indicated in red and yellow, and increases in blue)
Table 5 EC10 and EC50 values [mg/kg] of the silver nanomaterial NM-300K for different bacterial orders
Fig. 5
figure 5

Steinhaus and Stander similarity indices [39] for the taxonomic distribution of bacteria in control samples and samples treated with different concentrations of the silver nanomaterial NM-300K

Discussion

Overview

To address the recommendations published in a recent EFSA scientific opinion document [1], we compared the sensitivity of three tests for microbial activity in response to the effects caused by a model silver nanomaterial, NM-300K. We compared the PAO, MicroResp and exoenzyme tests for microbial functions, as well as the NGS-based analysis of bacterial genomic diversity. Although the EFSA document refers specifically to pesticides, current efforts to harmonize regulatory guidelines indicate that the same recommendations may, in the future, be applied to the risk assessment and regulation of other kinds of chemicals, including silver nanomaterials. The assessment of microbial test results in the context of substance regulation differs according to the nature of the test substance. If agrochemicals are tested, the effects in treated and control samples are compared with a threshold value of 25% [40]. For other substances, endpoints such as EC values are calculated [8, 12, 41]. Concentration–effect tests are also recommended for agrochemicals to determine the magnitude and temporal scale of their effects [1].

The PAO test

The PAO test focuses on a small group of specialized microorganisms with low diversity. We observed statistically significant differences between control and test samples at NM-300K concentrations ≥ 1.67 mg/kg, with a slightly increasing effect over time. These results, and the corresponding EC values, are in agreement with our previous studies, confirming the reproducibility of the PAO test [42,43,44]. Despite the advantages of the PAO test, OECD Guideline [8] sets out a different procedure to be used for the registration or approval of substances. This involves the mixing of spiked soil samples with a complex nitrogen source, and as well as nitrification it also detects the degradation of organic carbon sources. The inorganic ammonium, nitrite and nitrate concentrations used as endpoints therefore reflect the activity of a diverse microbial population and multiple microbial transformations. Accordingly, the PAO test is suitable as a specific indicator for toxicity whereas the nitrogen transformation method described in OECD Guideline 2016 provides more general information on the microbial population and nutrient cycles in the soil, at the expense of assay sensitivity. The PAO test is also more generally applicable, being suitable not only for testing organic compounds but also ion-releasing substances, whose toxicity is underestimated when the soil contains significant quantities of organic matter because the latter adsorbs ions and therefore reduces their bioavailability [26].

Substrate-induced respiration test

The EC10 and EC50 values in the MicroResp system revealed that substrate-induced respiration is less sensitive than PAO activity when testing for the effects of silver nanomaterials, and the difference between the two methods increased over time. The PAO test showed an increase in toxicity with longer incubation periods, reflecting the continued detrimental effects of exposure to NM-300K among the specialized microbes capable of PAO activity. In contrast, glucose-induced respiration showed a decline in toxicity over time, reflecting widespread functional redundancy for this process within the microbial community. Species that are less sensitive to NM-300K are likely to metabolize glucose, and their proliferation would lead to the recovery of microbial respiration.

One of the drawbacks of the MicroResp system is the 6-h interval between measurements. According to OECD Guideline [12], the respiration rate should be measured continuously or at hourly/two-hourly intervals for 12 consecutive hours in order to calculate a mean rate. Furthermore, ISO 17155 [45] states that respiration rates used to calculate the mean must be near constant, thus excluding the lag phase and exponential growth phase. The MicroResp system measures the amount of CO2 produced at the test start and endpoint. There is no information about the constancy of respiration in the interim period of 6 h. For example, heavy metals can increase the lag time prior to the mineralization of amino acids [46]. If a fixed incubation period is used, an extended lag time results in higher toxicity compared to evaluations where only the linear part of the respiration curve is considered. We therefore recommend that results obtained with the MicroResp system are compared with those obtained from tests with additional measurements and more frequent testing intervals.

When cysteine was used as the substrate, the inhibition of respiration remained constant on day 28 regardless of the NM-300K concentration, whereas the other substrates showed concentration-dependent effects. Campbell et al. [47] described positive color reactions when cysteine was used as a substrate, which may make it unsuitable, but we found no evidence for positive color reactions after 7 and 14 days, in agreement with Wakelin et al. [48].

The miniaturized MicroResp format may also affect the results. The PAO test uses soil samples of 25 g, which generated better concentration–effect relationships than the MicroResp and exoenzyme tests, both of which are carried out in multiwell plates with small soil samples. We were able to calculate EC values with confidence intervals at all sampling points in the PAO test but not in the others, and the small sample size may contribute to this issue.

Metabolic profiles based on complementary carbon sources can provide rich information about functional microbial diversity in soil samples, and can be used for the characterization of soils and the effects of contamination [49,50,51]. The EFSA opinion paper does not recommend the use of any specific carbon sources, and a standard test set covering different metabolic pathways remains to be defined. The selection of carbon sources must be considered carefully because this determines the discriminatory power and ecological relevance of the respiration profiles [52]. Our results indicate that glucose and cellobiose are equivalent and it is unnecessary to use both. The selection of additional carbon sources addressing further metabolic transformations should be based on environmental relevance, such as carboxylic acids, alcohols, fatty acids, polymers, aromatic structures and aliphatic compounds. The substances should be bioavailable (and therefore preferably soluble in water) due to the short incubation time of the test. Creamer et al. [53] used a set of seven substances comprising d-(+)-galactose, l-malic acid, γ-aminobutyric acid, N-acetylglucosamine, d-(+)-glucose, α-ketoglutarate, citric acid and water for basal respiration measurements. This was selected from among the 15 substances used by Campbell et al. [9], which included further sugars, amino acids, dicarboxylic acids and the aromatic compound protocatechuic acid, representing diverse carbon sources reported as constituents of root exudates and suitable for the discrimination of soils [47]. Linolenic acid, malic acid, oleic acid, arginine, serine, lactic acid, oxalic acid, trehalose, stearic acid and β-methyl d-glucoside were identified as the 10 carbon sources with the highest power to distinguish among different sites [47].

Exoenzyme activity test

In the exoenzyme activity test with four enzymes, β-glucosidase and arylsulfatase showed concentration–effect relationships at the three measuring time points. The EC values for β-glucosidase indicated that the sensitivity of the test was similar the substrate-induced respiration test with glucose or cellobiose. Arylsulfatase had no direct counterpart among the substrates used in the respiration test, but the sensitivity was comparable to β-glucosidase. Dick et al. [54] reported that the application of ammonium-based nitrogen fertilizer inhibited soil amidase activity, indicating a specific feedback mechanism, whereas no correlation was observed for β-glucosidase or arylsulfatase. It is unclear whether the different profiles of β-glucosidase and arylsulfatase versus arylamidase activity in our test also indicate different effects of NM-300K and corresponding feedback mechanisms. Based on the investigation of eight exoenzymes involved in various transformation steps (acid phosphatase, arylsulfatase, catalase, dehydrogenase, fluorescein diacetate hydrolase, protease, urease, and β-glucosidase), Chae et al. [55] recommended β-glucosidase for the measurement of soil ecosystem health. The different temporal profiles of three enzymes (leucine aminopeptidase, phosphatase, and β–glucosidase) in the presence of metals during an incubation period of 21 days [56] and of four enzymes (acid phosphatase, arylsulfatase, fluorescein diacetate hydrolase, and urease) affected by fluorine [57] indicate that more than one enzyme is required for risk assessment applications.

Analysis of genomic diversity by NGS

Functional assays provide information about microbial activity under the test conditions, whereas NGS provides information about the diversity of cultivable and non-cultivable microorganisms and can reveal changes in population structure that are not detected by activity assays due to community-level metabolic redundancy [58] or changes in the microbial community that occur during the functional assay [59]. The link between genomic and functional diversity, or between species richness and ecosystem services, has been widely discussed [4, 60]. We found that NM-300K caused a shift in the microbial community, enriching the soil for microbial groups representing the Proteobacteria (Caulobacterales, Burkholderiales, and Xanthomonadales), Cytophagales and Sphingobacteriales. Sphingobacteria and Proteobacteria are often found in soils contaminated with polyaromatic hydrocarbons [61]. Although the ability to degrade such compounds was not investigated in this study because the model contaminant was a metal, the enrichment of these microbial groups may indicate a disturbance in the microbial population resulting in the proliferation of organisms insensitive to the test compound. The depletion of particular microbial groups can slow down the transformation functions they normally provide. In this study, we observed a link between the depletion of Nitrosomonadales and the reduced PAO activity, whereas for the other bacterial orders there was no one-on-one relationship to specific functions. As stated by EFSA [1], functional redundancy should contribute less to insensitivity when studying more specialized functions.

The analysis of genomic DNA reveals the presence of bacteria without providing information about their activity, and extracellular DNA that persists in the soil may also be detected [62]. We found that our functional tests and genomic diversity analysis achieved comparable sensitivity in detecting the effects of NM-300K on soil microbes. The concentrations of NM-300K that affected microbial activity in the test systems also triggered clear changes in the composition of the microbial community. Significant effects at an NM-300K concentration of at least 1.67 mg/kg were observed in all four tests. The EC50 values calculated for the genomic survey were also comparable to those observed in the functional tests. Similar results were reported for the metagenomic and functional impact on soil microorganisms of a Cu(OH)2 nanopesticide [63] and four frequently used organic herbicides tested at a concentration corresponding to the upper recommended rate [64]. None of the herbicides affected the microbial richness or evenness, or the composition of the bacterial and archaeal communities, and from the functional perspective there was no significant effect on enzyme activities or the ability to degrade 15 substrates. The only exception was a short-term effect on the ability of soil microorganisms to utilize three organic acids and one amino acid. However, studies indicating no effects are of limited use when comparing the sensitivity of different test methods. Diversity indices such as the Shannon index and species evenness are often used to measure the impact of a test substance on whole communities, providing information on changes in species richness and taxonomic distribution [39]. However, samples with the same index can feature different communities. Such changes can be demonstrated by similarity indices, which measure the degree of equality or difference in the community [39]. Our study showed that NM-300K did not affect the overall diversity of the community because there was no change in the Shannon index or species evenness. However, changes became visible when calculating similarity indices, showing that NM-300K does not reduce diversity per se, but modifies the composition of the diverse microbial community in the soil.

Conclusion

In our study we compared the sensitivity of four tests for their ability to detect the effects of a model silver nanomaterial on soil microorganisms. We based our selection on EFSA recommendations to (i) evaluate the MicroResp system and (ii) consider diverse microbial processes as well as structural diversity [1]. We therefore evaluated the MicroResp substrate-induced respiration test system and exoenzyme test with four substrates each, as well as the PAO test for nitrogen transformation. We found that the sensitivity of all three functional tests was comparable for the model silver nanomaterial. However, several issues remain to be addressed before we can decide on the suitability of these approaches for the registration or approval of substances. For example, the tests should be evaluated with a broader range of model substances, including non-ionic compounds. This will indicate whether different substrates or different taxonomic levels influence the discriminatory power of the tests. Finally, the amount of soil in each replicate appears to influence the validity of the observed concentration–effect relationships, and this particular aspect must be investigated in more detail.