1 Introduction

Human-associated fecal contaminants and pathogens have been detected in rivers during storm events (Schiff et al., 2016), and fecal contamination to ocean water from river outfalls after rain events has been associated with increased rates of gastrointestinal illness and multiple infections amongst surfers (Arnold et al., 2017). Microbial source tracking (MST) methods such as the detection of the HF183 gene target of Bacteroides are increasingly being used to monitor human fecal pollution in surface waters (US EPA, 2019), but the use of these markers by themselves are not sufficient to separate out the relative contributions of different sources of human fecal pollution, especially from non-point sources. Non-point sources of human fecal pollution to surface waters include leaky sanitary sewers (exfiltration), runoff from open defecation, septic system leach fields, sanitary and combined sewer overflows, and unknown pollution inputs to some municipal storm sewer systems.

Sanitary sewer overflows (SSOs) introduce untreated wastewater to nearby soils or waterways during both wet and dry weather (Baresel & Olshammer, 2019). Sanitary sewer overflow events are quite common, potentially resulting from storms that could be classified as annual storm frequency events across all durations (Day & Seay, 2020). The most recent reporting by the California State Water Resources Control Board (2015) lists a total of 4,580 SSOs in fiscal year 2014–2015, resulting in more than 11 million gallons of sewage reaching the environment, and there are likely at least 23,000 to 75,000 SSO events in the USA each year (ASCE, 2017). Sanitary sewer overflows have been associated with increases in emergency room visits for gastrointestinal illnesses, which tended to peak 10 to 14 days following the sewer overflow events (Jagai et al., 2017; Potera, 2018). Furthermore, multiple studies have predicted that climate change and urbanization cause significant increases in the occurrence, volume, and duration of sanitary and combined SSO events (Astaraie-Imani et al., 2012; Mahaut & Andrieu, 2019; Tavakol-Davani et al., 2016). In addition to the implementation of efforts to reduce the frequency of sewer overflows and to monitor the frequency of their occurrence, there are needs to regulate or restrict access to sites impacted by SSOs in order to protect public health and, more specifically, to know how long access should be restricted. Traditional fecal indicator bacteria, such as E. coli and enterococci, do not strictly indicate human fecal pollution, while MST markers such as HF183 and pepper mild mottle virus (PMMoV) are more specific to human fecal pollution (Symonds et al., 2018). However, molecular methods such as qPCR and RT-qPCR, which are used to detect HF183 and PMMoV, respectively, do not provide information about viability and may pick up DNA and RNA from nonviable microorganisms. At the same time, viruses such as PMMoV may have different persistence in the environment than bacteria, such as E. coli, enterococci, and Bacteroides. As such, it is necessary to better understand the persistence of different types of microbial pollutants and pathogens in soils and sediments impacted by SSOs.

Sewer infrastructure also deteriorates over time due to structural failure, aging, and physical, biological, or chemical factors that contribute to deterioration of the pipe materials. In the USA, there are over 800,000 miles of public sewers and 500,000 miles of private laterals, much of which is in need of being replaced, according to a study by the American Society of Civil Engineers (ASCE, 2017). The ASCE gave wastewater collection and treatment infrastructure a grade of D + , which means that the infrastructure is in “fair to poor condition and mostly below standard, with many elements approaching the end of their service life” (ASCE, 2017). A study done by the US Environmental Protection Agency (Amick & Burgess, 2000) concluded that the risk of sewer exfiltration was probably limited to US cities with deeper groundwater, mainly located west of the Mississippi River and along the Appalachian Mountains. Measurements done in the late 1980s showed exfiltration rates that ranged from 5,000 to more than 50,000 gallons per inch diameter per mile per day, accounting for 12–30% of the total flow in the pipes (US EPA, 1989). In other locations throughout the world, exfiltration rates may be significantly higher—for example, sewer exfiltration was measured to account for 74% of the sewage inflow during the dry season in Hue, Vietnam (Watanabe et al., 2019). Sewer exfiltration also is responsible for the introduction of nutrients, pathogens (Reynolds & Barrett, 2003; Rutsch et al., 2008; Gotkowitz et al., 2016; Roehrdanz et al., 2016), and chemical contaminants (Roehrdanz et al., 2016), including pharmaceuticals and personal care products, into shallow groundwater (Chisala & Lerner, 2008) or into the vadose zone, from which baseflow or interflow of stormwater (Pinongcos, 2020; Mladenov et al., 2020) may transport these contaminants into waterways. Viruses in particular can travel long distances through the subsurface compared to larger bacteria and protozoa (Hunt & Johnson, 2017; Schijven et al., 2017), and relationships between sewer age and the detection of enteric viruses in groundwater below sanitary sewers has been reported (Bradbury et al., 2013; Gotkowitz et al., 2016). Due to their prevalence in wastewater and persistence in water matrices, novel viral markers, such as PMMoV, have been proposed as viral water quality monitoring tools (Bivins et al., 2020).

In June 2019, the San Diego Regional Water Quality Control Board (2019) issued an investigative order to identify and quantify the sources and transport pathways of human fecal pollution to the San Diego River, identifying possible sources to include SSOs, discharges and exfiltration from sewer collection systems and laterals, discharges from on-site wastewater treatment systems, illicit connections and discharges to stormwater infrastructure, and runoff mobilizing waste associated with homeless encampments. However, the relative contributions of each of these sources is unknown.

Leakages from sewer infrastructure in the form of sanitary or combined sewer overflows and sewer exfiltration result in the seepage of pathogenic bacteria, viruses, protozoa, and other microorganisms as well as organic carbon and nutrients into adjacent soils, where they may remain until a future rain event. These microorganisms and sewage-derived constituents may remain in soils for long periods, especially in arid Mediterranean climates, where > 80% of the precipitation is constrained to the wet winter season (Aschmann, 1973).

This study seeks to evaluate the persistence of fecal indicator bacteria and microbial markers of human contamination in water flushed through soil under two conditions: (1) after multiple flushing events (referred to as the “flushing treatment”) that simulate stormwater flushing contaminants from the soil surface or vadose zone via interflow and (2) after long-term drying and subsequent single flushing (referred to as the “decay treatment”) that simulates the persistence of microbial indicators under these conditions during prolonged periods without stormwater flushing. Specifically, we measured the concentrations of Escherichia coli, fecal enterococci, the HF183 genetic marker (Bacteroides dorei), PMMoV, dissolved organic carbon (DOC), and total dissolved nitrogen (TDN) in flushed water with respect to the number of consecutive times the soil was flushed and with respect to the number of days the sewage-spiked soil was left to dry.

2 Materials and Methods

2.1 Experimental Conditions

Soil was collected at a distance of ~ 10 m from the south side of Alvarado Creek (32.7778866, − 117.0665544), which is a tributary of the San Diego River in San Diego, CA, USA. An area away from sewer manholes and known sewer lines and in between canopy with no anthropogenic debris was chosen, and the top 5 cm of the soil were scraped off with a sterilized scoopula to remove pebbles and large debris. Approximately 4 kg of soil was collected with sterilized scoopula from an area of 1,861 cm2, placed into a sterilized bag, and brought back to the laboratory. The soil was sieved through a 2-mm sieve and homogenized by hand on the same day of sample collection and stored in a desiccator prior to use. A portion of soil was analyzed for organic content using the loss on ignition method (Beaudoin, 2003). The organic matter content of the soil was determined to be 6.9%, and soil sieve analysis indicated that the soil sample was 10.4% gravel, 76.8% sand, and 12.8% fine sediment (by weight). Approximately 10 mg of soil was dried in a drying oven at ~ 105 °C and homogenized by hand with a sterile trowel before being placed into duplicate sterilized 50-mL centrifuge tubes. The depth of the soil in the centrifuge tubes was approximately 1 inch, and the mass of the soil in each tube was 10 mg. Although the soil was not autoclaved, it is possible that the use of the drying oven may have affected the viability of the natural microbial community in the soil.

Screened untreated wastewater for soil spiking was collected as a single grab sample from the San Elijo Water Reclamation Facility (Cardiff, CA, USA) during the morning of 29 May 2019 and stored at 4 °C until use. Within 24 h of collection of the wastewater, two experiments were initiated, (1) a “consecutive flushing experiment” (Fig. 1a), meant to simulate flushing of sewage-contaminated soils during storm events, and (2) a “time-delay flushing experiment,” meant to simulate the flushing of soils that had been previously contaminated by sewage and flushed by a rainfall event occurring after long periods of drying (14 days, 28 days, 60 days, and 121 days). Experiments were conducted in duplicate, and controls for the experiments comprised duplicate soil samples that were not spiked with wastewater, but were subjected to flushing, settling, and decanting at the same time as the spiked samples. The supernatant of the experimental controls was analyzed for all parameters in the same manner as the spiked soil samples for the consecutive flushing experiment, with the only difference being that the controls were only flushed once on the first day of the experiment.

Fig. 1
figure 1

Experimental simulations of a surface runoff or interflow flushing soils contaminated with untreated wastewater and b persistence and decay of microbial and chemical constituents in water flushed from soils contaminated by untreated wastewater after being left idle for 14, 28, 60, and 121 days

For both experiments, a single 5 mL aliquot of untreated wastewater (sewage) was pipetted into each of the centrifuge tubes, and this volume was sufficient to completely saturate the soil with untreated wastewater. The centrifuge tubes were then lightly capped to allow air to flow in and out of the tube, then placed inside a Class-II biosafety cabinet to dry at ambient temperature (~ 20 °C). After 24 h of spiking soils with wastewater, tubes were removed in duplicate and flushed once by adding 45 mL of autoclaved synthetic rainwater (0.16 mM CaCl2, 0.02 mM Ca(NO3)2, and 0.12 mM Na2SO4; pH = 7.17, EC = 27.2 μS in ultrapure water) (as per Smith et al., 2002) to the centrifuge tube. The tubes were turned twice using an end-over-end motion and then set aside to settle for 10 min. Supernatant was then removed from each duplicate tube by careful pipetting. After this first flush, additional consecutive flushes of 45 mL each were performed for a total of 20 flushes (“consecutive flushing experiment”; Fig. 1a) of each duplicate tube. On average, 38 mL of supernatant was recovered from each flush, and recovery volumes ranged from 33 to 44 mL. The decanted supernatants from the first, second, fifth, tenth, and twentieth flushes were reserved for chemical and microbial analyses. The remaining soil fractions were not analyzed, as we were more interested in quantifying pollutant loadings in the flushed water (e.g., the supernatant), as it was meant to simulate stormwater interflow that could potentially contaminate surface waters. For the “time-delay flushing experiment” (Fig. 1b), duplicate sample tubes containing 10 mg of soil spiked with 5 mL of untreated wastewater were left loosely capped in the biosafety cabinet for 14, 28, 60, and 121 days after spiking (a total of 8 tubes); then they were sampled destructively as follows: the duplicate tubes were each flushed once with 45 mL of sterile synthetic rainwater on the prescribed day, using the same end-over-end motion described previously, and the supernatants were collected for analysis.

2.2 Quantification of Microbial Constituents

All samples were analyzed for E. coli and enterococci on the same day of collection using the Colilert and Enterolert methods, respectively, with the IDEXX Quanti-Tray 2000 system, as described in Standard Methods 9223B (APHA, 2012). Samples were vortexed for 10 s at maximum speed between dilutions to attempt to dislodge suspended particles. Samples anticipated to have high concentrations were diluted serially, either ten-fold or 100-fold at a time, vortexing between serial dilutions, with a maximum dilution of up to 1:10,000, and the concentrations were corrected for the dilution factor. Samples with suspected low concentrations were analyzed with a dilution of 1:100, resulting in a limit of detection of 100 MPN/100 mL for both E. coli and enterococci.

Bacteria and viruses in samples were concentrated from volumes between 30 and 40 mL, using membrane filtration and the adsorption-extraction method, respectively (Ahmed et al., 2015; Symonds et al., 2014; Verbyla et al., 2016). Briefly, MgCl2 was added to samples (final concentration of 25 mM), and the pH was adjusted to 3.0–3.5 with 1.0 M acetic acid. Then, samples were vacuum-filtered through 0.45-μm, 47-mm mixed cellulose ester filters (HAWP type, Millipore). Filters were placed into bead-beating tubes (lysis matrix E, MP Biomedical) with 600 µL of lysis buffer from the AllPrep PowerViral RNA/DNA Kit (Qiagen, Germantown, MD, USA) and 10% β-mercaptoethanol, and vortexed at high speed for 10 min. After centrifugation at 13,000 g for 1 min, filter lysates (typically ~ 500 µL) were then collected with a pipette and then transferred to clean microcentrifuge tubes, as described previously (Ahmed et al., 2015). Extracted nucleic acids were then purified using the AllPrep PowerViral RNA/DNA Kit, following the manufacturer’s recommended protocols, with final elution in a volume of 50 µL molecular grade water. Then, 8 µL of the extracted and purified nucleic acids were reverse transcribed with random hexamers using the SuperScript IV First-Strand cDNA Synthesis System (Invitrogen), following the manufacturer’s recommended protocol.

Molecular targets were quantified with the Bio-Rad QX200 droplet digital polymerase chain reaction (ddPCR) system, in a 20 µL reaction with 3 µL of purified DNA or 2 µL of cDNA containing the ddPCR Supermix for Probes (Bio-Rad), as well as primers and probes for the HF183/BacR287 or PMMoV assays (Supplementary Information) at final concentrations of 900 nM (primers) and 250 nM (probes). Thermocycling conditions were set to 95 °C for 10 min followed by 40 cycles of 94 °C for 30 s and then 60 °C for 1 min. After completion, samples were held at 98 °C for 10 min. The calculation of the assay limits of detection is described in detail in the Supplementary Information. Quality assurance and quality control recommendations were followed as described by Huggett et al. (2013) and are described in detail in the Supplementary Information.

Prior to spiking, a sample of the homogenized soil used for the experiments was analyzed for E. coli, enterococci, HF183, and PMMoV, using the methods described above, with the following differences: (1) E. coli and enterococci were eluted from ~ 20 g of soil with 40 mL of DI water (Julian et al., 2015) by vortexing for 1 min, the collecting the supernatant, which was diluted by a factor of 1:10 and analyzed using the IDEXX protocol, as described above; (2) nucleic acids for HF183 and PMMoV analysis were extracted directly from the soil using the RNeasy PowerSoil total RNA and DNA kits (Qiagen), following the manufacturer’s recommendations. Neither E. coli, enterococci, HF183, nor PMMoV were detected in the unspiked soil. Concentrations of E. coli, enterococci, HF183, and PMMoV in the wastewater used for spiking were multiplied by the volume spiked (5 mL) to calculate microbial loadings in the spiked soil. Concentrations of E. coli, enterococci, HF183, and PMMoV in the flushed water were multiplied by the flush volume (45 mL) to calculate the microbial loadings from each flush. For example, if a concentration of 7.93 × 103 gc/mL of HF183 was detected in the flush water, then that concentration would be multiplied by the volume of water flushed through the sample (45 mL) to get the loading (7.93 × 103 gc/mL × 45 mL per flush = 3.57 × 105 gc per flush).

2.3 Quantification of Chemical Constituents

All samples for DOC and TDN analysis were filtered through 0.7 μm (nominal pore size) precombusted (500 °C for 4 h) glass fiber filters and preserved immediately with hydrochloric acid. These samples were analyzed within 2 weeks of sample collection. Dissolved organic carbon (DOC) and total dissolved nitrogen (TDN) were measured (at a dilution of 1:20) using a Shimadzu TOC-L Total Organic Carbon Analyzer with a chemiluminescent nitrogen detector. The instrument uses a high temperature combustion method, which reports DOC concentrations as non-purgeable organic carbon. Each individual sample was analyzed with three sample injections. If the threshold for the coefficient of variation (20%) was exceeded, the sample was run again until the CV met the threshold, and the mean of those results was reported. In addition, each sample was run in duplicate, and standard deviations were within 10% of the mean. DOC and TDN concentrations in sewage used for spiking were 59.4 mg C/L and 88.7 mg N/L, respectively. When diluted with 45 mL of synthetic rainwater for flushing, this resulted in an additional 6.6 mg C/L and 9.9 mg N/L added to the C and N already in water flushed from unspiked soil.

2.4 Statistical Analysis

Changes in microbial loadings in rainwater flushed through sewage-spiked soils with respect to the number of flushes (i.e., volume of flushed water) were plotted and linear regression was performed. Differences in the rate of decrease in microbial loadings in rainwater flushed through sewage-spiked soils with respect to the number of days since contamination were assessed using analysis of covariance (ANCOVA) to test the equality of slopes for log-linear regression curves for pseudo-first-order decay. All statistical tests were performed at the alpha level of 0.05 using R v3.4.2 or Microsoft Excel.

3 Results and Discussion

3.1 Concentrations of Chemical Constituents

In the consecutive flushing experiment, concentrations of DOC and TDN in sewage-spiked soils had steep decreases, from > 80 mg C/L and ~ 8.0 mg N/L for the first flush to < 10.0 mg C/L and < 0.6 mg N/L by the 5th flush (Figs. 2a and 2c). After the 5th flush, DOC and TDN concentrations changed very little, with < 10% of the initial concentration remaining. The decrease in both DOC and TDN as a function of flushing volume (for the first 5–10 flushes) followed a decrease that appeared to be consistent with second-order rate processes (Supplemental Figure S1 shows that relationships of 1/concentration vs. flush volumes had r2 values ranging from 0.98 to 0.999). This result reveals that the initial stages of wetting and flushing are most important for mobilizing carbon and nutrients from soils. Also, it is worth noting that controls (without sewage spiking) did also experience decreases in DOC and TDN concentration, which are naturally found in soils, but the starting concentrations were lower than in spiked soils, with 55 mg C/L and 3.7 mg N/L measured in the first flush; therefore, the decrease in concentrations during the first 5 flushes was not as pronounced as in the spiked soil flushing experiments. For DOC, the second flush of the control soils only removed 6–9% of the initial DOC concentration (compared to > 70% removal in the spiked soils), and the 5th flush still had > 25% of the initial DOC concentration remaining. It is possible that sewage spiking increased the soil moisture (even though soils were left to dry for 24 h after spiking). This increased soil moisture may have “primed” soils for more facile and efficient flushing of solutes.

Fig. 2
figure 2

Concentrations of dissolved organic carbon (DOC; a and b) and total dissolved nitrogen (TDN; c and d) for synthetic rainwater flushed through soils contaminated with sewage with respect to the number of consecutive flushes (a and c) and with respect to the number of days since the soils were contaminated (b and d); dashed lines in a and c show concentrations in controls without sewage spiking, and dashed lines in b and d show predicted concentrations for day 1 based on concentrations in unspiked soil and sewage used for spiking

Results from the time-delay flushing experiment show variability in both DOC and TDN concentrations, which make it difficult to determine whether there is any degradation of either solute over long-term dry conditions. Due to destructive sampling for this experiment, each set of duplicate tubes did contain a new soil sub-sample (from the original homogenized soil sample), and some, even minor, amount of heterogeneity in the soil organic matter content may explain the variability in both DOC and TDN concentrations. The flushing experiment, by comparison, had all samples withdrawn from the same centrifuge tube, so no variability in concentrations is observed among flushes. Nonetheless, variable concentrations that sometime exceeded the predicted concentrations (from mass balance calculations) and the lack of a clear decreasing trend in DOC and TDN concentrations in the time-delay flushing experiment suggest that there was little decay of these solutes over time. This was not the case for microbial markers, as will be discussed next.

3.2 Concentrations of Microbial Constituents

The measured changes in the concentrations of microbial constituents with respect to each subsequent flush and with respect to the number of days since contamination are shown in Fig. 3. The concentrations in the sewage used to spike the soils were 2.2 × 107 MPN/100 mL for E. coli and 3.0 × 106 MPN/100 mL for enterococci, and 5 mL of sewage was spiked into the soil, which means that a total of 1.1 × 106 MPN E. coli and 1.5 × 105 MPN enterococci were added to the soil on average (dashed lines in Fig. 3a and b). For the consecutive flushing experiment, the geometric mean initial concentrations of fecal indicator bacteria in the water from the first flush, one day after soils were spiked with sewage, were 5.5 × 107 MPN/100 mL and 3.4 × 106 MPN/100 mL for E. coli and enterococci, respectively. Based on these values, the loadings of E. coli and enterococci flushed away from the soil for flush 1 were both an order of magnitude greater than the loadings originally added. Given that E. coli and enterococci were not detected in duplicate flushes of the original, unspiked soil samples (both < 1000 MPN/100 mL), it is possible that microbial growth may have occurred overnight. This finding of potential regrowth of both E. coli and enterococci in duplicate spiked sample tubes is important considering the extremely high FIB concentrations found in urban waterways (Hathaway & Hunt, 2011). For example, FIB concentrations are regularly orders of magnitude higher during storm events than during baseflow in the San Diego River (Pinongcos, 2020). Therefore, regrowth upon soil wetting is an important additional consideration when trying to evaluate the elevated FIB concentrations.

Fig. 3
figure 3

Loadings (a and b) and log10 differences in the loadings (c and d) for E. coli, fecal enterococci, HF183, and PMMoV for synthetic rainwater flushed through soils contaminated with sewage with respect to the number of consecutive flushes (a and c) and with respect to the number of days since the soils were contaminated (b and d); dashed lines show the loadings from the sewage spiked into the soil on day 1. Unspiked soils had no detectable concentrations of E. coli, enterococci, HF183, or PMMoV. Data from samples analyzed for E. coli after the 20th flush were non-detects because they were analyzed at too high of a dilution, so loadings were below 100,000 MPN per flush

Subsequent flushes yielded substantially lower loadings of E. coli and enterococci. Although loadings of all microbial markers tailed off by the end of the experiment, there were still 2.3 × 103 MPN of enterococci and 1.5 × 102 GC of HF183 detected in the supernatant (45 mL) of the 20th consecutive flush. The dilution (1:1000) used for enumerating E. coli in the supernatant of the 20th flush was too high and resulted in no detection; therefore, the final concentrations of E. coli were below the limit of detection (< 1000 MPN/100 mL) for both replicates. The rapid decrease in microbial loadings of all markers between the first flush and the 5th flush approached a 2-log10 (99%) difference, which is an order of magnitude greater than for DOC and TDN (> 90% by the 5th flush). This result also supports that the early stages of wetting and flushing are probably most important for mobilizing both microorganisms and solutes from soils.

As was the case for E. coli and enterococci, neither HF183 nor PMMoV were detected in the original, unspiked soil samples. On the other hand, concentrations of HF183 and PMMoV in the sewage used to spike the soils were 5.9 × 106 gc/100 mL and 3.4 × 105 gc/100 mL, respectively. Given that the soils were each spiked with 5 mL of sewage, the total numbers of HF183 and PMMoV added to the soils were approximately 2.9 × 105 and 1.7 × 104 gene copies, respectively, in both experiments (dashed lines in Fig. 3a and b). The geometric mean initial concentrations of HF183 and PMMoV in the water from the first flush, one day after soils were spiked with sewage, were 5.0 × 105 MPN/100 mL and 7.0 × 103 MPN/100 mL, respectively. Subsequent flushes yielded substantially lower loadings of HF183 and PMMoV, which tailed off with a similar pattern to E. coli and enterococci (Fig. 3c). Interestingly, PMMoV was not detected after the 10th flush, which may be due to the lower starting concentration of PMMoV in the sewage used to spike the soils. Moreover, the smaller size of viruses compared to bacteria has an important influence on their mobility (Hunt & Johnson, 2017), which may result in the faster flushing of PMMoV compared to the bacterial markers.

The retention, sorption, and desorption of fecal bacteria by soil have been previously studied (Jamieson et al., 2002; Stevik et al., 2004). Henry and Dillaha (2004) reported that 78% of E. coli sorbed to topsoil within 1 h, and Mankin et al. (2007) reported that the sorption of E. coli to soil was reversible and dependent on the soil type—nearly 100% of E. coli desorbed from sandy soil with one flush, but less than 15% desorbed from topsoil after three flushes. We observed much higher quantities of E. coli and enterococci in the first and second flush relative to the number of bacteria added to the soil, indicating possible regrowth, which is consistent with previous studies, according to a review article (Jamieson et al., 2002). We also found that the quantity of bacteria in the second and fifth flushes was more than 25% of the quantity flushed away initially (in the first flush), potentially indicating that the desorption of fecal bacteria from certain riverbank soils may be a prolonged process.

With respect to the number of days after contamination until the soils were flushed (time-delay flushing experiment), the different microbial markers underwent attenuation over the 121-day experiment, which may be due to decay in the soil or sorption of microorganisms to the soils. The attenuation of microorganisms, as indicated by the log10 difference in microbial concentrations, ranged from ~ 1-log for PMMoV (~ 90% reduction) to ~ 6-log (99.9999% reduction) for E. coli over the 121-day experimental period (Fig. 3d). Wastewater-spiked soils continued to leach detectable concentrations of E. coli, enterococci, HF183, and PMMoV, even after 4 months. In the case of E. coli, previous studies have shown its persistence and survival in soils (Brennan et al., 2010b), which was attributed to an advantageous physiology that favors the soil environment (Brennan et al., 2010a). Similarly, E. coli, enterococci, and HF183 were found to persist in sewage-inoculated sediments kept over a 21-day period, but PMMoV was not studied (Zimmer-Faust et al., 2017). 

There were also substantial differences in the loadings for the different microbial groups (Fig. 3b  and d), indicating differences in the persistence of these microbes or the related gene targets in contaminated soils. For instance, the loading of E. coli was highest in the flush that occurred one day after contamination (its concentration in sewage was also the highest). However, in the flush performed 4 months after contamination, the loading of E. coli in the flushed water was the lowest, indicating that it likely decayed faster than the rest of the microorganisms. By contrast, PMMoV had the lowest loadings in flush water on the first day but had the highest loading in the flush water after 4 months. The attenuation rate for E. coli was higher than the attenuation rates for HF183 and enterococci, which were higher than the attenuation rate for PMMoV. It is important to note that PMMoV and HF183 were measured using ddPCR, while E. coli and enterococci were measured using culture-based methods, so the gene targets detected for HF183 and PMMoV did not necessarily come from viable microorganisms. Also, it is important to note that we did not directly measure microbial decay, since the decrease in concentrations after different idle times could have been a result of bacterial die-off or irreversible adsorption.

The attenuation rates appeared to follow first-order kinetics for the first month or two, with possible subsequent decelerating attenuation rates (tailing effect). Assuming a first order decrease for the first month only, the pseudo-first-order attenuation rate coefficients (calculated on a natural log basis) were 0.103 day−1 for E. coli, 0.100 day−1 for enterococci, 0.121 day−1 for HF183, and 0.022 day−1 for PMMoV. This equates to T90 values of ~ 10 days and T99 values of ~ 20 days for E. coli, enterococci, and HF183 (T90 and T99 values are the times required for 90% and 99% reduction, respectively). The T90 and T99 values for PMMoV could not be interpolated but are expected to be > 120 days based on the data.

The persistence and decay of fecal bacteria in soils has also been previously studied (Jamieson et al., 2002; Vinten et al., 2002; Brennan et al., 2010b; Parsai et al., 2018), mostly in the context of manure or sewage sludge biosolid application to agricultural soils. Rogers et al. (2011) reported approximately 2-log10 reduction of the GenBac3 qPCR gene target after 120 days in manure-amended soil at 25 °C. The data from their study showed a two-stage, first-order, and biphasic decay pattern, indicating a tailing effect starting at around 10 days, which is similar to the tailing shape of the curve of HF183 flushed away from soil in our study after different lengths of time. Underthun et al. (2018) was unable to detect E. coli in a sandy loam soil at 30 °C after 56 days when the concentration on day 1 was 107 CFU/g. Lang and Smith (2007) reported that the decay of E. coli in air-dried sandy loam at 15 °C was slower (with less than 1-log10 reduction after 3 months) compared to moist sandy loam at the same temperature, where there was a 5-log10 reduction of E. coli after 3 months. The choice to let our soil samples air dry after spiking with sewage may have led to slower decay of fecal bacteria in the time-delay flushing experiment. Decay of the viral and bacterial indicators in the soil is not directly measured in our study, as we did not measure microbial indicators in the soil, and therefore we cannot separate out the effects of decay vs. desorption. It is possible that fewer bacteria would have flushed away in our time-delay flushing experiment if the soil was not air dried. Nevertheless, our results do provide an indication of how long these different indicators can be expected to be detected in the interflow of polluted soils after a sewage spill event.

Previous studies of the decay of microbial source tracking markers have mainly evaluated persistence in water matrices and were nearly an order of magnitude faster than those we report for spiked soils that were allowed to dry. For instance, in a study of the persistence of E. coli and the HF183 marker in river water with 10 g/L of river sediments, Dick et al. (2010) reported T99 values of 2.80 days for E. coli and 1.88 days for HF183. Liang et al. (2012) reported 99% decay of the HF183 marker in 2.7 days in river water at ambient temperature in a greenhouse at 15 °C (under natural light exposure), and Hamza et al. (2011) reported reductions of 1.1 log10 at 25 °C and by 0.5 log10 at 4 °C for the concentrations of PMMoV in river water after 21 days (under dark conditions). In sewage-inoculated sediments, Zimmer-Faust et al. (2017) found substantially faster decay rates for HF183 compared to E. coli or enterococci, but PMMoV was not studied.

In the present study, the decay rate of the HF183 marker was found to be faster than that of PMMoV by about an order of magnitude, meaning that the PMMoV marker would likely persist longer than the HF183 marker in soil by about a factor of ten. This characteristic is a crucial aspect of the relationship between these two markers. Indeed, the persistence of PMMoV in groundwater was found by Morrison et al. (2020) to be an important characteristic to support its potential use in evaluating virus log reduction values. Due to their differing persistence, PMMoV has a greater capacity for accumulating within an environment, assuming sources of this marker into the environment are constant. In this sense, HF183 may be used to indicate more recent sources of contamination, while PMMoV may be a better conservative surrogate for older fecal contamination, which may contain enteric viruses that have higher persistence in the environment.

It should be noted that some of the previous studies left samples exposed to sunlight, which can increase the inactivation rate for bacteria and viruses, especially in a water matrix where sunlight interacts with dissolved compounds to form reactive intermediates (Nelson et al., 2018). Our experimental setup simulated only the below-ground portion of a sewage spill, for example, when a SSO infiltrates soils or when exfiltration from belowground sewer pipes occurs. This simulation constrained our experimental setup to dark, temperature-controlled conditions and was not meant to address the influences of diurnal or seasonal temperature changes, ultraviolet (UV) light or sunlight exposure, or native microorganisms present in both soil and water. These factors have been shown in the literature to reduce fecal indicator and host-associated genetic marker degradation, but mainly in water matrices (Bae and Wuertz, 2012; Korajkic et al., 2014; Korajkic et al., 2019). The soil used in our experiments was native riverbank soil, and while it was not autoclaved prior to the experiment, the soil was dried in a drying oven at 105 °C and sieved to remove larger debris, which may have affected the native microbial community in the soil. Therefore, there is still a need to better understand the persistence, under environmentally relevant conditions, of bacterial and viral markers in dry soils that are later flushed, such as in the vadose zone where sewer lines are typically located. In addition, performing experiments in centrifuge tubes likely hampered the natural processes of drainage and ventilation of soils. This limitation may have resulted in greater retention of microorganisms in the soil. Future experiments conducted in situ with lysimeters (as in Horswell et al., 2010) or using soil-filled columns for spiking (as in Nakamura et al., 2004 or Paredez et al., 2017) are recommended. Other limitations of this study are that the sewage used to spike the soils came from a single grab sample, which may not be characteristic of all sanitary sewer collection systems. Furthermore, the protocol used to flush soils may not be representative of actual in situ flow through the subsurface. Future experiments should compare flushing when alternative laboratory set-ups are used, such as soil columns.

Despite the limitations of this study and the need for future studies that evaluate persistence under the conditions mentioned above, our findings have important applications in soil and water quality monitoring. For example, consider a situation in which the presence of either of these markers (HF183 or PMMoV) in runoff water collected from a particular patch of soil is being used to assess potential wastewater contamination of the soil matrix. If the initial concentration of both HF183 and PMMoV is known in the suspected contaminant (for example, wastewater from surface sewer exfiltration or overflow), then the ratio of these markers in the runoff water can be used to roughly estimate the relative age of soil contamination events and to gain some clues about the origin of human fecal contamination (e.g., whether it is from a fresh source or an older source). In our study, the ratio of PMMoV:HF183 on day 1 was 1:100, whereas the same ratio on day 121 increased to ~ 20:1; this change in persistence over time reflects the older age of our experimental soil contamination.

4 Conclusions

In this study, we simulated the decay and flushing of solutes, DOC and TDN, fecal indicators E. coli and enterococci (measured by culture-based methods), and microbial source tracking markers HF183 and PMMoV (measured via ddPCR and RT-ddPCR, respectively), from riverbank soils subjected to SSO events, sewer exfiltration, or other types of sewage contamination. The presence of molecular markers alone may be insufficient to substantiate a health risk; however, the differences in flushing rates and decay and sorption behaviors between both molecular and culture-based markers are nonetheless critical to consider fully. Flushing of both solutes (DOC and TDN) and microorganisms proceeded rapidly, with most solutes (90%) and microorganisms (99%) being removed from soil by the 5th flush. Our decay experiments further demonstrated that E. coli, enterococci, HF183, and PMMoV could all be detected in flushes of sewage-spiked soils, even after 4 months of dry conditions. The contrasting decay rates we determined for viral and bacterial markers in dry soils, with PMMoV degrading 5–6 times more slowly in dry soils than bacteria, imply that PMMoV is a useful indicator of legacy sewage contamination in soils, whereas HF183 indicated more recent sources of contamination. If information on the concentrations HF183 and PMMoV in suspected contamination sources is known, then the ratio of PMMoV:HF183 could provide a valuable tool for evaluating the relative age of soil contamination events. Overall, our results suggest that sewage spills or leaks into soil can be long-term sources of microbial pollutants and that the decay of different microbial markers in the soil matrix is wide ranging.

5 Supplementary Information