Introduction

Surface water ecosystems, especially those in rivers, serve as receptor for effluents from wastewater treatment plants (WWTP) and are essential for the biochemical cycling, nutrient recycling, and energy flows (Jin et al. 2018). The rich diversity of microorganisms in these ecosystems, closely linked to physicochemical parameters, plays a crucial role in the functioning and preservation of these processes. The increasing use of surface water for drinking and recreational activities requires microbial analysis to ensure water quality and human health. Various studies have investigated the microbial diversity of surface water (Bryant et al. 2016; Ibekwe et al. 2012). However, the microbial diversity in urban surface water, particularly in the case of the Seine River, remains poorly understood. Only one reference is available in the 2011 activity report (phase 6) of the Piren-Seine program, which investigated the microbial composition and diversity of the Seine River water. The report showed a diverse range of phyla, with Actinobacteria, Proteobacteia, and Bacteroidetes predominating. The study focused on three sites: Marnay, an upstream site, and Bougival and Triel, sites downstream of the Seine River (George et al. 2012), in relation to the Paris conurbation. These sites were situated away from the Paris conurbation where the impact of urbanisation is very significant. Moreover, no correlation was found between microbial and physico-chemical parameters.

The Seine River, which is heavily impacted by human activities, faces challenges in terms of urban wastewater discharges, which are still far from sufficient in relation to the river’s dilution and self-purification capacity. Around 100 km of the river, on the scale of the Parisian conurbation, is subject to significant anthropogenic pressure from treated effluent from WWTPs during dry weather, and treated effluent, combined sewer overflows and urban runoff during wet weather (Flipo et al. 2021). Numerous studies have highlighed the impact of treated, partially treated or untreated effluents on the physicochemical and microbiological quality of surface water, due to the persistent presence of pharmaceutical and cosmetic contaminants (Behera et al. 2011; Luo et al. 2014; Paijens et al. 2022, 2021) and pathogenic microorganisms (Castro-Hermida et al. 2008; Rhymes et al. 2015). Microorganisms, part of the world’s most diverse and abundant biological group, are an essential part of the river biocenosis and play a crucial role in maintaining the functionality of river ecosystem (Findlay 2010). More specifically, the bacterial community within a river contributes significantly to various biogeochemical processes and essential ecosystems services, including pollutant and organic matter degradation (Schmidt 2006), showing sensitivity to environmental factors such as pH, temperature, nutrient levels, and chemical pollutants (Rath et al. 2019; Wang et al. 2023, 2015). Bacterial populations often exhibit rapid-response spatio-temporal variations (Mai et al. 2018; Wang et al. 2015) with bacterial diversity reflecting ecological function within river ecosystems. Consequently, bacterial communities are emerging as promising biological indicators (Hooper et al. 2005; Xie et al. 2016). At the same time, climate change represents a serious threat to the stability and functioning of river ecosystem worldwide, impacting microbial diversity and composition (Weiskopf et al. 2020). Despite extensive research into the ecological impacts of climate change, there remains a notable gap in understanding its specific effects on river microbial communities. Current studies neglect long-term trends and fail to incorporate advanced analytical techniques, preventing us from understanding how climate change affects river microes. This work aims to fill these gaps by providing a comprehensive assessment of the impact of climate change on river microbes. Recent advances in high-throughput, next-generation sequencing methods enable comprehensive characterization of microbial communities in diverse environments, thanks to their capacity to generate large number of sequences. Given this information, local authorities can consider monitoring microbial diversity to assess water quality. The Seine River Bathing plan, launched in 2018 for the Paris 2024 Olympic and Paralympic Games introduces advanced oxydation to further reduce chemical and biological contaminants in the WWTP effluent upstream of Paris to better preserve the river’s water quality, a major concern for local authorities. Understanding microbial diversity and establishing ecological benchmarks before and after treatment are therefore essential to fully characterize new technologies. We therefore studied microbial diversity of the Seine River water to assess the impact of effluents from WWTPs and of tributaries. Excessive volumes of mixed wastewater and stormwater are discharged directly into the river during wet weather, which can have an impact on microbial metabolism and diversity (Drury et al. 2013; Guo et al. 2019). Conversly, during dry weather, high temperatures and low oxygen concentrations also affect microorganisms (Lestel et al. 2021). Consequently, we conducted four sampling campaigns during the two constrating seasons of summer and autumn in consecutive years 2020 and 2021, to get information into microbial communities. Five sampling sites along the Seine River where effluents are discharged from WWTPs, as well as two sites just before the confluence of the Oise and Marne Rivers, were studied to understand the WWTP impact. Microbial abundance, activity and diversity were analyzed using cytometry (BactoSense), enzyme assays and NGS technology, respectively.

Material and methods

Field sites and sample collection

Water samples were collected at seven points along the Seine River, from upstream to downstream of the Parisian conurbation. These sampling sites, which are depicted in the Fig. S1 (supplementary) were chosen to identify the potential impact of both WWTP discharges and tributaries on the quality of the Seine River. In the Ile-de-France region, up to 2.5 million m3/d of effluent during dry weather and up to 5 million m3/d, including a mix of raw wastewater and rainwater, during wet weather, are discharged into the Seine River (Servais et al. 2009).

Seven sampling points were selected, including (i) five points on the Seine River from upstream to downstream of the Paris conurbation, starting at Juvisy-sur-Orge (JSO), followed by Choisy-le-Roi (CLR), Vitry-sur-Seine (VSS), Bougival (BOUG), and Triel-sur-Seine (TSS), (ii) one point on the Marne River at Champigny-sur-Marne (CSM) before the confluence with the Seine River and located just after the discharge of the Marne aval WWTP, and (iii) one point on the Oise River at Conflans-Sainte-Honorine (CSH) before the confluence with the Seine River. GPS coordinates are given in Table 1. Samples were collected from a bridge, using a sampling bucket at the center of the width of the river. Water samples were collected at a depth of approximately 50 cm. All the samples were taken in triplicate and then combined to produce a composite sample. These composite samples were then put in sterile bottles and immediately stored at 6 °C. Afterwards, the samples were then transported to the laboratory for further analysis. The samples were taken in the summer (July) and autumn (November) of two successive years, 2020 and 2021.

Table 1 GPS latitude and longitude coordinates of sampling sites

Physico-chemical analysis

Parameters including conductivity (EC), pH and temperature were determined on-site after collection of water samples, using a Hach multiprobe portable meter (sensION+, Hach Company, Germany). Ammonium [NH4+], nitrate [NO3], and nitrite [NO2] were determined according to NF EN ISO 11732, NF EN ISO 10304-1 and NF EN ISO 10304-1 standards, respectively. Orthophosphate [PO43−] was determined folowing NF EN ISO 6878 standard. The chemical oxygen demand (COD), biochemical oxygen demand (BOD), and total organic carbon (TOC) were analysed using the appropriate analytical methods (ISO 15,705 for COD, NF EN 1899 for BOD, and NF EN 1484 for TOC). The hydrometeorological parameters taken into account include the flow of the Seine River based on records from Vigicrues monitoring stations in France (https://www.vigicrues.gouv.fr/), ambient air temperature and relative humidity data collected from Infoclimat stations (https://www.infoclimat.fr/).

Abundance of microbial cells

The microbial cell count in water samples was determined using the BactoSense automated flow cytometry (FCM) instrument (bNovate, Switzerland). BactoSense is a fully automated flow cytometer using microfluidic detection technology. The sample is injected into the center of a sheath flow containing reagents with fluorescent dyes: SYBR Green I (emitting fluorescence at 535 nm) and propidium iodide (PI, emitting fluorescence at 715 nm). While both dyes are capable of staining DNA, SYBR Green 1 is able to penetrate all cells, whatever the integrity of their cell membrane, whereas PI can only penetrate cells with damaged membranes. Microfluidic flow cytometers use the principle of hydrodynamic focusing to present laser-stained cells through a capillary flow chamber. A single laser diode, emitting light at 488 nm, is used. Fluorescence is measured simultaneously by three detectors: at a wavelength of 525 nm for FL1 (SYBR Green I), at 715 nm for FL2 (PI), and at 488 nm laterally for the Side Scatter (SSC). The results are made available within 20 min.

For pratical reasons, homogenized water samples (2 ml) were transferred to a plastic screw-capped microtube (bNovate, Switzeland) for analysis. The instrument automatically took an aliquot (260 µl) of the sample, of which 90 µl was used for analysis and 150 µl for instrument cleaning. A cleaning step was performed before and after each measurement. Each sample was measured in triplicate. Vittel bottled water was used as a standard to check the condition of the BactoSense after every five measurements. The collected aliquot is labelled with SYBR Green I and PI, and incubated at 37 °C for 10 min before counting. The process is automated to ensure consistent measurements for all samples.

The raw FCM data files were then analysed using customized software that enabled batch processing of the large datasets generated in this study. In brief, FCM gates were constructed to separate bacterial cell signals (total cell concentration, TCC) from background signals and to differentiate intact cells, stained only by SYBR Green I, from cells with altered membranes, positive to both two fluorochromes, i.e. SYBR Green I and PI.

Microbial activity

The enzymatic activity of the microorganisms was measured using the colorimetric assay technique. This method is based on the hydrolysis of fluorescein diacetate (FDA). The microorganisms produce multiple enzymes (esterases, lipases and proteases), leading to the hydrolysis and consequent release of fluorescein (Adam and Duncan 2001). The absorbance of the released fluorescein is then measured at 490 nm. The higher the intensity, the greater the amount of FDA hydrolysed, resulting in increased microbial activity. The assay is conducted in 96-well microplates using the modified method of (Taylor et al. 2002). A 100 µl water sample was added to each well, followed by 100 µl of Mac Ilvain buffer (0.1 M citric acid and 0.2 M Na2HPO4) and 50 μl of fluorescein diacetate (FDA, 4.16 g/l). The microplates were then incubated under agitation (150 rpm) at 37 °C for 2 h. After the incubation period, the microplates were then centrifuged at 4 °C at 15,000 rpm for 10 min. Then, 100 μl of the supernatant of the centrifuged solution were transferred to a new microplate containing 100 µl of Mac Ilvain buffer. Optical density (OD) was measured using a spectrophotometer (Xenius, Safas, Monaco) at a wavelength of 490 nm. The enzyme activity is expressed in µg of fluorescein per ml and per hour, in accordance with the standard range established beforehand.

Bacterial diversity analysis

DNA extraction and polymerase chain reaction (PCR) amplification

To collect the microorganisms, water samples (500 ml) were filtered through a vacuum pump system with 0.22 µm nitrate-cellulose Whatman filtres (Sartorius, France) in a microbiological safety cabinet (BIOII, ADS Laminaire, France). The vacuum pumping system and the filters were sterilized at 121 °C for 21 min. After filtration, the filters containing the microorganisms were placed in labelled sterile Petri dishes in a laminar flow hood for 30 min to dry. DNA was extracted directly from these filters using the DNeasy PowerWater Kit (QIAGEN) according to the manufacturer’s intructions. The amount of genomic DNA was assessed on 1% agarose gel and quantified with Quant-iT™ PicoGreen dsDNA assay kit according to the manufacturer’s intructions and rapidly stored at − 20 °C for future analysis.

Bacterial 16S rRNA gene libraries were constructed using the “16S metagenomic sequencing library preparation” protocol provided by Illumina for the Illumina MiSeq System. The V3–V4 variable regions of the 16S rRNA gene of the bacterial communities were amplified with the forward primer (5′-TAC GGG AGG CAG CAG-3′) (Turner et al. 1999) and the reverse primer (5′-CCA GGG TAT CTA ATC C-3′). Each primer set contains the Illumina adapter sequence: forward primer target sequence (5′- ACACTCTTTCCCTACACGACGCTCTTCCGATCT-3′) and reverse primer target sequence (5′- GACTGGAGTTCAGACGTGTGCTCTTCCGATCT-3′). PCR conditions were as follows: 94 °C for 5 min, 30 cycles of 94 °C for 30 s, 54 °C for 30 s, 72 °C for 45 s, and finally 72 °C for 10 min. DNA amplification was verified by agarose gel electrophoresis (1.5%). DNA purity and concentration were assessed using Nanodrop ND-1000 (Thermo-Fisher Scientific). Amplified DNA was purified using the QIAquick96 PCR purification kit (Qiagen, CA) according to the manufacturers’instructions. Purified amplicons were normalized to 120 ng/µl into a new low-retention microtube prior to sequencing. The amplified DNA was sequenced using 2 × 250 paired-end Illumina MiSeq platform by Eurofins Genomics (Germany) on an Illumina MiSeq plateform according to the standard protocols. The raw sequence data have been deposited in the NCBI Sequence Read Archive under the BioProject PRJNA1016134.

Sequence analysis and data processing

The raw sequences (.fastq) for all of the samples from Eurofins were imported into the FROGS pipeline Find Rapidly OTU with Galaxy Solution) implemented on a galaxy instance (v.2.3.0) (http://sigenae-workbench.toulouse.inra.fr/galaxy/). Pair-end reads were merged using the SoftWare FLASH (v1.2.6) with an average merging rate of 98% (Magoč and Salzberg 2011). Barcode and primer sequences were removed with cutadapt (Martin 2011). Sequences were then clustered into operational taxonomic units (OTU) with SWARM (Mahé et al. 2014), which generates the OTU abundance table with an OTU being defined at the 97% sequence similarity level. Chimeras were removed using the VSEARCH tool (Rognes et al. 2016). The taxonomic assignation of each OTU was performed using the BLAST tool against with the database SILVA-132 16S rRNA (Quast et al. 2013). The Shannon index and the Chao1 index were calculated to estimate alphadiversity (Magurran 2004). The Shannon index was used to estimate species richness and evenness, while the Chao1 index used to estimate the number of species in a community, taking into account both the singletons and doubletons species (Chao 1984).

Statistical analysis

Before performing the various analyses to test for significant differences between the treatments, the normality of the data matrices was tested using the Kolmogorov–Smirnov test implemented in R-3.6.1 software (R Core Team 2020).

To compare the variation between sites and seasons, Bartlett’s Test was conducted to assess the homoscedasticity of the dataset. When the variances were found to be equal, the Anova test was selected (Zar 2014). Differences between the physical and chemical parameters, the relative abundance of microbial cells, the enzyme activity, and the diversity indexes were analyzed by one-way ANOVA (Fisher’s post hoc test) with a 95% confidence interval.

Further analyses, principal component analysis (PCA), non-metric multidimensional scaling analysis (NMDS), redundancy analysis (RDA), and correlation tests, were performed using R software (R Core Team 2020). To visualize the spatio-temporal effect on physico-chemical characteristics, a PCA was performed by with the ade4 package (Dray and Dufour 2007). A permutation test was used to determine whether the differences observed between the modality were large enough to reject the null hypothesis. Values were considered different if the probability of a null hypothesis was less than 0.05. NMDS and RDA were performed using the “vegan” package in R (Oksanen et al. 2022) to explore the correlations between bacteria at the phylum level and environment variables. For the correlation test between microbial activity and abundance, depending on the results of the normality test, we selected either the Pearson or Spearman test. If both variables exhibited a normal distribution, Pearson’s test was chosen; otherwise, Spearman’s test was used.

Result

Physico-chemical parameters

The results for the physico-chemical parameters of the surface water of the Seine River during summer and autumn 2020 and 2021 are presented in Table S1. The pH ranged between 7.8 and 8.2, no variability was observed between sampling sites nor sampling periods (Table 2). Besides, N-NO3 exhibited the highest concentration compared to N-NO2 and N-NH4+ for all samples, ranging from 16.1 to 29.3 mg/L. The results of the Two-ways Anova analysis revealed a significant spatial variability in NO2, NO3, and NH4+ concentrations, while no seasonal variability existed between the sampling periods (Table 2). The total organic carbon (TOC) content varied significantly among sampling sites from 1.8 to 5.1 mgC/L (Table 2). The TSS site, situated downstream of the Parisian conurbation, recorded the highest TOC value; while the lowest value was observed at the CSM (Champigny-sur-Marne) site, located on the Marne River, upstream Paris. COD ranged between 5.8 mg/L (TSS in summer 2021) and 15.1 mgO2/L (CSM in summer 2020). BOD ranged from 0.5 mgO2/L (CLR in summer 2021) to 5.1 mgO2/L (TSS in summer 2021). Both BOD and COD maximum were observed during summer 2021 at TSS site. The results of the two-way Anova analysis releaved that there were no significant differences in COD and BOD concentrations along the spatial axis, while a significant seasonal variability was noted (Table 2). P-PO43− concentrations ranged from 0.2 mg/L (JSO in summer 2021) to 0.39 mg/L (TSS in autumn 2021) and varied significantivly between seasons, whereas there was no significant difference between the sampling sites. The highest [PO43−] values were recorded during autum 2021 for all sites. The water temperature (T °C) did not differ between summer and autumn in 2020, however it was significantly different between the seasons in 2021. Indeed, the average water temperature was approximately 20.5 °C during summer and 9.8 °C in autumn (Table S1).

A principal component analysis (PCA) was conducted on the physico-chemical data of all parameters for the 7 sampling sites and the 2 seasons (summer and autumn) in 2020 and 2021 (Fig. 1). The PCA revealed that there was no significant disparity between the Seine River and its tributaries, i.e., Marne and Oise Rivers (Fig. 1A, B, C). When compared to the other sites, TSS exhibited significantly higher concentrations of [NO3], [NO2], [NH4+], and TOC (Fig. 1D), and appeared separated from the other sites. All the samples, regardless both the site and the year, collected during summer displayed higher concentrations of PO43− and COD compared to those collected during autumn. The constrast in seasonal changes (summer vs. autumn) during 2021 was clearly apparent; whereas this distinction was less noticeable during 2020.

Table 2 Effects of sampling sites and seasons on physico-chemical properties and FDA microbial activity of the Seine River water samples tested by a two-way Anova
Fig. 1
figure 1

PCA analysis of physico-chemical parameters of the surface water of the Seine River for the two seasons (summer and autumn) 2020 and 2021. A Correlation circle. Ordination of the samples in the plane defined by axes 1 and 2 of the PCA highlighting, B the similarity of the 3 rivers, C the differences between the samples, D the differences between the sampling sites, E the differences between two seasons, and F the differences between the season and the year. N = 7 sites for all parameters and 2 seasons of 2020 and 2021. TOC total organic carbon, BOD biological oxygen demand, COD chemical oxygen demand, T°C water temperature. JSO Juvisy-sur-Orge, CLR Choisy-le-Roi, VSS Vitry-sur-Seine, CSM Champigny-sur-Marne, BOUG Bougival, CSH Conflans-Sainte-Honorine, TSS Triel-sur-Seine

The mean flow of the Seine River in 2021 (> 200 m3/s) was higher than that in 2020 (< 200 m3/s). This was the case regardless of the location from April to December (Fig. 2A, B).

Fig. 2
figure 2

Flow of the Seine River (m3/s) upstream and downstream of the Parisian conurbation in 2020 (A) and 2021 (B)

Abundance of intact/damaged cells

Total microbial cells (TTC) and damaged microbial cells (DMC) were quantified and are presented in Fig. 3A for TTC and Fig. 3B for DMC. TTC ranged between 1 × 106 and 6 × 106 cells/mL, DMC represents an average of 6% of TTC. CSH samples had the highest TTC, observed in autumn 2020 at 5 × 106 cells/mL. Conversely, the lowest TTC, 1 × 106 cells/ml, was recorded at JSO in summer 2021. The two-way Anova results showed that no significant differences were observed for TTC and DMC between sampling sites and seasons.

Fig. 3
figure 3

Microbial cell counts of the Seine River in summer and autumn 2020 and 2021 at seven sampling sites. N = 12 for each site and each season. JSO Juvisy-sur-Orge, CLR Choisy-le-Roi, VSS Vitry-sur-Seine, CSM Champigny-sur-Marne, BOUG Bougival, CSH Conflans-Sainte-Honorine, TSS Triel-sur-Seine

Microbial activity

The values of microbial activity ranged between 12 and 30 µg fluorescein/ml/h (Fig. 4). The FDA activity in the samples collected during summer was significantly higher than that for sample collected during autumn. These findings were confirmed by the two-way Anova analysis (***p < 0.01) (Table 1). However, no significant difference between the sampling sites and no interaction between the sampling sites and seasons were observed as shown Table 1. Besides, Fig. S2 shows that there is a strong correlation between microbial activity and the number of microbial cells.

Fig. 4
figure 4

FDA activity (µg fluorescein/ml/h) of the Seine River for the 7 sampling sites in summer and autumn 2020 and 2021. N = 12 for each site and each season. JSO Juvisy-sur-Orge, CLR Choisy-le-Roi, VSS Vitry-sur-Seine, CSM Champigny-sur-Marne, BOUG Bougival, CSH Conflans-Sainte-Honorine, TSS Triel-sur-Seine

Bacterial community composition

After eliminating low-quality sequences and chimeras, a total of 6 million high-quality 16S rRNA sequences were obtained. Subsequently, all samples were then randomly subsampled to 25,700 sequences, which were subjected to further statistical analysis. The mean number of Operational Taxonomic Units (OTU) in CSM samples (approximately 5,500 OTUs) was significantly lower than that found in the other sites (approximately 7,800 OTUs), regardless of the season or year. Additionaly, the average number of OTUs in all samples during both summer and autumn of 2020 was significantly higher than that of 2021. The classified OTUs belonged to 40, 43, 34, and 25 phyla in the autumn and summer of 2020 and 2021, respectively. It has been observed that changes in the relative abundance of phyla between years (2020 and 2021) were much more significant than those between seasons (summer and autumn) and sites (Fig. 5). Whatever the site, there was a significant difference in the number of classified OTUs between summer and autumn 2021 (**p < 0.05), between summer 2020 and 2021 (***p < 0.001), and between autumn 2020 and 2021 (***p < 0.001), as determined by a two ways Anova analysis. Moreover, the OTU count in the downstream samples was significantly higher than that in the upstream samples (Fig. 6). Figure 5 illustrates the predominant groups present in each sample, with abundances exceeding 1%, based on their relative abundance. The dominant phyla observed were Proteobacteria, Actinobacteriota, Bacteroidota and Cyanobacteria, which accounted for approximately 90–95% of the total abundance. Other phyla, comprising Chloroflexi, Firmicutes, Verrumomicrobiota, Campilobacterota, Bdellovibrionota, and Patescibacteria, constituted 5–10% of the abundance.

Fig. 5
figure 5

Relative abundance (%) histograms of the dominant phyla (> 1%) for the Seine River from seven sampling sites in autumn and summer 2020 and 2021. JSO Juvisy-sur-Orge, CLR Choisy-le-Roi, VSS Vitry-sur-Seine, CSM Champigny-sur-Marne, BOUG Bougival, CSH Conflans-Sainte-Honorine, TSS Triel-sur-Seine

Fig. 6
figure 6

Comparisons of OTU number—Richness Index (A), and diversity indexes including Chao1 (B), Shannon index (C) and Shannon Equitability (D) were made between seasons (three figures above) and localizations (three figures below) at five sampling sites along the Seine River during autumn and summer 2020 and 2021. Upstream sites: JSO Juvisy-sur-Orge, CLR Choisy-le-Roi, VSS Vitry-sur-Seine and downstream sites: BOUG Bougival, CSH Conflans-Sainte-Honorine, TSS Triel-sur-Seine

Regardless of site, the most dominant phylum Proteobacteria was significantly less abundant in summer and autumn 2020 (75.5% in summer and 75.4% in autumn) when compared to 2021 (37.8% in summer and 40.5% in autumn) (Fig. 7A). The second most abundant phylum, Actinobacteriota, accounted for 10–30% of the total abundance. The occurence of this phylum was significantly higher in 2020 than in 2021 for both seasons, in contrast to the results obtained for Proteobacteia (***p < 0.001). There were no significant differences observed for Bacteroidota, between seasons and years, and they accounted for approximately 18% of the total abundance. The Cyanobacteria phylum was detected at a relative abundance of around 10% in the samples taken throughout 2020. This level of abundance has been consistently decreased over the seasons, while its presence in samples taken during 2021 was insignificant, at less than 1%. A significant difference in the abundance of Cyanobacteria between seasons was observed (*p < 0.05). The other phyla, including Chloroflexi, Firmicutes, Verrumomicrobiota and Campilobacterota were detected only in the samples collected in 2020, with a minimal relative abundance of less than 1%. For the comparison between sites, we noted a higher relative abundance of the phyla Bacteroidota and Patescibacteria in the downstream sites (i.e., BOU and TSS) compared to the upstream samples (i.e., JSO, CLR, and VSS). Conversely, the relative abundance of Firmicutes and Cyanobacteria was lower in the dowstream sites than upstream (Fig. 8).

Fig. 7
figure 7

Comparisons of the relative abundance of the bacterial community at both phylum (A) and class (B) levels during summer and autumn 2020 and 2021. *, significant difference between season/year; ns, not significant difference

Fig. 8
figure 8

Comparisons of the relative abundance of the microbial community at phylum (A) and class (B) levels between five sampling locations along the Seine River during autumn and summer 2020 and 2021. Upstream sites: JSO Juvisy-sur-Orge, CLR Choisy-le-Roi, VSS Vitry-sur-Seine, and downstream sites: BOUG Bougival, CSH Conflans-Sainte-Honorine, TSS Triel-sur-Seine

To better undrstand the microbial composition in the Seine River water at different locations and seasons, we examined the relative abundance and classification of OTUs from the four most prevalent phyla including Proteobacteria, Bacteroidia, Actinobacteriota and Cyanobacteria at the class level (Fig. 7B). The analysis carried out to detect spatial discrepancies (between seasons and years) revealed the existence of two classes, Alphaproteobacteria and Gammaproteobacteria, both of which belong to the Proteobacteria phylum, in the Seine River. The abundance of Gammaproteobacteria was higher than that of Alphaproteobacteria in all samples. Meanwhile, there were no significant seasonal differences in the relative abundance of Alphaproteobacteria. However, in 2021 samples, the relative abundance of Gammaproteobacteria was significantly higher than that for 2020 samples, whatever the season. Furthermore, the relative abundance of other classes such as Cyanobacteriia (members of Cyanobacteria) and Acidimicrobiia (members of Actinobacteria) was significantly greater in the 2020 samples than in the 2021 samples. When comparing the relative abundance between the studied sites at phylum level, it is worth mentioning that there was no difference in the relative abundance of the Proteobacteria phylum between downstream and upstream samples. However, when comparing the classes α- and γ-Proteobacteria (two important members of Proteobacteria), the downstream samples showed a higher abundance of alpha Proteobacteria than the upstream samples, and conversely, a lower abundance of gamma (Fig. 7B). Hence, no significant difference was detected at the phylum level. Within the Actinobacteriota phylum, the abundance of the class Actinobacteria was higher in downstream samples than upstream; and conversely Acidimicrobiia exhibited a lower abundance in upstream samples. Moreover, the downstream samples showed a higher abundance of classes such as Bacteroidia, Clostridia, and Saccharimonadia compared to the upstream samples (Fig. 8).

In addition, we analyzed the OTUs classification and relative abundance at the genus level (Fig. 9). The dominant genus abundance varied both for year and season, with Microtrichales_CL500-29 marine group, Sporochthyaceae_hgcI clade, unclassified Actinobacteriota, Cyanobium PCC-6307, and unclassified-Cyanobacteriia being dominant in samples from 2020 in both seasons. Conversely, Methylophilaceae_OM43clade, Methylotenera, and Pseudomonas were dominant in 2021 samples. During 2021 summer of, Sphingorhabidus and Massilia were the two most common genera. Flaviobacterium was present throughout all seasons and particularly plentiful at Bougival as illustrated Fig. 9. It is worth noting that Methylophilus accounted for more than 30% of the Bougival samples during the summer of 2021. Additionally, the aquatic_group Burkholderiales_MWH-UniP1 was detected in all samples and was more widespread in the 2021 samples than in the 2020 ones.

Fig. 9
figure 9

Relative abundance of the dominant genera in the Seine River during summer and autumn 2020 and 2021 at different sampling sites from upstream to downstream of the Parisian conurbation. JSO Juvisy-sur-Orge, CLR Choisy-le-Roi, VSS Vitry-sur-Seine, CSM Champigny-sur-Marne, BOUG Bougival, CSH Conflans-Sainte-Honorine, TSS Triel-sur-Seine

General bacterial community characteristics were assessed at phylum level in all samples collected at different sites and seasons in 2020 and 2021, using non-metric multidimensional scaling analysis (NMDS). Figure 10A shows that axis 1 separated four sampling sites along the Seine River (excluding TSS, which is the furthest site downstream from Paris) from CSM, situated on the Marne River and CSH located on the Oise River. Thus, each river has its own distinct microbial community. The NMDS analysis outcomes demonstrated a clear differentiation between bacterial communities in summer and autumn (Fig. 10B). This was further confirmed by the analysis of data from individual sites (Fig. 10C). However, when the year variable (2020 and 2021) was added to the data (Fig. 10D), samples from both years were segregated along axis 1, irrespective of the seasons. This suggests that the effect from one year to the next, as demonstrated in our findings between 2020 and 2021, carries more weight than the impact of seasonal variations.

Fig. 10
figure 10

Nonmetric multidimensional scaling plot (MDS) analysis of bacterial community at the phylum level in water for all sampling by comparison between sites (A), between seasons according to the sites (B), between seasons according to the year (C) and between seasons including summer and autumn (D). JSO Juvisy-sur-Orge, CLR Choisy-le-Roi, VSS Vitry-sur-Seine, BOUG Bougival, TSS Triel-sur-Seine (located on the Seine River), CSM Champigny-sur-Marne (located on the Marne River), CSH Conflans-Sainte-Honorine (located on the Oise River)

Diversity indices

Diversity indices, including Chao1, Shannon, and Shannon Equitability were calculated using the OTU count and their presence or absence. A higher richness index was found in the 2020 samples compared to those from 2021 (Fig. 6A). The 2021 samples for both seasons exhibited a significantly higher Chao1 than those of 2020 (**p < 0.05), as depicted in Fig. 6B. The Shannon index of the 2021 samples has decreased compared to that for 2020 samples (*p < 0.01) both for both summer (*p < 0.01) and autumn (**p < 0.05) (Fig. 6C). The Shannon equitability (Fig. 6D) showed a higher value for the 2020 samples compared to that for the 2021 samples, regardless of the seasons.

From the comparison between the upstream (i.e., JSO, CLR, VSS) and downstream (i.e., BOU and TSS) sites along the Seine River, it clearly appeared that the richness index values of the downstream samples were higher than those of the upstream samples. In addition, the Shannon index and Shannon equitability values in the upstream samples were higher than in the downstream samples.

Multivariate correlation analysis between water bacterial communities at the phylum level and environmental variables

RDA (Redundancy analysis) was performed to assess the correlation between environmental factors and the family-level composition of bacterial communities in water samples from seven sampling sites during summer and autumn 2020 and 2021. Unsurprisingly, different bacterial phyla responded differently to nutrients, temperature, and pH at different sampling sites, rivers, and seasons according to the year (Fig S3). RDA analysis has revealed that the relative abundance of four phyla, namely Proteobacteria, Cyanobacteria, Bacteroidota and Actinobacteriota, varies across the sampling sites, based on different parameters, including N-NO3, temperature, BOD and P-PO43−, respectively. Proteobacteria and Bacteroidota were found to be dominant phyla in the Seine and Oise samples, due to various nutrient parameters such as N-NO3, N-NH4+, N-NO2, TOC, DOC and temperature. At last, Actinobacteria and Cyanobacteria were the predominant phyla identified in the 2020 samples, despite seasonal variations. However, Proteobacteria was the predominant phylum in 2021. No particular environmental factor was fully identified to provide a satisfactory explanation for this variability.

Discussion

Microorganisms are highly sensitive to environmental conditions such as pH, oxygen, nutrients, contaminants, and temperature (Savio et al. 2015; Wang et al. 2023, 2018, 2012). These factors act as selective pressures, shaping the microbial communities within ecosystems. We first present the results of physico-chemical parameters of the water in the Seine River. In previous works (Bagagnan et al. 2024), we noted the stability of the surface water physico-chemical properties over the last 10 years. Comparing with global river data reveals similar pH levels in the Seine River (approximately 8.0) and the Qin and Yongding Rivers in China (7.8 and 8.3, respectively), despite high human impacts (Li et al. 2022; Zhao et al. 2022). Nutrient parameters (N-NO3 at 22.8 mg/L, N-NH4+ at 0.12 mg/L and P-PO43− at 0.13 mg/L) fall within ranges observed in other rivers like the Yellow Delta River, and Maozhou River (Lv et al. 2016; Ouyang et al. 2020). The shift from one season to another and from one year to the next results in remarkable alterations of the physico-chemical properties of the water. However, no changes between the sampling sites was observed when each site was analysed separately. Seasonal variation in water temperature in the Seine River and its tributaries is a natural occurrence typical of temperate climates, closely tied to atmospheric temperatures. Meanwhile, nitrogen-based nutrients and other parameters remain relatively stable between seasons, with disparities observed between upstream and downstream samples. Downstream sites, particularly TSS, exhibit a high nutrient concentrations (NO2, NO3, NH4+, and TOC), due to upstream accumulation, similar to observations in the Maozhou River (Ouyang et al. 2020). In addition, nitrifying biofilters are increasingly adopted in dense urban areas, such as the Paris conurbation, due to their increased nitrogen treatment capacity with limited footprint, but the trade-off is a higher nitrite input to watercourses (Raimonet et al. 2017; Rocher et al. 2015). Seasonal orthophosphate fluctuations attributed partially to reduced river flows during summer (Liu et al. 2013) underscore the important of understanding nutrient dynamics in aquatic ecosystems. Reduction in river flows during the summer is a common phenomenon in many regions, with significant implications for nutrient dynamics in rivers. When flow rates are low, the residence time of water in the river increases. As a result, nutrients like orthophosphates may become more concentrated in the water. Changes in environmental factors can significantly impact the abundance and composition/diversity of microorgnisms.

According to the results obtained from high-throughput sequencing technology, the total number of phyla varied from 25 to 43, a range commonly reported in similar studies analyzing bacterial communities in rivers worldwide. Comparable numbers of phyla have previously been identified on the Danube in Europe (Savio et al. 2015), the Apies in South Africa (Abia et al. 2018), the Mississipisi in the USA (Korajkic et al. 2015; Staley et al. 2013), the Kalamas in Greece (Meziti et al. 2016), the Yangtze in Japan (Liu et al. 2018), and the Maozhou in China (Ouyang et al. 2020). The dominant phyla identified included Proteobacteria, Actinobacteriota, Bacteroidota, and Cyanobacteria, consistent with previous findings (George et al. 2012). These phyla were predominant at three sites of the previously cited French PIREN-Seine study, at Marnay-sur-Seine upstream Paris, and Bougival and Triel-sur-Seine downstream Paris. Their presence is persistent with observations made in other urban rivers (Mai et al. 2018; Ouyang et al. 2020; Savio et al. 2015; Wang et al. 2023), indicating commonalities in microbial communities under urban pressure. Each phylum likely contributes uniquely to the river ecosystem’s overall ecological functioning. Proteobacteria are metabolically versatile bacteria involved in nitrogen cycling, organic matter decomposition, and pollutant degradation (Goetghebuer 2019; Sun et al. 2017). Actinobacteria contribute to nutrient recycling by decomposing complex organic compounds, including plant material and recalcitrant organic matter (Mikhailov et al. 2019). Bacteroidota degrade complex carbohydrates (i.e., glycanes) and play a role in organic matter breakdown, influencing nutrient cycling (Kruczyńska et al. 2023). Cyanobacteria are photosynthetic bacteria that act as primary producers, contributing to oxygen production, but can also form toxic algal blooms under specific conditions, negatively impacting ecosystem (Soto Ramos et al. 2023). Regarding microbial abundance, the Seine River water contained approximately 106 cells/ml, consistent with findings from the PIREN-Seine program reported by (George et al. 2012). This abundance level is similar to that reported for the Wensum River in England (Albaggar 2014), ranging between 0.2 and 5.3 × 106 cells/ml, depending on the sampling sites and higher to that measured on the Lancang River in China (Luo et al. 2019), lying between 0.3 and 0.5 × 106 cells/ml, depending on the seasons.

Spatial and temporal changes of bacterial communities in water

Comparing bacterial communities across seasons revealed significant differences overall. However, there was substantial variation inter-year, overshadowing seasonal intra-year variations. Regardless of the seasons, we noted fascinating results as we compared the bacterial compositions between those years. The finding that bacterial communities in the water of the Seine River were more diverse and evenly distributed in 2020 compared to 2021 is very interesting. The hypothesis, that the COVID-19 lockdown in 2020 may have played a role in this disparity, warrants further investigation. During the lockdown period, numerous human activities were suspended, resulting in a decrease in wastewater production and fewer contaminants being discharged into the Seine River. Another explanation could be linked to the lower water flow in the Seine River in 2020 compared to 2021. Unlike the even distribution between the phyla in 2020, in 2021, Proteobacteria constituted more than 60% of the relative abundance. This dominance was primarily driven by genera such as Methylotenera, Methylophilaceae-OM43clade, Pseudomonas, and Massilia, which are recognized as opportunistic pathogens, with their presence potentially linked to insufficient removal at the WWTP level, as suggested by multiple authors (Pandey et al. 2014; Shao et al. 2019). Conversely, in 2020, there was a significantly higher relative abundance of the genera Cyanobacteriia_unclassifier and Cyanobium from the phylum Cyanobacteria. The presence of Cyanobacteria in 2020 can be explained by the low river flow conditions as Cyanobacteria are often associated with slower-moving or stagnant water bodies where they can form blooms (Shang et al. 2023). High water flow prevents stratification and and favors the mixing of suspended matter, which can hinder the acquisition of nutrients by Cyanobacteria and their optimal positioning in the water column for light absorption. These conditions led to a change in microbial composition, marked by a high abundance of Actinobacteria. Interestingly, (Tu et al. 2022) demonstrated that a Cyanobacteria bloom accelerates the conversion of organic phosphorus to soluble inorganic phosphorus from sediment to water, facilitating the development of Actinobacteria. This possibility can be supported by RDA analysis, which highlights the orthophosphate content as a significant environmental factor influencing the developement of Actinobacteria. However, despite year-to-year bacterial composition changes, microbial activity and abundance remained relatively stable. However, microbial activity and abundance were higher in summer than in autumn for both years, with high correlation between microbial abundance and activity, likely due to factors like temperature, light, and nutrient availability favouring microbial growth and metabolic activity (Crump and Hobbie 2005; Sun et al. 2017; Wang et al. 2016).

In our study, the spatial variation has been taken into account to differentiate between sampling sites (seven sites), between before (JSO with respect to Seine-Valenton WWTP) and after (CLR, VSS with respect to Seine-Valenton, and BOU with regard to Seine-Aval) WWTP discharge, between upstream (JSO, CLR and VSS) and downstream (BOU and TSS) sites, between rivers (Seine, Marne and Oise). Comparing sites before and after WWTP discharge, we observed significantly higher microbial abundance and activity downstream than upstream of the WWTP. This suggests that the WWTP effluent dischage can contribute to an increase in microbial activity and abundance in the river. Additionally, microbial abundance was higher upstream of Paris compared to downstream, although no changes in activity were noted. This constrasts with our observations of the physico-chemical parameters, where we noticed a nutrient accumumation from upstream to downstream, consistent with findings from previous studies (Gücker et al. 2006; Marti et al. 2004; Spänhoff et al. 2007; Waiser et al. 2011). Several factors may contribute to these variations. The low-charge activated sludges used in the Seine Valenton WWTP, an upstream site, may be linked to the higher microorganisms abdundance upstream of Paris. Changes in microbial activity can lag behind changes in abundance, as microorganisms need time to adapt to new conditions or to recover from environmental disturbances (Sveen et al. 2024). Thus, no overall activity difference may be observed despite higher upstream abundance. Variability between sampling sites is less pronounced, indicating the dominant factors shaping the microbial communities in the Seine River are primarily related to seasonal and annual changes rather than spatial differences. According to available data, the effluents were well regulated for all physico-chemical parameters before being discharged into the Seine River. Thus, the ecosystem of the Seine River, in the Paris conurbation, may have relatively uniform environmental conditions across the various sampling sites, leading to reduced variability in microbial communities among sites. Regarding downstream and upstream samples, downstream samples displayed lower bacterial diversity compared to upstream samples, as indicated by Shannon and Chao1 indices, suggesting potential effects of accumulation phenomena. Downstream water showed a higher abundance of Bacteroidota and Patescibacteria phyla compared to upstream. Bacteroidota, commontly found in human intestines (Dowd et al. 2008; Shanks et al. 2011), may indicate the influence of excessive faecal matter and contaminant accumulation from high population density areas in the Paris conurbation. Patescibacteria groups are known for encoding active enzymes that degrade complex organic matter into simple molecules, a process observed in previous studies (Dong et al. 2021; Gu et al. 2023). Specifically, the dominance of Patescibacteria was associated with the presence of complex organic compounds like PolyPh and PolyAr. While the latter are significantly or completely removed during the water treatment process, other compounds promoting the development of this phylum may be introduced by the effluents from the Seine Centre, Seine Aval and Seine Grésillons WWTPs. At the phylum level, Proteobacteria showed similar relative abundance between upstream and downstream samples, yet significant variations were noted at the class level. Notably, the Gammaproteobacteria class dominated all surface water samples of the Seine River, consistent with findings by (Ge et al. 2021). Their study revealed that Gammaproteobacteria is predominant in the urbanised area, followed by Alphaproteobacteria and Betaproteobacteria. This is because many members of the Gammaproteobacteria originate from anthropogenic or zoonotic sources and are widely considered transient passengers (Newton et al. 2011; Sun et al. 2017) that may have been transported by the WWTP effluent. It is interesting also to note that no class was detected for Betaproteobacteria, despite its significance and abundance in other studies, particularly in surface water in the Dongjiang River in China (Liu et al. 2012; Sun et al. 2017). This absence can be explained by varying selection pressures. Betaproteobacteria are fast-growing, nutrient-dependent, making them vulnerable to predation, whereas, Alpha- and Gamma-proteobacteria are better adapted to low nutrient environments and can degrade complex organic compounds such as humic substances (Sun et al. 2017). Given the nutrient-rich nature of the Seine River, which mainly contains recalcitrant organic compounds from WWTP effluents, and their ability to degrade them, Alpha- and Gamma-proteobacteria are likely to thrive. The downstream sites exhibited greater abundance of Gammaproteobacteria, while the upstream sites showed higher abundance of Alphaproteobacteria. This finding is in good agreement with similar results presented by (Marti and Balcázar 2014), who found higher Gammaproteobacteria abundance downstream in the Ter River; constrasting with (Yuan et al. 2023), who observed the opposite trend in the North Canal River, China. These authors explained those variations between upstream and downstream sites for this class as being related to WWTP, due to the presence of residual contaminants in the effluent. To better understand these variations, the measurement of contaminant levels in term of emerging organic contaminants in water and the correlation between microbial diversity and contaminant need to be further investigated.

Conclusion

Changes in the bacterial communities were evaluated using high-throughput sequencing, and the relationships between microbial parameters and physico-chemincal properties were investigated. The dominant phyla identified in the Seine River were Proteobacteria, Actinobacteriota, Bacteroidota, and Cyanobacteria. The presence of the Alphaproteobacteria and Gammaproteobacteria suggests that the Seine River water contains nutrients in the form of recalcitrant organic compounds. The results show a strong temporal variability and a lower spatial variability in the bacterial community structure of surface water in the Seine River. It is interesting to note that the bacterial composition of the Seine River in 2020 was clearly different from that in 2021, probably due to the lower water flow rate, which allowed Cyanobacteria to develop, leading to changes in the composition of the bacterial communities. Higher microbial activity and abundance were found in summer compared to autumn, likely due to temperature and orthophosphate content. Moreover, spatial variation in bacterial composition was observed between upstream and downstream sites, and between before and after of the Seine Valenton-WWTP, suggesting a contribution due to either the wastewater treatment processess or an accumulation phenomena. For example, Gammaproteobacteria were higher in the upstream sites than downstream, potentially linked with the presence of residual contaminants in the effluent. Additional measures of emerging contaminants and other pollutants are needed to better understand these variability. Our results provide a basic understanding of the microbial community in the Seine River, serving as a reference for assessing the impact of the recent introduction of wastewater disinfection by performic acid.