Introduction

While the microplastic (MP, plastic items  <5 mm) contamination of the environment is constantly growing, MP has meanwhile been detected in all ecosystems reaching from aquatic [1,2,3] to aerial [4,5,6] to terrestrial systems [7,8,9] and has also been found in remote areas far from population centres such as Arctic ice [10, 11] and remote mountain lakes [12, 13] but also in biota [14, 15]. For an accurate risk assessment reliable analytical methods are thus urgently required to appropriately identify MP isolated from environmental matrices qualitatively and quantitatively [16]. The field of MP research arose from MP detected in coastal waters in the 1970s, initially focusing on rather large MPs (sampled with plankton tow nets (mesh size: 333 µm)) [17, 18]. Due to their size, manual handling of prospective large MP items for analysis is possible and is common practice. Commonly applied methods for larger MP particles are visual identification [19], Nile red staining [20, 21], the hot needle method [22] or through attenuated total reflectance (ATR)-Fourier transform-infrared (FTIR) spectroscopy [23,24,25] which allows for chemical identification of the respective polymer. However, most of these methods are prone to human bias, leading to an over- or underestimation of the MP abundance. While analysis with ATR-FTIR also requires a manual preselection of prospective MP and is thus also susceptible to bias, the chemical identification of polymers is reliable which makes this method favourable for the analysis of larger MP.

Recent studies have shown that the abundance of small MP (S-MP,  <500 µm) in the environment is much higher and thus should not be neglected when evaluating the environmental MP contamination [26, 27], especially when considering the increased potential toxicity of small MP [28]. Due to their miniscule size and high abundance, manual analysis of S-MP is not feasible. Thus, rapid, automated and highly accurate analytical methods are required that allow for a high-throughput rate of samples. Well-established analytical methods include pyrolysis–gas chromatography (py-GC) coupled with mass spectrometry (MS) [29,30,31] or thermal extraction desorption (TED) GC-MS [32, 33] which, however, only give information on polymer composition and polymer masses in a sample. In contrast, spectroscopic methods such as focal plane array (FPA) based micro-Fourier-transform infrared (µFTIR) imaging [24, 25, 34] and µRaman mapping [25, 35, 36] have proven to be highly efficient methods for MP analysis of sample filters, yielding information not only on polymer type but also on item count, shape and size distributions.

This study focuses on FPA-µFTIR analysis of S-MP derived from environmental samples as this technique facilitates chemical imaging by simultaneously recording thousands of spectra within one measurement in a reasonable timeframe, which makes it a powerful tool in MP analysis of whole sample filters [34]. Previously, the IR-spectra of each item on the filter containing the purified sample were compared manually to reference spectra, which is prone to human bias. Due to the prevalence of non-plastic residues post purification and the potentially high abundance of S-MP in environmental samples, this, however, is extremely time-consuming and therefore not practical, especially for monitoring studies where a high number of samples need to be analysed. To facilitate a rapid analysis without manual screening, a broad range of algorithms have been developed to automate the process of spectroscopic MP data analysis. As explained by Hufnagl et al. [37], these can be categorised into model-based [38,39,40,41,42,43,44] and instance-based [2, 45,46,47,48,49,50,51,52] machine learning approaches. Model-based approaches are based on a statistical model from spectroscopic reference data which is then applied to unknown spectra. These are then assigned to predefined classes which may include anything from polymer types to matrix components such as sediment, plant or animal debris. Instance-based approaches, in contrast, directly apply the reference data (i.e. the “instance”) to identify unknown spectra based on similarity measures. Here, hit quality indices (HQIs) are computed, e.g. by measures such as the Pearson correlation coefficient [37, 48]. The advantage of this approach is that the spectroscopic reference data can easily be enhanced or adapted, e.g. by adding relevant spectra to the existing library [53]. For model-based learning, however, a high degree of expert knowledge is required which make application-specific changes more difficult. On the other hand, it has the advantage of much shorter analysis time which allows for a higher analytical throughput [37].

In the field of MP research, it is well recognised that the comparison of results from different studies is often hampered by the lack of standardisation concerning sampling methods, sample processing and analytical methods. Discrepancies can of course also arise from different automated data evaluation algorithms at the end of the analytical pipeline. Thus, as a first step towards harmonisation in this regard, the aim of our study is a comprehensive comparison of the output of two frequently used and well-established FPA-µFTIR data analysis algorithms: (1) siMPle (systematic identification of MicroPlastics in the environment) (which is freely accessible on www.simple-plastics.eu) [48] in combination with the image analysis tool MPAPP (MicroPlastic Automated Particle/fibre analysis Pipeline), with its script having been published by Primpke et al. [54] (available for download as executable here: https://drive.google.com/drive/folders/1fWIGp7MgJZJcy7NWI5Vri0eUYJw0Qvrz?usp=share_link), and (2) the Bayreuth Particle Finder (BPF), which is based on the methodology presented in Hufnagl et al. [42] and is an integrated module in the Epina ImageLab Engine (www.imagelab.at). BPF is the preliminary version of the Purency Microplastic Finder developed by the Purency GmbH, which is commercially available (https://www.purency.ai/microplastics-finder). The main difference between both approaches is that siMPle is an instance-based machine learning approach and relies on a dual database search using two different similarity measures. HQIs are computed through Pearson correlation [48]. BPF on the other hand is a model-based machine learning approach. Here, a combination of spectral descriptors and random decision forest (RDF) classifiers is applied [42]. Both pipelines allow for the analysis of whole filters, avoiding the bias which would occur during extrapolation of results obtained from randomly preselected subareas of a filter [55]. Furthermore, they have been applied frequently in a multitude of different studies, analysing MP from various environmental matrices. BPF has, for instance, been applied in different studies focusing on MP contamination in freshwater environments, such as Frei et al. [56] and Schrank et al. [57]. It has further been applied in studies focusing on the analysis of airborne MP [6], MP in soil [58] but also MP in animal tissue [59]. Over the course of the years, the approach based on RDF classifiers after Hufnagl et al. [42] has undergone different development stages to improve on certain aspects of the classification, resulting in the latest version which was released in 2021 [37] and applied in Dong et al. [60]. For the present study, the version released in 2019 was chosen, as it had been applied in the above-mentioned previous studies. The analysis software siMPle also finds a broad range of applications, e.g. in the analysis of drinking water [61], wastewater [62] or marine waters [63]. In combination with MPAPP or its precedent version APA (automated particle analysis, after Primpke et al. [2]), siMPle has been applied in recent studies on river surface water [64, 65], wastewater effluents [53, 66] and deep sea sediments [67].

Within this study, we have analysed two sample sets containing ten measurement files each, which were both analysed by the algorithms siMPle/MPAPP and BPF. The resulting data output was compared with respect to (a) MP abundance, (b) polymer composition and (c) size distribution. Hereby, the present work can enhance the understanding of similarities and differences of MP analysis pipelines, with the ultimate goal of forming a basis for a better comparability of MP datasets from different studies. It is to be noted that the two sample sets used for the comparison of the two aforementioned analysis algorithms are based on environmental samples. Thus, the actual amount of MP in the samples is unknown, but the comparison of the output data is nevertheless valid. We chose to work with environmental samples in order to demonstrate the complexity of the analysis of these samples, representing all environmentally relevant polymer types, shapes and sizes as well as environmentally aged MP, potentially containing matrix residue and remains of biofilm. As mentioned above, S-MP is much more abundant in the environment and poses a higher ecotoxicological risk. Thus, the importance of evaluating methods suitable for the analysis of S-MP is enhanced and our study aims at shedding light on this matter.

Materials and methods

In order to compare the output of the two analysis pipelines, two MP sample sets (size range: 11–500 µm) — sample set A containing ten riverine samples and sample set B containing ten estuarine samples — were analysed with the BPF [42] and in parallel with the MP analysis tools siMPle and MPAPP [48, 54]. The measurement files re-analysed in the present work had been generated in the framework of the joint project PLAWES (Microplastic Contamination in the Weser-Wadden Sea-National Park Model System: an Ecosystem-Wide Approach), which aimed at a comprehensive assessment of MP in the river system Weser-Wadden Sea. Herein, sample set A contains water samples from the Upper and Middle Weser and is subject of a study by Moses et al. (unpublished data). Sample set B contains water samples collected in the Lower/Outer Weser and Jade Bay and was initially analysed by Roscher et al. [64] (Tab. S1).

After enzymatic-oxidative sample purification based on the protocol presented in Löder et al. [68], all samples had been filtered onto Anodisc™ filters (aluminium oxide, pore size: 0.2 µm, diameter: 25 mm, Whatman, UK). Measurements using µFTIR imaging (Bruker Hyperion 3000, Bruker Optik GmbH, Germany) were performed with a 64 × 64 FPA detector in transmission mode with a 3.5 × IR lens in the wavenumber range 3600 − 1250 cm−1 (spectral resolution: 8 cm−1) using 32 co-added scans (background on pure filter: 32 and 64 scans, respectively), resulting in a pixel resolution of 11 µm. Data were saved as OPUS-measurement files (operating software OPUS 7.5, Bruker Optik GmbH, Germany). Details on the subsequent data analysis through siMPle/MPAPP and BPF are provided in the following sections.

siMPle/MPAPP

OPUS-measurement files of sample set A and B were first processed with the software OPUS 7.5 (Bruker Optik GmbH, Germany), in order to transfer spectral data into the file format.dx in preparation to the following analysis steps [48]. In siMPle (version 1.1.β), the.dx files were converted into the .spe format, allowing for the subsequent automated comparison to our in-house polymer database [47]. Within this process, each spectrum is compared twice with the database (siMPle_database_Version 1.0.1), first using the untreated spectra and a second time using the 1st derivatives for spectral correlation calculation. Only if both processes determine the spectrum of the same polymer type, it is labelled as correctly identified for later image analysis, following the approach from Primpke et al. [2]. The data processing in siMPle was followed by the final image analysis via MPAPP. Here, the determined image containing the x,y coordinates on the filter, the combined hit quality and assigned polymer type are first analysed against polymer-specific quality control threshold values [69] for each polymer type. This is followed by a majority voting filter analysis and a series of image analysis tools to separate fibre like items from particles (see Primpke et al. [54] for the exact details of the procedure). Sample set B underwent an additional processing step, where the pixels assigned to the polypropylene (PP) support ring of the Anodisc filters containing the sample were removed in OriginPro.2017G (OriginLab Corporation, USA) (size of measurement field: 20 × 20–22 × 22 FPA tiles). This step was not necessary for sample set A, as a smaller measurement field had been applied (17 × 17 FPA tiles), which did not cover any border area of the filter. The final analysis using MPAPP provided information on numbers, sizes and polymer composition of particle and fibre-like MP items [54].

Bayreuth Particle Finder (BPF)

Both datasets analysed with BPF were also first processed with the software OPUS 7.5 (Bruker Optik GmbH, Germany), in order to transfer spectral data into the file format.hdr while the optical image was extracted as .jpeg file. The .hdr file containing the spectral data and the infrared image of the analysed filter were then imported to the program ImageLab (Epina GmbH) [42]. The optical image was then aligned with the infrared image and carefully adjusted manually. After calibration with measurement parameters from the OPUS measurement file, the BPF algorithm was applied to the measurement data. The database used to train the machine learning model contained 22 polymer types (for details see Table 1). In the classification step, each pixel is classified by the machine learning model resulting in the assignment to either a polymer type or a class describing the matrix (“non-plastic”) or the filter surface (“background”). MP items are detected as neighbouring pixels of the same polymer class. The results are presented in a list containing their properties such as the size (longest and shortest dimension), polymer type, position on the filter etc. All material identified and categorised as “non-plastic” and “background” were deselected. The spectra of the remaining items were checked twice by experts. This conservative step is optional but was included in our routine for QA/QC and more reliable results. In Hufnagl et al. [37], cases are discussed, where expert intervention can significantly improve questionable results, e.g. coming from total absorption and overlapping MP items. For this, the spectra of items identified as plastic were compared using a built in reference database. Furthermore, the size of the identified polymer was verified with the optical and infrared image and if necessary adapted using the editing tools. This allowed to correctly assign items that are partially covered with organic material post-purification that may mistakenly be registered as smaller items. Furthermore, due to the round cross section of synthetic fibres and resulting distorted IR spectra, oftentimes only individual pixels of fibres are identified as plastic. Therefore, the entire length and width of the fibres were audited to allow an exact size measurement. The generated output file delivers information on polymer type, shape, length and width of the identified items.

Table 1 Harmonised polymer types compared in this study, as well as excluded ones, which were only present in one of the analysis pipelines. Abbreviations: A: acrylates; CA: cellulose acetate; CMC: chemically modified cellulose; EVA and EVAc: ethylene vinyl acetate; PA: polyamide; PC: polycarbonate; PP: polypropylene; POM: polyoxymethylene; PEEK: polyether ether ketone; PVC: polyvinylchloride; PUR/PU: polyurethane; PMMA: polymethyl methacrylate; PS: polystyrene; ABS: acrylonitrile butadiene styrene; PEST: polyester; PET: polyethylene terephthalate; PBT: polybutylene terephthalate; PSU: polysulfone; PLA: polylactic acid; PLA-PBAT: polylactic acid/poly(butylene adipate-co-terephthalate) blend; PE: polyethylene, V: varnish

Data evaluation

In order to compare the datasets generated by the siMPle/MPAPP and BPF analysis, MP polymer data were harmonised. For this, comparable polymer types (e.g. containing same functional groups, such as terephthalates or styrenes) were identified, and grouped into clusters whenever necessary (Table 1). For instance, the terephthalates, polybutylene terephthalate (PBT) and polyethylene terephthalate (PET), are identified separately in the BPF. The siMPle output data, however, shows results on the cluster polyester (PEST), which includes both aforementioned terephthalates [47]. This resulted in the harmonised polymer cluster PEST, which for the BPF output data includes PET and PBT. In contrast, some polymer types were only present in one of the analysis pipelines (e.g. polyacrylonitrile (PAN), polyimide or polychloroprene; (Table 1)) and were therefore excluded from the comparison.

Results

Exclusion of further polymer clusters from analysis

When comparing the data output of BPF and siMPle/MPAPP, we noticed that the polymer type ethylene vinyl acetate (EVA) was relatively prominent in the siMPle/MPAPP data (n = 55 in both sample set A and B, respectively), whereas no assignments to EVA were reported after BPF analysis (see Tables S2 and S4). The same was true for cellulose acetate (CA), with no detections after BPF analysis, but 14 assignments (sample set B) after siMPle/MPAPP analysis. With respect to EVA, 10 exemplary spectra assigned to EVA by siMPle were re-inspected together and rejected in agreement of both operators, due to poor hit quality. Furthermore, strong discrepancies in the data output were also recorded for the acrylates/polyurethanes/varnish (A/PUR/V) cluster, with much higher counts after siMPle/MPAPP analysis (n = 1086 and n = 1016 in sample set A and B, respectively) compared to after BPF analysis (n = 4 and n = 108 in sample set A and B, respectively) (Fig. 1). Thus, the clusters EVA, CA, and A/PUR/V were excluded from further analysis. Reasons for the differences will be discussed in detail in the “Effects of differences in the general methodological approach” section.

Fig. 1
figure 1

Data output after BPF (a, b) and siMPle/MPAPP (c, d) analyses of sample set A and B before exclusion of EVA, CA and A/PUR/V (cf. the “Exclusion of further polymer clusters from analysis” section). For abbreviations refer to Table 1

MP abundance, polymer composition and size distribution

Comparison of the analysis output of siMPle/MPAPP and BPF after exclusion of previously mentioned polymer types/clusters (EVA, CA and A/PUR/V) revealed similar results for sample set A (Fig. 2a, c), with a difference in polymer count detected by both pipelines being on average ∆n ~ 6 (mean ± standard deviation (SD) = 6 ± 8), with ∆n ranging from 0 for A-02 and A-03 to 27 for A-08. Samples with a difference of ∆n > 6 were A-04 (∆n = 10), A-05 (∆n = 7), A-06 (∆n = 8) and A-08 (∆n = 27). Within both pipelines, PP and PE were identified as dominant polymer types and maximum item counts were recorded for sample A-08 (Fig. 2a, c). Furthermore, in all remaining samples (except for sample A-04), a similar trend was observed.

Fig. 2
figure 2

Data output after BPF (a, b) and siMPle/MPAPP (b, c) analyses of sample set A and B, under exclusion of EVA, CA and A/PUR/V (cf. the “Exclusion of further polymer clusters from analysis” section). For abbreviations, refer to Table 1

Generally, in the riverine sample set A, the siMPle/MPAPP pipeline detected more PA (present in five samples, 1–4 items per sample) and PEST (present in six samples, 1–5 items per sample) than the BPF pipeline (PA present in one sample containing 3 items; PEST present in one sample containing 9 items). Slight differences were also recorded with regard to PE: In samples A-04, A-05 and A-06, siMPle/MPAPP detected more PE than BPF (Fig. 2, ∆n = 5, 4 and 4, respectively). On the other hand side, BPF detected more PVC (four samples, 1–2 items per sample) than siMPle (one sample containing 1 item). Details can be found in Table S2 and Table S3. The differences in PE in the aforementioned samples were especially recorded for items  <100 µm (Fig. 3), and herein the majority of MP detected by siMPle/MPAPP had a size  <25 µm (85%). A similar observation was made for items identified as PA (70% < 25 µm, n = 7) and PEST (100% < 25 µm, n = 16) by siMPle/MPAPP within sample set A, which were rather present in larger size classes based on BPF results.

Fig. 3
figure 3

Assignments to PE in samples A-04 (a), A-05 (b) and A-06 (c) by siMPle/MPAPP (black) and BPF (grey), shown for different size classes. A share of 85% of the assignments to the 11–100 µm size class refers to MPs  <25 µm

With respect to the estuarine sample set B, a decreasing trend in the MP counts was recorded from sample B-04 to B-10 with both analysis pipelines (Fig. 2b, d). The other samples, however, showed a stronger variation between pipelines: samples B-01, B-02 and B-03 showed 0–1 MP item after BPF analysis, whereas siMPle detected  >10 MP (B-01 and B-02), and  >50 MP (B-03). Concerning polymer compositions, sample B-04 showed a high degree of similarity after BPF and siMPle/MPAPP analysis, with PE, PP and PS being predominant polymer types. The results of the remaining samples, however, showed higher discrepancies. For samples B-06 and B-07, e.g. BPF detected more PE, whereas after siMPle analysis, PP was predominant. Details can be found in Table S4 and Table S5. Here too, with regard to PE, the differences were mainly driven by the smallest size class 11–25 µm, with n = 6 (sample B-06) and n = 13 (sample B-07) MP items detected by BPF, while 0 and 3, respectively, were detected by siMPle/MPAPP. This contrasted with the findings for sample set A, where more small PE items were detected by siMPle/MPAPP (Fig. 3).

The overall size distribution presented in Fig. 4 confirms the impression of size-related discrepancies, which were already stated above for PE, PEST and PA. For sample set A, few counts in small size classes were recorded in the BPF results, and most MP were assigned to the 50–75 µm size range (Fig. 4a). In contrast, siMPle results were clearly dominated by MP items in the size class 11–25 µm (Fig. 4c), with PP, PEST and PE being dominant. With regard to sample set B, both analyses showed highest counts in the smallest size class (Fig. 4b, d). However, especially in this size class, polymer compositions differed strongly with PE being dominant in the BPF results, whereas PP but also PA were dominant in siMPle/MPAPP results.

Fig. 4
figure 4

Overall size distributions (length of detected MP in µm) recorded for sample set A and B through BPF (a, b) and siMPle/MPAPP (c, d) analysis. It is to be noted that all size bins cover 25 µm, except for the smallest category. The latter starts at 11 µm which is equivalent to the size of one FPA detector pixel

Discussion

Effects of MP size on spectral quality and automated analysis

This study provides a detailed comparison of two MP analysis pipelines, i.e. siMPle/MPAPP and BPF. It is to be noted that this comparison was conducted with environmental samples, containing an unknown amount and composition of MP items. Thus, it remains unclear which results of the aforementioned analysis pipelines are closer to the actual MP occurrence. However, this approach was chosen to shed some light on the complexity of the analysis of environmental samples. Working with internal standards or spiked samples has multiple limitations in terms of representativeness due to the limited availability of commercially available reference material in regard to polymer types, shapes and sizes. Additionally, the preparation of spiked MP samples is challenging, especially using S-MP < 100 µm [70]. As this size range, however, appears to be the most abundant in environmental samples and poses an increased ecotoxicological risk, it is especially important to represent this size class. The aim of our study, however, was to demonstrate the complexity of the analysis of environmental samples in its full spectrum, containing all relevant polymer types, shapes and sizes, as well as the interplay of matrix residues with aged microplastic being the real challenge for both algorithms. Furthermore, a well-established approach in other domains of chemometrics for comparing algorithms is the use of expert-annotated training data [71, 72]. However, within the considered size range, experimental difficulties (e.g. the handling of small virgin MP items) make it very challenging to ensure a correct assignment of the ground truth of spectra, which is why, as a first step, this study was limited to a relative comparison of results. Nevertheless, our study highlights the similarities and differences in results obtained with both tools which are essential for further efforts towards method optimisation and harmonisation.

Despite of the use of state-of-the-art methods, our results, however, also underline the uncertainties in MP analysis concerning polymer identification, especially at the lower end of the detection limit in terms of size. Depending on morphology and thickness of MPs in a size range below 50 µm, IR-spectra can be influenced by effects such as diffraction and Mie scattering [42], in tandem with low intensity of the original polymer signal. Thus, the interplay of these phenomena potentially results in low quality spectra and low signal-to-noise ratios. Due to a more conservative, time-intensive approach with BPF, possibly such low-quality polymer spectra were manually rejected. While for sample set A between 17 and 50% (mean ± SD = 37% ± 9%) of all items identified as MP were accepted after manual re-inspecting the data, only between 0 and 39% (mean ± SD = 13% ± 15%) of all MP hits were accepted for sample set B. For siMPle/MPAPP, such a process was performed former to this study in previous work on surface water samples collected in the North Sea [69], setting the minimum threshold to be reached per polymer type. Nevertheless, during application of siMPle/MPAPP, a fraction of such low quality spectra may have been included automatically. Although the problem of classification of MPs with low quality spectra at the lower end of the detection limit of IR spectroscopy has been demonstrated in our case on the example of two automated analysis algorithms, it also holds true for other automated classification solutions or pure manual classification. The severity of this phenomenon may differ between studies as it also depends on the specifications of the respective µFTIR system, filter type etc. used for MP sample measurement.

 Potential reasons for differences in MP abundance, polymer composition and size distributions

 Our results showed a high similarity between MP counts and polymer composition especially with respect to sample set A (average difference: ∆n ~ 6, maximum difference: A-08, ∆n = 27), where PP and PE were recorded as most abundant polymer types (Fig. 2a, c). Slight differences in polymer compositions, however, were observed: more PE and PEST were detected by siMPle/MPAPP than with BPF (samples A-04, A-05, and A-06). As stated above, these differences were mostly driven by the spectral features occurring in the smallest size classes.

With regard to sample set B, most samples showed similar MP counts after BPF and siMPle/MPAPP analysis (average difference: ∆n ~ 12). Additionally, sample B-04 showed strikingly high similarities also with regard to polymer compositions (Fig. 2b, d). Discrepancies in MP counts, however, were observed in samples B-01 to B-03 (maximum difference: B-03, ∆n  >50). For B-01 and B-02, this is possibly due to a matrix effect: these samples were rich in very fine sediments that could not be effectively eliminated during sample purification, which was underlined by high content of suspended particulate matter (SPM) (details in Table S1) recorded during sampling in the semi-enclosed Jade Bay (German North Sea) [64]. These characteristics, potentially resulting in matrix interferences, were in contrast to the other samples from sample set B and might have negatively affected the automated polymer identification by either under- or overestimation by the respective analysis pipelines. This problem can only be solved in future by further adjustments in the extraction and purification methodology. Sample B-03, however, did not show any obvious specific matrix-related characteristics which would explain the observed differences between both pipelines. Concerning the remaining samples of sample set B (B-04 to B-10), a general trend of decreasing MP counts was observed for both pipelines. However, differences in polymer compositions became evident especially in B-06 and B-07, where BPF detected more PE, whereas siMPle/MPAPP detected more PP. Randomly selected PP spectra detected through siMPle analysis were visually checked (exemplary spectra shown in Fig. S1) and accepted, as characteristic bands were present (i.e. stretching vibrations of CH3 and CH2 between  ~2830 and 2950 cm−1, and bending vibrations of mainly CH3 at  ~1450 and  ~1370 cm−1). However, existing noise in the spectra might have resulted in manual rejection during the quality check after BPF analysis, while it was considered valid due to double spectral confirmation within the siMPle process, as described in Primpke et al. [48]. The differences in PE counts, however, could not be explained in the framework of this study. Finally, our analysis showed that for the polymer clusters considered, BPF and siMPle/MPAPP generally are in accordance with regard to MP counts and polymer compositions, with exceptions likely caused by residues of complex sample matrixes post sample purification on the sample filters and discrepancies regarding the detection of small MP items  <25 µm.

Both pipelines found an overall dominance of MP in the smaller size classes. However, especially for sample set A, the overall size distribution differed between BPF and siMPle/MPAPP results, with the latter showing much higher counts in the smallest size class. Next to low quality spectra resulting from diffraction, Mie scattering and low signal intensity that may have led to an under- (BPF) or overestimation (siMPle/MPAPP) in the smallest size classes by the respective methods as described above, these discrepancies are potentially also due to different assessment approaches with regard to size classification in both algorithms. After the application of BPF, a manual QA/QC was performed for each classified MP item, and whenever necessary adjacent pixels were combined retrospectively if they clearly belong to the same item. During siMPle/MPAPP analysis, a closing step is implemented, where neighboured pixels are combined automatically [2, 54]. Thus, if the surface of an item is not uniform, potentially due to organic residues, i.e. the closing step may not be successful. This may lead to the detection of multiple counts in smallest size classes instead of the detection of one large item, which could be the reason for the observed differences in size classification. Nevertheless, a general high occurrence of small MP items was confirmed by Primpke et al. [73] using QCL-IR measurements and Cabernard et al. [74] with both µFTIR and Raman, with the latter showing even higher counts in the exact same sample. Thus it remains unclear, to what extent the aforementioned effect is relevant for the differences in size distributions observed in the present study.

Consequently, the observed differences between BPF and siMPle/MPAPP concerning size distributions underline the necessity of further comparative studies and should be focus of future harmonisation efforts with the final goal of a reliable assessment of environmental MP concentrations in all detectable size classes.

Effects of differences in the general methodological approach

To our knowledge, this is the first study to compare a model-based to an instance-based algorithm with respect to resulting MP abundances, polymer compositions and sizes, by applying both analysis pipelines on the data of the same two MP sample sets. For an accurate comparison of BPF and siMPle/MPAPP, a restructuring of the data output was performed, including the harmonisation of polymer types and the exclusion of those which are not targeted in both pipelines (e.g. silicone, which is only addressed by BPF, or polyimide, which is only included in siMPle/MPAPP; Table 1). This data handling step resulted in the data output presented in Fig. 1, showing that especially A/PUR/V was much more frequently detected by siMPle/MPAPP. Both, the PUR and PMMA class of the BPF approach was trained using only PUR and PMMA spectra. In siMPle/MPAPP, the individual spectra of different polymers were assigned by a label to polymer type clusters of similar substances [47] including other types of acrylate substances for A/PUR/V. While the former approach tries to mimic the real structure of the underlying polymer data by deriving a statistical model, the latter uses a combination of hierarchal cluster analysis and expert knowledge to generate generalised polymer type clusters separable in the available spectral range. The harmonisation step should thus be observed critically, as different design philosophies, with very different mathematical characteristics are compared. This potentially leads to a broader coverage of polymers by siMPle/MPAPP, while the BPF database allows that certain polymer types such as PUR and PMMA are differentiated.

Interestingly, BPF detected items falling into the A/PUR/V cluster especially in sample set B (Weser-Wadden Sea transitional zone), but almost none in sample set A (Upper and Middle Weser) (Fig. 1). Sample set B originates from sampling stations situated in an area with relatively high shipping activity, which represents a potential source for varnish-like items [64]. In this study, also L-MP items with a varnish-like morphology were recorded in this area. Sample set A, however, stems from an area less influenced by shipping activity in the middle and upper part of the Weser River, which is in accordance with very few (n = 4) validated assignments to the cluster A/PUR/V by BPF, but contrasts with the extremely high abundances after siMPle/MPAPP analysis (n = 1086). Beside the possibility of potential underestimation of this polymer cluster by BPF due to conservative manual rejection of low-quality spectra, another explanation could be an overestimation by siMPle/MPAPP, e.g. due to the missing awareness of specific natural materials causing systematic false positive assignments. This cluster was addressed with an extensive QA/QC procedure in a previous study on MP in WWTP effluents. It was found that this polymer cluster is strongly affected by a spectral interference with plant cuticles in sample matrixes present in high amounts of residual biological material [53] resulting in false positives with a high spectral match. Hence, in sample set A, an overestimation in samples with a high organic load by siMPle/MPAPP in the A/PUR/V cluster cannot be ruled out. Also, the polymer cluster EVA showed significant discrepancies, with relatively high abundances after siMPle/MPAPP analysis (sample set A: n = 55, sample set B: n = 55), and no counts after BPF analysis. As stated in the “Exclusion of further polymer clusters from analysis” section, respective siMPle-spectra were rejected after visual re-inspection by both operators as false positives. Also here, the manual inspection of the analysis results as routinely implemented after BPF analysis leads to exclusion and more conservative results with regard to this polymer type. Especially the lack of the ethylene signal at approx. 1370 cm−1 in the sample spectra (Fig. S2) led to rejection of most spectra assigned to EVA. Moreover, similar to the siMPle cluster A/PUR/V, also EVA was affected by matrix interferences in the study by Roscher et al. [53], further hinting towards the assumption that respective counts in the present study might also be due to false positive identification. Indeed, the two samples with highest EVA counts (A-08: n = 25; B-03: n = 39) showed high amounts of potentially biogenic material on Anodisc™ filters (Fig. S3) in comparison to other samples. These observations show that despite the high benefit of automated or semi-automated analysis pipelines, expert knowledge and manual QA/QC processes are highly necessary in order to allow for solid and unbiased datasets, as previously stated by Song et al. [75]. In siMPle/MPAPP, a QA/QC on the individual spectral level is implemented on a regular basis (see Lorenz et al. [69] and Primpke et al. [48]), whereas in BPF, the manual check of MP assignments was performed routinely for each dataset. Although the latter may provide a high certainty, it can also be time consuming, depending on the amount of potential MPs detected. For example, as stated above, in sample set A 50–73% (mean ± SD = 63% ± 9%) and in sample set B 61–100% (mean ± SD = 87% ± 15%) of all items identified as MP by the BPF algorithm were rejected after manual reinspection of the spectra. Here, especially samples with a high content of residual matter post-purification (such as fine sand or non-digestible matter such as plant pollen) appear to be critical. Due to an increased amount of material remaining on the filter, the time required for manual re-inspection of all spectra assigned to MP increases. These observations underline the great importance of an effective purification approach to produce final MP samples with as little sample matrix as possible present [68].

Due to the IR-transparency features of the filter material (aluminium oxide, Anodisc) used as substrate for FTIR imaging in this study, only the wavenumber range 3600 − 1250 cm−1 could be measured, where synthetic and biogenic substances partly share similar bands and discrimination can be challenging [53]. Thus, the additional assessment of data in the spectral fingerprint range  <1250 cm−1 should be further pursued for better distinguishing materials. In the case of µFTIR imaging in transmission mode, which results in the highest spectral quality [34], this is only possible if the used filter material aluminium oxide is replaced by material like silicon that is also IR transparent in the fingerprint region [76]. This filter substitute could help to improve classification especially for acrylate- and PUR-based polymers as well as EVA which would enhance the comparability of results as well as the general detection success and reliability of data.

Future implications

By comparing two currently well-established and frequently applied MP analysis tools, this study can act as a basis for future harmonisation and standardisation efforts in MP analysis. In general, BPF and siMPle/MPAPP showed similar results, with some discrepancies likely caused by matrix effects, and others explainable by the chemical characteristics of certain polymers which could be improved by a broader measurement range including the fingerprint region. On the whole, both pipelines are rapid and generate a detailed data output and therefore show great potential for a broad application in MP assessments. This study further underlines the importance of QA/QC, e.g. implemented by manual counter-checking by experts, in order to allow for the generation of high-quality datasets and underlines the importance of purification approaches that reduce the present sample matrix effectively. Additionally, our study also shows that all studies on MP contamination should be interpreted with caution, especially with respect to smaller size classes, since it remains unclear for all currently applied methods how correct the generated results are with respect to the actual occurrence of MP in the environment. Keeping this in mind, as a final consequence, we have to admit that even by the use of state-of-the-art methodology, the determination of the real environmental MP number is still a challenge which needs to be addressed by further research efforts. In order to have a clearer picture of how close the obtained results are to the actual numbers, it may perhaps be beneficial to work with spiked samples — although one has to be aware of the limitations in regard to available polymer types, shapes and sizes. Nevertheless, through the current ongoing development and improvement of the here applied analysis tools, both usability and reliability are being enhanced, e.g. by adaptations of underlying reference databases (siMPle) or optimised follow-up versions (“Purency Microplastic Finder” derived from BPF). These improvements and further optimisations will lead to analysis tools that — in the best case — produce data with high reliability without additional manual re-evaluation efforts.