Introduction

The deep sea is among the least investigated habitats on earth, harboring highly diverse benthic communities as well as mineral resources (Ramirez-Llodra et al. 2011). The Clarion Clipperton Fracture Zone (CCZ), located in the eastern equatorial Pacific (Fig. 1), has been targeted for the exploration of polymetallic nodules (Clark et al. 2013), which are a potential resource of copper, nickel, cobalt and rare earth elements (Wegorzewski and Kuhn 2014; Kuhn and Rühlemann 2021). There is general consensus that deep-sea mining will have adverse effects on the benthic communities (Gollner et al. 2017; Jones et al. 2017; Niner et al. 2018), and that recolonization of certain taxonomic groups will be slow (Miljutin et al. 2011). To preserve biodiversity in these deep-sea environments, it is not only important to define appropriate marine protected areas and preservation zones (Wedding et al. 2013; Jones et al. 2020; Uhlenkott et al. 2020), but also to develop tools for rapid biodiversity assessment to monitor the deep-sea fauna (Mohrbeck et al. 2015; Macheriotou et al. 2019). Biodiversity is generally high in the deep sea (George et al. 2014), but is often contrasted by low abundances of specimens (Smith et al. 2008; Ramirez-Llodra et al. 2010). Moreover, the actual biodiversity in the deep sea and specifically in nodule areas can currently only be predicted and is most likely underestimated (e.g. Christodoulou et al. 2019). Meiofauna studies conducted in the CCZ have so far focused mainly on the most abundant taxon Nematoda (Miljutina et al. 2010; Miljutin et al. 2015; Singh et al. 2016; Hauquier et al. 2019). Valid species descriptions of Copepoda from the CCZ only exist for some larger-sized species belonging to the orders Calanoida, e.g Aetideidae (Markhaseva et al. 2017), Siphonostomatoida (Mahatma et al. 2008) and Harpacticoida, e.g. Aegisthidae (Mercado-Salas et al. 2019). No large-scale evaluation of the copepod biodiversity in the CCZ has been published yet. Some new Kinorhynch species of the CCZ have been described in Sánchez et al. (2019) and Sánchez et al. (2022).

Fig. 1
figure 1

Bathymetric map of the study area positioned in the eastern Clarion Clipperton Fracture Zone (CCZ). The red dots represent multicore sampling sites

Until now, biodiversity assessment and species descriptions in the CCZ focused on morphology and DNA barcoding (Janssen et al. 2015; Herzog et al. 2018; Christodoulou et al. 2019; Mercado-Salas et al. 2019; Brix et al. 2020). This molecular method allows for species discrimination with little taxonomic expertise compared to pure morphological identification (Hebert et al. 2003; Tautz et al. 2003). Different algorithms exist for species delineation based on DNA barcoding, discriminating species either by distance-based or by tree-based approaches (Christodoulou et al. 2020; Paulus et al. 2022; Korfhage et al. 2022). Among the most commonly used methods is the distance-based Automatic Barcode Gap Discovery (ABGD), delimiting species according to higher within-species similarity compared to other species (Puillandre et al. 2012). The tree-based General Mixed Yule Coalescent method (GMYC) investigates a similar pattern, but is rather based on coalescence of branches in phylogenetic trees (Pons et al. 2006). The term reverse taxonomy describes the identification of specimens by analyzing their genetic information before carrying out a taxonomic investigation using morphological methods (Markmann and Tautz 2005). The first use of reverse-taxonomy in the CCZ using part of the mitochondrial cytochrome c oxidase 1 (CO1) gene focused on the macrofaunal taxa Isopoda and Polychaeta in two areas approx. 1300 km apart (Janssen et al. 2015). Other taxa that have been investigated using this molecular approach in addition to morphological examination include Tanaidacea (Jakiel et al. 2019), Ophiuroidea (Christodoulou et al. 2020) and Amphipoda (Mohrbeck et al. 2021; Jażdżewska et al. 2022).

In contrast to molecular barcoding, Matrix-Assisted Laser Desorption/Ionization Time-of-Flight Mass Spectrometry (MALDI-TOF MS) investigates the proteome fingerprint, i.e. the mass and amount of a subset of cell molecules comprised mainly of small proteins and peptides. Although MALDI-TOF MS is most commonly used in microbiology (Singhal et al. 2015), it also proved to be useful in distinguishing a large variety of different marine metazoan taxa such as isopods, cnidarians, mollusks and fish (Holst et al. 2019; Rossel et al. 2020; Wilke et al. 2020; Paulus et al. 2022; Korfhage et al. 2022; Kürzel et al. 2022). For copepods, it has successfully been used to discriminate a large pool of harpacticoid meiobenthic copepods (Rossel and Martínez Arbizu 2018a, 2019) as well as calanoid epipelagic (Laakmann et al. 2013), mesopelagic (Bode et al. 2017) and bentho-pelagic (Renz et al. 2021) copepods. This method is especially useful as a monitoring tool due to its relatively fast processing times and low costs per investigated specimen (for further details, see Rossel et al. 2019). An automatic discrimination of species based on the mass spectra derived from MALDI-TOF MS based on different clustering approaches has successfully been applied for biodiversity estimates of harpacticoid copepods (Rossel and Martínez Arbizu 2020) and calanoid copepods (Renz et al. 2021).

In addition to the fact that there is no need for comprehensive taxonomic expertise and that processing times are significantly faster compared to morphological investigations, another advantage of DNA barcoding and MALDI-TOF MS is that both methods can be used to identify operational taxonomic units (OTUs) instead of validated species. Morphological OTUs are often error-prone (Janssen et al. 2015) and identification catalogues are rarely available even for the larger megafauna organisms (Horton et al. 2021). Identification to the species level is thus rare for deep-sea meiofauna. Even the most abundant taxon Nematoda is usually only determined to the genus level in the CCZ (Miljutina et al. 2010; Singh et al. 2016; Hauquier et al. 2019). Despite difficulties for some taxa, molecular methods can aid to identify potential species and, hence, allow for a higher resolution of biodiversity assessments. Also, molecular identifiers are usually archived in repositories such as BOLD for DNA barcodes (Ratnasingham and Hebert 2007) or Dryad for mass spectrometry data. Hence, the identification can later be verified or the taxonomic resolution can be improved based on future species descriptions. Most importantly, identifications can more easily be matched between studies. As deep-sea mining in the polymetallic nodule fields of the CCZ is approaching at a rapid pace, the development of appropriate tools to monitor the (changing) environment is vital. Hence, in this study, we aim at (i) evaluating the applicability of proteome fingerprinting for biodiversity estimation and (ii) estimating species richness of meiobenthic harpacticoids in the CCZ using proteome fingerprinting and COI barcodes. To evaluate the confidence in biodiversity prediction using these molecular markers, we inter-compare different approaches on species delimitation for both methods, i.e. the distance-based ABGD and the tree-based GMYC method for the COI data as well as two different clustering approaches (partitioning around medoids and consensus clustering) for the proteomic spectra.

Material and methods

Sample collection and preparation

Sediment sampling in the CCZ was conducted using a multicorer during the cruises MANGAN 2018 (SO262: 05/04 to 29/05/2018, Rühlemann and Shipboard Scientific Party 2019) and MiningImpact2 (SO268/2: 30/03 to 22/05/2019, Haeckel and Linke 2021), both on the German research vessel SONNE (Fig. 1). The study area is located within the eastern part of the German contract area for the exploration of polymetallic nodules, which has been licensed by the German Federal Institute for Geosciences and Natural Resources (BGR) from the International Seabed Authority (ISA). In 2018, 16 biological sampling sites within four clusters of environmental grid layers were randomly chosen from a relatively small, ca. 5 × 5 km sampling area (see Fig. 1) using the R package vegan (Oksanen et al. 2017). The spatial layers of environmental variables used to distinguish as many different habitats within the study area as possible were based mainly on bathymetry and backscatter values (Uhlenkott et al. 2019). In addition, nine sites were manually selected to include the full range of bathymetric conditions and oceanographic characteristics that could, for example influence the potential direction(s) of spreading of a mining-related sediment plume (Uhlenkott et al. 2019). In 2019, meiofauna samples were obtained at eleven sites in close vicinity, about 5 km to the south of the sampling area of 2018 (Fig. 1). An overview of sampling stations is provided in Table 1.

Table 1 Exact position, water depth and sampling date of the multicore samples analyzed in this study (also see Fig. 1)

Meiofauna was sampled using multicores with an inner diameter of 94–96 mm. Bottom water was sieved over a 32-μm sieve and fixed with 99.8% ethanol denatured with methyl ethyl ketone together with, in 2018, the upper 3 cm and in 2019 the upper 5 cm of sediment in a Kautex wide-neck bottle (1000 ml). All samples were re-fixed with the same fixative after 24 h and stored at −20°C. To extract all meiofauna organisms from the sediment, samples were centrifuged according to the differential flotation method (Heip et al. 1985) with the colloidal gel Levasil®. Centrifuged samples were transferred into a Kautex wide-neck bottle (100 ml) and further stored at −20°C in the same fixative. All copepods were sorted out of the supernatant under a dissecting microscope. Prior to molecular processing, all individuals were photographed to document their basic morphology, and the ontogenetic stage was determined.

Further processing was conducted according to two different protocols. In the first approach conducted on 58% of all available specimens, the individual was cut into two pieces. The posterior part was used for DNA barcoding, while the anterior part was used for investigations with MALDI-TOF MS. In the second, enhanced protocol conducted on 42% of the specimens, the individuals were first prepared for MALDI-TOF MS and then washed with 10 μl molecular grade water before they were processed for DNA barcoding, to increase biomass used for the MALDI measurements. The change of protocol only influenced the success rate of MALDI-TOF MS but had no influence on the resulting DNA barcode or the mass spectrum. Furthermore, the exuviae could be retained for potential morphological investigations in the future.

DNA barcoding

DNA was extracted in 20 μl chelex (InstaGene Matrix, Bio-Rad) for 50 min at 56°C, followed by a denaturation of the enzymes for 10 min at 96°C. PCR was conducted directly using the extract as DNA template with Accu Start (2× PCR master mix, Quantabio) (Suppl. Tab. 1) in a 20-μl starting solution. A variety of primers and primer combinations was used (Table 2). The success of all reactions was checked on a 1%-agarose gel; all PCR-products that produced a band were sent to Macrogen Europe, Amsterdam, Netherlands, for sequencing on an Applied Biosystems 3730XL sequencer. A negative control was used in all PCR runs.

Table 2 Table of primers that were used for amplification of COI-barcoding fragments. These primers allow various combinations for amplification of a COI-fragment ranging from 660 to 890 bp in size

Resulting sequencing reads were assembled in Geneious R7 v. 7.0.6. and checked for contamination (e.g. bacteria, fungi, non-crustacean taxa) using the basic local alignment search tool (BLAST) (Altschul et al. 1997). Subsequently, sequences were aligned in SeaView (Gouy et al. 2010) using the MUSCLE algorithm (Edgar 2004). Alignments were checked and misalignments by the algorithm were corrected manually. Possible nuclear mitochondrial DNA segments (NUMTs) were discarded by excluding sequences containing stop codons.

MALDI-TOF MS

The tissue was transferred with 5-μl ethanol into a 0.2-ml microcentrifuge tube. After the ethanol had evaporated at room temperature, 2.5-μl α-cyano-4-hydroxycinnamic acid (HCCA) was added and the tissue was incubated for at least 5 min. Thereafter, the extract with the HCCA was transferred to a metallic target plate and measured in a Microflex LT/SH System (Bruker Daltonics) using the method MBTAuto. Peak evaluation was carried out in a mass peak range between 2 and 10 k Dalton (Da) using a centroid peak detection algorithm, a signal-to-noise threshold of 2 and a minimum intensity threshold of 600. To create a sum spectrum, 160 satisfactory shots were summed up.

Raw spectra were imported to R and further processed using the R-packages MALDIquantForeign (Gibb 2015) and MALDIquant (Gibb and Strimmer 2012). Spectra were square-root transformed, smoothed using the Savitzky Golay method (Savitzky and Golay 1964), baseline corrected using the Statistics-sensitive Non-linear Iterative Peak-clipping algorithm (SNIP) (Ryan et al. 1988) and spectra normalized using the Total Ion Current (TIC) method. Repeated measurements were averaged by using mean intensities.

Signal-to-noise (SNR) cut-off values and peak detection half window size (HWS) were optimized by comparing unsupervised (without a reference library) biodiversity estimation using partitioning around medoids (PAM) sensu Rossel and Martínez Arbizu (2020) to the results of the DNA-based method ABGD. SNR and HWS values were chosen based on the highest adjusted rand index (ARI) between the PAM clustering calculated using the R-package mclust (Scrucca et al. 2016) and ABGD. A higher SNR-value will discard mass peaks of low intensity that are considered to be noise. A high SNR will result in more discarded mass peaks and thus in less mass peaks in the final dataset. The highest mass peak within a certain HWS will be picked as the most relevant mass peak in this range. A smaller HWS will result in more mass peaks in the final dataset.

Due to the current lack of standard objective methods, quality control was mainly carried out by expert opinion. In that context, mass spectra with strong noise, low peak intensities or an exceedingly low number of peaks were discarded.

Assessment of species richness

We applied four different unsupervised species delimitation approaches, two based on DNA barcoding and two based on proteomic spectra. To assign DNA sequences to molecular operational taxonomic units (MOTUs), ABGD was applied on the whole dataset (Puillandre et al. 2012) using the default setting on the ABGD web application (https://bioinfo.mnhn.fr/abi/public/abgd/abgdweb.html). As another method for species delimitation, GMYC analysis (Pons et al. 2006) was used by creating an ultrametric tree in BEAST. This tree was then used for GMYC analysis in R using the package splits (R-Core-Team, 2018; Ezard et al. 2021).

Using mass spectra derived from MALDI-TOF MS, PAM clustering previously tested on harpacticoid copepods by Rossel and Martínez Arbizu (2020) was applied using Hellinger-transformed data (Legendre and Gallagher 2001). For dimensionality reduction, a principal component analysis (PCA) was carried out on the data using command “prcomp” in R. We applied the silhouette index (Rousseeuw 1987) as an internal validation measure for the optimal clustering result. The silhouette analysis uses the difference between normalized separation, i.e. minimum of pairwise distances between clusters, and compactness, i.e. maximum of pairwise distances within the clusters. For each data point, a silhouette width is calculated and the average of these widths then provides the validation criterion. The largest silhouette was chosen as the best estimation for proteomic operational taxonomic units (POTUs) (Fig. 2a).

Fig. 2
figure 2

a Plot of silhouette values obtained by PAM clustering. Each dot represents the silhouette value (y-axis) obtained from a certain number of clusters (x-axis; varying from 2 to 726). The highest value was obtained for 499 clusters and is marked in red. b Ambiguous clustering plot for the designation of the number of stable clusters from consensus clustering. Each dot represents the proportion of ambiguous clustering (y-axis) per number of clusters (x-axis). The optimal number of clusters is marked in red

To evaluate stability and reproducibility of POTU delimitation, a second clustering approach was applied using consensus clustering based on hierarchical clustering (HC) with single linkage using the R-package ConsensusClusterPlus (Wilkerson and Hayes 2010) as applied in a previous study by Renz et al. (2021). A consensus matrix was calculated based on 100 repetitions of HC using Euclidean distance of Hellinger-transformed peak intensities based on 80% of randomly chosen compounds. Outer clustering of the consensus matrix was repeated using HC with single linkage. The number of stable and reproducible clusters was inferred from the consensus analysis using the proportion of ambiguous clustering (PAC) as internal validation measure (Șenbabaoğlu et al. 2014). PAC is defined as the fraction of sample pairs with consensus values in the interval above 0 (i.e. sample pairs that are never in the same cluster) and below 1 (i.e. sample pairs that are always in the same cluster). In a truly stable clustering, a consensus matrix contains only 0 and 1, and the PAC would have a score of 0. Here, we used 0.1 as lower and 0.9 as upper limit. From this, we inferred the number (n) of stable clusters by visual inspection of the first distinct minimum, i.e. when the difference in PAC between n and n+1 clusters approaches 0 (Fig. 2b).

For every resulting copepod community, Shannon’s diversity (H’) (Shannon 1948) using the ‘diversity’ function from the R-package vegan (Oksanen et al. 2017) and Pielou’s evenness (Pielou 1966) were calculated as a measure of biodiversity. Finally, communities based on DNA and MALDI-TOF data were compared using a Mantel test (Mantel 1967) applied to distance matrices calculated from community data.

Rarefaction curves were computed using the ‘rarecurve’ function from the R-package vegan (step = 20, sample = 100).

Inter-comparison of the species delimitation approaches

To estimate variability between the species delimitation approaches, all resulting species clusters were compared by calculating the ARI using the R-package mclust (Scrucca et al. 2016). Except for direct comparisons between two species delimitation tools, the DNA-based method ABGD was used as baseline identification, as methods for delimitation of proteomic data as well as ABGD are similarity based, making these more comparable. Thus, the ARI was always calculated in relation to ABGD results. Additionally, the SNR and the HWS were varied to investigate ARI and biodiversity variability in relation to data processing.

Results

MOTUs (Molecular Operational Taxonomic Units)

DNA sequences were obtained for 1296 out of 2115 copepod specimens. Applying ABGD resulted in 718 MOTUs (H’=6.30, J=0.96) (Table 3). Intraspecific JC69 distances for these MOTUs ranged from 0 to 0.15 (mean=0.02), and interspecific distances from 0.11 to 0.42 (mean=0.20). GMYC produced 794 MOTUs (H’=6.44, J=0.96) (Table 3).

From the 2115 specimens used in our study, quality-controlled MALDI mass spectra were retained for 1445 specimens. However, both COI and MALDI information was available for a subset of 727 specimens, which were used for the comparison of methods. For these, ABGD resulted in 440 MOTUs (H’=5.84, J=0.96) and GMYC in 489 MOTUs (H’=5.60, J=0.97) (Table 3).

Table 3 Genetic delimitation methods GMYC and ABGD compared to proteomic biodiversity estimation using Shannon diversity and species evenness for every approach

Quality control and processing of proteomic spectra

Using the dataset of 727 specimens for which both COI and MALDI information was available, we tested how data processing steps influence the accuracy of unsupervised biodiversity estimation using PAM clustering sensu Rossel and Martínez Arbizu (2020) based on the protein mass spectra in consistency with cluster formation based on the ABGD approach. We found that varying SNR and HWS had a major impact on estimation accuracy. However, the adjusted Rand index (ARI) of ABGD delimitation and PAM was found to span a narrow range from 0.48 to 0.56. The best ARI was found for SNR = 10 and HWS = 20.

POTUs (Proteomic Operational Taxonomic Units)

PAM clustering sensu Rossel and Martínez Arbizu (2020) of the 727 specimens with concurrent COI barcodes resulted in 499 POTUs (H’=6.06, J=0.98) (Table 3; Fig. 2a). Determination of POTUs using hierarchical clustering with consensus clustering (HC_CC) after Renz et al. (2021) resulted in 527 clusters (H’=6.09, J=0.97) (Fig. 2b).

Evaluation of clustering approaches

Consistency between the PAM-based POTU clusters and AGBD-based MOTU clusters described above was evaluated by adjusted rand (ARI = 0.57) (Table 3) and a detailed comparison of cluster composition. Over 70% of MOTU clusters with less than five specimens were correctly identified using proteome fingerprinting, while MOTUs containing more specimens were not retained completely. Figure 3 displays where wrong assignments occur frequently. Species clusters containing larger numbers of specimens are often split into smaller fractions, albeit retaining large numbers of conspecific specimens in the same clusters. Singletons often cluster non-specifically into other clusters. A Mantel test carried out on distance matrices from community tables resulting from PAM clustering and ABGD delimitation resulted in an r-value of 0.56 (p=0.001). Stability of clusters based on proteome patterns was evaluated using a consensus cluster approach based on HC after Renz et al. (2021). In total, 527 robust clusters (H’=6.09, J=0.97) were identified with an ARI of 0.60, with a high number of specimens being wrongly identified as singletons (n=215).

Fig. 3
figure 3

Alluvial plot of “relocation” of specimens, comparing ABGD species delimitation (middle) to the unsupervised delimitations based on MALDI-TOF MS data (left and right). Blue lines indicate that a specimen identified as a MALDI POTU is also correspondingly observed as ABGD MOTU. Red lines indicate relocation into a POTU with less than 50% of specimens found in its assigned ABGD MOTU. Magenta, turquoise and light blue indicate clustering with at least 50%, 66% or 80% of conspecific specimens, respectively

Even though occasional clustering of conspecific specimens of different ontogenetic stages was observed, this was not found to be a consistent pattern throughout all species. Such conspecific specimens were also frequently grouped into different clusters. These often differed by the average number of mass peaks (e.g. Fig. 4) and maximum intensities of the respective mass spectra.

Fig. 4
figure 4

ABGD MOTU 017 was found to be the species with the most specimens based on DNA barcoding, but was divided into three clusters by PAM clustering of proteomic data (a, c, d). Mass spectra of specimens displayed in a and c are visually similar, whereas those of d are quite different. The difference in average mass-peak number of these groups is displayed in b. a Cluster 78; c Cluster 65; d Cluster 472

Impact of data quality

The formation of clusters based on the intensity and the number of peaks indicates a large influence of mass spectra quality on the POTU delimitation. Since objective quality control tools are not available, a stricter visual quality control was carried out and the dataset was reduced to a total of 321 specimens (n = 235 species) to test if the accuracy of unsupervised delimitation methods can be enhanced using only high-quality mass spectra (Table 3). Again, SNR and HWS were varied to investigate the range of accuracy and to examine data-processing influence.

The ARI based on PAM ranged from 0.45 to 0.79 with the best ARI being detected at HWS = 10 and SNR = 19. This resulted in highly similar Shannon diversities (H’=5.37, J=0.98, nspec=243) in comparison to ABGD (H’=5.31, J=0.97). HC_CC resulted in a total number of 286 POTUs, with an ARI of 0.50. Shannon diversity (H’=5.59) and Pielou’s evenness (J=0.99) based on this method were a bit higher, especially because species with two specimens are frequently split into singleton clusters (Fig. 5, right side). In general, results from this high-quality dataset showed greater overlaps with barcoding delimitation (Fig. 5). A Mantel test carried out on distance matrices from community tables resulting from PAM clustering and ABGD delimitation resulted in a high r-value of 0.73 (p=0.001).

Fig. 5
figure 5

Alluvial plot of “relocation” of specimens for a reduced dataset after a strict visual quality control (see text for details). ABGD species delimitation (middle) is compared to the unsupervised delimitations based on MALDI-TOF MS data (left and right). Blue lines indicate that a specimen identified as a MALDI POTU is also correspondingly observed as ABGD MOTU. Red lines indicate relocation into a POTU with less than 50% of specimens found in its assigned ABGD MOTU. Magenta, turquoise and light blue indicate clustering with at least 50%, 66% or 80% of conspecific specimens, respectively

Implications of data processing for biodiversity assessments

Due to the lack of reference information (e.g. morphological identification or some molecular identifier) in studies where MALDI-TOF MS is applied as a standalone method, data processing (i.e. peak identification) cannot be optimized in a standardized way. Therefore, variability of evenness and biodiversity from PAM clustering were investigated varying the processing steps with the highest impact on results (SNR and HWS). Irrespective of data processing, results of PAM clustering are in high agreement with genetic results (Fig. 6). MALDI-TOF results from HC_CC overestimated diversity and evenness due to the high number of singletons compared to genetic tools.

Fig. 6
figure 6

Comparisons between adjusted rand index (ARI), Pielou’s evenness (J, in blue) and Shannon diversity (H’, in red) obtained by PAM clustering from datasets in which HWS and SNR were varied. Triangles display the corresponding value from HC_CC in the optimized dataset according to the comparison of PAM clustering and ABGD. Diamonds represent values obtained from GMYC. Lines are values obtained from ABGD species delimitation

Estimation of biodiversity

In total, a dataset of 1445 copepod specimens was analyzed by MALDI-TOF (Table 3). This dataset includes the 727 specimens with a concurrent barcode reference as described in the previous sections. For the remaining 718 specimens, a reference was not available. Using PAM clustering on this complete dataset, 815 POTUs were defined (H’=6.49; J=0.97; Table 3), while HC_CC resulted in 1023 POTUs and subsequently in H’=6.70 and J=0.97 (Table 3). This fits well with the high biodiversity and evenness values obtained for the previously analyzed datasets.

Furthermore, a dataset comprising 1296 DNA barcodes of benthic copepods from the sampling area was analyzed using ABGD and GMYC (Table 3). This dataset includes the 727 specimens for which a MALDI-TOF mass spectrum was also available. ABGD obtained 718 MOTUs with H’=6.30 and J=0.96. GMYC, on the other hand, produced 794 MOTUs with H’=6.44 and J=0.96.

Rarefaction curves based on the different methods applied in this study to the different datasets described above do not reach an asymptote and all emphasize that we are far from having discovered all species to be expected in the study area (Fig. 7).

Fig. 7
figure 7

Rarefaction curves generated for all tested datasets in this study. Numbers represent the size of the dataset for which the corresponding method applies (HC_CC; PAM; DNA_GMYC; DNA_ABGD). For each method, the number of defined species is shown in relation to the number of specimens analyzed

Discussion

The analyses carried out in this study show that unsupervised biodiversity assessment of deep-sea benthic copepods based on proteome fingerprinting is possible and is in general agreement with DNA-based methods. Hence, MALDI-TOF MS can be regarded as a potential tool for accelerated and standardized assessment of biodiversity, in addition to DNA barcoding which has already been applied in several studies on benthic biodiversity in the CCZ (Janssen et al. 2015; Jakiel et al. 2019; Christodoulou et al. 2020). So far, no assessment method applied in the CCZ was able to identify all collected specimens. However, it is important to mention that juvenile copepods were also identified in our approach. These are often not considered in morphological studies, although they make up one- to two-thirds of the specimens. In our study, a MALDI spectrum was successfully obtained for 68% of the specimens, whereas 61% of the specimens provided a DNA barcode. In previous studies from the CCZ, the success rate in obtaining COI-barcodes varied between 17 and 26% for benthic polychaetes (Janssen et al. 2019; Bonifácio et al. 2020), but was higher for Isopoda (42%, Janssen et al. 2019) and Ophiuroidea (57%, Christodoulou et al. 2020). One possibility to potentially handle these constraints in the comparative use of DNA barcoding and MALDI-TOF MS for biodiversity assessment would be to work with a pre-defined number of specimens. This method has also been applied to avoid unbalanced datasets in photographic surveys of benthic megafauna (Simon-Lledó et al. 2019). In this regard, the number of investigated specimens could be set in relation to the overall abundance of the analyzed taxa, as commonly applied in the investigation of nematodes (Hauquier et al. 2019).

The success rate of MALDI-TOF measurements on copepods (68%) was low compared to other studies, which usually reached 95 (Kaiser et al. 2018) to 100% (Renz et al. 2021). A high net efficiency identification rate due to low measurement failures coupled with relatively high accuracy has been discussed as a strong advantage of proteomic fingerprinting for species delimitation in copepods (Renz et al. 2021). The fact that we did not obtain mass spectra from 32% of our specimens in this study might be related to their small size and thus the small amount of biomass available for analyses. Additionally, some specimens were cut into separate parts for simultaneous DNA and MALDI analyses, which further reduced the amount of material available for the smaller specimens. Furthermore, juvenile stages were analyzed that induced additional biomass differences within one species. Subsequently, low biomasses may be a central cause for false partitioning of species into different clusters. This insufficient amount of biomass might also be an explanation for the variations in the mass-peak number for some specimens, which probably lead to larger Euclidean distances between specimens with different peak numbers and thus to their assignment to different clusters. This effect may have specifically influenced consensus clustering, where clustering based on subsets of markers resulted in higher dissimilarity between conspecific specimens and the prediction of a higher number of singleton clusters (Renz et al. 2021).

Previous studies on various animal groups such as insects, arachnids and crustaceans have already highlighted the influence of sample storage on protein mass spectra used for supervised identification (Mathis et al. 2015; Nebbak et al. 2017; Rossel and Martínez Arbizu, 2018b), in which identification success decreases with sample and data quality. This has also been emphasized by studies measuring the freshness of samples based on MALDI-TOF measurements (Ulrich et al. 2017). However, these studies have also shown that quality decrease does not always affect all specimens in a sample in the same way, resulting in good identification for some specimens while others could not be measured successfully anymore (Rossel and Martínez Arbizu 2018b). Although all samples were fixed in highly concentrated ethanol and stored at −20°C in this study, it cannot be excluded that reduced data quality might be related to degradation in some samples and specimens.

Even though results from different techniques as well as species delimitation methods are not completely congruent, performance as well as the resulting delimitation of taxonomic units is generally similar. Comparing the POTU delimitation of the high-quality proteome fingerprint dataset using the PAM algorithm to the MOTU delimitation using ABGD accounted for an ARI of 0.79. The comparison of GMYC and ABGD led to an ARI of only 0.84, although these delimitations are based on an identical dataset. Thus, with good-quality measurement as a prerequisite, proteome-based data are generally capable of providing an accurate picture of species richness, which is also supported on community level by the Mantel test. Also, even though high overlap of species defined by proteome fingerprinting and by COI DNA barcoding was shown various times (Bode et al. 2017; Rossel and Martínez Arbizu 2019; Renz et al. 2021; Yeom et al. 2021), none of the methods can claim to actually show natural species boundaries. Different methods can, even applied to the same data, return different species boundaries as is shown by the comparison of GMYC and ABGD. This is even more likely when delimitation methods rely on different kinds of data such as ABGD relying on DNA and PAM clustering relying on proteomic fingerprints. Furthermore, although the high overlap between genetic and proteomic data in the smallest dataset probably originates from better data quality, the difference in cluster sizes may also play a role, since the higher-quality dataset did not contain species equivalents with more than seven specimens.

Generally, the high number of rare MOTUs/POTUs and singletons poses difficulties on all species delimitation tools. The applied unsupervised methods depend on clustering approaches and, thus, on distances between and within clusters. If the cluster is only composed of a single specimen, no variability within the cluster can be derived and, hence, no distance within the cluster. Using MALDI-TOF MS data, no hard thresholds for within and between species variability were reported so far, in contrast to barcoding genes such as COI. Here, species were previously delimited solely based on percentage sequence divergence, e.g. in the program CD-HIT (Huang et al. 2010). A potential reason for the difficulty to define MALDI-TOF-based species delimitations is the higher variability of mass spectra signals, in turn originating from multiple factors such as ecological differences (Karger et al. 2019), quality disparities (Rossel and Martínez Arbizu 2018b) but also differences in data processing. Also, there is potential for more variability due to varying signal intensities and slight differences of masses in certain molecules in comparison to DNA, which can only exhibit four different possible stages at fixed positions along a sequence. Clearly, changing cut-off values such as the SNR and peak-picking half-window sizes will, even if only slightly, alter mass spectra and thus also distances between specimens. Hence, results of unsupervised delimitation tools such as the two clustering methods applied in this study (Rossel and Martínez Arbizu 2020; Renz et al. 2021) have to be treated with care. If proteome fingerprinting is applied as a stand-alone method for biodiversity assessment, optimization of the mass spectrometry data based on comparison with the DNA-based delimitation method ABGD for reference as applied in the study described here is not possible. However, our results suggest that even though data processing influences the outcome, the variability is not immensely high. Furthermore, general diversity as well as species evenness is relatively stable and similar compared to DNA-based results. These results are very promising and invite the inclusion of mass spectrometry data in reverse taxonomy approaches, which have been proposed as an acceleration and simplification of benthic-specimen and species identification in the CCZ (Janssen et al. 2015; Glover et al. 2016).

Economic interest in the polymetallic nodules of the CCZ is steadily increasing, due to their high content of metals such as manganese, nickel, copper and cobalt (Kuhn et al. 2017; Hein et al. 2020). Currently (2022), the International Seabed Authority has already issued 17 licenses for the exploration of polymetallic nodules in the CCZ. In this context, it is especially important to develop effective, fast and low-cost methods to initially investigate the baseline and potentially later to monitor the benthic fauna exposed to mining activities (Lins et al. 2021). MALDI-TOF MS could be especially attractive for quantitative assessments (specimen-by-specimen) of benthic communities as it does not require an extensive taxonomic knowledge and is considerably cheaper than DNA barcoding (Rossel et al. 2019). Furthermore, MALDI-TOF MS data can be used not only to distinguish species, but also the developmental stages within species (Rossel et al. 2022). Hence, it might be a useful, additional tool to monitor resettlement and dispersion strategies of individual taxa at impacted sites.

In this context, our unsupervised species delimitations can also be regarded as the first step towards the development of a mass-spectra reference library of meiobenthic Copepoda of the CCZ. However, considerations on the best form of data deposition of mass spectra are still ongoing, while DNA barcodes can easily be deposited and accessed via the BOLD database (Ratnasingham and Hebert 2007), which is also linked to GenBank (Benson et al. 2012), the most established database for sequence data. Should specimen identification based on MALDI-TOF MS be applied on a large scale, for example for monitoring purposes, a data repository for data from the CCZ will become mandatory to enable comparisons. Furthermore, the proteomic approach using MALDI-TOF MS requires well-adapted standardization. We were able to show that varying values such as SNR and HWS can have a major impact on species delimitation and biodiversity estimation. This optimization of the workflow, however, can only be carried out and adapted using morphological, genetic or other delimitation methods.

Conclusion

Unsupervised biodiversity assessments using MALDI-TOF MS can accelerate laborious morphological identifications in the context of biodiversity assessments of meiofauna in the CCZ and can be especially useful for the analysis of impacts associated with potential future deep-sea mining. We have shown that for benthic copepods, estimations of biodiversity, evenness and species numbers in comparison to those derived from DNA barcoding data are very similar, provided that the mass spectrometry data are of good quality. Still, variability in mass spectra quality is one of the main factors influencing the resulting species delimitation and needs to be further investigated and improved in the future. In our study area within the CCZ, with a high diversity of copepod species, we obtained a high number of singletons despite our large dataset of over 2000 specimens. In this context, the application of unsupervised methods needs to be undertaken with caution. To not only rely on unsupervised methods, future work needs to aim at collecting reference data for DNA-based as well as MALDI-TOF-based studies to allow species identification to the lowest possible taxonomic level.

During routine monitoring surveys where mere identifications of specimens are required, proteomic fingerprinting can be a valuable alternative to DNA barcoding. The low costs allow a far more extensive assessment compared to costly DNA barcoding. Also, there is no need to find a specific set of primers as the standard preparation was successfully tested for a variety of animal species, again saving costs and time. Our study shows that the assessment nevertheless has the capability to provide accurate species identifications once further optimized.