1 Introduction

The detection of adulterated and counterfeit food became a focus in many countries even before European competent food control authorities were challenged by the horse meat scandal and its associated media attention in 2013. Since 2011, Europol and INTERPOL jointly coordinate concerted operations called OPSON with many participating countries, in order to detect counterfeit and substandard food and beverages on the markets, and to dismantle the involved organized crime groups.Footnote 1 OPSON means “food” or refers to “the value-giving component of the food” in ancient Greek. In the first operation, 10 mainly European countries participated. Since then, the operations have grown and operation OPSON X (spanning from 2020 to 2021) involved 72 countries from all over the world. Besides uncovering food adulteration, the joint OPSON actions aim to strengthen inter-agency cooperations between food control, consumer protection authorities, law enforcement, and customs, both at the national and international levels. Germany participated in operation OPSON X within a national priority action on meat products with regard to the declared species.Footnote 2 Many processed meat products are nearly impossible to be differentiated by the naked eye, taste or smell with regard to the processed animal species. Because of different prices and availabilities of meat from different species, there is a significant potential for fraud, such as substitution of meat from an expensive species with a cheaper one, or mixing in meat from a non-declared species, in order to increase profit. In the top 10 list of food categories with potential suspicion of fraud, meat and meat products (other than poultry) are number 3, and poultry meat and poultry meat products are number 7, together with fats and oils in the EU (European Commission 2021).

Next-generation sequencing (NGS) methods are progressively replacing or complementing traditional protein and DNA-based analytical methods for the authentication of food products regarding the authenticity of the declared species as they compensate for some of their shortcomings (Haynes et al. 2019). A variety of NGS methods for species identification in meat products have been described. For example, DNA metabarcoding is based on the PCR amplification of genetic markers, which is followed by massively parallel sequencing of the amplicons (Dobrovolny et al. 2019; Liu et al. 2021; Mahama et al. 2022; Pan et al. 2020; Xing et al. 2019). In addition, PCR-free, shotgun-sequencing methods have been developed for the analysis of food products (Akbar et al. 2021; Hellmann et al. 2020; Jiang et al. 2022; Liu et al. 2017; Ripp et al. 2014). Despite the promising approach of NGS methods for meat product authentication and their application by individual competent food control laboratories, these methods still need to be sophistically validated and standardised to become reliable and easily implementable tools for a wider range of competent food control authorities.

Preckel et al. (2021) were first to validate a published DNA metabarcoding method based on short 16S rRNA gene (16S rDNA) targets (Dobrovolny et al. 2019) for the identification of species in mammal and poultry meat products. The applicability of the method was demonstrated by analysing 25 reference samples, 56 food products and 23 pet food products and by comparing the results with those obtained with commercial DNA chips and/or multiplex real-time PCR. Thus, it could be shown that the method is both robust and reproducible. Only recently, the method was validated through an interlaboratory ring trial with 15 participating laboratories (Dobrovolny et al. 2022), demonstrating that the method can be applied to different sequencing platforms and is suitable for implementation in routine food control analyses. However, for standardization by national bodies or European or International standardisation committees such as CEN or ISO, more information on the consistency of the results with the results of the standard analytical methods used and the robustness of the different analytical steps is crucial.

Here, the cooperation of the German National Reference Centre for Authentic Food (NRZ-Authent) with the German competent food control authorities regarding the analysis of meat products and declared species is described, in the framework of a laboratory cooperation within operation OPSON X. Sixty-three meat product samples collected by the authorities, analysed by using various standard analytical methods, and 5 commercially available proficiency test samples, 9 known mock mixtures, and 6 negative controls of 5 different types were subjected to the 16S rDNA metabarcoding protocol published by Dobrovolny et al. (2019), and analysed by 3 different bioinformatics data processing strategies.

The aim of this cooperation was not only to provide additional information regarding the species of the meat product samples for the assessment of authenticity, but also to further validate the 16S rDNA metabarcoding method regarding the comparability with diverse standard analytical methods used by the competent authorities. Moreover, the study aimed to gain information about the source of unspecific reads commonly observed in negative controls and mock mixtures, as well as about the robustness of the bioinformatic evaluation of the generated DNA sequences.

2 Materials and methods

2.1 Sample material

Monitoring samples were collected from the market or at producing companies within operation OPSON X, and analysed for authenticity of the declared species using various standard analytical methods. Aliquots of extracted DNA from 61 samples collected by 7 agencies from 4 Federal States were transferred to the NRZ-Authent for 16S rDNA metabarcoding. DNA of 2 additional meat product monitoring samples were provided by an 8th agency. The meat product samples comprised various product types from minced meat to goulash soup (Fig. 1). Furthermore, 8 DNA extracts from 5 samples produced for an interlaboratory comparison study in 2019 (LVU Lippold, Herbolzheim, Germany) and in 2021 (LVU Lippold) were provided by 2 agencies (DNA extracts from 3 of these samples were individually prepared in both laboratories). Table 1 and Online Resource 1 (Supplementary Information) show the standard analytical methods used by the food control authorities for the analysis of the samples. DNA concentrations of all provided DNA extracts were measured fluorometrically at the NRZ-Authent with the DeNovix dsDNA High Sensitivity Kit (DeNovix, Wilmington, DE, USA) on a DS-11FX µl volume spectrophoto/fluorometer (DeNovix, Wilmington, DE, USA). Aliquots of DNA extracts were prepared with concentrations of 10 ng/µl, samples with lower concentrations were left unchanged.

Fig. 1
figure 1

Number of meat product samples of different product types taken by food control authorities within OPSON X (n = 60) and 2 official control samples taken outside OPSON X (one kebab skewer sample and the only goulash soup sample) that were analysed with the 16S rDNA metabarcoding method in this study, in addition to conventional reference methods

Table 1 Analytical methods applied by the competent food control authorities for the authentication of food control meat products and proficiency test samples

Additionally, 3 types of mock mixtures with defined species shares were prepared:

  • 2 mock mixtures with animal species shares based on mitochondrial genome copies were provided by the Austrian competent food control authority

  • 6 further mock mixtures were prepared at the NRZ-Authent as mixtures of total DNA of individual species

  • 1 additional mock mixture sample was prepared as a mixture (volume/volume) from commercial goat’s and cow’s milk previously authenticated by real-time PCR

Aliquots of all mock mixtures were prepared with concentrations from 5 to 10 ng/µl.

2.2 PCR and sequencing reaction

16S rDNA metabarcoding analysis was performed according to Dobrovolny et al. (2019) with minor changes. Briefly, an approximately 120 bp region of the 16S rDNA was amplified in all samples (with 13 samples analysed in duplicates) by PCR (‘amplicon PCR’). Primers for mammal and poultry 16S rDNA were synthesized with adapters for subsequent index PCRs and sequencing (in italics) in “nGS scale” by metabion international AG (Planegg, Germany), and sequences were as follows:

Fwd_MaOH 5′-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGACGAGAAGACCCTATGGAGC-3′, Rev_MaOH 5′-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGTCCGAGGTCACCCCAACC-3′, Fwd_POH 5′-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGACGAGAAGACCCTGTGGAAC-3′, Rev_POH 5′-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGTCCAAGGTCGCCCCAACC-3′.

The amplicon PCR reaction mix (25 µl) contained 12.5 µl 2 × HotStarTaq Master Mix (Qiagen, Hilden, Germany), 3 µl 25 mM MgCl2, 1.75 µl of each primer (2.14 µM) and 2.5 µl sample DNA. In addition, 2 amplicon PCR negative controls were prepared with molecular biology grade water instead of sample DNA. PCR was conducted in a GeneExplorer 96 thermocycler (Bioer Technology, Hangzhou, China) with the following cycling protocol: 15 min initial denaturation at 95 °C, 25 cycles of 30 s denaturation at 95 °C, 30 s annealing at 62 °C and 30 s elongation at 72 °C, and final elongation of 10 min at 72 °C. Aliquots of 3 µl of each PCR were checked for successful amplification with agarose gel electrophoresis on 2% agarose gels. 20 µl of each PCR product were purified using 90 µl Mag-Bind TotalPure NGS magnetic beads (Omega Bio-tek, Norcross, GA, USA). The amplicons were incubated with magnetic beads in deepwell plates for 2 min with shaking and additional 5 min without shaking and the supernatant was discarded. The amplicons were washed twice with 200 µl ethanol (80%) and the ethanol was removed carefully. The amplicons were resuspended in 53 µl 10 mM Tris–HCl (pH 8.5) and incubated for 2 min with shaking and 2 min without shaking. Finally, the plate was incubated for 2 min on the magnetic adapter and the purified DNA was transferred to a new plate.

The purified amplicons were used for a second PCR (‘index PCR’) to index the amplicons from the different samples and to introduce adapters for binding the DNA to the sequencing flow cell. 5 µl of purified amplicon DNA were mixed with 5 µl each of two indices from the Nextera XT Index Kit v2 Set B (Illumina, San Diego, CA, USA), 25 µl 2 × HotStarTaq Master Mix (Qiagen, Hilden, Germany) and 10 µl molecular biology grade water (total volume of 50 µl). Two index PCR negative controls were prepared with molecular grade water instead of amplicon DNA. The index PCR cycling protocol started with an initial denaturation step (15 min at 95 °C) that was followed by 8 cycles of denaturation (30 s at 95 °C), primer annealing (30 s at 55 °C) and elongation (30 s at 72 °C) and the protocol ended with a final elongation step (10 min at 72 °C). The index amplicons were purified with magnetic beads as described above, but with the following changes: 50 µl of PCR reactions were used, the washed PCR products were resuspended in 28 µl Tris buffer and 25 µl of supernatants were transferred to a new plate. The success of the index PCR amplification was checked with electrophoresis on 2% agarose gels, and amplicons were quantified using the Qubit dsDNA BR Assay Kit (Life Technologies, Carlsbad, CA, USA) and a Spark multimode microplate reader (Tecan Group, Männedorf, Switzerland).

Each amplicon was diluted with 10 mM Tris–HCl (pH 8.5) to a concentration of 4 nM and then equimolarly pooled. One of the 2 amplicon PCR negative controls and 1 of the 2 index PCR negative controls were diluted at a ratio of 1:46, corresponding to the mean dilution factor for all samples. The other 2 PCR negative controls were used undiluted. 5 µl of the pooled library was denatured for 5 min with 5 µl 0.2 M NaOH (pH 8.5) and subsequently diluted with buffer HT1 (from the MiSeq Reagent Kit v2 (300-cycles), Illumina, San Diego, CA, USA) to a concentration of 8 pM. 270 µl of the diluted library were spiked with 30 µl PhiX solution (12.5 pM; Illumina). Finally, the library pool was loaded on a MiSeq Reagent Kit v2 flow cell and 126 bp were sequenced on a MiSeq sequencing instrument (Illumina) in both directions.

2.3 Sequence data processing

Before further data processing, sequencing adapters were removed from the reads. Demultiplexing was achieved with the software on the MiSeq instrument with default settings allowing one mismatch in the index sequences. Further data processing was performed on a locally installed Galaxy platform (Jalili et al. 2020). Three data processing workflows were tested in parallel.

For the ‘OTU clustering workflow’, the sequence reads were processed according to Dobrovolny et al. (2019) with minor changes. In brief, the qualities of the reads were checked with FastQCFootnote 3 and forward and reverse primer sequences were removed with Cutadapt (Martin 2011). Reads with lengths of less than 50 bp were deleted and bases with quality scores below 15 were trimmed with Trimmomatic (Bolger et al. 2014). Forward and reverse reads were joined with fastq-join (Aronesty 2013) with a maximum of 9% difference between matching segments, and joined sequences were converted to FASTA format with FASTQ to FASTA converter (Blankenberg et al. 2010). Reads were dereplicated and sorted with VSearch (Rognes et al. 2015), and sequences were clustered into de novo Operational Taxonomic Unit (OTU) sequences using a 97% threshold with the USEARCH-Tool Suite (Edgar 2013). OTU representative sequences were used to search a locally installed copy of the NCBI BLAST nucleotide database (version 2019-12-09) using blastn (Camacho et al. 2009; Cock et al. 2015) with a 97% identity cut-off and the top hit in each case was used for assigning taxonomy.

For setting up the ‘dereplication workflow’, the OTU clustering step was removed from the workflow above. Thus, the dereplicated sequences could be directly checked against the nucleotide database. All other steps were left unchanged.

A ‘dada2 workflow’ was set up using the software package DADA2 according to Callahan et al. (2016). To remove the primers, 21 and 18 bases were truncated at the start of each forward and reverse read, respectively, and all sequences were truncated after 80 bases. After determining the error model of the sequencing data, the denoised forward and reverse reads were merged with a minimum of 12 bases and without allowing mismatches in the overlapping segments. The resulting amplicon sequence variants (ASV) were purified from chimeric sequences and were queried against the same locally installed copy of the NCBI nucleotide database as in the two other workflows.

After the execution of each workflow, the read percentages per species were summed up in all samples. Species with less than 0.5% of total sample reads were considered negative and the remaining read percentages per species were re-adjusted to a total of 100%.

The demultiplexing of the sequencing raw data was repeated with the BCL software (Illumina) installed on a separate computer and with allowing zero mismatches in the index sequences as this was not possible with the software on the instrument. This time, only the reads in the negative control samples and in the mock mixtures were recorded.

3 Results

One of the 63 meat product samples from the competent food control authorities did not yield a PCR product after the amplicon PCR step, despite repeated reactions and therefore, could not be analysed. For the remaining 62 monitoring samples (8 samples measured in duplicates) as well as the 5 boiled sausage proficiency test samples (3 samples measured in duplicates) and the 9 mock mixtures (2 samples measured in duplicates), PCR amplicons were produced. Thus, together with 6 negative controls, a total of 95 samples were analysed in parallel on a flow cell of the MiSeq Reagent Kit v2, and data analysis was performed using 3 different workflows. The compositions of the mock mixtures were known before carrying out the experiment, whereas the compositions of the proficiency test samples were only disclosed after data evaluation. The actual compositions of the monitoring samples remained unknown, but the results of the metabarcoding analysis of these samples, as well as those of the proficiency test samples, were compared with the results of the standard analytical methods used by the competent food control authorities.

3.1 NGS read performance

In total, 16.16 m reads were produced with a Q30 quality score of 90.9%, a mean of 133 462 reads per sample (excluding negative controls) and a mean of 90.7% (dereplication), 88.7% (OTU clustering) and 96.1% (dada2) evaluated reads after executing the different bioinformatic workflows (Table 2). The negative controls yielded 171–264 reads per sample (205.8 reads on average), which could be reduced to 51–69 reads (57.8 reads on average) per sample by allowing zero mismatches in the index sequences during demultiplexing (Table 3). No trend was observed with regard to the type of negative control (undiluted or diluted amplicon PCR, undiluted or diluted index PCR, or simulated index combinations) and the resulting read numbers. The false-positive results of the mock mixtures with the known compositions were used for determining the read percentage threshold for species detection. The highest read percentages for particular false-positive species were 0.32–0.39% depending on the bioinformatic workflow used (Table 4). Although the total percentage of false-positive reads per sample decreased by demultiplexing with no mismatches in the indices, the maximum percentage of reads per given false-positive species in the mock mixture samples was not reduced. Thus, for the further analysis of the metabarcoding results, a threshold of 0.5% reads was used to rate a species as positive. Duplicate measurements carried out for 13 samples gave very similar results (Fig. 2). However, the measured species proportions in the duplicate proficiency test samples, for which the DNA extraction had been carried out in different laboratories and which were much more diverse in species composition, differed somewhat more than the duplicate measurements of the mock mixtures, for which only one DNA extract was used and which differed less in species variety.

Table 2 Total reads per sample after sequencing and recovery of reads and total number of different sequences, OTUs or ASVs per sample after different bioinformatic workflows
Table 3 Total reads in negative control samples for demultiplexing on the MiSeq device with max. one mismatch in index sequences and outside the MiSeq device with zero mismatches in index sequences
Table 4 Number of identified false-positive species in mock mixtures and read percentages per false-positive species
Fig. 2
figure 2

Read percentages per species in samples measured in duplicates. The first 8 duplicates (pairs of bars) show the results of OPSON X meat samples and one official control sample taken outside of OPSON X (OPSONX_070a/b). Sample OPSONX_078a was a mock mixture with DNA content-based shares of species, whereas samples OPSONX_097/OPSONX_098 were aliquots with species shares based on mitochondrial genome target copies. For these 8 samples, duplication started with performing 2 separate amplicon PCRs. The last 3 duplicates (boiled sausage samples) were from a commercial interlaboratory ring trial and DNA extractions were performed in 2 different laboratories. The different species are indicated by different greyscales

3.2 Mock mixtures and proficiency test samples

Nine mock samples were analysed containing 20 different species in total (Table 5) of which 2, Atlantic salmon and desert locust, were not target species for the metabarcoding approach. With applying a threshold of 0.5% reads, no false-positive species were determined. Also, almost all target species could be detected except for fallow deer (Dama dama), a finding which has been reported by Preckel et al. (2021) and is attributed to 2 mismatches between the reverse primer for mammals and the 16S rRNA gene of fallow deer (Fig. 3). In addition, read percentages for species that were present in low concentrations in the mock mixtures (0.5% and 0.1%, respectively) were below the chosen read percentage threshold, and the respective species were therefore also considered as negative. Although the 16S rDNA metabarcoding method was originally developed for mammal and poultry species, reads for salmon were also recovered. The forward primer for mammals matches Atlantic salmon DNA without mismatches and the two reverse primers match salmon DNA with 3 mismatches each (Fig. 4). This appears to be sufficient for at least partial amplification of salmon DNA, probably due to the positions of the mismatches in the primer Rev_POH at and near the primer start. Read percentages for desert locust, the other non-target species, were below the threshold. This was expected, as the primer sequences could not be found within desert locust DNA sequences from GenBank (data not shown). When the demultiplexing was repeated with allowing zero mismatches in the index sequences, no differences in recovered species above the threshold of 0.5% were found (Online Resource 1, Supplementary Information). However, the recovered percent reads per species do not necessarily reflect the actual share of species in the samples but differ in part considerably (Table 5).

Table 5 Composition and read percentages per species in mock mixtures and proficiency test samples
Fig. 3
figure 3

Comparison of amplicon primer sequences to fallow deer (Dama dama) DNA sequence. Fwd_MaOH: sequence of forward mammal primer; Fwd_POH: sequence of forward poultry primer; fallow deer: part of fallow deer 16S rDNA extracted from GenBank sequences KJ870163.1 and JN632629.1; Rev_MaOH: reverse-complementary sequence of reverse mammal primer; Rev_POH: reverse-complementary sequence of reverse poultry primer. Primer sequences are indicated without NGS adapters and positions with mismatches to fallow deer DNA are highlighted in white

Fig. 4
figure 4

Comparison of primer sequences to Atlantic salmon (Salmo salar) DNA sequence. Fwd_MaOH: sequence of forward mammal primer; Fwd_POH: sequence of forward poultry primer; salmon: part of Atlantic salmon 16S rDNA extracted from GenBank sequence KR476892.1; Rev_MaOH: reverse-complementary sequence of reverse mammal primer; Rev_POH: reverse-complementary sequence of reverse poultry primer. Primer sequences are indicated without NGS adapters and positions with mismatches to salmon DNA are highlighted in white

When applying the determined threshold of 0.5% reads to the proficiency test samples, all species were identified in the 5 meat mixtures (boiled sausages; three samples measured as duplicates) from the proficiency tests, with the exception of fallow deer which was present in 4 of the 5 samples. No false-positive species were detected in the samples, assuming possible contaminations with cattle in two proficiency test samples, as was described in the respective interlaboratory study reports (LVU 2019, 2021) (Table 5). However, it should be noted that the applied NGS method could not distinguish between wild boar (Sus scrofa) and domesticated pig (Sus scrofa domesticus/Sus domesticus/Sus domestica) (which was present in 2 samples).

3.3 Comparison of 16S rDNA metabarcoding with standard methods: food control samples and proficiency test samples

As most of the results provided by the food control laboratories were of qualitative and not quantitative nature, only the qualitative metabarcoding results for the 62 successfully investigated food control samples and the 5 boiled sausage samples from the interlaboratory comparison tests were compared to the results obtained by the food control agencies using standard analytical methods. As 3 proficiency test samples had been independently analysed by 2 food control agencies using different standard methods, these results were compared separately, resulting in a total of 70 comparisons. In 71.4% of the samples, exactly the same species (highlighted in green font in the Online Resource 1, Supplementary Information) were identified by the metabarcoding approach and the standard methods, regardless of the bioinformatic workflow used. The discrepancies between NGS and the standard analytical methods regarding the identified species in the other samples are described in the following sub-sections.

3.3.1 NGS detected species that were not target species for the used standard methods

See Online Resource 1 (Supplementary Information): species highlighted in purple font. In 5 control samples, particular species were discovered by metabarcoding, that were not target species for the standard analytical methods:

  1. a.

    In a kangaroo meat sample (OPSONX_021), about 11% reads for red deer were measured using the metabarcoding method, that were missed when applying Sanger sequencing of 2 distinct meat pieces of the sample, and could also not be detected using a multiplex real-time PCR for cattle, pig, sheep and equids as well as with ELISA kits for beef, poultry, pork, horse and sheep.

  2. b.

    In a red deer sausage with declared pork and red deer content (OPSONX_042), chamois (Rupicapra rupicapra) was detected by metabarcoding, which was not a target species for the DNA chip used (MEAT 5.0 LCD-Array).

  3. c.

    In a red deer salami sample (OPSONX_047), sika deer (Cervus nippon) was detected with about 14% of reads by all three bioinformatic workflows, that was not a target species of the DNA chip used (also MEAT 5.0 LCD-Array).

  4. d.

    In another red deer sausage (OPSONX_043), sika deer was identified with a much smaller read percentage (about 1%) only by the dereplication and the OTU clustering workflow but not by the dada2 workflow. Here, it is unclear, whether the reads for sika deer are due to sika deer traces in the sample or whether they resulted from amplification or sequencing errors that were removed by the denoising strategy implemented in the dada2 software package.

  5. e.

    In a liver sausage sample with declared wild boar/pork and deer content (OPSONX_040), the DNA chip used detected only “roe deer (Capreolus capreolus)” which is the European roe, whereas the metabarcoding also found Siberian roe (Capreolus pygargus) (4.1% of reads) in addition to European roe.

Moreover, 2 proficiency test samples contained elk (Alces alces), which was correctly detected by the metabarcoding method. These samples were independently analysed in 2 different food control laboratories using the MEAT 5.0 LCD-Array, but one laboratory did not detect elk (in OPSONX_067 and OPSONX_068), while the other laboratory identified elk (in OPSONX_044 and OPSONX_045) because the DNA chip MEAT Plus 3.0, which contains elk as a target species, was additionally used (Online Resource 1, Supplementary Information). In a third proficiency test sample (OPSONX_048), the metabarcoding method detected common wallaroo (Osphranter/Macropus robustus) in addition to red kangaroo and Eastern grey kangaroo that was not detected with the DNA chip, which only targets “Kangaroo (M. rufus, M. giganteus)” (the red kangaroo and the Eastern grey kangaroo).

Furthermore, for one sample (OPSONX_046=_069), the comparative method (DNA chip) reported the white-fronted goose (“Goose”, Ansa albifrons), whereas metabarcoding found the greylag goose (Anser anser) (Online Resource 1, Supplementary Information, highlighted in orange). However, a manually performed browser-based BLAST search in GenBank revealed identical sequence identities of the NGS sequence with the white-fronted goose and the greylag goose.

3.3.2 False-positive or false-negative metabarcoding results

See Online Resource 1, Supplementary Information: species highlighted in red font. As in the mock mixtures, fallow deer could not be recovered by metabarcoding in all 7 samples (OPSONX_012, _043, _044=_067, _045=_068, _046=_069, _047 and _049) that were judged positive for fallow deer with standard methods, presumably due to the mismatching NGS primer sequence (see above). In one sample (OPSONX_038) with declared egg content, chicken DNA was only determined with PCR–RFLP, real-time PCR and DNA chip, but not with ELISA or the metabarcoding method.

Moreover, in 4 samples (OPSONX_037, _051, _052 and _063) with high read percentages for cattle (Bos taurus), wild yak (Bos mutus) was also recovered in small proportions (max. 0.7% reads). Upon closer inspection of the BLAST hit description (Accession number CP027084.1), it was obvious that this GenBank sequence did not come from the mitogenome but was detected on a chromosomal region within an NGS whole genome assembly study. Furthermore, when submitting the identified NGS sequence from this study manually to a GenBank BLAST search, a hit for a chromosomal region of cattle was also found with 100% sequence identity (accession number LR962746.1) (Fig. 5). Thus, it is very likely that the wild yak reads derive from sequencing a cattle 16S rDNA pseudogene [nuclear mitochondrial DNA, NUMT; see (Gaziev and Shaikhaev (2010)] present not only in cattle but also in wild yak.

Fig. 5
figure 5

Comparison of cattle (Bos taurus) and wild yak (Bos mutus) 16S rDNA sequences and probable pseudogene sequences of the two species. ASV1 and ASV29 were amplicon sequence variants from sample OPSONX_037 analysed with the dada2 bioinformatic pipeline. Whereas the first hit of ASV1 was cattle mitochondrial DNA, the first hit of ASV29 was DNA from wild yak chromosome 16. However, the sequence of the chromosome 16 part of wild yak is identical to a corresponding chromosome 16 part of cattle. Sequences were extracted from GenBank sequences MN714195.1 (Bos taurus mitochondrion), MK033130.1 (Bos mutus mitochondrion), LR962746.1 (Bos taurus chromosome 16), CP027084.1 (Bos mutus chromosome 16)

3.3.3 Unresolved cases of discrepancy

See Online Resource 1, Supplementary Information: species highlighted in blue font. In 4 samples, particular species (OPSONX_001 and OPSONX_054: turkey; OPSONX_025: cattle; OPSONX_042: cattle, goat and sheep) were detected with the standard methods, but not with the metabarcoding method. However, as only qualitative but not quantitative information of the identified species was provided by the food control laboratories for most samples, it remains largely unclear, whether these findings were due to traces or substantial ingredients of the meat products. It seems likely that some species were missed by NGS due to very low concentrations of the DNA of the respective species, on the other hand false-positive findings by using the standard methods may also be a possible reason.

In 3 other samples, individual species (OPSONX_047 and _049: cattle; OPSONX_053: pig) were identified by metabarcoding in small percentages (0.9–1.5%), that were not reported by the food control laboratories, although the species were target species for the methods used. The reasons for these discrepancies (either false-positive metabarcoding results or false-negative results by standard methods) could not be clarified because the actual composition of the samples was not known.

3.4 Comparison of the different bioinformatic workflows

Overall, the dereplication workflow, the OTU clustering workflow and the dada2 workflow produced very similar results. In only 1 sample (OPSONX_043), the identified species differed between the workflows: Low percentages (0.8% and 1.0%, respectively) of sika deer were recovered by the dereplication and the OTU clustering workflow, but not with the dada2 workflow (see above and Online Resource 1, Supplementary Information). However, since the actual composition was not known and sika deer was not a target species of the standard method used, it cannot be determined which workflow was correct. Otherwise, only minor differences between the workflows were observed that did not affect the outcome of the identified species in the samples. For example, the recovery of reads after performing the data processing was higher with the dada2 workflow compared to the other workflows (Table 2), and the dada2 workflow produced less false-positive species in the mock mixtures compared to the other workflows (Table 4). However, since these species were below the threshold, this is considered as negligible. On the other hand, the dereplication workflow and the OTU clustering workflow produced substantially more different sequences per sample compared to the dada2 workflow. While for the dereplication workflow an average of 3,428 different sequences per sample had to be queried against the sequence database, the average number of sequences per sample decreased to 520 OTUs (15.2% of dereplicated sequences) and 15 ASVs (0.4% of dereplicated sequences), respectively (Table 2).

4 Discussion

63 meat product samples, 5 boiled sausage proficiency test samples, 9 mock mixtures with defined species compositions and 6 negative controls of 5 different types were analysed using a published 16S rDNA metabarcoding method. Using the resulting sequence data, the source of unspecific reads commonly observed in samples, the robustness of the bioinformatic data processing and the comparability with diverse standard analytical methods were investigated.

4.1 Source of unspecific sequences

No significant differences in read numbers were observed in the different types of negative controls (undiluted or diluted amplicon PCR, undiluted or diluted index PCR, or simulated index combinations). Therefore, it seems likely that the main proportion of reads in the negative controls resulted either from incorrect index molecules or from errors during amplification or sequencing of the indices and thus belong to other samples on the flow cell rather than originating from (cross-) contamination with DNA from the environment, other samples or other amplicons. Otherwise, much higher read numbers would be expected in the undiluted negative controls compared to the diluted controls. The same applies to the undiluted and diluted negative controls in comparison to the simulated negative controls where the index combinations were only used for the sequencing sample sheet and for which no PCR reaction had been prepared. Thus, assuming an average of about 133 000 reads per sample, of which about 206 reads were due to erroneous indices or sequencing errors in index sequences from other samples (see Sect. 3.1), one would expect about 0.16% of reads from other samples of the same run in the respective sample. This could be reduced to about 0.04% reads when allowing no mismatches in the index sequences. Unfortunately, the software on board the MiSeq instrument does not allow restriction to zero mismatches, so the demultiplexing with allowing zero mismatches has to be carried out independent of the machine. The use of longer indices (> 8 nucleotides), either from an alternative indexing kit or as internal parts of the amplicon primers in a one-step amplicon protocol (Kozich et al. 2013), could also reduce the number of reads from other samples.

However, as the maximum read percentages for false-positive species in the mock mixtures noticeably exceeded 0.16% or 0.04%, respectively, additional sources for unspecific reads besides false index sequences or cross-contaminations must be considered. It was observed that many false-positive reads with higher percentages were assigned to species closely related to those actually present in the samples, like urial (Ovis vignei, e.g. 0.37% in OTU workflow) and argali (Ovis ammon, 0.26% in OTU workflow) in samples containing sheep, or sambar deer (Rusa unicolor, e.g. 0.30% in OTU workflow) and Eld’s deer (Rucervus eldii, 0.12% in OTU workflow) in samples containing deer species. This suggests that single nucleotide sequence changes caused by the amplification or sequencing steps or incorrect sequences in the NCBI nucleotide database could be the cause for these non-specific reads. Unexpectedly, these related but non-specific species were also identified using the dada2 workflow to some extent, which has an implemented denoising step to derive error rates from the sequences and remove incorrect sequences from the dataset. Thus, these particular sequences could therefore also represent natural variations in the genomes of the animals present in the samples. Moreover, the probable sequencing of a pseudogene in cattle (see Sect. 3.4) was another reason for false-specific sequences in cattle containing samples. The same pseudogene appears to be present in wild yak, which was identified as first BLAST hit but also with the same sequence identity (100%) in cattle chromosomal DNA (see above). Therefore, the use of a customized database with only reliable mitogenome sequences, as described in Dobrovolny et al. (2019) and Preckel et al. (2021), is preferable, provided that it contains all species potentially present in meat products with sufficient sequence variation. In addition, the bioinformatic workflows should be adapted to consider not only the first BLAST hit but all hits within an appropriate identity range, so that species with identical sequences are reported only on genus level. Otherwise, the first hit could be wrongly interpreted as an undeclared ingredient, while the declared species would not be found. An appropriate bioinformatics pipeline, ‘FooDMe’, for the analysis of metabarcoding of food and feed samples can be downloaded from Zenodo (Denay 2022).

4.2 Robustness of the bioinformatic data processing

Although the three bioinformatic workflows pursued different strategies, the results were almost always identical. Thus, the data evaluation seems to be quite robust and different algorithms can be applied. Even the denoising step, the core of the dada2 tool suite, also produced almost no different results and could not remove the erroneous sequences that lead to the identification of small percentages of related species such as urial or sambar deer. However, as the dereplication and the OTU clustering workflow produced considerably more sequences per sample, these workflows are also much more demanding than the dada2 workflow in terms of computing power and computing time. With regard to this, the dada2 workflow seems more favourable, especially for the processing of large sequencing datasets.

4.3 Comparability of the metabarcoding approach with standard analytical methods

For most of the samples and especially for samples with less complex species compositions, the 16S rDNA metabarcoding approach identified exactly the same species. However, the main advantage of this metabarcoding approach compared to the species-specific methods such as for example real-time PCR, DNA chip, or ELISA is certainly the discovery of unexpected species. Although the DNA chips extensively used in this study cover a fairly wide range of target species, they could not detect elk, sika deer, Siberian roe, chamois or common wallaroo in contrast to the NGS method. Moreover, a combination of Sanger sequencing and multiplex real-time PCR was not able to identify an unexpected admixture of red deer in a kangaroo meat product. On the other hand, fallow deer cannot be identified by metabarcoding using the described primers from Dobrovolny et al. (2019). However, an additional reverse primer for the detection of fallow deer is described in the method specification for the detection of mammals and birds by DNA metabarcoding, which will be published by the Federal Office of Consumer Protection and Food Safety (BVL, Germany) through an update of the Official Collection of Methods of Analysis in 2023. As shown in this study and also by others (e.g. Preckel et al. 2021), the read percentages do not necessarily reflect the actual species shares of the samples accurately. It should therefore be borne in mind that the metabarcoding method is more of a screening method for compositional analysis in terms of species in complex food samples, but not for quantifying the exact proportion of the particular species (Defra 2020). As a screening method, it is a useful alternative to the DNA MEAT LCD-Arrays (Chipron, Berlin, Germany) that were used by many laboratories in this study but have recently (in May 2022) been withdrawn from the market (according to a customer letter from manufacturer).

For best results, samples should be screened using 16S rDNA metabarcoding and results confirmed with orthogonal species-specific methods, especially when quantitative results are required.

5 Conclusion

This study showed the successful cooperation of the competent German food control authorities with the German NRZ-Authent within the framework of operation OPSON X in the application and validation of a published 16S rDNA metabarcoding method. The NGS method proved to be a suitable method for the identification of mammal and poultry species, especially for food samples with complex species compositions, and was in some cases able to identify particular species that had been overlooked when using standard analytical methods like DNA chips, Sanger sequencing, or multiplex real-time PCR. As expected, fallow deer was the only species that could not be detected with the applied NGS method. False-positive hits which occurred with small read percentages were likely caused by indexing errors, introduced by PCR or sequencing error, or reflect natural variability in animal’s mitochondrial genome. Consequently, an appropriate read percentage threshold must be applied, but this also means that small admixtures of species can lead to read percentages that are below the threshold and thus give false-negative results. The bioinformatic data evaluation was shown to be very robust, as different strategies and algorithms led to nearly identical results. The dada2 workflow may be favourable when analysing large datasets as it is less demanding in computing capacities due to the output of significantly fewer sequences per sample to be matched with the database. In order to exclude pseudogenes or other non-meaningful hits from the results, the use of an audited sequence reference database (Neto et al. 2021) containing only validated mitogenome or 16S rDNA sequences is recommendable. Furthermore, it would make sense to include more BLAST hits than only the first for a valid identification of the species (or genus), as sometimes, two or more species share the same 16S rDNA sequences and a specification of the genus may be more valid.