Principal considerations
The suitability of the qualitative method was successfully evaluated in a preliminary ring trial with 10 participating laboratories (A–J) analyzing seven randomized sausages (3 or 4 samples without TG, and 4 or 3 samples with 1% TG, each). For the production of the three different test materials included in the main validation study, materials 1 and 2 were produced as emulsion-type sausages due to their high homogeneity. Material 3 was produced as raw hamburger patties, representing a compromise between the homogeneity of the sample material and the matrix similarity to raw restructured meat. A technical TG mixture with maltodextrin as the sole further ingredient (and no further protein additives such as caseinate or pork protein) was selected for the preparation of the test materials to ensure the highest possible matrix similarity of samples with and without TG. In total, 16 test samples (Table 1) with and without TG and one matrix sample (1% TG; for optimization) were produced for each of the 13 participating laboratories, whereas the sample numbers of the test samples were randomized. The concentration of technical TG added ranged between 0.2%, a concentration which has proven to be too low for meat binding (Jira and Schwägele 2017), and 2%, a concentration twice as high as recommended by the manufacturer. The samples with a TG concentration just sufficient for meat binding (0.5%) were used as positive control samples. This was necessary, since a relevant number of laboratories participating in this study had no or only limited experience in targeted proteomics. Therefore, minimal quality criteria for the overall analytical performance were announced to identify and exclude systematic analytical errors that are not method based. A determination of the protein content in technical TG (no additional protein ingredients; determination of the protein content in technical TG section) revealed protein contents of 0.67%, confirming the TG concentrations in this product found by Kütemeyer (2007). Therefore, the concentration of TG (pure enzyme) in the test materials ranged from 0.0013 to 0.0134% (Table 1). The technical TG was deactivated by cooking (Zhang et al. 2012) before the production of the test materials to avoid that the participants could identify the presence or absence of TG in the test samples based on different sample consistencies. From each matrix, a sample with 1% TG should be analyzed in duplicate determination.
The original published method (Jira and Schwägele 2017) identified six marker peptides for the detection of the predominantly distributed type of microbial TG in the EU (Kanaji et al. 1993; Kashiwagi et al. 2002). Further investigations into other commercially available TGs, in part from other strains of S. mobaraensis (e.g., NCBI accession AAV31068.1), revealed that only two of the marker peptides were common for all available TGs. Therefore, only the marker peptides VTPPAEPLDR (TG-1) and SPFYSALR (TG-2) were used in the inter-laboratory validation study.
In the schematic procedure of the analysis of samples (Table 2), blank samples were measured between each matrix sample to investigate a possible carry-over of marker peptides due to the fact that some peptides show a non-specific adsorption that can occur at every part of the HPLC–MS/MS system (Maes et al. 2014). However, in the inter-laboratory validation study, only one laboratory detected traces of TG-2 in blank samples demonstrating that both peptides do not show a relevant carry-over.
The detection of TG in a meat sample was considered to be positive, if (a) at least three mass transitions for TG-1 and TG-2, each, showed a signal-to-noise ratio (SNR) ≥ 3, (b) the results of the negative control samples were negative, and (c) the results of the positive control samples (sample with 0.5% TG) were positive.
Identification of the Most Intense Mass Transitions for Each Marker Peptide
According to the SOP, each participating laboratory had to identify the seven most intense theoretically explainable mass transitions for the two marker peptides, each, by direct infusion of the peptide standard solutions (synthetic peptide standards and test material for optimization section and identification of the most intense mass transitions and optimization of the MS/MS parameters section). However, three participating laboratories determined only a lower number of mass transitions (laboratory G: 3; laboratory H: 5; laboratory K: 4 for TG-1 and 3 for TG-2). For each of the marker peptides TG-1 and TG-2, three mass transitions were identified in all thirteen participating laboratories (see Table 5). These most prevalent fragment ions (TG-1: m/z 447.7 (y82+), 500.3 (y4), 797.4 (y7); TG-2: m/z 446.3 (y4), 609.3 (y5), 756.4 (y6)) were all y-ions with m/z > 400. They were determined among the most intense mass transitions also in the laboratories I and K using argon (instead of nitrogen) as collision gas. Furthermore, four mass transitions, each, were identified in at least seven laboratories. The most prevalent fragment ions were usually among the most intense fragment ions.
Table 5 Most abundant fragment ions of the TG marker peptides determined by the participating laboratories. Gray marked cells: mass transitions identified in all laboratories After MS/MS parameter optimization, the meat samples (Table 1) were analyzed for all previously identified mass transitions according to the SOP. Three samples (P03, P09, and P14) were further evaluated to determine whether the most abundant fragment ions also showed the highest SNR. Overall, the most abundant fragment ions also showed high SNR values in the sample materials.
Intra-laboratory Deviations of the Retention Times of the Marker Peptides
Deviations in retention times of the matrix-adjusted positive control samples between individual laboratories were analyzed to examine the general robustness and whether additional quality controls can be applied. For this purpose, the deviations of retention times between the (up to) three matrices were calculated for each laboratory and both marker peptides. The distributions of these deviations are displayed by means of kernel density estimation (KDE) in Fig. 1. Based on robust statistics (Q/Hampel), the 95% tolerance intervals for the respective deviations were calculated (vertical red lines in Fig. 1). For both marker peptides, the tolerance limits were frequently exceeding ± 0.1 min, but were always within ± 0.2 min. It can also be stated that the deviations of retention times for marker peptide TG-1, which elutes earlier and therefore has less interactions with the stationary phase, vary more than for marker peptide TG-2. This tentatively can also be ascribed to variations of the DMSO volume which was added to the sample after SPE before solvent removal and which is more pronounced for the less interacting peptide. The results obtained led to the quality requirement in the official method (Official Collection of Analysis Methods 2021) specification that the retention times of TG-1 and TG-2 in a sample must not deviate more than ± 0.2 min from the retention times of the marker peptides in the matrix-adjusted positive control sample. Considering also the variety of applied HPLC instruments and columns (see Table 3), this confirms the stability of the retention times of the marker peptides.
Statistical Analysis of the Detection of TG
The data basis for the statistical analysis of the detection of TG is given in Table 6. Corresponding ROD values and false-negative rates across laboratories are displayed in Table 7.
Table 6 Sample-specific results of laboratories; + = TG detected, –1 = TG-1 negative, –2 = TG-2 negative, –12 = TG-1 and TG-2 negative; gray marked cells = excluded from statistical analysis Table 7 Summary of the results across laboratories separately for each TG concentration Applying the identification criterion that results are only to be considered as verified, if TG was detected in the 0.5% TG sample (positive control sample), the complete results of laboratories E, I, and M as well as the results for hamburger patties (material 3: P13–P16) of laboratory J must be excluded from the calculations of the false-positive and the false-negative rates and for the POD curve. The laboratories I and J had no experience in targeted proteomics. Laboratory I reported about a low sensitivity of the mass spectrometer and laboratory M had problems with the SPE cleanup and assumed as a consequence a noticeable suppression of the ESI ionization. The incorrect detection of TG in the positive control samples of laboratory E (despite having experience in targeted proteomics and having no problems in the preliminary ring trial) referred exclusively to the missing detection of TG-2. Laboratory J did not detect both TG marker peptides in any samples of the hamburger patties. Therefore, a systematic error during sample preparation of this matrix is probable, which could not be further specified. Nevertheless, even after exclusion of the 3 or 4 laboratories mentioned above, the remaining number of laboratories was still sufficient to fulfil the requirements for collaborative study procedures to validate an analytical method (Appendix D of AOAC Official Methods of Analysis 2005; ISO 2020).
The false-positive rate was calculated based on the three blank samples P01, P07, and P12 (Table 1). Regardless of the sample matrix, there was no misclassification, so that a false-positive rate of 0% could be achieved. The false-negative rate was calculated based on the samples containing a TG concentration of at least 0.5%. In total, two false-negative results were reported: one for material 2 and one for material 3 which both contain 1% TG. This resulted in an overall false-negative rate of 2.0% and in false-negative rates of the specific TG concentrations as shown in Table 7.
Probability of detection and LOD
95%
Since the conventional statistical approach for the validation of quantitative methods (according to ISO 2019) is not feasible for the validation of a binary qualitative method, the statistical analysis was based on the POD approach, which is currently being discussed to become an international validation standard (ISO 2021).
The probability of detection (POD) across laboratories as a function of the TG concentration could be modeled independently of the matrix, since no significant effects between the three matrices in terms of detection probability were detected. The resulting POD curve across laboratories with associated 95% confidence interval as well as the 95% prediction range and the laboratory-specific ROD values are shown in Fig. 2.
The evaluation range of the POD curve is 0.2–2.0% TG. Outside this range, extrapolation is necessary, so that the value of 0.03% TG for the LOD50% (upper confidence limit 0.12% TG) can only be regarded as an estimation value. The LOD95% for the median laboratory is 0.31% TG (upper confidence limit 1.16% TG). However, it should be noted that the qualitative method in most cases yields a positive result at all investigated concentration levels (above 0%) provided that laboratories with systematic problems are disregarded. The determination of method performance parameters is therefore difficult and associated with considerable uncertainties.
For the laboratory standard deviation \({\sigma }_{L}\), a high value of 0.81 (upper confidence limit 3.26) is obtained, which corresponds to a relative laboratory standard deviation of 81% according to the standards of a quantitative method (ISO 2019). There are several reasons: (1) for laboratory B, the laboratory-specific LOD95% is outside the measured range; and (2) the determination of the threshold for the detection of TG is not fully standardized and both systematic and random differences between laboratories are to be expected. Therefore, for the practical implementation of the method, it is recommended to regularly use suitable control samples with a TG concentration of 0.5% as a positive control. In addition, upon introduction of the method into a laboratory, it is recommended to perform a verification study in which six independent measurements of a sample containing 0.5% TG are performed. If all six results of this verification are positive, it can be concluded that the laboratory is capable of determining a level of 1% TG with a probability of at least 0.8.
Robustness of the Method
In some laboratories, sample processing and protein extraction deviated slightly from the SOP. Three laboratories used blending devices with lower maximal processing speeds (24,000 rpm at D, 15,000 rpm at F, 9500 rpm at E) than the 25,000 rpm specified in the method. Laboratories E, F, and M did not use a ball mill, whereas laboratories C and K only used a ball mill for some samples. Laboratory H used a mortar and pestle on the defatted and dehydrated samples. Despite the differences in sample preparation, each laboratory obtained a fine powder. Laboratory D recovered the finished protein extracts with centrifugation at 5000 rpm instead of 8000 rpm. In addition to the deviations from the SOP, the method itself provided the laboratories with some liberties in processing steps and especially in the analysis. Therefore, the tryptic digestion was conducted using trypsin from five different manufacturers and for SPE, two different systems were used. In terms of analysis, the laboratories could use any suitable HPLC–MS/MS system after optimization. Despite all these variations, no negative impact on the overall characterization of the samples was observed in the validation study.
Comparison of the Results of Triple-Quadrupole (MRM) and Quadrupole Time-of-Flight Instruments (Pseudo-MRM)
An important aspect for the transfer of mass spectrometric methods to routine applicability is their platform independence. Although quantitative analyses in routine applications are still mainly established on triple-quadrupole instruments, the benefits of high-resolution MS (HRMS) are increasingly exploited (Domon and Gallien 2015; Faktor et al. 2017; Higgs et al. 2013). It is important to note that the mechanisms of targeted approaches are different on triple-quadrupole and quadrupole time-of-flight (Q-ToF) instruments. Triple-quadrupole instruments are mainly run in multiple reaction monitoring (MRM) mode. In Q-ToF HRMS, pseudo-MRM transitions (also referred to as MRM-HR or PRM (parallel reaction monitoring)) are generated by the instrument as it cycles through predefined sets of precursor ions and collects full scan fragment ion spectra of each precursor. MRM transitions are reconstructed by software and are rather extracted ion chromatograms in high resolution. The major advantages are enhanced specificity due to high-resolution and the availability of full scan fragment spectra.
To demonstrate the platform independence of the developed method, the pseudo-MRM approach using two Q-ToF HRMS instruments (Sciex TT4600 and Sciex TT6600, respectively) was included in the inter-laboratory study for the detection of TG. All samples in the analyzed concentration range from 0.2 to 2% TG were classified correctly in both laboratories using pseudo-MRM. Furthermore, it was shown that the SNRs for TG-1 and TG-2 were comparable for the triple-quadrupole and Q-ToF instruments in all analyzed matrices (TG concentration 0.5%). These convincing results are also reflected in the fact that the LOD95% for the hybrid MS and the triple-quadrupoles were at least comparably good. Both laboratories that employed Q-ToF instruments in the inter-laboratory study achieved ROD values of 1 (Fig. 2). Overall, these results demonstrate that the Q-ToF mass spectrometers used in the inter-laboratory comparison proved to be completely suitable for the method application.