Introduction

The aim of nuclear safeguards is the verification of the non-diversion of fissile material from its intended and declared use in line with the Treaty on the Non-Proliferation of Nuclear Weapons (NPT) [1] and the EURATOM Treaty [2]. To achieve this goal, a reliable nuclear material accountancy system has to be established by the plant operator and a reliable verification system by the international or regional safeguards authority. Safeguarding nuclear materials involves a quantitative verification of fissile materials by independent measurements. The effectiveness of these verifications depends to a great extent upon the comparison of the measurements achieved by both the facility operator and the safeguards inspectorate. Measurement results accompanied with measurement uncertainty estimations and a traceability statement are best suited for making safeguards evaluations [3]. Measurements of amount content and isotope ratios of uranium and plutonium in samples taken from proliferation-sensitive stages of the nuclear fuel cycle such as enrichment and reprocessing are of great importance [4]. Laboratories carrying out such measurements need to comply with rigorous quality goals and demonstrate their measurement capabilities. One possibility for a laboratory to demonstrate technical competence in line with ISO/IEC 17025:2005 is the successful participation in an interlaboratory comparison (ILC) [5].

The EC—Joint Research Centre—Institute for Reference Materials and Measurements (JRC-IRMM) is an accredited provider of interlaboratory comparisons according to ISO/IEC 17043:2010 [6]. The Regular European Interlaboratory Measurement Evaluation Programme (REIMEP) was established at the JRC-IRMM in 1982 for carrying out external control of the quality of the measurements of nuclear materials that match samples routinely analysed in the nuclear fuel cycle. Previous REIMEP campaigns included samples such as uranium oxide, uranium in nitric acid, uranium in the form of UF6, plutonium oxide [7]. The focus in REIMEP-17 was on the plutonium and uranium amount content, and isotope amount ratios in synthetic dissolved spent nuclear fuel solutions. It was jointly organized by JRC-IRMM and EC—Joint Research Centre—Institute for Transuranium Elements (JRC-ITU) for the EURATOM safeguards laboratories, the IAEA Network of Analytical Laboratories for nuclear material analysis (NWAL), laboratories from industry and experts in the field. Two test samples with different concentrations of plutonium and uranium were prepared to suit laboratories with different objectives. One of the test samples, REIMEP-17A had the uranium and plutonium concentration typical for an undiluted spent nuclear fuel solution, and the other test sample, REIMEP-17B was a diluted fraction of it. A dilution was used to prepare the REIMEP-17B sample purely for practical reasons, resulting in the same isotopic composition for both test samples. The measurands in REIMEP-17 were plutonium and uranium amount contents and n(238Pu)/n(239Pu), n(240Pu)/n(239Pu), n(241Pu)/n(239Pu), n(242Pu)/n(239Pu) and n(234U)/n(238U), n(235U)/n(238U), n(236U)/n(238U) amount ratios (mol mol−1). The participants were requested to apply their standard analytical procedures and report the measurement results with associated measurement uncertainties. The results were evaluated against the independent assigned values and international quality goals by means of z scores and zeta scores in compliance with ISO 13528:2005 [8] and with ISO 17043:2010 [6]. The quality goals for nuclear material analysis are defined in the document International Target Values for Measurement Uncertainties in Safeguarding Nuclear Materials (ITV2010) [9]. They represent the estimates of the “state of the practice” which should be achievable under routine measurement conditions in a typical industrial laboratory or during actual safeguards inspections [9]. The list of participating laboratories in REIMEP-17 is presented in Table 1.

Table 1 Laboratories participating in REIMEP-17 interlaboratory comparison

Test samples

Participants received two test samples, REIMEP-17A and REIMEP-17B, with different uranium and plutonium amount contents. The REIMEP-17A sample solution was prepared by dissolution of a sample of an unirradiated mixed oxide fuel in nitric acid and the addition of natural uranium, aiming at the mass fraction of uranium and plutonium of about 200 mg g−1 and 2 mg g−1, respectively. Other impurities (e.g. fission products) were not added for transport purposes. The REIMEP-17B sample solution was prepared by a 400-fold dilution of REIMEP-17A resulting in a mass fraction of uranium and plutonium of about 500 mg g−1 and 5 μg g−1, respectively. The isotope ratios of uranium and plutonium were in the range of depleted uranium [0.67 g/(100 g) 235U] and high burn-up plutonium. The solutions were dispensed into glass ampoules and laser-sealed. Each ampoule contained about 6 mL of solution. The preparation of the test samples and shipment to the participants were carried out by JRC-ITU.

The assigned (reference) values for the plutonium and uranium amount content and isotope amount ratios were established at JRC-IRMM in combination with homogeneity and stability assessment by isotope dilution mass spectrometry (IDMS) and thermal ionization mass spectrometry (TIMS) on randomly selected ampoules. Five ampoules of each fraction (REIMEP-17A and REIMEP-17B) were analysed in three replicate measurements. Isotope ratio measurements were performed by total evaporation (TE) on a Triton TIMS (Thermo Fisher Scientific, Bremen) [1012] after chemical purification by anion exchange. Details on the preparation and characterization of the test material can be found in the REIMEP-17 report [13]. The assigned values for REIMEP-17A and REIMEP-17B are presented in Table 2.

Table 2 REIMEP-17: plutonium and uranium amount content and isotope amount ratios in synthetic dissolved spent nuclear fuel solution assigned values

Evaluation of laboratory performance in REIMEP-17

All REIMEP-17 participant results were evaluated against the assigned values by means of z scores and zeta scores in compliance with ISO 13528:2005 [8]. The ITV2010 target values were used as a standard deviation (\(\hat{\sigma }\)) for proficiency assessment to score the participant results.

$$z = \frac{{x_{\text{lab}} - X_{\text{ref}} }}{{\hat{\sigma }}}$$
(1)
$$zeta = \frac{{x_{\text{lab}} - X_{\text{ref}} }}{{\sqrt {u_{\text{lab}}^{2} + u_{\text{ref}}^{2} } }}$$
(2)

where

x lab :

is the measurement result reported by a participant,

X ref :

is the assigned value,

u ref :

is the standard measurement uncertainty of the assigned value,

u lab :

is the standard measurement uncertainty reported by a participant, and

\(\hat{\sigma }\) :

is the standard deviation for proficiency assessment

z scores can be interpreted as follows:

  • satisfactory performance for |z| ≤ 2,

  • questionable performance for 2 < |z| ≤ 3

  • unsatisfactory performance for |z| > 3. An identical ranking applies to zeta scores

Measurement performance evaluation in REIMEP-17 was done according to the purpose of the measurement and the possible use of the result. The ITV2010 values are expressed as relative combined standard uncertainties; therefore, in addition to scoring as recommended in ISO 13528:2005, a performance assessment criterion for a minimum and maximum acceptable measurement uncertainty was applied to complete satisfactory performance that take reported measurement uncertainties into account.

The range of acceptable standard measurement uncertainty reported by a participant with a satisfactory performance expressed by zeta score has been evaluated as such:

For all |zeta| ≤ 2, it is evaluated whether u min < u lab ≤ u max, where u min = 0, u lab = u lab;rel and the respective ITV2010 values serve as u max [9].

As mentioned earlier, the respective ITV2010 values were used as the standard deviation for proficiency assessment in REIMEP-17. These depend on the instrumental/analytical technique and the type of spike and conditions applied in the analysis [9]. Table 3 summarizes the ITV2010 values applied in the evaluation of REIMEP-17 results. According to the IUPAC International Harmonised Protocol, participants can apply their own scoring settings and recalculate the scores if the purpose of their measurements is different [15].

Table 3 The ITV2010 values expressed as combined relative standard uncertainties

Participant results

The laboratory results were coded A through J. There is no correlation between the laboratory code and the order of the laboratories presented in Table 1. One laboratory received two sets of test samples and submitted results for both sets. Therefore, 10 codes were assigned for a total of 9 laboratories. As shown in Figs. 1 and 2, the individual results are presented in graphs as a relative deviation from the assigned reference value. The results from the stability assessment (with expanded uncertainty, k = 2) are shown together with the participants’ results. The stability measurements were carried out by JRC-IRMM about a year after the production of the test samples to demonstrate the stability of the sample solution during the interlaboratory comparison. One ampoule of each fraction, REIMEP-17A and REIMEP-17B, was analysed in three replicate measurements. All stability measurements confirmed the REIMEP-17 assigned values.

Fig. 1
figure 1

Participant results of the U amount content, v(U), in REIMEP-17A with measurement uncertainties (error bars) as reported by the participants (laboratories A and I reported k = 1; all other results and the stability value were reported with expanded uncertainties, k = 2). The grey band represents the assigned value with expanded uncertainty (k = 2). The dashed line represents the respective ITV2010 value. The dashed squares indicate the laboratories applying ISO GUM [14] methodology for the evaluation of the measurement uncertainty

Fig. 2
figure 2

The z and zeta scores for the U amount content in REIMEP-17A. The dashed and dotted lines indicate z or zeta = 2 and z or zeta = 3, respectively

Isotope dilution in combination with thermal ionization mass spectrometry (ID-TIMS) or inductively coupled-plasma mass spectrometry (ID-ICP-MS) was utilized by the participants for the determination of the amount content. One laboratory used X-ray fluorescence (XRF) for determination of the amount content in the REIMEP-17B sample. All participants performed a chemical separation prior to the isotope ratio measurements. The results for the uranium amount content in REIMEP-17A are shown in Fig. 1.

Depending on the analytical approach chosen by laboratories, two different ITV2010 criteria can be applied for the calculation of z scores and zeta scores for the amount content (see Table 3). However, in this paper, the results evaluated with the ITV2010 criterion 0.28 % are presented because the majority of laboratories analysed the REIMEP-17 samples under conditions appropriate to this value. A detailed evaluation of the participants’ results can be found in the REIMEP-17 report [13]. The z and zeta scores for the uranium amount content in REIMEP-17A are shown in Fig. 2.

Good performance was observed among the participants for the determination of the U amount content in REIMEP-17A. Laboratories A, B, C, F, I and J achieved satisfactory performance in terms of z and zeta scores as shown in Fig. 2. Laboratory D reported a result exceeding the ITV2010 value and achieved unsatisfactory performance. Figures 1 and 2 show that the laboratories G and H underestimated their measurement uncertainty. This led to unsatisfactory performance in terms of zeta scores for both laboratories. A possible explanation for this could be in an incomplete uncertainty estimation, where the laboratory reported an estimate for the measurement precision under a set of repeatability conditions instead of a more comprehensive measurement uncertainty estimation [16]. Uncertainties reported by the participants were also evaluated to see whether they were within the acceptable uncertainty range for the respective measurands. The maximum acceptable measurement uncertainty for the uranium amount content expressed as a relative standard deviation was 0.28 % (ITV2010). All laboratories with satisfactory performance (zeta scores) reported their relative measurement uncertainty below this threshold except laboratory I, which drastically overestimated its measurement uncertainty.

Three laboratories did not submit results for REIMEP-17B. Among the remaining laboratories, good performance for the uranium amount content was observed. Only one laboratory achieved unsatisfactory performance (z and zeta scores). One laboratory reported the “less than” value [v(U) < 2.5 µmol g−1]; however, no scores could be calculated for this laboratory. Similar to the measurements on the REIMEP-17A samples, in some cases, the reported uncertainties appeared to be somewhat overestimated.

The overall performance for the plutonium amount content in REIMEP-17 was good; however, fewer laboratories achieved satisfactory performance compared to the the measurements of uranium. For REIMEP-17A, four laboratories achieved satisfactory performance in terms of z and zeta scores, and among them, three reported their measurement uncertainties within the maximum acceptable range. For REIMEP-17B, only two laboratories achieved satisfactory performance. The results for the plutonium amount content in REIMEP-17B are shown in Fig. 3.

Fig. 3
figure 3

Participant results of the Pu amount content v(Pu) in REIMEP-17B with measurement uncertainties (error bars) as reported by the participants (laboratory I reported k = 1; all other results and the stability value were reported with expanded uncertainty, k = 2). The grey band represents the assigned value with the expanded uncertainty (k = 2). The dashed line represents the respective ITV2010 value. The dashed squares indicate the laboratories applying ISO GUM [14] methodology for the evaluation of the measurement uncertainty

The result by laboratory I included a significant bias: the reported value was by two orders of magnitude higher than the assigned value. This could be due to a contamination problem or a calculation error. Laboratories G and H also reported the results outside the ITV2010 criterion; therefore, in all three cases, unsatisfactory performance in terms of z scores was achieved. Laboratory E reported the “less than” value [v(Pu) < 0.2 µmol g−1]. Laboratories A, F and J did not submit their results because the amounts of uranium and plutonium in REIMEP-17B were too low to be treated as a routine nuclear material sample in their laboratories. Some inconsistency in the estimation of measurement uncertainty was observed for the laboratories G and H. Reported uncertainties appeared to be overestimated for the REIMEP-17B, but underestimated for the REIMEP-17A test sample for the same measurand.

For the measurement of the amount ratios, participants applied either TIMS or ICP-MS with five laboratories utilizing the total evaporation technique. For the measurement of the n(238Pu)/n(239Pu) ratio, five laboratories applied alpha spectrometry. The results for the n(235U)/n(238U) ratio in REIMEP-17A are shown in Fig. 4.

Fig. 4
figure 4

Participant results of the amount ratio n(235U)/n(238U) in REIMEP-17A with measurement uncertainties (error bars) as reported by the participants (laboratory A reported k = 1; all other results and the stability value were reported with expanded uncertainty, k = 2). The grey band represents the assigned value with the expanded uncertainty (k = 2). The dashed line represents the respective ITV2010 value. The dotted circles indicate the laboratories using the total evaporation technique

Very good performance was observed for the n(235U)/n(238U) amount ratio. From Fig. 4 it can be taken that the laboratory G showed a bias of about 0.8 % and therefore obtained unsatisfactory performance in terms of z score. Laboratory G also showed a bias in the n(234U)/n(238U) amount ratio in the same test sample. Contamination with natural uranium could explain this deviation, or it could also be a result of an incorrect mass fractionation correction due to the use of a non-TE method. The laboratories D and H overestimated their measurement uncertainties. On the other hand, excellent performance was observed for the laboratories using TIMS in combination with the total evaporation technique.

Excellent performance was also observed for the n(235U)/n(238U) amount ratio in REIMEP-17B, where all laboratories that reported the results except one achieved satisfactory performance (zeta and z scores). As already observed with REIMEP-17A, the laboratory D overestimated its measurement uncertainty. Two laboratories did not submit their results.

The majority of the participants also reported the results for the minor uranium ratios. As there are no ITV2010 values defined for n(234U)/n(238U) and n(236U)/n(238U), only zeta scores were provided to the participants. Overall, good performance was observed for the n(234U)/n(238U) amount ratio. Satisfactory performance in terms of zeta scores was achieved by 60 % of the participants in REIMEP-17A and 88 % in REIMEP-17B, respectively. On the other hand, very few laboratories obtained satisfactory performance for the amount ratio n(236U)/n(238U). Some laboratories either reported only an upper value or did not submit the result for this ratio at all.

Excellent performance was observed for the amount ratio n(240Pu)/n(239Pu) in REIMEP-17. As an example, the results for the amount ratio n(240Pu)/n(239Pu) in REIMEP-17B are shown in Fig. 5.

Fig. 5
figure 5

Participant results of the amount ratio n(240Pu)/n(239Pu) in REIMEP-17B with measurement uncertainties (error bars) as reported by the participants (all results and the stability value were reported with expanded uncertainty, k = 2). The grey band represents the assigned value with the expanded uncertainty (k = 2). The dashed line represents the respective ITV2010 value. The dotted circles indicate the laboratories using the total evaporation technique

All the laboratories except laboratory B achieved satisfactory performance. From Fig. 5 it is obvious that laboratories C and D overestimated their measurement uncertainty. With REIMEP-17A, all the laboratories achieved satisfactory performance for the n(240Pu)/n(239Pu) amount ratio. Moreover, an excellent performance was observed for other plutonium amount ratios in REIMEP-17A and for the n(242Pu)/n(239Pu) amount ratio in REIMEP-17B. Regarding the amount ratio n(238Pu)/n(239Pu), in some cases the results exceeded the ITV2010 values [13].

In REIMEP campaigns, the participant measurement results are evaluated against independent reference values. In addition in this paper, we also applied the robust statistics as a comparison. The determination of the robust statistics allows the assigned values to be compared with the mean results reported by the participants (so-called the consensus value). This approach is usually applied when it is suspected that the population of the results includes a few extreme outliers. The robust mean (X rob) and the robust standard deviation (s rob) of the participants’ results are calculated by the Algorithm A specified in standards ISO 5725-5 [17] and ISO 13528:2005 [8]. The compatibility [16] between the robust mean and the assigned values is represented by Δ/u d, calculated according to the following equations:

$$\varDelta = X_{\text{rob}} - X_{\text{ref}}$$
(3)
$$u_{\text{d}} = \sqrt {u_{\text{rob}}^{2} + u_{\text{ref}}^{2} }$$
(4)

where

X rob :

is the robust mean of the participants’ results,

X ref :

is the assigned value,

u ref :

is the standard measurement uncertainty of the assigned value, and

u rob :

is the standard measurement uncertainty of the robust mean which is calculated according to the following equation:

$$u_{\text{rob}} = \frac{{1.25 s_{\text{rob}} }}{\sqrt p }$$
(5)

where

s rob :

is the robust standard deviation of the participants calculated using Algorithm A and

p :

is the number of reported results

The outcome of the robust statistics for the U amount content and the n(235U)/n(238U) amount ratio in REIMEP-17A and for the Pu amount content and the n(240Pu)/n(239Pu) amount ratio in REIMEP-17B is shown in Table 4.

Table 4 Results of the robust statistics for REIMEP-17A and REIMEP-17B

It can be concluded that the robust means and the assigned values are compatible (|Δ/u d| ≤ 2) for the above-mentioned measurands, except for the U amount content in REIMEP-17A. Nevertheless, the assigned value of the U amount content in REIMEP-17A mother solution was verified by IDMS at JRC-ITU (|Δ/u d| = 0.006).

Discussion

In REIMEP-17, two test samples with different concentration of uranium and plutonium were prepared to accommodate laboratories with different objectives. One test sample was representative of undiluted dissolved spent nuclear fuel solution; the other test sample was a diluted fraction of it. The mission of most of the participating laboratories was to carry out measurement for fissile material control or safeguards. Some laboratories were also involved in research and development or medical application.

It can be concluded that the participants performed well for the measurements of uranium and plutonium amount content in compliance with the respective values of ITV2010, except for the plutonium amount content in the REIMEP-17B sample where the performance was less satisfactory. In particular, the measurement performance for the plutonium isotope amount ratios and the uranium major ratio was very good for both REIMEP-17 samples. Excellent performance was observed among the laboratories using the TE method. Some larger spread of the results was observed for the minor uranium amount ratios.

With respect to measurement uncertainties, all the laboratories reported measurement uncertainties for the REIMEP-17A sample, and 60 % of the laboratories followed the ISO GUM methodology [14]. The measurement uncertainty of these laboratories were comparable and, in most cases, within the maximum acceptable range for the respective measurand. In one case, ITV values were reported as measurement uncertainties. Two laboratories stated that they do not routinely report measurement uncertainties. Some differences in the measurement uncertainty estimations were observed even for the same instrumental technique. The laboratories which utilized the TE method had comparable estimates of measurement uncertainties, whereas the laboratories using a non-TE (e.g. classical method) in many cases either overestimated or underestimated their measurement uncertainties. Laboratories applying alpha spectrometry demonstrated that this technique, within its larger uncertainties compared to TIMS, provides reliable results for the measurement of the n(238Pu)/n(239Pu) isotope amount ratio. A tendency was noticed towards reporting uncertainties overestimated for the REIMEP-17B and underestimated for the REIMEP-17A sample for the same measurand. This might be due to the fact that in some cases, measurement precision has been reported instead of a combined measurement uncertainty. At this point, it must be recalled that the main mission of the participating laboratories was to carry out measurements for fissile material control or safeguards, whereas a sample such as REIMEP-17B with a mass fraction of uranium and plutonium of about 500 µg g−1 and 5 μg g−1, respectively, might not be the type of sample usually analysed in these laboratories. Therefore, the routine analytical procedures of some of the participants in REIMEP-17 were probably not optimized for measuring samples with lower uranium and plutonium concentrations. All laboratories reported the use of certified reference materials for instrument calibration or method validation.

Conclusion

Nuclear safeguards in the European Union has the rank of European law (Euratom treaty, Chapter VII, Euratom regulation 302/2005). A prerequisite for any analytical method used to draw conclusions in nuclear safeguards is to deliver accurate measurement results. Measurement results for nuclear material accountancy and safeguards verification purposes have to be fit for intended use and within the required measurement uncertainties of the ITV2010 [9]. In recent ESARDA workshops and IAEA technical meetings, recommendations were given on quality control to ensure confidence in the measurement results, with emphasis on traceability of measurement results and reliable measurement uncertainty estimations under routine analytical conditions [3, 4]. The ITV2010 provides the most suitable fit-for-purpose quality goals to assess the measurement capability of EURATOM safeguards laboratories, the IAEA Network of Analytical Laboratories (NWALs) for nuclear material analysis and operators’ measurement systems. REIMEP-17 interlaboratory comparison demonstrated the measurement capabilities of the laboratories in the field of nuclear material analysis and at the same time served as a confirmation that the ITV2010 values are achievable and fit-for-purpose target values under state-of-practice conditions. Because of possible interferences from fission products in the chemical purification stages and also from transuranium elements in the mass-spectrometry measurements, it is to be expected that the measurement performance of laboratories on the real dissolved spent nuclear fuel sample solution would be less satisfactory. Due to limitations in the transport of radioactive samples, no such real samples could be prepared for this interlaboratory comparison. The provision of quality control tools for conformity assessment, such as REIMEP-17, thus, directly contributes to the effectiveness of the safeguards system.