Is the z score sufficient to assess participants’ performance in proficiency testing? The hidden corrective action

Cordeiro, Fernando; Emons, Hendrik; Robouch, Piotr

doi:10.1007/s00769-022-01496-w

Is the z score sufficient to assess participants’ performance in proficiency testing? The hidden corrective action

Practitioner's Report
Open access
Published: 18 March 2022

Volume 27, pages 145–153, (2022)
Cite this article

Download PDF

You have full access to this open access article

Accreditation and Quality Assurance Aims and scope Submit manuscript

Is the z score sufficient to assess participants’ performance in proficiency testing? The hidden corrective action

Download PDF

4241 Accesses
2 Citations
Explore all metrics

Abstract

Proficiency testing providers, accreditation bodies and testing laboratories should be aware that a laboratory participating in a proficiency testing round might have reported a biased result despite a satisfactory performance indicated by an assessment using uniquely the z score. A complementary performance evaluation, based on the ζ score and the assessment of the measurement uncertainty, is therefore highly recommended. This work presents an intuitive graphical tool (the Naji2 plot) that combines z and ζ scores together with the reported measurement uncertainties. This tool allows a comprehensive assessment of the laboratory performance and enables to identify the need for corrective actions. The concerned laboratory should then perform a root cause analysis and investigate their bias and/or their measurement uncertainty evaluation.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The international standard ISO/IEC 17025:2017 [1] requires from testing laboratories to use validated measurement procedures, certified reference materials (when available) or appropriate quality control materials and to participate in interlaboratory comparisons such as proficiency testing (PT) rounds, to ensure and/or demonstrate the validity of their results. In addition, every laboratory is expected to estimate the measurement uncertainty (MU) associated with the reported result in order to comply with the basic demand that “a measurement result is generally expressed as a single measured quantity value and a measurement uncertainty” [2]. Similarly, ISO/IEC 17043:2010 [3] is demanding from PT providers, among other requirements, to estimate the associated measurement uncertainties when determining the assigned values.

Most of the European Union official laboratories performing control activities in the Single Market, for customs or environmental surveillance, are accredited according to ISO/IEC 17025 [1]. During the accreditation audits, the technical assessors check the compliance of such testing against the requirements set in the standard. They are often relying on the laboratory performance demonstrated in the frame of regular participation in PT schemes.

While all PT providers require laboratories to report a value for each investigated measurand, only a few of them request participants to report the associated MU. Hence, the common proof of satisfactory performance relies only on the z score. This score may confirm that the method of analysis applied is fit for the intended purpose, but it will not allow concluding on the presence or absence of a potential bias of the value reported, or on the appropriateness of the estimated MU associated with that value. Hence, laboratories may miss the hidden corrective action needed.

Robouch et al. presented a simple graphical tool for the evaluation of PT results at an international workshop on “Data analysis of key-comparisons” in 2003 [4]. The original Naji plot allowed the simultaneous assessment of various performance criteria of PT participants. Two alternative visualisation tools have been subsequently developed, namely the PomPlot [5] and the Kiri plot [6].

This paper revisits the approach of the original Naji plot concept, combining several assessment criteria recommended by ISO/IEC 17043 and ISO 13528 [3, 7]. It introduces several modifications to obtain the “Naji2 plot”, a simpler and more intuitive tool. The main goal of this work is to raise the awareness of PT providers on the importance of requesting from the participants the analytical results together with the respective measurement uncertainties, in order to deliver a comprehensive assessment of the laboratory performance.

Performance scoring

ISO/IEC 17043:2010 [3] and ISO 13528:2015 [7] recommend, among others, the following three performance scores to assess the reported results in the frame of a proficiency testing round.

The relative deviation (D_%,i) used to estimate the deviation of the reported result (x_i) from the assigned value (x_pt) is calculated as (expressed as a percentage of x_pt):

$${D}_{\%,i}=100 ({x}_{i}-{x}_{pt})/{x}_{pt}$$

(1)

Similarly, the deviation may be normalised using the standard deviation for proficiency assessment (σ_pt) to calculate the z score:

$${z}_{i}=({x}_{i}-{x}_{pt})/{\sigma }_{pt}$$

(2)

Alternatively, the deviation can be normalised combining the uncertainty reported by the laboratory (u(x_i)) and the uncertainty associated with the assigned value (u(x_pt)) to compute the zeta (ζ) score:

$${\xi }_{i}=\frac{{x}_{i}-{x}_{pt}}{\sqrt{{{u\left({x}_{i}\right)}^{2}+u\left({x}_{pt}\right)}^{2}}}$$

(3)

In addition, ISO 13528 [7] suggests to assess the reported measurement uncertainties (MU). The systematic approach implemented by the European Union Reference Laboratories (EURLs) managed by the Joint Research Centre is described in the PT report to participants, see for instance [8, 9]. This was done by comparing the reported relative MU (u_rel(x_i) = u(x_i)/x_i) to the relative uncertainty of the assigned value (u_rel(x_pt) = u(x_pt)/x_pt) and to the relative standard deviation for proficiency assessment (σ_pt,rel = σ_pt/x_pt). Hence, MU is considered as realistic (case “a”), probably underestimated (case “b”) or likely to be overestimated (case “c”) when: “a” u_rel(x_pt) ≤ u_rel(x_i) ≤ σ_pt,rel; “b” u_rel(x_i) < u_rel(x_pt); and “c” u_rel(x_i) > σ_pt,rel, respectively.

Knowing that there are three performance evaluation categories (satisfactory, questionable and unsatisfactory) and three uncertainty evaluation cases (underestimated, realistic and overestimated) for each of the three assessments mentioned above (z score, ζ score and MU), a total of 27 theoretical possibilities could be identified as shown in Table 1. However, a few possibilities marked by an asterisk in Table 1 may be unrealistic. For example, it is not possible to have simultaneously a satisfactory performance according to z, an unsatisfactory performance according to ζ and an overestimated MU (see possibility “I” in Table 1).

Table 1 The 27 theoretical possibilities related to z score, ζ score and the measurement uncertainty evaluation (MU)

Full size table

The Naji2 Plot: a graphical solution

Assuming that the ζ score is smaller than a certain performance limit “P_L” (equal to 2 or 3, as set in [3, 7]), and combining Eqs. 2 and 3 one can derive the following relations:

$${P}_{L}\ge \zeta ={\sigma }_{pt}\cdot z /\sqrt{{{u\left({x}_{i}\right)}^{2}+u\left({x}_{pt}\right)}^{2}}$$

(4)

Equivalent to:

$${{P}_{L}}^{2}\left[{{u\left({x}_{i}\right)}^{2}+u\left({x}_{pt}\right)}^{2}\right]\ge {{{\sigma }_{pt}}^{2} z}^{2}$$

(5)

Further transformed into:

$${(u\left({x}_{i}\right)/{\sigma }_{pt})}^{2}\ge { z}^{2}/{{P}_{L}}^{2}- {(u\left({x}_{pt}\right)/{\sigma }_{pt})}^{2}$$

(6)

Equation 6 describes the original Naji plot parabolas presented by Robouch et al. [4], when plotting ${(u\left({x}_{i}\right)/{\sigma }_{pt})}^{2}$ (y-axis) as a function of the z score (x-axis). These graphs were used in several publications related to proficiency testing rounds organised in a broad variety of fields [10,11,12,13,14,15,16,17]. Furthermore, this graphical representation is currently implemented in the commercial software “PROLab™” developed by Quodata GmbH (Dresden, Germany) for the interpretation of results reported in the frame of interlaboratory comparisons [18].

An alternative transformation of Eq. 5 is further investigated:

$${u\left({x}_{i}\right)}^{2}\ge {\left({\sigma }_{pt}\cdot z/{P}_{L}\right)}^{2}- {{u(x}_{pt})}^{2}$$

(7)

Equivalent to:

$$u({x}_{i})\ge \sqrt{\left[\frac{{\sigma }_{pt}}{{P}_{L}}z-{u(x}_{pt})\right]\left[\frac{{\sigma }_{pt}}{{P}_{L}}z+{u(x}_{pt})\right]}$$

(8)

Equation 8 describes two sets of hyperbolas obtained when plotting u(x_i) versus the z score, for the two acceptance criteria (${P}_{L}$ = 2 or 3). This graphical representation constitutes the “Naji2 plot”. Each hyperbola presents the following characteristics: (i) it is symmetric around the y-axis; (ii) it is always positive or equal to zero when $z=\pm {P}_{L} {u(x}_{pt})/{\sigma }_{pt}$; and (iii) it increases steadily with increasing |z| to reach the linear asymptote $(y = \pm {\sigma }_{pt} z/{P}_{L}$). However, Eq. 8 is not valid for z values ranging between “$-{P}_{L} {u(x}_{pt})/{\sigma }_{pt}$” and “$+{P}_{L} {u(x}_{pt})/{\sigma }_{pt}$”, because they would lead to a negative value under the square root.

Assessment of measurement uncertainties

In order to include the assessment of measurement uncertainties (cases “a”, “b”, “c”) in the Naji2 plots, the relation between the relative standard deviation (u_rel(x_i)) and the z score is described hereafter:

$${u\left({x}_{i}\right)= u}_{rel}\left({x}_{i}\right) {x}_{i}= {u}_{rel}\left({x}_{i}\right) \left({\sigma }_{pt} z+{x}_{pt}\right)$$

(9)

Equation 9 is a linear relation of u(x_i) as a function of the z score. Two specific lines delimit the range of “realistic” measurement uncertainties (case “a”), when ${u}_{rel}\left({x}_{i}\right)$ is equal to ${u}_{rel}\left({x}_{pt}\right)$ or to ${\sigma }_{pt, rel}$. While each line crosses the y-axis (z = 0) at $u\left({x}_{i}\right)$ = $u\left({x}_{pt}\right)$ or at $u\left({x}_{i}\right)$ = ${\sigma }_{pt}$, they both cross the x-axis ($u\left({x}_{i}\right)$ = 0) at $z= -{x}_{pt}/{\sigma }_{pt}.$

The bias boundaries

No bias can be identified (null hypothesis H₀) when the distribution of a measurement result and the interval of the assigned value and its expanded uncertainty (both assumed to follow a normal or approximately normal distributions) overlap with a probability (α) of rejecting a true H₀ of 5%, and a probability (β) of accepting a false H₀ of 5% [19]. Consequently, a value (x_i) lower than the assigned value (x_pt) would be negatively biased (z < 0) if:

$${x}_{i}+1.64 u\left({x}_{i}\right)<{x}_{pt}-1.64 u({x}_{pt})$$

(10)

where 1.64 is the inverse of the standard normal cumulative distribution for a probability of 95% (using the built-in spreadsheet function “NORM.S.INV(0.95)”).

Combining Eqs. 2 and 10, one derives the following linear relation between u(x_i) and z:

$$u\left({x}_{i}\right)<(-{\sigma }_{pt} z/1.64)-u\left({x}_{pt}\right)$$

(11)

A similar linear relation is obtained in the case of “positive bias” (z > 0):

$$u\left({x}_{i}\right)<({\sigma }_{pt} z/1.64)-u\left({x}_{pt}\right)$$

(12)

The two lines defined by Eqs. 11 and 12 delimit the “upper range” of the bias boundaries. The ranges delimited by these lines are similar to those defined by the two green hyperbolas (|ζ|> 2) and comply with the criteria for bias set by ISO Guide 33 [20]. Such large ζ scores may be caused either by a too large numerator, indicating a significant deviation (bias) of the reported value from the assigned value ($\left|{x}_{i}-{x}_{pt}\right|$); or by a too small denominator ($\sqrt{{u({x}_{i})}^{2}+{u({x}_{pt})}^{2}}$) caused by an underestimated reported MU, assuming that the uncertainty associated with the assigned value is realistic. The concerned laboratory should then perform a root cause analysis and investigate their bias and/or their measurement uncertainty evaluation, in order to resolve the identified poor performance expressed as a ζ score.

Despite the fact that many PT results are indeed corresponding to a satisfactory performance when expressed uniquely by the z score (|z| ≤ 2), some of them are located below the bias boundaries (with |ζ| > 2) and may require corrective actions by the laboratory.

A hypothetical case study

In order to demonstrate the Naji2 plot concept, a hypothetical proficiency test round attended by 40 participants was constructed. A PT provider processed a commercially available food commodity to produce a test material with adequate homogeneity and stability, according to the recommendations of ISO/IEC 17043 [3]. The PT provider also determined the total mass fraction of a specific analyte (${x}_{pt}$) of 100 mg kg⁻¹ and the associated standard uncertainty (u(x_pt)) of 3 mg kg⁻¹. In addition, a ${\sigma }_{pt}$ of 10 mg kg⁻¹ was set to assess the measurement capabilities of the laboratories to comply with some legal requirements. Since u(x_pt) ≤ 0.3 ${\sigma }_{pt}$, the test item was considered fit-for-purpose and the z score could be applied for performance assessment.

Figure 1 presents the Naji2 plot generated based on the above-mentioned predefined criteria (${x}_{pt}$, u(x_pt), and ${\sigma }_{pt}$). This plot consists of:

(i)
u(x_i) (y-axis) as a function of the z score (x-axis) having;
(ii)
four vertical lines at |z| = 2 and 3;
(iii)
two sets of hyperbolas for |ζ| = 2 and 3 (Eq. 8);
(iv)
two lines representing u_rel(x_i) = u_rel(x_pt) and u_rel(x_i) = σ_pt,rel (Eq. 9, see dotted and dashed lines in Fig. 1); and
(v)
two bias boundaries (Eqs. 11 and 12, see double blue lines in Fig. 1).

The four vertical lines (|z| = 2 and 3), the four hyperbolas (|ζ| = 2 and 3) and the two straight lines (u_rel(x_i) = u_rel(x_pt) or u_rel(x_i) = σ_pt,rel) delimit all the possible Naji2 plot areas identified by the letters “A” to “AA” in Table 1 and Fig. 1. The areas denoted by the letters with an asterisk in Table 1 could not be represented in Fig. 1, since they represent unrealistic possibilities.

The spreadsheet function “NORMINV(RAND(),m,u)” was used to generate two sets of 40 values, where m is the mean value and u the standard uncertainty. The first set of data simulated the reported results (x_i) normally distributed around the assigned value (m = 100 mg kg⁻¹) with u = 25 mg kg⁻¹ (see Table 2, 2nd column). The second set simulated a broad range of reported standard measurement uncertainties (u(x_i)), with m = 6 mg kg⁻¹ and u = 4 mg kg⁻¹ (see Table 2, 4th column). The standard uncertainties were multiplied by a coverage factor (k = 2) to derive the expanded uncertainties shown in Fig. 2.

Table 2 Synthetic data set of 40 laboratories having reported measurement results (x_i) and expanded uncertainties (U(x_i)). The standard MU (u(x_i)) and the relative standard uncertainty (u_rel(x_i)) are calculated accordingly. The performance scores (D_%,_i,, z and ζ scores) and the uncertainty assessment are based on the criteria set by the PT provider (x_pt = 100 mg kg⁻¹; u(x_pt) = 3 mg kg⁻¹ and ${\sigma }_{pt}$ = 10 mg kg⁻¹). The grey and black cells represent “questionable” and “unsatisfactory” performance scores, respectively. The last column refers to the MU assessment (cf. Case “a” realistic, “b” underestimated or “c” overestimated)

Full size table

Table 2 summarises the synthetic data set, the relative standard uncertainty u_rel(x_i) (to be compared to u_rel(x_pt) and to σ_pt,rel) and the outcome of the four assessments (D_%,i, z score, ζ score, and MU). Similarly, Fig. 2 presents the data set in a particular order, sorted by the reported result in increasing order first, then by the performance categories (SP, QP and UP), expressed as z scores and ζ scores. This allows the identification of 11 laboratories (from L25 to L38, see x-axis of Fig. 2) having “satisfactory” performances expressed as z scores but “questionable” or even “unsatisfactory” performances when expressed as ζ scores.

Figure 3 presents the Naji2 plot applied to the synthetic data set (Table 2). This graphical presentation allows an intuitive and direct assessment of every data point, related to all the assessment criteria under investigation, namely the z score, the ζ score or the bias boundaries, and the appropriateness of the reported MU. However, this graphical presentation is only accessible to PT providers collecting measurand values including measurement uncertainties from their participants.

One can easily notice that: (i) most of the results are in the central part of the plot, representing satisfactory z and ζ scores; (ii) seven “overestimated” (case “c”) and (iii) seven seemingly “underestimated” (case “b”) MU were reported, while four laboratories did not report their MU (u(x_i) = 0 for L02, L04, L13 and L27). Seventeen results are below the “bias boundaries”, of which six are significantly biased with |z|> 2 and |ζ|> 3 (L03, L04, L09, L14, L27, L33). The remaining eleven laboratories have “satisfactory” z scores, and “questionable” or “unsatisfactory” ζ scores (see empty circles in Fig. 3). These laboratories may consider two types of improvement actions: either an increase in their MU (vertical translation in the Naji2 plot) or a bias correction (horizontal translation in the Naji2 plot).

Discussion

Measurement uncertainty evaluation

Since 2000, the PT providers of the Joint Research Centre in Geel are assessing the participants’ performance using the z and ζ scores for laboratories participating in various IMEP, REIMEP, NUSIMEP PTs or for PT schemes organised by JRC’s European Union Reference Laboratories for contaminants in food. The organisers have noticed that extremely large reported MU lead to |ζ|≤ 2 (satisfactory performance), even when |z|≥ 3 (unsatisfactory performance). Hence, an additional evaluation criterion was introduced related to the expected maximum and minimum measurement uncertainties. It was assumed that realistic uncertainty estimations should not be larger than the standard deviation for performance assessment (u(x_i)_max = σ_pt) or smaller than the uncertainty of the assigned value (u(x_i)_min = u(x_pt)). This concept was later adopted in the standard ISO 13528:2015 [7], where the example of the IMEP-111 is specifically mentioned.

Let us consider, for example, the results submitted by L14 and L19 (62.2 ± 9.0 mg kg⁻¹ /0.14/; and 127.6 ± 11.5 mg kg⁻¹ /0.09/, expressed as x_i ± u(x_i)/u_rel(x_i)/, see Table 2), in the frame of the hypothetical PT round (where x_pt = 100 mg kg⁻¹; u(x_pt) = 3 mg kg⁻¹; σ_pt = 10 mg kg⁻¹; u_rel(x_pt) = 0.03; and σ_pt,rel = 0.10). According to the above-mentioned criteria, the absolute MU reported by L14 would be considered as “realistic” (9 < 10, in mg kg⁻¹), while the one of L19 would be flagged as potentially “overestimated” (11.5 > 10, in mg kg⁻¹).

However, according to Ellison and Williams [21, 22], the relative uncertainty of chemical measurements can be taken as constant for a range of measurand values. Based on this, the alternative approach described in this paper had been implemented by several EURLs of the JRC. Instead of comparing u(x_i) to u(x_pt) and to σ_pt, the reported relative measurement uncertainty (u_rel(x_i) = u(x_i)/x_i) is compared to the relative standard uncertainty of the assigned value (u_rel(x_pt) = u(x_pt)/x_pt) and to the relative standard deviation for proficiency assessment (σ_pt,rel = σ_pt/x_pt) over the whole range of reported mass fractions (e.g. from 60 to 140 mg kg⁻¹ or |z| $\le 4$ in Figs. 1 and 3).

Knowing that a constant relative uncertainty u_rel(x_i) implies a proportional increase of u(x_i) with x_i, one may have for z > 0 (x_i > x_pt) a realistic u(x_i) larger than u(x_pt), or inversely, a realistic u(x_i) smaller than u(x_pt) for z < 0 (x_i < x_pt). Hence, according to the new criteria and unlike to the assessment presented in ISO 13528, the MU statement of L14 is flagged as potentially “overestimated” (0.14 > 0.10), while the one of L19 is considered as “realistic” (0.09 < 0.10).

These new criteria for the evaluation of reported measurement uncertainties may be taken into consideration to replace those prescribed in the current ISO 13528 standard [7].

Naji2 plot applied to real cases

In 2018, the European Union Reference Laboratory for Food Contact Materials (EURL-FCM) organised a PT round (FCM-18–02, [23]) for the determination of the mass fractions of total zinc and other metals in food simulant B (acetic acid, 3% w/v). The test items were prepared gravimetrically, which resulted in an accurate assigned value (x_pt = 5.024 mg kg⁻¹) with a small associated standard measurement uncertainty (u(x_pt) = 0.033 mg kg⁻¹). Based on the expert opinion, σ_pt,rel was set to 0.12. A total of 46 National Reference Laboratories (NRLs) and Official Control Laboratories (OCLs) reported results. The outcome of this PT is presented in Fig. 4a, where the Naji2 plot includes the “bias boundaries”. Most of the laboratories reported accurate values and realistic measurement uncertainty estimations (case “a”), while some of them reported overestimated MUs (case “c”). However, according to ISO Guide 33 [20] six laboratories have to investigate their significant bias (with |ζ|> 2), of which three have a |z|≤ 2 (see empty circles in Fig. 4a).

In 2019, the European Union Reference Laboratory for Genetically Modified Food and Feed (EURL GMFF) organised a PT round (GMFF-19/02, [24]) for the determination of GMOs in food and feed materials to support Regulation (EU) 2017/625 on official controls [25]. One of the test items consisted of a pig feed spiked with a genetically modified “40–3-2 soybean”. The EURL characterised the mass fraction of the GM event applying the EURL validated method and used the following values for the performance assessment of the participants: x_pt = 1.014 m/m %; u(x_pt) = 0.061 m/m %, and σ_pt,rel = 0.25. A total of 67 NRLs and OCLs reported results. Figure 4b shows the outcome of this PT round, where the Naji2 plot presents the set of hyperbolas. Most of the laboratories reported accurate values and realistic measurement uncertainty estimations, while some of them reported overestimated MUs (case “c”). However, according to ISO Guide 33 [20] sixteen laboratories have to investigate their significant bias (with |ζ| > 2), of which twelve have a |z| ≤ 2 (see empty circles in Fig. 4b). It is worth noting that, in this particular case, the realistic range of MU (defined by Eq. 9) goes to zero at z = − 4 (= 1/σ_pt,rel).

Conclusions

The Naji2 plot is an intuitive graphical representation that allows the simultaneous assessment of the three performance evaluations (z, ζ scores and the MU assessment) and the identification of potential biases. This comprehensive assessment may indicate to participants the need for an appropriate corrective action that, otherwise, would have been hidden by a satisfactory performance, if expressed uniquely as a z score. Similarly to the original Naji plot, the Naji2 plot can be applied to all analytes, matrices, concentration/content levels and proficiency testing schemes. It can be used to summarise the outcome of a PT round presenting the results of all laboratories for a given measurand, or to present a historical overview of a laboratory participation in different rounds of a specific PT scheme.

This tool is accessible to the PT providers that request from laboratories who participate in their PT schemes, to report a measurement result as stipulated by ISO/IEC 17025, i.e. including the associated measurement uncertainty. Unfortunately, this is still rarely the case. The situation may improve as soon as the next edition of ISO/IEC 17043 would require PT providers to assess laboratory performances based on the measured value and the corresponding measurement uncertainty.

Furthermore, the authors consider that the evaluation of the reported measurement uncertainties described in Sect. 9.8 of ISO 13528:2015 may be reviewed to take into account the relative uncertainties as an assessment criterion.

References

ISO 17025:2005. General requirements for the competence of testing and calibration laboratories. International Organization for Standardization, Geneva, Switzerland
BIPM (2012) International vocabulary of metrology—basic and general concepts and associated terms (VIM 3rd edition). JCGM 200. https://www.bipm.org/en/publications/guides/vim.html. Accessed 31 Jan 2021
ISO 17043:2010. Conformity assessment—general requirements for proficiency testing. International Organization for Standardization, Geneva, Switzerland
Robouch P, Younes N, Vermaercke P (2003) The "Naji Plot", a simple graphical tool for the evaluation of inter-laboratory comparisons. In: Proceedings of the international workshop: data analysis of interlaboratory comparisons; data analysis of key comparisons. pp 149–160. Edited by the Federal Institute for Material (PTB), Berlin, Germany. ISBN: 3897019333
Pommé S (2006) An intuitive visualisation of intercomparison results applied to the KCDB. Appl Radiat Isot 64:1158–1162. https://doi.org/10.1016/j.apradiso.2006.02.017
Article CAS PubMed Google Scholar
Harms AV (2009) Visualisation of proficiency test exercise results in Kiri plots. Accred Qual Assur 14:307–311. https://doi.org/10.1007/s00769-009-0512-0
Article Google Scholar
ISO 13528:2015. Statistical methods for use in proficiency testing by interlaboratory comparison. International Organization for Standardization. Geneva, Switzerland
Tsochatzis ED, Alberto Lopes J, Emteborg H, Robouch P, Hoekstra E (2019) Determination of PBT cyclic oligomers in and migrated from food contact materials—FCM-19/01 Proficiency Test Report. JRC 118572. https://europa.eu/!pD99nU. Accessed 30 Jan 2021
Tanaskovski B, Broothaerts W, Buttinger G, Corbisier P, Emteborg H, Robouch P, Emons H (2020) Determination of GM Maize MON88017 in bird feed and GM Maize GA21 in maize flour. EURL GMFF Proficiency Testing Report GMFF-20/01. JRC122118. https://europa.eu/!TV99gT. Accessed 30 Jan 2021
Aregbe Y, Harper C, Nørgaard J, De Smet M, Smeyers P, Van Nevel L, Taylor PDP (2004) The interlaboratory comparison “IMEP-19 trace elements in rice”—a new approach for measurement performance evaluation. Accred Qual Assur 9:323–332. https://doi.org/10.1007/s00769-003-0693-x
Article Google Scholar
Bednarova M, Aregbe Y, Harper C, Taylor PDP (2006) Evaluation of laboratory performance in IMEP water interlaboratory comparisons. Accred Qual Assur 10:617–626. https://doi.org/10.1007/s00769-005-0031-6
Article CAS Google Scholar
Serapinas P (2007) Approaching target uncertainty in proficiency testing schemes: experience in the field of water measurement. Accred Qual Assur 12:569–574. https://doi.org/10.1007/s00769-007-0310-5
Article CAS Google Scholar
Frazzoli C, Robouch P, Caroli S (2010) Analytical accuracy for trace elements in food: a graphical approach to support uncertainty analysis in assessing dietary exposure. Toxicol Environ Chem 92:641–654. https://doi.org/10.1080/02772240903257941
Article CAS Google Scholar
Koster-Ammerlaan M, Bode P (2009) Improved accuracy and robustness of NAA results in a large throughput laboratory by systematic evaluation of internal quality control data. J Radioanal Nucl Chem 280:445–449. https://doi.org/10.1007/s10967-009-7475-9
Article CAS Google Scholar
Venchiarutti C, Varga Z, Richter S, Jakopič R, Mayer K, Aregbe Y (2015) REIMEP-22 inter-laboratory comparison: “U age dating—determination of the production date of a uranium certified test sample.” Radiochim Acta 103:825–834. https://doi.org/10.1515/ract-2015-2437
Article CAS Google Scholar
Ferreiro López-Riobóo J, Crespo González N, López Mahía P, Muniategui Lorenzo S, Prada Rodríguez D (2019) Preparation of items for a textile proficiency testing scheme. Accred Qual Assur 24:73–78. https://doi.org/10.1007/s00769-018-1325-9
Article Google Scholar
Tsochatzis ED, Alberto Lopes J, Dehouck P, Robouch P, Hoekstra E (2020) Proficiency test on the determination of polyethylene and polybutylene terephthalate cyclic oligomers in a food simulant. Food Packag Shelf Life 23:100441. https://doi.org/10.1016/j.fpsl.2019.100441
Article PubMed PubMed Central Google Scholar
ProLab—Software for PT programs and collaborative studies according to ISO 17043. Edited by Quodata, Dresden, Germany. https://quodata.de/content/prolab-software-interlaboratory-studies-quodata-gmbh#0. Accessed 31 Jan 2021
ISO 3534–1:2006. Statistics—vocabulary and symbols—Part 1: general statistical terms and terms used in probability. International Organization for Standardization. Geneva, Switzerland
ISO Guide 33:2015. Reference materials—good practice in using reference materials. International Organization for Standardization. Geneva, Switzerland
Ellisson SLR, Williams A (2012) Eurachem/CITAC guide: quantifying Uncertainty in Analytical Measurement, Third edition, ISBN 978–0–948926–30–3. Available from https://www.eurachem.org/index.php/publications/guides/quam. Accessed 22 Feb 2021
Williams A (2020) Calculation of the expanded uncertainty for large uncertainties using the lognormal distribution. Accred Qual Assur 25:335–338. https://doi.org/10.1007/s00769-020-01445-5
Article CAS Google Scholar
Cordeiro F, Beldi G, Snell J, García-Ruiz S, Van Britsom G, Cizek-Stroh A, Robouch P, Hoekstra E (2018) Determination of the mass fractions of total aluminium, nickel, antimony and zinc in food simulant B. JRC report JRC113663. https://europa.eu/!uP37WY. Accessed 30 Jan 2021
Broothaerts W, Beaz Hidalgo R, Buttinger G, Corbisier P, Cordeiro F, Dimitrievska B, Emteborg H, Maretti M, Robouch P, Emons H (2019) Determination of GM Soybean 40–3–2 and MON87708 and GM Cotton GHB119 in Animal Feed and GM Soybean DAS-44406 in Soybean Flour - EURL GMFF Proficiency Testing Report GMFF-19/02. JRC report JRC11837. https://europa.eu/!FK99Pg. Accessed 30 Jan 2021
Commission Regulation (EU) No 2017/625. Regulation on official controls and other official activities performed to ensure the application of food and feed law, rules on animal health and welfare, plant health and plant protection products. Off. J. Eur. Union L 95: 1–142

Download references

Acknowledgements

The authors wish to acknowledge Alexander Bernreuther, Ioannis Fiamegkos, Emanuel Tzokatsis (former JRC colleagues) and Marzia Mancin (from the Istituto Zooprofilattico Sperimentale delle Venezie) for their valuable comments.

Author information

Authors and Affiliations

European Commission, Joint Research Centre (JRC), Geel, Belgium
Fernando Cordeiro, Hendrik Emons & Piotr Robouch

Authors

Fernando Cordeiro
View author publications
You can also search for this author in PubMed Google Scholar
Hendrik Emons
View author publications
You can also search for this author in PubMed Google Scholar
Piotr Robouch
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fernando Cordeiro.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cordeiro, F., Emons, H. & Robouch, P. Is the z score sufficient to assess participants’ performance in proficiency testing? The hidden corrective action. Accred Qual Assur 27, 145–153 (2022). https://doi.org/10.1007/s00769-022-01496-w

Download citation

Received: 06 May 2021
Accepted: 03 February 2022
Published: 18 March 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s00769-022-01496-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Is the z score sufficient to assess participants’ performance in proficiency testing? The hidden corrective action

Abstract

Introduction

Performance scoring

The Naji2 Plot: a graphical solution

Assessment of measurement uncertainties

The bias boundaries

A hypothetical case study

Discussion

Measurement uncertainty evaluation

Naji2 plot applied to real cases

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation