A Bayesian approach to the evaluation of comparisons of individually value-assigned reference materials
- 106 Downloads
Several recent international comparison studies used a relatively novel experimental design to evaluate the measurement capabilities of participating organizations. These studies compared the values assigned by each participant to one or more qualitatively similar materials with measurements made on all of the materials by one laboratory under repeatability conditions. A statistical model was then established relating the values to the repeatability measurements; the extent of agreement between the assigned value(s) and the consensus model reflected the participants’ measurement capabilities. Since each participant used their own supplies, equipment, and methods to produce and value-assign their material(s), the agreement between the assigned value(s) and the model was a fairer reflection of their intrinsic capabilities than provided by studies that directly compared time- and material-constrained measurements on unknown samples prepared elsewhere. A new statistical procedure is presented for the analysis of such data. The procedure incorporates several novel concepts, most importantly a leave-one-out strategy for the estimation of the consensus value of the measurand, model fitting via Bayesian posterior probabilities, and posterior coverage probability calculation for the assigned 95% uncertainty intervals. The benefits of the new procedure are illustrated using data from the CCQM-K54 comparison of eight cylinders of n-hexane in methane.
KeywordsBayesian analysis Degrees of equivalence Generalized distance regression Leave-one-out analysis Posterior coverage probability
We thank the GAWG and its members for pioneering the comparison of multiple reference materials value-assigned by different organizations using measurements made under repeatability conditions by one organization.
- 5.The BIPM key comparison database, http://kcdb.bipm.org/
- 6.JCGM 200:2008. International vocabulary of metrology—basic and general concepts and associated terms (VIM). Joint Committee for Guides in Metrology (JCGM), Sèvres, France (2008) http://www.bipm.org/en/publications/guides/vim.html
- 7.ISO. ISO 6143:2001(E) Gas analysis—comparison methods for determining and checking the composition of calibration gas mixtures. International Organization for Standardization (ISO), Geneva (2001)Google Scholar
- 11.Fuller W (2006) Measurement error models. New York, NY: Wiley & SonsGoogle Scholar
- 12.Jeffreys H (1961) Theory of probability. New York, NY: Oxford University PressGoogle Scholar
- 13.JCGM 100:2008. Evaluation of measurement data—guide to the expression of uncertainty in measurement. Joint Committee for Guides in Metrology (JCGM), Sèvres, France (2008) http://www.bipm.org/en/publications/guides/gum.html
- 14.JCGM 101:2008. Evaluation of measurement data—supplement 1 to the “Guide to the expression of uncertainty in measurement”—propagation of distributions using a Monte Carlo method. Joint Committee for Guides in Metrology (JCGM), Sèvres, France (2008) http://www.bipm.org/en/publications/guides/gum.html
- 16.Gelman A, Meng XL, Stern H (1996) Posterior predictive assessment of model fitness via realized discrepancies (with discussion). Stat Sin 6:733–807Google Scholar