Skip to main content

Modeling measurement and its quality

  • Chapter
  • First Online:
Measurement across the Sciences

Abstract

This chapter aims to propose a general model of a measurement process, consistent with the ontological and epistemological commitments developed in the previous chapters. Starting from the premise that any measurement is grounded on an empirical process, we propose a characterization of measurement methods related to the complementary roles of empirical and informational components, by broadly distinguishing between direct and indirect methods of measurement, where indirect measurements necessarily include at least one direct measurement. The stages of direct measurement are then analyzed and exemplified in reference to both physical and psychosocial properties. This is the basis for discussing once again the quality of measurement, now described in terms of the complementary requirements of object relatedness (“objectivity”) and subject independence (“intersubjectivity”).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    A basic reason for the complexity of this endeavor is the (usually unavoidable and in fact appropriate) specialization of the scientific and technical disciplines, which triggers the construction of specific terminologies. An interesting example of an attempt to overcome lexical hyper-specialization while maintaining scientific and technical correctness is Electropedia, “the world’s most comprehensive online terminology database on ‘electrotechnology’, containing more than 22,000 terminological entries [...] organized by subject area” (Electropedia makes the series of standards IEC 60050 freely accessible online at www.electropedia.org)

  2. 2.

    About direct and indirect methods of measurement, see also the discussion by Boumans (2007: Sect. 9.3).

  3. 3.

    This is related to what is referred to as the measurement model part of a structural equation model (SEM), or a measurement model for short in the context of structural equation modeling (SEM; see, e.g., Skrondal & Rabe-Hesketh, 2004). SEMs are a popular class of statistical models used in the human sciences, although such models have a number of different purposes.

  4. 4.

    This is related to what is referred to as the structural model part of the structural equation model (SEM), or structural model for short.

  5. 5.

    The distinction between direct measurements and processes of evaluation which are not direct is sometimes accepted as unproblematic. For example, consider this quotation from John Taylor (1997: p. 45): “Most physical quantities usually cannot be measured in a single direct measurement but are instead found in two distinct steps. First, we measure one or more quantities that can be measured directly and from which the quantity of interest can be calculated. Second, we use the measured values of these quantities to calculate the quantity of interest itself. For example, to find the area of a rectangle, you actually measure its length l and height h and then calculate its area A as A = lh. Similarly, the most obvious way to find the velocity v of an object is to measure the distance traveled, d, and the time taken, t, and then to calculate v as v = d/t” (all emphases added). Interestingly, the process of indirect evaluation is not even called a measurement here, as if only direct measurements were measurements, in opposition to Lira’s conclusion mentioned above (“no measurement can strictly be considered to be ‘direct’”).

  6. 6.

    The fact that in some cases the transduction is repeated and the measured value is computed as a statistic of the sample of indication values does not create a third method: from the structural point of view in which we are interested here, it remains unproblematically a case of direct measurement.

  7. 7.

    Thus, for example, when measuring the weight of foods at the supermarket, the measurand is accepted to be the weight of the object put on the scale with no further specifications, though of course the measuring instrument is expected to be appropriately calibrated, including all relevant corrections.

  8. 8.

    Given the VIM definition of <adjustment of a measuring system> as a “set of operations carried out on a measuring system so that it provides prescribed [indication values] corresponding to given values of a quantity to be measured” (JCGM, 2012: 3.11, adapted), this action could be also called “adjustment of the measurement environment”.

  9. 9.

    As already mentioned in Footnote 14 of Chap. 3, definitional uncertainty is sometimes understood simply as one of the components of measurement uncertainty, and as such can be combined with the other components. We follow here the other approach, and consider it as the lower bound of the result of such a combination.

  10. 10.

    The adjective “basic” refers to the simplification of not taking uncertainty into account in scale construction.

  11. 11.

    In the human sciences, readily transportable measurement standards are not very common. In some contexts, synthetic reference objects have been found to be useful, such as computerized chess players (Maul et al. 2019).

  12. 12.

    By presenting public scales first and then local scales we are following a conceptual sequence from what is outside the box, i.e., a measuring instrument, to what is inside the box. But in the historical development public scales may be the outcome of the previous development of multiple local scales and the assessment of their agreement. In the example of temperature, the first public thermometric scales (e.g., Celsius and Fahrenheit) were developed only after several thermometers were discovered to be in substantial agreement in their behavior (Chang, 2007).

  13. 13.

    For the sake of simplicity, in this initial presentation we do not distinguish between (i) the intended property (that is, the measurand), i.e., the property referred to in the Basic Evaluation Equation that reports the result of measurement, and (ii) the effective property, i.e., the property that interacts with the measuring instrument and produces an effect on it; we call both of them the property under measurement.

  14. 14.

    This assumption of causality is at the core of the very possibility of measurement by means of these kinds of devices, which are instances of what Nancy Cartwright calls a nomological machine, “a fixed (enough) arrangement of components, or factors, with stable (enough) capacities that in the right sort of stable (enough) environment will, with repeated operation, give rise to the kind of regular behaviour that we represent in our scientific laws” (1999: p. 50).

  15. 15.

    This is sometimes referred to as the “pilot” stage of instrument construction in the human sciences.

  16. 16.

    Under the hypothesis that the instrument behavior is modeled by an analytical function, and the underlying structure of the evaluation is sufficiently rich (see Sect. 6.5.3), this pointwise map may be interpolated, so as to obtain pairs (local value, public value) also for public values for which the instrument was not directly calibrated. For example the thermometer could be hypothesized to behave in a linear way between the freezing and the boiling point of water—i.e., a change of the measured temperature produces a proportional change of the position of the upper surface of alcohol in the tube—so that the midpoint of the upper surface would be mapped to 50 °C, and so on. Of course, the thermometer could be calibrated even if its behavior were not linear, but this would require the availability of other public reference temperatures, intermediate between the freezing and the boiling point of water.

  17. 17.

    We are using the contrast of direct and indirect in two distinct ways. Given the distinction between direct and indirect measurement, the reference is here to a difference about operational implementations of direct measurement, which may be performed by indirectly or directly comparing the measured property and the reference properties in the public scale. In summary, a direct measurement can be performed through an indirect (as usual) or a direct comparison.

  18. 18.

    This case is quite rare in the human sciences, due to the already mentioned rarity of reference objects.

  19. 19.

    As noted above, and illustrated in Sect. 7.3.3 and in particular in Fig. 7.12, significant parts of this figure and the next two are absent in the case of direct comparisons with reference objects.

  20. 20.

    There is in fact a mathematical condition here, i.e., the mappings involved in the process can be formalized as functions, but this does not constrain the measurability of nonquantitative properties. As discussed further in Sect. 7.4, this is to be reconsidered if uncertainties are included in the model.

  21. 21.

    This is typical of all passive instruments, which are activated by the object under measurement instead of by an external source. Also in this regard a RCA test is analogous to an alcohol thermometer, in which the upper surface of alcohol in the tube changes its position only because of the energy transferred from the object whose temperature is under measurement.

  22. 22.

    Note that this will result in a matrix of local references Xik*.

  23. 23.

    Complicating matters, some authors refer to error and uncertainty interchangeably (Kirkup & Frenkel, 2006), as noted by Taylor: “In science, the word error does not carry the usual connotations of the terms mistake or blunder. Error in a scientific measurement means the inevitable uncertainty that attends all measurements. [… Here] error is used exclusively in the sense of uncertainty, and the two words are used interchangeably” (1997: p. 3). More or less explicitly, this denies that there is anything new in what has happened on this matter in the last decades, as witnessed in particular by the publication in 1993 of the Guide to the expression of uncertainty in measurement (GUM) (JCGM, 2008).

  24. 24.

    Several cases of this phenomenon are proposed by Jerzy Muller in his book so explicitly titled The Tyranny of Metrics. An example: “In England, in an attempt to reduce wait times in emergency wards, the Department of Health adopted a policy that penalized hospitals with wait times longer than 4 h. The program succeeded—at least on the surface. In fact, some hospitals responded by keeping incoming patients in queues of ambulances, beyond the doors of the hospital, until the staff was confident that the patient could be seen within the allotted 4 h of being admitted” (2018: p. 5).

  25. 25.

    The measurand definition and its influence on the design and the operation of measurement are subjects that still require investigation—see, e.g., Baratto (2008) and Morawski (2013).

  26. 26.

    Sometimes the distinction between objectivity and intersubjectivity is not maintained, and they are conflated in a single concept of <non-subjective>. An explicit example is: “A highly disciplined discourse helps to produce knowledge independent of the particular people who make it. This last phrase points to my working definition of objectivity. It is, from the philosophical standpoint, a weak definition. It implies nothing about truth to nature. It has more to do with the exclusion of judgment, the struggle against subjectivity” (Porter, 1995: p. ix).

  27. 27.

    As noted above, the concept of validity in human science measurement is closely related to definitional uncertainty. However, it is an expansive concept, and hence, there are aspects of it that also overlap with other parts of objectivity and intersubjectivity.

  28. 28.

    Interestingly, these dimensions correspond to the two main stages of a measurement process of (a) transduction and matching and (b) calibration mapping, as connected by the local scale application. The idea that measurement is to be modeled on the basis of such two stages is not unusual, though the terms may be different. For example, Roman Morawski calls them “conversion” and “reconstruction” (Morawski, 2013) and Giovanni Battista Rossi and Francesco Crenna call them “observation” and “restitution” (Rossi & Crenna, 2018).

  29. 29.

    This implies that <measurement> cannot be defined in terms of objectivity and intersubjectivity, as instead sometimes suggested (e.g., “the result of measurement must meet the condition of objective truth”, Piotrowski, 1992: p. 1), also, mistakenly, by one of the present authors (Mari et al., 2012: p. 2109).

References

  • Baratto, A. C. (2008). Measurand: A cornerstone concept in metrology. Metrologia, 45, 299–307.

    Article  ADS  Google Scholar 

  • Bich, W. (2008). How to revise the GUM? Accreditation Quality and Assurance, 13, 271–255.

    Article  Google Scholar 

  • Boumans, M. (2007). Invariance and calibration. In M. Boumans (Ed.), Measurement in economics: A handbook (pp. 231–248). London: Academic Press.

    Google Scholar 

  • Campbell, N. R. (1920). Physics—The elements. Cambridge: Cambridge University Press.

    Google Scholar 

  • Cartwright, N. (1999). The dappled world—A study of the boundaries of science. Cambridge: Cambridge University Press.

    Book  Google Scholar 

  • Chang, H. (2007). Inventing temperature—Measurement and scientific progress. Oxford: Oxford University Press.

    Google Scholar 

  • Cook, A. H. (1994). The observational foundations of physics. Cambridge: Cambridge University Press.

    Google Scholar 

  • de Forest Palmer, A. (1912). The theory of measurements. New York: McGraw-Hill.

    Google Scholar 

  • Dray, A. J., Brown, N. J. S., Diakow, R., Lee, Y., & Wilson, M. (2019). A construct modeling approach to the assessment of reading comprehension for adolescent readers. Reading Psychology, 40(2), 191–241.

    Article  Google Scholar 

  • Ellis, B. (1968). Basic concepts of measurement. Cambridge: Cambridge University Press.

    Google Scholar 

  • Flynn, J. R. (1987). Massive IQ gains in 14 nations: What IQ tests really measure. Psychological Bulletin, 101(2), 171–191.

    Article  Google Scholar 

  • Frigerio, A., Giordani, A., & Mari, L. (2010). Outline of a general model of measurement. Synthese, 175, 123–149.

    Article  Google Scholar 

  • Gertsbakh, I. (2003). Measurement theory for engineers. Berlin: Springer.

    Book  Google Scholar 

  • Giordani, A., & Mari, L. (2019). A structural model of direct measurement. Measurement, 145, 535–550.

    Article  ADS  Google Scholar 

  • Goodhart, C. (1981). Problems of monetary management: The U.K. experience. In A. S. Courakis (Ed.), Inflation, depression, and economic policy in the west (pp. 111–146). Totowa, NJ: Barnes and Noble.

    Google Scholar 

  • Hanson, N. R. (1958). Patterns of discovery: An inquiry into the conceptual foundations of science. Cambridge: Cambridge University Press.

    Google Scholar 

  • International Electrotechnical Commission (IEC). (various publication dates). International electrotechnical vocabulary (IEV). Geneva: IEC. Retrieved from www.electropedia.org

  • International Organization for Standardization (ISO) and Other Three International Organizations. (1984). International vocabulary of basic and general terms in metrology (VIM) (1st ed.). Geneva: International Bureau of Weights and Measures (BIPM), International Electrotechnical Commission (IEC), International Organization for Standardization (ISO), International Organization of Legal Metrology (OIML).

    Google Scholar 

  • Joint Committee for Guides in Metrology (JCGM). (2008). JCGM 100:2008, Evaluation of measurement data—Guide to the expression of uncertainty in measurement (GUM). Sèvres: JCGM. Retrieved from www.bipm.org/en/publications/guides/gum.html

  • Joint Committee for Guides in Metrology (JCGM). (2012). JCGM 200:2012, International vocabulary of metrology—Basic and general concepts and associated terms (VIM) (3rd ed.). Sèvres: JCGM (2008 version with minor corrections). Retrieved from www.bipm.org/en/publications/guides/vim.html

  • Kirkup, L., & Frenkel, R. B. (2006). An introduction to uncertainty in measurement using the GUM (Guide to the expression of uncertainty in measurement). Cambridge: Cambridge University Press.

    Book  Google Scholar 

  • Krantz, D. H., Luce, R. D., Suppes, P., & Tversky, A. (1971). Foundations of measurement (Vol. 1). New York: Academic Press.

    MATH  Google Scholar 

  • Kyburg, H. E. (1984). Theory and measurement. Cambridge: Cambridge University Press.

    Google Scholar 

  • Landsberger, H. A. (1958). Hawthorne revisited. Ithaca, NY: Cornell University Press.

    Google Scholar 

  • Lira, I. (2002). Evaluating the measurement uncertainty—Fundamentals and practical guidance. Bristol: IOP Publishing.

    Book  Google Scholar 

  • Mari, L. (2005). The problem of foundations of measurement. Measurement, 38, 259–266.

    Article  ADS  Google Scholar 

  • Mari, L., Carbone, P., & Petri, D. (2012). Measurement fundamentals: A pragmatic view. IEEE Transactions on Instrumentation and Measurement, 61, 2107–2115.

    Article  Google Scholar 

  • Mari, L., Maul, A., Torres Irribarra, D., & Wilson, M. (2017). Quantities, quantification, and the necessary and sufficient conditions for measurement. Measurement, 100, 115–121.

    Article  ADS  Google Scholar 

  • Masters, G., & Forster, M. (1997). Mapping literacy achievement: Results of the 1996 National School English Literacy Survey. Hawthorn: ACER Press.

    Google Scholar 

  • Maul, A., Mari, L., Torres Irribarra, D., & Wilson, M. (2018). The quality of measurement results in terms of the structural features of the measurement process. Measurement, 116, 611–620.

    Article  ADS  Google Scholar 

  • Maul, A., Mari, L., & Wilson, M. (2019). Intersubjectivity of measurement across the sciences. Measurement, 131, 764–770.

    Google Scholar 

  • Mislevy, R. J., Steinberg, L. S., & Almond, R. G. (2003). On the structure of educational assessments (CSE technical report). Washington, DC: University of California at Los Angeles, Center for the Study of Evaluation, National Center for Research on Evaluation, Standards, and Student Testing.

    Google Scholar 

  • Morawski, R. Z. (2013). An application-oriented mathematical meta-model of measurement. Measurement, 46, 3753–3765.

    Article  ADS  Google Scholar 

  • Muller, J. Z. (2018). The tyranny of metrics. Princeton, NJ: Princeton University Press.

    Book  Google Scholar 

  • Nitko, A. J. (1983). Educational tests and measurement: An introduction. New York: Harcourt Brace Jovanovich.

    Google Scholar 

  • Nunally, J. C., & Bernstein, I. H. (1994). Psychometric theory. New York: McGraw-Hill.

    Google Scholar 

  • Petri, D., Mari, L., & Carbone, P. (2015). A structured methodology for measurement development. IEEE Transactions on Instrumentation and Measurement, 64, 2367–2379.

    Article  Google Scholar 

  • Piotrowski, J. (1992). Theory of physical and technical measurement. Amsterdam: Elsevier.

    Google Scholar 

  • Porter, T. M. (1995). Trust in numbers—The pursuit of objectivity in science and public life. Princeton, NJ: Princeton University Press.

    Book  Google Scholar 

  • Possolo, A. (2015). Simple guide for evaluating and expressing the uncertainty of NIST measurement results. NIST Technical Note 1900. Retrieved from www.nist.gov/publications/simple-guide-evaluating-and-expressing-uncertainty-nist-measurement-results

  • Roberts, F. S. (1979). Measurement theory with applications to decision-making, utility and the social sciences. Reading, MA: Addison-Wesley.

    Google Scholar 

  • Rossi, G. B., & Crenna, F. (2018). A formal theory of the measurement system. Measurement, 116, 644–651.

    Article  ADS  Google Scholar 

  • Skrondal, A., & Rabe-Hesketh, S. (2004). Generalized latent variable modeling: Multilevel, longitudinal, and structural equation models. Boca Raton, FL: Chapman & Hall/CRC.

    Book  Google Scholar 

  • Taylor, J. R. (1997). An introduction to error analysis—The study of uncertainties in physical measurements. Sausalito: University Science Book.

    Google Scholar 

  • Wilson, M. (2005). Constructing measures: An item response modeling approach. Mahwah, NJ: Erlbaum. (Now published by Taylor and Francis, New York).

    Google Scholar 

  • Wilson, M., & Sloane, K. (2000). From principles to practice: An embedded assessment system. Applied Measurement in Education, 13(2), 181–208.

    Article  Google Scholar 

  • Wright, B. D., & Masters, G. N. (1981). Rating scale analysis. Chicago, IL: MESA Press.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mari, L., Wilson, M., Maul, A. (2021). Modeling measurement and its quality. In: Measurement across the Sciences. Springer Series in Measurement Science and Technology. Springer, Cham. https://doi.org/10.1007/978-3-030-65558-7_7

Download citation

Publish with us

Policies and ethics