Evaluation of the link between the Guttman errors and response shift at the individual level

Dubuy, Yseulys; Sébille, Véronique; Grall-Bronnec, Marie; Challet-Bouju, Gaëlle; Blanchin, Myriam; Hardouin, Jean-Benoit

doi:10.1007/s11136-021-03015-9

Evaluation of the link between the Guttman errors and response shift at the individual level

Special Section: Non-parametric IRT
Published: 17 October 2021

Volume 31, pages 61–73, (2022)
Cite this article

Quality of Life Research Aims and scope Submit manuscript

Yseulys Dubuy ORCID: orcid.org/0000-0001-8390-2285¹,
Véronique Sébille¹,
Marie Grall-Bronnec^1,2,
Gaëlle Challet-Bouju^1,2,
Myriam Blanchin ORCID: orcid.org/0000-0003-1318-7620¹ &
…
Jean-Benoit Hardouin¹

271 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Purpose

Methods for response shift (RS) detection at the individual level could be of great interest when analyzing changes in patient-reported outcome data. Guttman errors (GEs), which measure discrepancies in respondents’ answers compared to the average sample responses, might be useful for detecting RS at the individual level between two time points, as RS may induce an increase in the number of discrepancies over time. This study aims to establish the link between recalibration RS and the change in the number of GEs over time (denoted index $I$) via simulations and explores the discriminating ability of this index.

Methods

We simulated the responses of individuals affected or not affected by recalibration RS (defined as changes in the patients’ standard of measurement) to determine whether simulated individuals with recalibration had a greater change in the number of GEs over time than individuals without recalibration. The effects of factors related to the sample, the questionnaire structure and recalibration were investigated. As an illustrative example, the change in the number of GEs was computed in patients suffering from eating disorders.

Results

Within simulations, simulated individuals affected by recalibration had, on average, a greater change in the number of GEs over time than did individuals without RS. Some of the parameters related to the questionnaire structure and recalibration magnitude appeared to have substantial effects on the values of $I$. Discriminating abilities appeared, however, globally low.

Conclusion

Some evidence of the link between recalibration and the change in GEs was found in this study. GEs could be a valuable nonparametric tool for RS detection at a more individual level, but further investigation is needed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Practical challenges in mediation analysis: a guide for applied researchers

Article Open access 12 April 2024

Quantitative Research

Characterising and justifying sample size sufficiency in interview-based studies: systematic analysis of qualitative health research over a 15-year period

Article Open access 21 November 2018

Data availability

Modules, scripts and an extract of the simulated data used in the paper are available at the Open Science Framework via the link: https://osf.io/h9nyd/?view_only=b196db78f31c4e9fbb07013342a133a2

Notes

i.e., nonrandom errors in the latent variable estimates.

References

Basch, E. (2017). Patient-reported outcomes: Harnessing patients’ voices to improve clinical care. The New England Journal of Medicine, 376(2), 105–108. https://doi.org/10.1056/NEJMp1611252
Article PubMed Google Scholar
Schwartz, C. E., Finkelstein, J. A., & Rapkin, B. D. (2017). Appraisal assessment in patient-reported outcome research: Methods for uncovering the personal context and meaning of quality of life. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 26(3), 545–554. https://doi.org/10.1007/s11136-016-1476-2
Article Google Scholar
Sprangers, M. A. G., & Schwartz, C. E. (1999). Integrating response shift into health-related quality of life research: A theoretical model. Social Science & Medicine, 48(11), 1507–1515. https://doi.org/10.1016/S0277-9536(99)00045-3
Article CAS Google Scholar
Vanier, A., Falissard, B., Sébille, V., & Hardouin, J. B. (2017). The complexity of interpreting changes observed over time in health-related quality of life: A short overview of 15 years of research on response shift theory. In F. Guillemin, A. Leplege, S. Briancon, E. Spitz, & J. Coste (Eds.), Perceived health and adaptation in chronic disease (1st ed.). New York: Routledge.
Google Scholar
Schwartz, C. E., Sprangers, M. A., & Fayers, P. M. (2005). Response shift: You know it’s there, but how do you capture it? Challenges for the next phase of research. In Assessing quality of life in clinical trials (2nd ed.). Oxford University Press.
Google Scholar
Oort, F. J. (2005). Using structural equation modeling to detect response shifts and true change. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 14(3), 587–598.
Article Google Scholar
Schwartz, C. E. (2016). Introduction to special section on response shift at the item level. Quality of Life Research, 25(6), 1323–1325. https://doi.org/10.1007/s11136-016-1299-1
Article PubMed Google Scholar
Guilleux, A., Blanchin, M., Vanier, A., Guillemin, F., Falissard, B., Schwartz, C. E., Hardouin, J. B., & Sébille, V. (2015). RespOnse Shift ALgorithm in Item response theory (ROSALI) for response shift detection with missing data in longitudinal patient-reported outcome studies. Quality of Life Research, 24(3), 553–564. https://doi.org/10.1007/s11136-014-0876-4
Article PubMed Google Scholar
Blanchin, M., Guilleux, A., Hardouin, J.-B., & Sébille, V. (2020). Comparison of structural equation modelling, item response theory and Rasch measurement theory-based methods for response shift detection at item level: A simulation study. Statistical Methods in Medical Research, 29(4), 1015–1029. https://doi.org/10.1177/0962280219884574
Article PubMed Google Scholar
Vanier, A., Sébille, V., Blanchin, M., Guilleux, A., & Hardouin, J.-B. (2015). Overall performance of Oort’s procedure for response shift detection at item level: A pilot simulation study. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 24(8), 1799–1807. https://doi.org/10.1007/s11136-015-0938-2
Article Google Scholar
Nolte, S., Mierke, A., Fischer, H. F., & Rose, M. (2016). On the validity of measuring change over time in routine clinical assessment: A close examination of item-level response shifts in psychosomatic inpatients. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 25(6), 1339–1347. https://doi.org/10.1007/s11136-015-1123-3
Article CAS Google Scholar
Gandhi, P. K., Schwartz, C. E., Reeve, B. B., DeWalt, D. A., Gross, H. E., & Huang, I.-C. (2016). An item-level response shift study on the change of health state with the rating of asthma-specific quality of life: A report from the PROMIS(®) Pediatric Asthma Study. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 25(6), 1349–1359. https://doi.org/10.1007/s11136-016-1290-x
Article Google Scholar
Verdam, M. G. E., Oort, F. J., & Sprangers, M. A. G. (2016). Using structural equation modeling to detect response shifts and true change in discrete variables: An application to the items of the SF-36. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 25(6), 1361–1383. https://doi.org/10.1007/s11136-015-1195-0
Article Google Scholar
Ahmed, S., Sawatzky, R., Levesque, J.-F., Ehrmann-Feldman, D., & Schwartz, C. E. (2014). Minimal evidence of response shift in the absence of a catalyst. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 23(9), 2421–2430. https://doi.org/10.1007/s11136-014-0699-3
Article Google Scholar
Blanchin, M., Sébille, V., Guilleux, A., & Hardouin, J.-B. (2016). The Guttman errors as a tool for response shift detection at subgroup and item levels. Quality of Life Research, 25(6), 1385–1393. https://doi.org/10.1007/s11136-016-1268-8
Article PubMed Google Scholar
Meijer, R. R., Niessen, A. S. M., & Tendeiro, J. N. (2016). A practical guide to check the consistency of item response patterns in clinical research through person-fit statistics: Examples and a computer program. Assessment, 23(1), 52–62. https://doi.org/10.1177/1073191115577800
Article PubMed Google Scholar
Sijtsma, K., & Molenaar, I. W. (2002). Introduction to nonparametric item response theory. SAGE.
Book Google Scholar
Emons, W. H. M. (2008). Nonparametric person-fit analysis of polytomous item scores. Applied Psychological Measurement, 32(3), 224–247. https://doi.org/10.1177/0146621607302479
Article Google Scholar
Fischer, G. H., & Ponocny, I. (1994). An extension of the partial credit model with an application to the measurement of change. Psychometrika, 59(2), 177–192. https://doi.org/10.1007/BF02295182
Article Google Scholar
American Psychiatric Association, & American Psychiatric Association (eds.). (2009). Diagnostic and statistical manual of mental disorders: DSM-IV-TR (4. ed., text revision, 13. print.). Arlington, VA: American Psychiatric Assoc.
Lecrubier, Y., Sheehan, D., Weiller, E., Amorim, P., Bonora, I., Harnett Sheehan, K., & Dunbar, G. (1997). The Mini International Neuropsychiatric Interview (MINI). A short diagnostic structured interview: Reliability and validity according to the CIDI. European Psychiatry, 12(5), 224–231. https://doi.org/10.1016/S0924-9338(97)83296-8
Article Google Scholar
Sheehan, D. V., Lecrubier, Y., Sheehan, K. H., Amorim, P., Janavs, J., Weiller, E., & Dunbar, G. C. (1998). The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. The Journal of Clinical Psychiatry, 59(Suppl 20), 22–33.
PubMed Google Scholar
Garner, D. M. (1991). Eating disorder inventory-2. Professional manual. Psychological Assessment Research.
Google Scholar
Archinard, M., Rouget, P., Painot, D., & Liengme, C. (2002). Inventaire des troubles alimentaires 2 [Eating Disorder Inventory 2]. In M. Bouvard & J. Cottraux (Eds.), Protocoles et échelles d’évaluation en psychiatrie et en psychologie [Protocols and evaluation scales in psychiatry and psychology] (3rd ed., pp. 249–251). Masson.
Google Scholar
Cloninger, C. R., Przybeck, T. R., & Svrakic, D. M. (1994). The temperament and character inventory (TCI) a guide to its development and use. Center for Psychobiology of Personality, Washington University.
Google Scholar
Pélissolo, A., & Lépine, J.-P. (1997). Traduction française et premières études de validation du questionnaire de personnalité TCI. [Validation study of the French version of the TCI.]. Annales Médico-Psychologiques, 155(8), 497–508.
Google Scholar
Chakroun-Vinciguerra, N., Faytout, M., Pélissolo, A., & Swendsen, J. (2005). Validation française de la version courte de l’Inventaire du Tempérament et du Caractère (TCI-125). Journal de Thérapie Comportementale et Cognitive, 15(1), 27–33. https://doi.org/10.1016/S1155-1704(05)81209-1
Article Google Scholar
Cooper, P. J., Taylor, M. J., Cooper, Z., & Fairbum, C. G. (1987). The development and validation of the body shape questionnaire. International Journal of Eating Disorders, 6(4), 485–494. https://doi.org/10.1002/1098-108X(198707)6:4%3c485::AID-EAT2260060405%3e3.0.CO;2-O
Article Google Scholar
Rousseau, A., Knotter, A., Barbe, P., Raich, R., & Chabrol, H. (2005). Validation of the French version of the Body Shape Questionnaire. L’Encephale, 31(2), 162–173. https://doi.org/10.1016/s0013-7006(05)82383-8
Article CAS PubMed Google Scholar
Ware, J. E., & Sherbourne, C. D. (1992). The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Medical Care, 30(6), 473–483.
Article Google Scholar
Aaronson, N. K., Ahmedzai, S., Bergman, B., Bullinger, M., Cull, A., Duez, N. J., & de Haes, J. C. (1993). The European Organization for Research and Treatment of Cancer QLQ-C30: A quality-of-life instrument for use in international clinical trials in oncology. Journal of the National Cancer Institute, 85(5), 365–376. https://doi.org/10.1093/jnci/85.5.365
Article CAS PubMed Google Scholar
Holland, P. W., & Wainer, H. (Eds.). (1993). Differential item functioning. Differential Item Functioning, xv, 453–xv, 453.
Osterlind, S., & Everson, H. (2009). Differential item functioning. SAGE Publications.
Book Google Scholar
Marais, I., & Andrich, D. (2008). Formalizing dimension and response violations of local independence in the unidimensional Rasch model. Journal of Applied Measurement, 9(3), 200–215.
PubMed Google Scholar
Christensen, K. B., Kreiner, S., & Mesbah, M. (Eds.). (2013). Rasch models in health. ISTE.
Google Scholar
Andrich, D., & Kreiner, S. (2010). Quantifying response dependence between two dichotomous items using the rasch model. Applied Psychological Measurement, 34(3), 181–192. https://doi.org/10.1177/0146621609360202
Article Google Scholar
Andrich, D., Humphry, S. M., & Marais, I. (2012). Quantifying local, response dependence between two polytomous items using the Rasch model. Applied Psychological Measurement, 36(4), 309–324. https://doi.org/10.1177/0146621612441858
Article Google Scholar
Yen, W. M. (1984). Effects of local item dependence on the fit and equating performance of the three-parameter logistic model. Applied Psychological Measurement, 8(2), 125–145. https://doi.org/10.1177/014662168400800201
Article Google Scholar
Chen, W.-H., & Thissen, D. (1997). Local dependence indexes for item pairs using item response theory. Journal of Educational and Behavioral Statistics, 22(3), 265. https://doi.org/10.2307/1165285
Article Google Scholar
Hoskens, M., & De Boeck, P. (1997). A parametric model for local dependence among test items. Psychological Methods, 2(3), 261–277. https://doi.org/10.1037/1082-989X.2.3.261
Article Google Scholar
Douglas, J., Kim, H. R., Habing, B., & Gao, F. (1998). Investigating local dependence with conditional covariance functions. Journal of Educational and Behavioral Statistics, 23(2), 129–151. https://doi.org/10.2307/1165318
Article Google Scholar
Ip, E. H. (2001). Testing for local dependency in dichotomous and polytomous item response models. Psychometrika, 66(1), 109–132. https://doi.org/10.1007/BF02295736
Article Google Scholar
Ip, E. H. (2002). Locally dependent latent trait model and the dutch identity revisited. Psychometrika, 67(3), 367–386. https://doi.org/10.1007/BF02294990
Article Google Scholar
Edwards, M. C., Houts, C. R., & Cai, L. (2018). A diagnostic procedure to detect departures from local independence in item response theory models. Psychological Methods, 23(1), 138–149. https://doi.org/10.1037/met0000121
Article PubMed Google Scholar
Straat, J. H., van der Ark, L. A., & Sijtsma, K. (2016). Using conditional association to identify locally independent item sets. Methodology, 12(4), 117–123. https://doi.org/10.1027/1614-2241/a000115
Article Google Scholar
Olsbjerg, M., & Christensen, K. B. (2015). Modeling local dependence in longitudinal IRT models. Behavior Research Methods, 47(4), 1413–1424. https://doi.org/10.3758/s13428-014-0553-0
Article PubMed Google Scholar
Marais, I. (2009). Response dependence and the measurement of change. Journal of Applied Measurement, 10, 17–29.
PubMed Google Scholar
Olsbjerg, M., & Christensen, K. B. (n.d.). LIRT: SAS macros for longitudinal IRT models, 49.
Olsbjerg, M., & Christensen, K. B. (2015). %lrasch_mml : A SAS macro for marginal maximum likelihood estimation in longitudinal polytomous rasch models. Journal of Statistical Software. https://doi.org/10.18637/jss.v067.c02
Article Google Scholar

Download references

Acknowledgements

We would like to warmly thank all the staff members of the EVALADD cohort.

Funding

Y. Dubuy received a national grant from the French Ministry of Higher Education, Research and Innovation. The EVALADD cohort is sponsored by Nantes University Hospital (CHU Nantes).

Author information

Authors and Affiliations

INSERM U1246, SPHERE University of Nantes, University of Tours, Nantes, France
Yseulys Dubuy, Véronique Sébille, Marie Grall-Bronnec, Gaëlle Challet-Bouju, Myriam Blanchin & Jean-Benoit Hardouin
Addictive Medicine and Psychiatry Department, CHU Nantes, Nantes, France
Marie Grall-Bronnec & Gaëlle Challet-Bouju

Authors

Yseulys Dubuy
View author publications
You can also search for this author in PubMed Google Scholar
Véronique Sébille
View author publications
You can also search for this author in PubMed Google Scholar
Marie Grall-Bronnec
View author publications
You can also search for this author in PubMed Google Scholar
Gaëlle Challet-Bouju
View author publications
You can also search for this author in PubMed Google Scholar
Myriam Blanchin
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Benoit Hardouin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yseulys Dubuy.

Ethics declarations

Conflict of interest

Authors declare that they have no conflict of interest.

Ethical approval

The EVALADD cohort (Investigator: M. Grall-Bronnec) was approved by the local Research Ethics Committee (Groupe Nantais d’Ethique dans le Domaine de la Santé), by the CCTIRS (Comité Consultatif sur le Traitement de l'Information en matière de Recherche dans le domaine de la Santé) and by the CNIL (Commission Nationale de l'Informatique et des Libertés).

Informed consent

All participants provided written informed consent (for under 18-year-olds, a legal representative provided informed consent), in accordance with the Helsinki declaration.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 4778 kb)

Supplementary file2 (PDF 317 kb)

Appendix 1: simulation implementation

Longitudinal partial credit model

The longitudinal Partial Credit Model (LPCM) was chosen to generate data since it allowed modelling response categories probabilities of polytomous items forming a unidimensional scale across time, and provided a possibility to simulate RS for a changing proportion of patients. The probability of patient $n$ to answer $m~(=0,\dots , M-1)$ on item $j$ at time $t$ under the LPCM is given by:

$$P\left({X}_{nj}^{(t)}=m|{\theta }_{n}^{(t)}, {\delta }_{j1}^{(t)},\dots , {\delta }_{jM-1}^{\left(t\right)}\right)=\frac{\mathrm{exp}\left(m{. \theta }_{n}^{\left(t\right)}-\sum_{p=1}^{m}{\delta }_{jp}^{\left(t\right)}\right)}{\sum_{l=0}^{M-1}\mathrm{exp}\left(l.{\theta }_{n}^{\left(t\right)}-\sum_{p=1}^{l}{\delta }_{jp}^{\left(t\right)}\right)}$$

where ${X}_{nj}^{(t)}$ denotes the response to the item $j=1,\dots , J$ of the individual $n$ at time $t$

${\theta }_{n}^{\left(t\right)}$ stands for the latent variable level of the individual $n$ at $t$ (realization of the random variable $\Theta$).

$$\left(\begin{array}{c}{\Theta }^{{(t}_{1})}\\ {\Theta }^{{(t}_{2})}\end{array}\right)\sim N\left(\left[\begin{array}{c}{\mu }_{1}\\ {\mu }_{2}\end{array}\right],\Sigma =\left[\begin{array}{cc}{\sigma }_{1}^{2}& {\sigma }_{\mathrm{1,2}}\\ {\sigma }_{\mathrm{1,2}}& {\sigma }_{2}^{2}\end{array}\right]\right)$$

${\delta }_{jm}^{\left(t\right)}$ is the difficulty of the response category $m=1,\dots , M-1$ from item $j$ at the time point $t$. If ${\delta }_{jm}^{\left(t\right)}$ is low, the proportion of patients scoring $m$ or more to item $j$ will be high: $m$ is hence an easy response category (vice versa for difficult response categories). Null response categories do not have a difficulty parameter.

At the first measurement occasion, difficulty parameters were chosen to be spaced along the latent variable continuum (assumed normally distributed, with a zero mean and a standard deviation equaled to 1). For each item$j$, the difficulty parameter of the first positive response category (denoted ${\delta }_{j1}^{(t_1)}$) equaled the $\frac{j}{J+1}th$ quantile from a $N\left(\mathrm{0,1}\right)$. Difficulty parameters of the following response categories were then regularly shifted from the first one: ${\delta }_{jm}^{\left({t}_{1}\right)}={\delta }_{j1}^{({t}_{1})}+\left(m-1\right)\times \frac{2}{M-2}$. Finally, difficulty parameters of all items were centered on the mean $\overline{\updelta }=\frac{\sum_{j,m}{\delta }_{jm}^{\left({t}_{1}\right)}}{J(M-1)}$ so that difficulty parameters were centered on the mean of the latent variable distribution (i.e. 0). It hence corresponded to the situation where the questionnaire is suitable for a population with a latent variable following a standard normal distribution. At the first measurement occasion, the model is a rating scale model.

Recalibration operationalization

To simulate the responses of patients affected by UR at ${t}_{2}$, we choose to shift by − 1 all the difficulty parameters of the item(s) affected by recalibration, making all response categories easier. For patients affected by NUR, difficulty parameters were differentially shifted by values ranging 0 to 2 $2\eta ,\mathrm{ with~}\eta =1.8$: the first positive response category kept the same difficulty parameter over time, while other categories became more difficult. Finally, we kept the difficulty parameters constant over time to simulate the responses of patients not affected by RS.

$$\text{For all } m \,\mathrm{in }\left\{1,\dots ,M-1\right\},~ \delta_{jm}^{(t_2)} = \left\{ {\begin{array}{*{20}cl} \delta_{jm}^{(t_1)}+\eta_m & \text{for individuals affected by RS} \\ \delta_{jm}^{(t_1)} & \quad \,\text{for individuals not affected by RS} \end{array} } \right. $$

For UR, ${\eta }_{m}^{UR}=-1$ for all $m$ $\mathrm{in }\left\{1,\dots ,M-1\right\}$

For NUR, $\eta_{m}^{NUR}=\left\{\begin{array}{cll}\frac{(m-1) \eta}{m} & \quad\text { if } ~1 \leq m<\frac{M}{2}& \\ \eta & \quad\text { if } ~m=\frac{M}{2} & \text { where } \eta=1.8 \\ \frac{(M-m+1) \eta}{M-m} & \quad\text { if } ~\frac{M}{2}<m \leq M-1 & \end{array}\right.$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dubuy, Y., Sébille, V., Grall-Bronnec, M. et al. Evaluation of the link between the Guttman errors and response shift at the individual level. Qual Life Res 31, 61–73 (2022). https://doi.org/10.1007/s11136-021-03015-9

Download citation

Accepted: 04 October 2021
Published: 17 October 2021
Issue Date: January 2022
DOI: https://doi.org/10.1007/s11136-021-03015-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Evaluation of the link between the Guttman errors and response shift at the individual level