Abstract
Purpose
Methods for response shift (RS) detection at the individual level could be of great interest when analyzing changes in patient-reported outcome data. Guttman errors (GEs), which measure discrepancies in respondents’ answers compared to the average sample responses, might be useful for detecting RS at the individual level between two time points, as RS may induce an increase in the number of discrepancies over time. This study aims to establish the link between recalibration RS and the change in the number of GEs over time (denoted index \(I\)) via simulations and explores the discriminating ability of this index.
Methods
We simulated the responses of individuals affected or not affected by recalibration RS (defined as changes in the patients’ standard of measurement) to determine whether simulated individuals with recalibration had a greater change in the number of GEs over time than individuals without recalibration. The effects of factors related to the sample, the questionnaire structure and recalibration were investigated. As an illustrative example, the change in the number of GEs was computed in patients suffering from eating disorders.
Results
Within simulations, simulated individuals affected by recalibration had, on average, a greater change in the number of GEs over time than did individuals without RS. Some of the parameters related to the questionnaire structure and recalibration magnitude appeared to have substantial effects on the values of \(I\). Discriminating abilities appeared, however, globally low.
Conclusion
Some evidence of the link between recalibration and the change in GEs was found in this study. GEs could be a valuable nonparametric tool for RS detection at a more individual level, but further investigation is needed.
Similar content being viewed by others
Data availability
Modules, scripts and an extract of the simulated data used in the paper are available at the Open Science Framework via the link: https://osf.io/h9nyd/?view_only=b196db78f31c4e9fbb07013342a133a2
Notes
i.e., nonrandom errors in the latent variable estimates.
References
Basch, E. (2017). Patient-reported outcomes: Harnessing patients’ voices to improve clinical care. The New England Journal of Medicine, 376(2), 105–108. https://doi.org/10.1056/NEJMp1611252
Schwartz, C. E., Finkelstein, J. A., & Rapkin, B. D. (2017). Appraisal assessment in patient-reported outcome research: Methods for uncovering the personal context and meaning of quality of life. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 26(3), 545–554. https://doi.org/10.1007/s11136-016-1476-2
Sprangers, M. A. G., & Schwartz, C. E. (1999). Integrating response shift into health-related quality of life research: A theoretical model. Social Science & Medicine, 48(11), 1507–1515. https://doi.org/10.1016/S0277-9536(99)00045-3
Vanier, A., Falissard, B., Sébille, V., & Hardouin, J. B. (2017). The complexity of interpreting changes observed over time in health-related quality of life: A short overview of 15 years of research on response shift theory. In F. Guillemin, A. Leplege, S. Briancon, E. Spitz, & J. Coste (Eds.), Perceived health and adaptation in chronic disease (1st ed.). New York: Routledge.
Schwartz, C. E., Sprangers, M. A., & Fayers, P. M. (2005). Response shift: You know it’s there, but how do you capture it? Challenges for the next phase of research. In Assessing quality of life in clinical trials (2nd ed.). Oxford University Press.
Oort, F. J. (2005). Using structural equation modeling to detect response shifts and true change. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 14(3), 587–598.
Schwartz, C. E. (2016). Introduction to special section on response shift at the item level. Quality of Life Research, 25(6), 1323–1325. https://doi.org/10.1007/s11136-016-1299-1
Guilleux, A., Blanchin, M., Vanier, A., Guillemin, F., Falissard, B., Schwartz, C. E., Hardouin, J. B., & Sébille, V. (2015). RespOnse Shift ALgorithm in Item response theory (ROSALI) for response shift detection with missing data in longitudinal patient-reported outcome studies. Quality of Life Research, 24(3), 553–564. https://doi.org/10.1007/s11136-014-0876-4
Blanchin, M., Guilleux, A., Hardouin, J.-B., & Sébille, V. (2020). Comparison of structural equation modelling, item response theory and Rasch measurement theory-based methods for response shift detection at item level: A simulation study. Statistical Methods in Medical Research, 29(4), 1015–1029. https://doi.org/10.1177/0962280219884574
Vanier, A., Sébille, V., Blanchin, M., Guilleux, A., & Hardouin, J.-B. (2015). Overall performance of Oort’s procedure for response shift detection at item level: A pilot simulation study. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 24(8), 1799–1807. https://doi.org/10.1007/s11136-015-0938-2
Nolte, S., Mierke, A., Fischer, H. F., & Rose, M. (2016). On the validity of measuring change over time in routine clinical assessment: A close examination of item-level response shifts in psychosomatic inpatients. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 25(6), 1339–1347. https://doi.org/10.1007/s11136-015-1123-3
Gandhi, P. K., Schwartz, C. E., Reeve, B. B., DeWalt, D. A., Gross, H. E., & Huang, I.-C. (2016). An item-level response shift study on the change of health state with the rating of asthma-specific quality of life: A report from the PROMIS(®) Pediatric Asthma Study. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 25(6), 1349–1359. https://doi.org/10.1007/s11136-016-1290-x
Verdam, M. G. E., Oort, F. J., & Sprangers, M. A. G. (2016). Using structural equation modeling to detect response shifts and true change in discrete variables: An application to the items of the SF-36. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 25(6), 1361–1383. https://doi.org/10.1007/s11136-015-1195-0
Ahmed, S., Sawatzky, R., Levesque, J.-F., Ehrmann-Feldman, D., & Schwartz, C. E. (2014). Minimal evidence of response shift in the absence of a catalyst. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 23(9), 2421–2430. https://doi.org/10.1007/s11136-014-0699-3
Blanchin, M., Sébille, V., Guilleux, A., & Hardouin, J.-B. (2016). The Guttman errors as a tool for response shift detection at subgroup and item levels. Quality of Life Research, 25(6), 1385–1393. https://doi.org/10.1007/s11136-016-1268-8
Meijer, R. R., Niessen, A. S. M., & Tendeiro, J. N. (2016). A practical guide to check the consistency of item response patterns in clinical research through person-fit statistics: Examples and a computer program. Assessment, 23(1), 52–62. https://doi.org/10.1177/1073191115577800
Sijtsma, K., & Molenaar, I. W. (2002). Introduction to nonparametric item response theory. SAGE.
Emons, W. H. M. (2008). Nonparametric person-fit analysis of polytomous item scores. Applied Psychological Measurement, 32(3), 224–247. https://doi.org/10.1177/0146621607302479
Fischer, G. H., & Ponocny, I. (1994). An extension of the partial credit model with an application to the measurement of change. Psychometrika, 59(2), 177–192. https://doi.org/10.1007/BF02295182
American Psychiatric Association, & American Psychiatric Association (eds.). (2009). Diagnostic and statistical manual of mental disorders: DSM-IV-TR (4. ed., text revision, 13. print.). Arlington, VA: American Psychiatric Assoc.
Lecrubier, Y., Sheehan, D., Weiller, E., Amorim, P., Bonora, I., Harnett Sheehan, K., & Dunbar, G. (1997). The Mini International Neuropsychiatric Interview (MINI). A short diagnostic structured interview: Reliability and validity according to the CIDI. European Psychiatry, 12(5), 224–231. https://doi.org/10.1016/S0924-9338(97)83296-8
Sheehan, D. V., Lecrubier, Y., Sheehan, K. H., Amorim, P., Janavs, J., Weiller, E., & Dunbar, G. C. (1998). The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. The Journal of Clinical Psychiatry, 59(Suppl 20), 22–33.
Garner, D. M. (1991). Eating disorder inventory-2. Professional manual. Psychological Assessment Research.
Archinard, M., Rouget, P., Painot, D., & Liengme, C. (2002). Inventaire des troubles alimentaires 2 [Eating Disorder Inventory 2]. In M. Bouvard & J. Cottraux (Eds.), Protocoles et échelles d’évaluation en psychiatrie et en psychologie [Protocols and evaluation scales in psychiatry and psychology] (3rd ed., pp. 249–251). Masson.
Cloninger, C. R., Przybeck, T. R., & Svrakic, D. M. (1994). The temperament and character inventory (TCI) a guide to its development and use. Center for Psychobiology of Personality, Washington University.
Pélissolo, A., & Lépine, J.-P. (1997). Traduction française et premières études de validation du questionnaire de personnalité TCI. [Validation study of the French version of the TCI.]. Annales Médico-Psychologiques, 155(8), 497–508.
Chakroun-Vinciguerra, N., Faytout, M., Pélissolo, A., & Swendsen, J. (2005). Validation française de la version courte de l’Inventaire du Tempérament et du Caractère (TCI-125). Journal de Thérapie Comportementale et Cognitive, 15(1), 27–33. https://doi.org/10.1016/S1155-1704(05)81209-1
Cooper, P. J., Taylor, M. J., Cooper, Z., & Fairbum, C. G. (1987). The development and validation of the body shape questionnaire. International Journal of Eating Disorders, 6(4), 485–494. https://doi.org/10.1002/1098-108X(198707)6:4%3c485::AID-EAT2260060405%3e3.0.CO;2-O
Rousseau, A., Knotter, A., Barbe, P., Raich, R., & Chabrol, H. (2005). Validation of the French version of the Body Shape Questionnaire. L’Encephale, 31(2), 162–173. https://doi.org/10.1016/s0013-7006(05)82383-8
Ware, J. E., & Sherbourne, C. D. (1992). The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Medical Care, 30(6), 473–483.
Aaronson, N. K., Ahmedzai, S., Bergman, B., Bullinger, M., Cull, A., Duez, N. J., & de Haes, J. C. (1993). The European Organization for Research and Treatment of Cancer QLQ-C30: A quality-of-life instrument for use in international clinical trials in oncology. Journal of the National Cancer Institute, 85(5), 365–376. https://doi.org/10.1093/jnci/85.5.365
Holland, P. W., & Wainer, H. (Eds.). (1993). Differential item functioning. Differential Item Functioning, xv, 453–xv, 453.
Osterlind, S., & Everson, H. (2009). Differential item functioning. SAGE Publications.
Marais, I., & Andrich, D. (2008). Formalizing dimension and response violations of local independence in the unidimensional Rasch model. Journal of Applied Measurement, 9(3), 200–215.
Christensen, K. B., Kreiner, S., & Mesbah, M. (Eds.). (2013). Rasch models in health. ISTE.
Andrich, D., & Kreiner, S. (2010). Quantifying response dependence between two dichotomous items using the rasch model. Applied Psychological Measurement, 34(3), 181–192. https://doi.org/10.1177/0146621609360202
Andrich, D., Humphry, S. M., & Marais, I. (2012). Quantifying local, response dependence between two polytomous items using the Rasch model. Applied Psychological Measurement, 36(4), 309–324. https://doi.org/10.1177/0146621612441858
Yen, W. M. (1984). Effects of local item dependence on the fit and equating performance of the three-parameter logistic model. Applied Psychological Measurement, 8(2), 125–145. https://doi.org/10.1177/014662168400800201
Chen, W.-H., & Thissen, D. (1997). Local dependence indexes for item pairs using item response theory. Journal of Educational and Behavioral Statistics, 22(3), 265. https://doi.org/10.2307/1165285
Hoskens, M., & De Boeck, P. (1997). A parametric model for local dependence among test items. Psychological Methods, 2(3), 261–277. https://doi.org/10.1037/1082-989X.2.3.261
Douglas, J., Kim, H. R., Habing, B., & Gao, F. (1998). Investigating local dependence with conditional covariance functions. Journal of Educational and Behavioral Statistics, 23(2), 129–151. https://doi.org/10.2307/1165318
Ip, E. H. (2001). Testing for local dependency in dichotomous and polytomous item response models. Psychometrika, 66(1), 109–132. https://doi.org/10.1007/BF02295736
Ip, E. H. (2002). Locally dependent latent trait model and the dutch identity revisited. Psychometrika, 67(3), 367–386. https://doi.org/10.1007/BF02294990
Edwards, M. C., Houts, C. R., & Cai, L. (2018). A diagnostic procedure to detect departures from local independence in item response theory models. Psychological Methods, 23(1), 138–149. https://doi.org/10.1037/met0000121
Straat, J. H., van der Ark, L. A., & Sijtsma, K. (2016). Using conditional association to identify locally independent item sets. Methodology, 12(4), 117–123. https://doi.org/10.1027/1614-2241/a000115
Olsbjerg, M., & Christensen, K. B. (2015). Modeling local dependence in longitudinal IRT models. Behavior Research Methods, 47(4), 1413–1424. https://doi.org/10.3758/s13428-014-0553-0
Marais, I. (2009). Response dependence and the measurement of change. Journal of Applied Measurement, 10, 17–29.
Olsbjerg, M., & Christensen, K. B. (n.d.). LIRT: SAS macros for longitudinal IRT models, 49.
Olsbjerg, M., & Christensen, K. B. (2015). %lrasch_mml : A SAS macro for marginal maximum likelihood estimation in longitudinal polytomous rasch models. Journal of Statistical Software. https://doi.org/10.18637/jss.v067.c02
Acknowledgements
We would like to warmly thank all the staff members of the EVALADD cohort.
Funding
Y. Dubuy received a national grant from the French Ministry of Higher Education, Research and Innovation. The EVALADD cohort is sponsored by Nantes University Hospital (CHU Nantes).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Authors declare that they have no conflict of interest.
Ethical approval
The EVALADD cohort (Investigator: M. Grall-Bronnec) was approved by the local Research Ethics Committee (Groupe Nantais d’Ethique dans le Domaine de la Santé), by the CCTIRS (Comité Consultatif sur le Traitement de l'Information en matière de Recherche dans le domaine de la Santé) and by the CNIL (Commission Nationale de l'Informatique et des Libertés).
Informed consent
All participants provided written informed consent (for under 18-year-olds, a legal representative provided informed consent), in accordance with the Helsinki declaration.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Appendix 1: simulation implementation
Appendix 1: simulation implementation
Longitudinal partial credit model
The longitudinal Partial Credit Model (LPCM) was chosen to generate data since it allowed modelling response categories probabilities of polytomous items forming a unidimensional scale across time, and provided a possibility to simulate RS for a changing proportion of patients. The probability of patient \(n\) to answer \(m~(=0,\dots , M-1)\) on item \(j\) at time \(t\) under the LPCM is given by:
where \({X}_{nj}^{(t)}\) denotes the response to the item \(j=1,\dots , J\) of the individual \(n\) at time \(t\)
\({\theta }_{n}^{\left(t\right)}\) stands for the latent variable level of the individual \(n\) at \(t\) (realization of the random variable \(\Theta\)).
\({\delta }_{jm}^{\left(t\right)}\) is the difficulty of the response category \(m=1,\dots , M-1\) from item \(j\) at the time point \(t\). If \({\delta }_{jm}^{\left(t\right)}\) is low, the proportion of patients scoring \(m\) or more to item \(j\) will be high: \(m\) is hence an easy response category (vice versa for difficult response categories). Null response categories do not have a difficulty parameter.
At the first measurement occasion, difficulty parameters were chosen to be spaced along the latent variable continuum (assumed normally distributed, with a zero mean and a standard deviation equaled to 1). For each item\(j\), the difficulty parameter of the first positive response category (denoted \({\delta }_{j1}^{(t_1)}\)) equaled the \(\frac{j}{J+1}th\) quantile from a \(N\left(\mathrm{0,1}\right)\). Difficulty parameters of the following response categories were then regularly shifted from the first one: \({\delta }_{jm}^{\left({t}_{1}\right)}={\delta }_{j1}^{({t}_{1})}+\left(m-1\right)\times \frac{2}{M-2}\). Finally, difficulty parameters of all items were centered on the mean \(\overline{\updelta }=\frac{\sum_{j,m}{\delta }_{jm}^{\left({t}_{1}\right)}}{J(M-1)}\) so that difficulty parameters were centered on the mean of the latent variable distribution (i.e. 0). It hence corresponded to the situation where the questionnaire is suitable for a population with a latent variable following a standard normal distribution. At the first measurement occasion, the model is a rating scale model.
Recalibration operationalization
To simulate the responses of patients affected by UR at \({t}_{2}\), we choose to shift by − 1 all the difficulty parameters of the item(s) affected by recalibration, making all response categories easier. For patients affected by NUR, difficulty parameters were differentially shifted by values ranging 0 to 2 \(2\eta ,\mathrm{ with~}\eta =1.8\): the first positive response category kept the same difficulty parameter over time, while other categories became more difficult. Finally, we kept the difficulty parameters constant over time to simulate the responses of patients not affected by RS.
For UR, \({\eta }_{m}^{UR}=-1\) for all \(m\) \(\mathrm{in }\left\{1,\dots ,M-1\right\}\)
For NUR, \(\eta_{m}^{NUR}=\left\{\begin{array}{cll}\frac{(m-1) \eta}{m} & \quad\text { if } ~1 \leq m<\frac{M}{2}& \\ \eta & \quad\text { if } ~m=\frac{M}{2} & \text { where } \eta=1.8 \\ \frac{(M-m+1) \eta}{M-m} & \quad\text { if } ~\frac{M}{2}<m \leq M-1 & \end{array}\right.\)
Rights and permissions
About this article
Cite this article
Dubuy, Y., Sébille, V., Grall-Bronnec, M. et al. Evaluation of the link between the Guttman errors and response shift at the individual level. Qual Life Res 31, 61–73 (2022). https://doi.org/10.1007/s11136-021-03015-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11136-021-03015-9