Abstract
Purpose
It has been rarely studied whether observed disparity in health-related quality-of-life (HRQoL) scores between patients with diabetes and healthy individuals is due to differential item functioning (DIF) or a true difference in the underlying construct. This study aimed to examine DIF in the SF-36 questionnaire and its effect on comparing HRQoL scores between patients with diabetes and healthy people.
Methods
The sample consisted of 230 patients with type 2 diabetes and 642 healthy individuals who filled out the Persian version of the SF-36 questionnaire. To detect DIF across patients with diabetes and healthy individuals, multiple-group multiple-indicator multiple-causes model was used. In addition, item calibration strategy was used to determine whether the effect of item-level DIF was transferred to the scale level.
Results
Nine out of thirty-six (25 %) items were detected as DIF, of which one item (11 %) was flagged as uniform and eight items (89 %) as non-uniform DIF. Most of the DIF items were detected in the mental health component which includes vitality, perceived mental health and social functioning subscales rather than in physical health component. Moreover, nonsignificant latent mean differences for general health perception and social functioning subscales became significant after DIF calibration.
Conclusion
The findings of the present study show that patients with diabetes and healthy individuals perceived some items in the SF-36 questionnaire differently. More importantly, in some subscales, the effect of item-level DIF was transferred to the scale level. Consequently, considerable caution should be taken in comparing HRQoL scores between patients with diabetes and healthy individuals.
Similar content being viewed by others
References
WHO Organization. (2014). About diabetes. http://www.who.int/diabetes/action_online/basics/en.
International Diabetes Federation. (2014). Key findings. http://www.idf.org.
Al Hayek, A. A., Robert, A. A., Al Saeed, A., Alzaid, A. A., & Al Sabaan, F. S. (2014). Factors associated with health-related quality of life among Saudi patients with type 2 diabetes mellitus: A cross-sectional survey. Diabetes and Metabolism Journal, 38(3), 220–229.
Daniele, T. M., Bruin, V. M., Oliveira, D. S., Pompeu, C. M., & Forti, A. C. (2013). Associations among physical activity, comorbidities, depressive symptoms and health-related quality of life in type 2 diabetes. Arquivos Brasileiros de Endocrinologia e Metabologia, 57(1), 44–50.
Shah, B. M., Mezzio, D. J., Ho, J., & Ip, E. J. (2015). Association of ABC (HbA1c, blood pressure, LDL-cholesterol) goal attainment with depression and health-related quality of life among adults with type 2 diabetes. Journal of Diabetes and Its Complications, 29(6), 794–800.
Chasens, E. R., Sereika, S. M., Burke, L. E., Strollo, P. J., & Korytkowski, M. (2014). Sleep, health-related quality of life, and functional outcomes in adults with diabetes. Applied Nursing Research, 27(4), 237–241.
Santos, M. A., Ceretta, L. B., Reus, G. Z., Abelaira, H. M., Jornada, L. K., Schwalm, M. T., et al. (2014). Anxiety disorders are associated with quality of life impairment in patients with insulin-dependent type 2 diabetes: a case-control study. Revista Brasileira de Psiquiatria, 36(4), 298–304.
Moazen, M., Mazloom, Z., Ahmadi, A., Dabbaghmanesh, M. H., & Roosta, S. (2015). Effect of coenzyme Q10 on glycaemic control, oxidative stress and adiponectin in type 2 diabetes. Journal of the Pakistan Medical Association, 65(4), 404–408.
Sharif, F., Masoudi, M., Ghanizadeh, A., Dabbaghmanesh, M. H., Ghaem, H., & Masoumi, S. (2014). The effect of cognitive-behavioral group therapy on depressive symptoms in people with type 2 diabetes: A randomized controlled clinical trial. Iranian Journal of Nursing and Midwifery Research, 19(5), 529–536.
Eftekhari, M. H., Akbarzadeh, M., Dabbaghmanesh, M. H., & Hassanzadeh, J. (2014). The effect of calcitriol on lipid profile and oxidative stress in hyperlipidemic patients with type 2 diabetes mellitus. ARYA Atheroscler, 10(2), 82–88.
Zhan, P., Zhang, X., Brown, J., Vistisen, D., Sicree, R., Shaw, J., et al. (2010). Global healthcare expenditure on diabetes for 2010 and 2030. Diabetes Research and Clinical Practice, 87(3), 293–301.
Boyer, J. G., & Earp, J. A. (1997). The Development of an instrument for assessing the quality of life of people with diabetes. Medical Care, 35(5), 440–453.
Darvishpoor Kakhki, A., & Abed Saeedi, Z. (2013). Health-related quality of life of diabetic patients in Tehran. International Journal of Endocrinology and Metabolism, 11(4), e7945.
Weinberger, M., Kirkman, M. S., Samsa, G. P., Cowper, P. A., Shortliffe, E. A., Simel, D. L., et al. (1994). The relationship between glycemic control and health-related quality of life in patients with non-insulin-dependent diabetes mellitus. Medical Care, 32(12), 1173–1181.
Chung, J. O., Cho, D. H., Chung, D. J., & Chung, M. Y. (2014). An assessment of the impact of type 2 diabetes on the quality of life based on age at diabetes diagnosis. Acta Diabetologica, 51(6), 1065–1072.
Speight, J., Reaney, M. D., & Barnard, K. D. (2009). Not all roads lead to Rome: A review of quality of life measurement in adults with diabetes. Diabetic Medicine, 26(4), 315–327.
Tol, A., Alhani, F., Shojaeazadeh, D., Sharifirad, G., & Moazam, N. (2015). An empowering approach to promote the quality of life and self-management among type 2 diabetic patients. Journal of Education and Health Promotion, 4, 13.
O’Shea, M. P., Teeling, M., & Bennett, K. (2015). Comorbidity, health-related quality of life and self-care in type 2 diabetes: a cross-sectional study in an outpatient population. Irish Journal of Medical Science, 184(3), 623–630.
Bourdel-Marchasson, I., Druet, C., Helmer, C., Eschwege, E., Lecomte, P., Le-Goff, M., et al. (2013). Correlates of health-related quality of life in French people with type 2 diabetes. Diabetes Research and Clinical Practice, 101(2), 226–235.
Co, M. A., Tan, L. S., Tai, E. S., Griva, K., Amir, M., Chong, K. J., et al. (2015). Factors associated with psychological distress, behavioral impact and health-related quality of life among patients with type 2 diabetes mellitus. Journal of Diabetes and Its Complications, 29(3), 378–383.
Golicki, D., Dudzinska, M., Zwolak, A., & Tarach, J. S. (2015). Quality of life in patients with type 2 diabetes in Poland-comparison with the general population using the EQ-5D questionnaire. Advances in Clinical and Experimental Medicine, 24(1), 139–146.
Kiadaliri, A. A., Najafi, B., & Mirmalek-Sani, M. (2013). Quality of life in people with diabetes: A systematic review of studies in Iran. Journal of Diabetes and Metabolic Disorders, 12(1), 54.
Huang, I. C., Hwang, C. C., Wu, M. Y., Lin, W., Leite, W., & Wu, A. W. (2008). Diabetes-specific or generic measures for health-related quality of life? Evidence from psychometric validation of the D-39 and SF-36. Value in Health, 11(3), 450–461.
McHorney, C. A., Ware, J. E., Jr., & Raczek, A. E. (1993). The MOS 36-item short-form health survey (SF-36): II. Psychometric and clinical tests of validity in measuring physical and mental health constructs. Medical Care, 31(3), 247–263.
Ware, J. E., Jr., & Sherbourne, C. D. (1992). The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Medical Care, 30(6), 473–483.
Teresi, J. A., & Fleishman, J. A. (2007). Differential item functioning and health assessment. Quality of Life Research, 16(1), 33–42.
Bjorner, J. B., Kreiner, S., Ware, J. E., Jr., Damsgaard, M. T., & Bech, P. (1998). Differential item functioning in the Danish translation of the SF-36. Journal of Clinical Epidemiology, 51(11), 1189–1202.
Dallmeijer, A. J., de Groot, V., Roorda, L. D., Schepers, V. P., Lindeman, E., van den Berg, L. H., et al. (2007). Cross-diagnostic validity of the SF-36 physical functioning scale in patients with stroke, multiple sclerosis and amyotrophic lateral sclerosis: A study using Rasch analysis. Journal of Rehabilitation Medicine, 39(2), 163–169.
Horner-Johnson, W., Krahn, G. L., Suzuki, R., Peterson, J. J., Roid, G., Hall, T., et al. (2010). Differential performance of SF-36 items in healthy adults with and without functional limitations. Archives of Physical Medicine and Rehabilitation, 91(4), 570–575.
Perkins, A. J., Stump, T. E., Monahan, P. O., & McHorney, C. A. (2006). Assessment of differential item functioning for demographic comparisons in the MOS SF-36 health survey. Quality of Life Research, 15(3), 331–348.
Pollard, B., Johnston, M., & Dixon, D. (2013). Exploring differential item functioning in the SF-36 by demographic, clinical, psychological and social factors in an osteoarthritis population. BMC Musculoskeletal Disorders, 14, 346.
Taylor, W. J., & McPherson, K. M. (2007). Using Rasch analysis to compare the psychometric properties of the short form 36 physical function score and the Health Assessment Questionnaire disability index in patients with psoriatic arthritis and rheumatoid arthritis. Arthritis and Rheumatism, 57(5), 723–729.
Wolfe, F., Hawley, D. J., Goldenberg, D. L., Russell, I. J., Buskila, D., & Neumann, L. (2000). The assessment of functional impairment in fibromyalgia (FM): Rasch analyses of 5 functional scales and the development of the FM Health Assessment Questionnaire. The Journal of Rheumatology, 27(8), 1989–1999.
Yu, Y. F., Yu, A. P., & Ahn, J. (2007). Investigating differential item functioning by chronic diseases in the SF-36 health survey: A latent trait analysis using MIMIC models. Medical Care, 45(9), 851–859.
Jafari, H., Lahsaeizadeh, S., Jafari, P., & Karimi, M. (2008). Quality of life in thalassemia major: Reliability and validity of the Persian version of the SF-36 questionnaire. Journal of Postgraduate Medicine, 54(4), 273–275.
Montazeri, A., Goshtasebi, A., Vahdaninia, M., & Gandek, B. (2005). The short form health survey (SF-36): Translation and validation study of the Iranian version. Quality of Life Research, 14(3), 875–882.
Ostini, R., & Nering, M. L. (2006). Polytomous item response theory models. Thousand Oaks, CA: Sage.
Huang, I. C., Leite, W. L., Shearer, P., Seid, M., Revicki, D. A., & Shenkman, E. A. (2011). Differential item functioning in quality of life measure between children with and without special health-care needs. Value in Health, 14(6), 872–883.
Cheung, G. W., & Rensvold, R. B. (2002). Evaluating goodness-of-fit indexes for testing measurement invariance. Structural Equation Modeling, 9(2), 233.
Meredith, W. (1993). Measurment invariance, factor analysis and factorial invariance. Psychometrika, 58(4), 525–543.
Hancock, G. R. (1997). Structural equation modeling methods of hypothesis testing of latent variable means. Measurement and Evaluation in Counseling and Development, 30(2), 91–105.
Fonseca-Pedrero, E., Sierra-Baigrie, S., Lemos-Giráldez, S., Paino, M., & Muniz, J. (2012). Dimensional structure and measurement invariance of the youth self-report across gender and age. The Journal of Adolescent Health, 50(2), 148–153.
Schram, M. T., Baan, C. A., & Pouwer, F. (2009). Depression and quality of life in patients with diabetes: a systematic review from the European Depression in Diabetes (EDID) research consortium. Current Diabetes Reviews, 5(2), 112–119.
Ali, S., Stone, M. A., Peters, J. L., Davies, M. J., & Khunti, K. (2006). The prevalence of co-morbid depression in adults with type 2 diabetes: A systematic review and meta-analysis. Diabetic Medicine, 23(11), 1165–1173.
Li, C., Barker, L., Ford, E. S., Zhang, X., Strine, T. W., & Mokdad, A. H. (2008). Mokdad diabetes and anxiety in US adults: Findings from the 2006 behavioral risk factor surveillance system. Diabetic Medicine, 25(7), 878–881.
Li, C., Ford, E. S., Zhao, G., Balluz, L. S., Berry, J. T., & Mokdad, A. H. (2010). Undertreatment of mental health problems in adults with diagnosed diabetes and serious psychological distress: The behavioral risk factor surveillance system, 2007. Diabetes Care, 33(5), 1061–1064.
Peyrot, M., Rubin, R. R., Lauritzen, T., Snoek, F. J., Matthews, D. R., & Skovlund, S. E. (2005). Psychosocial problems and barriers to improved diabetes management: Results of the cross-national diabetes attitudes, wishes and needs (DAWN) study. Diabetes Medicine, 22(10), 1379–1385.
Bagheri, Z., Jafari, P., Faghih, M., Allahyari, E., & Dehesh, T. (2015). Testing measurement equivalence of the SF-36 questionnaire across patients on hemodialysis and healthy people. International Journal of Urology and Nephrology, 47(12), 2013–2021.
Ware, J. E., Jr., Gandek, B., Kosinski, M., Aaronson, N. K., Apolone, G., Brazier, J., et al. (1998). The equivalence of SF-36 summary health scores estimated using standard and country-specific algorithms in 10 countries: Results from the IQOLA project. Journal of Clinical Epidemiology, 51(11), 1167–1170.
Teresi, J. A., Ocepek-Welikson, K., Kleinman, M., Eimicke, J. P., Crane, P. K., Jones, R. N., et al. (2009). Analysis of differential item functioning in the depression item bank from the patient reported outcome measurement information system (PROMIS): An item response theory approach. Psychology Science Quarterly, 51(2), 148–180.
Swaminathan, H., & Rogers, H. J. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement, 27(4), 361–370.
Choi, S. W., Gibbons, L. E., & Crane, P. K. (2011). Lordif: An R package for detecting differential item functioning using iterativehybrid ordinal logistic regression/item response theory and Monte Carlo simulations. Journal of Statistical Software, 39(8), 1–30.
Teresi, J. A. (2006). Different approaches to differential item functioning in health applications: Advantages, disadvantages and some neglected topics. Medical Care, 44(11), 152–170.
Woods, C. M., & Grimm, K. J. (2011). Testing for nonuniform differential item functioning with multiple indicator multiple cause models. Applied Psychological Measurement, 35(5), 339–361.
Vandenberg, R. J., & Lance, C. E. (2000). A review and synthesis of the measurement invariance literature: suggestions, practices, and recommendations for organizational research. Organizational Research Method, 3(1), 4–70.
Jafari, P., Sharafi, Z., Bagheri, Z., & Shalileh, S. (2014). Measurement equivalence of theKINDL questionnaire across child self-reports and parent proxy-reports: A comparison between item response theory and ordinal logistic regression. Child Psychiatry and Human Development, 45(3), 369–376.
Muthen, B. O. (1985). A method for studying the homogeneity of test items with respect to other relevant variables. Journal of Educational Statistics, 10(2), 121–132.
Muthén, B. O. (1984). A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators. Psychometrika, 49(1), 115–132.
Muthén, L. K., & Muthén, B. O. (2007). Mplus user’s guide. Los Angeles, CA: Muthén & Muthén.
Acknowledgments
This work was supported by a Grant No. (89-5315) from Shiraz University of Medical Sciences Research Council. The article was extracted from Marzieh Mahmoudi’s Master of Science thesis.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors hereby declare that they have no conflict of interest.
Ethical approval
All procedures performed in studies involving human participants were in accordance with the ethical standards of the ethical and research committee of our institution, Shiraz University of Medical Sciences and also with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Additional information
Zahra Bagheri and Peyman Jafari have contributed equally to this work.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Appendices
Appendix 1
The MIMIC model consists of two main components, namely measurement and structural components. In the measurement component, the model relates test items to the underlying constructs of interest by a vector of regression coefficients (factor loadings) by the following equation:
In this equation, X ij is the response of ith individual to the jth item, λ j is the factor loading of jth item, ζ i is latent trait variable of ith individual (e.g., physical functioning), Z is a vector of background variables (e.g., age, gender or grouping variable), β j is the effect from the Z variable to X ij and δ i is random error with mean zero and is assumed to be normally distributed and independent of Z. If β j = 0, for the grouping variable, then jth item is homogeneous across groups or there is no DIF in this item.
In the structural component, the underlying construct of interest and the test items are related to background variables through matrices of regression coefficients by the following equation:
where \(\varvec{\gamma}{^\prime}\) is a vector of regression coefficient which models the relation between latent structure and background variables and ε i is the random disturbance of the latent trait variable with mean zero and is assumed to be normally distributed and independent of Z [57]. In the current study, the latent constructs of interest or latent subscales of the SF-36 regressed to age and gender as confounding variables in order to control the effect of these variables when assessing differential item functioning across healthy individuals and patients with diabetes.
Appendix 2
The WLSMV is an asymptotically distribution-free estimation method which has been specifically designed for categorical variables (binary and ordinal). This method is a simplified version of the weighted least square (WLS) approach. In WLS, it is assumed that a continuous, normal, latent response distribution X * underlies an observed ordinal variable X in the population:
where m (=1, 2,…, c) defines the observed value of an ordinal observed variable X and τ is the latent threshold parameter which partitions the continuous distribution of X * into c categories (c = 0, 1, …, C − 1). The lowest category of the threshold is predetermined to −∞ (τ 0 =−∞, i.e., c = 0) and highest one to +∞ (τ c =+∞, i.e., c = C − 1), whereas the other thresholds should be estimated from the data. In the first step of WLS technique, the thresholds and polychoric correlations are estimated using two-stage ML estimation method. Then, parameter estimates and their standard errors are obtained by the estimated asymptotic covariance matrix of the polychoric correlation and threshold estimates (denoted \({\tilde{\mathbf{V}}}\)) in a weight matrix W to minimize the following weighted least squares fit function:
In this function, θ is the vector of model parameters, W (=\({\tilde{\mathbf{V}}}\)) is the weight matrix, σ(θ) is the model-implied vector including then non-redundant, vectorized elements of Σ(θ), and s is the vector of sample statistics like threshold and polychoric correlation estimates. From mathematical point of view, LSMV is simpler than WLS estimator since it only considers diagonal elements of the full weight matrix in the above fit function. In addition, the diagonal weight matrix W D = diag(\({\tilde{\mathbf{V}}}\)) which does not need not be positive definite is more flexible than the full weight matrix W = \({\tilde{\mathbf{V}}}\). Therefore, computations and encountering numerical problems in the process of parameters estimation are not as extensive as WLS [58, 59].
Rights and permissions
About this article
Cite this article
Bagheri, Z., Jafari, P., Mahmoodi, M. et al. Testing whether patients with diabetes and healthy people perceive the meaning of the items in the Persian version of the SF-36 questionnaire similarly: a differential item functioning analysis. Qual Life Res 26, 835–845 (2017). https://doi.org/10.1007/s11136-016-1419-y
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11136-016-1419-y