Abstract
Quality of life assessment includes measurement of positive affect. Methods artifacts associated with positively and negatively worded items can manifest as negative items loading on a second factor, despite the conceptual view that the items are measuring one underlying latent construct. Negatively worded items may elicit biased responses. Additionally, item-level response bias across ethnically diverse groups may compromise group comparisons. The aim was to illustrate methodological approaches to examining method factors and measurement equivalence in an affect measure with 9 positively and 7 negatively worded items: The Feeling Tone Questionnaire (FTQ). The sample included 4960 non-Hispanic White, 1144 non-Hispanic Black, and 517 Hispanic community and institutional residents receiving long-term supportive services. The mean age was 82 (s.d. = 11.0); 73% were female. Two thirds were cognitively impaired. Methods effects were assessed using confirmatory factor analyses (CFA), and reliability with McDonald’s omega and item response theory (IRT) generated estimates. Measurement equivalence was examined using IRT-based Wald tests. Methods effects associated with negatively worded items were observed; these provided little IRT information, and as a composite evidenced lower reliability. Both 13 and 9 item positive affect scales performed well in terms of model fit, reliability, IRT information, and evidenced little differential item functioning of high magnitude or impact. Both CFA and IRT approaches provided complementary methodological information about scale performance. The 9-item affect scale based on the FTQ can be recommended as a brief quality-of-life measure among frail and cognitively impaired individuals in palliative and long-term care settings.
Similar content being viewed by others
References
Abbott, R. A., Ploubidis, G. B., Huppert, F. A., Kuh, D., Wadsworth, M. E., & Croudace, T. J. (2006). Psychometric evaluation and predictive validity of Ryff's psychological well-being items in a UK birth cohort sample of women. Health and Quality of Life Outcomes, 4, 76. doi:10.1186/1477-7525-4-76.
Albert, S. M., & Teresi, J. A. (2002). Quality of life, definition and measurement. The MacMillan Encyclopedia of Aging. New York: MacMillan References, U.S.A.
Asparouhov, T., & Muthén, B. (2009). Exploratory structural equation modeling. Structural Equation Modeling, 16, 397–438. doi:10.1080/10705510903008204.
Azocar, F., Areán, P., Miranda, J., & Muñoz, R. F. (2001). Differential item functioning in a Spanish translation of the Beck depression inventory. Journal of Clinical Psychology, 57(3), 355–365. doi:10.1002/jclp.1017.
Benjamini, Y., & Hochberg, Y. (1995). Controlling for the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B, 57, 289–300.
Bentler, P. M. (1990). Comparative fit indexes in structural models. Psychological Bulletin, 107(2), 238–246. doi:10.1037/0033-2909.107.2.238.
Bentler, P. M. (2009). Alpha, dimension-free, and model-based internal consistency reliability. Psychometrika, 74, 137–143. doi:10.1007/s11336-008-9100-1.
Blanchflower, D. G., & Oswald, A. J. (2008). Is well-being U-shaped over the life cycle? Social Science Medicine, 66, 1733–1749. doi:10.1016/j.socscimed.2008.01.030.
Bolt, D. M., & Newton, J. R. (2011). Multiscale measurement of extreme response style. Educational and Psychological Measurement, 71(5), 814–833. doi:10.1177/0013164410388411.
Bonferroni, C. E. (1936). Teoria statistica delle classi e calcolo delle probabilità. Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commerciali di Firenze, 8, 3–62.
Bonifay, W. E., Reise, S. P., Scheines, R., & Meijer, B. R. (2015). When are multidimensional data unidimensional enough for structural equation modeling? An evaluation of the DETECT multidimensionality index. Structural Equation Modeling, 22, 504–516. doi:10.1080/107005511.2014.938596.
Brod, M., Stewart, A. L., Sands, L., & Walton, P. (1999). Conceptualization and measurement of quality of life in dementia: The dementia quality of life instrument (DQoL). The Gerontologist, 39, 25–35. doi:10.1093/geront/39.1.25.
Buja, A., & Eyuboglu, N. (1992). Remarks on parallel analysis. Multivariate Behavioral Research, 27(4), 509–540. doi:10.1207/s15327906mbr2704_2.
Cai, L., Thissen, D., & du Toit, S. H. C. (2011). IRTPRO: Flexible, multidimensional, multiple categorical IRT modeling [computer software]. Chicago: Scientific Software International, Inc..
Camilli, G., & Shepard, L. A. (1994). Methods for identifying biased test items. Thousand Oaks: Sage Publications.
Cella, D., Yount, S., Rothrock, N., Gershon, R., Cook, K., Reeve, B., et al., on behalf of the PROMIS Cooperative Group. (2007). The Patient-Reported Outcomes Measurement Information System (PROMIS): Progress of an NIH roadmap cooperative group during its first two years. Medical Care , 45(5 Suppl 1), S3–S11. doi:10.1097/01.mlr.0000258615.42478.55.
Chan, K. S., Orlando, M., Ghosh-Dastidar, B., & Sherbourne, C. D. (2004). The interview mode effect on the Center of Epidemiological Studies Depression (CES-D) scale: An item response theory analysis. Medical Care, 42(3), 281–289. doi:10.1097/01.mlr.0000115632.78486.1f.
Chen, W. H., & Thissen, D. (1997). Local dependence indices for item pairs using item response theory. Journal of Educational and Behavioral Statistics, 22, 265–289. doi:10.2307/1165285.
Chen, F. F., West, S. W., & Soussa, K. H. (2006). A comparison of bifactor and second-order models of quality of life. Multivariate Behavioral Research, 41, 189–224.
Cheng, Y., Liu, C., & Behrens, J. (2015). Standard error of reliability estimates and the classification accuracy and consistency of binary decisions. Psychometrika, 80, 645–664. doi:10.1007/s11336-014-9407-z.
Choi, H., Fogg, L., Lee, E. E., & Choi Wu, M. (2009). Evaluating differential item functioning of the CES-D scale according to caregiver status and cultural context in Korean women. Journal of the American Psychiatric Nurses Association, 15(4), 240–248. doi:10.1177/1078390309343713.
Choi, S. W., Reise, S. P., Pilkonis, P. A., Hays, R. D., & Cella, D. (2010). Efficiency of static and computer adaptive short forms compared to full-length measures of depressive symptoms. Quality of Life Research, 19, 125–136. doi:10.1007/s11136-009-9560-5.
Choi, S. W., Gibbons, L. E., & Crane, P. K. (2011). lordif.: An R package for detecting differential item functioning using iterative hybrid ordinal logistic regression/item response theory and Monte Carlo simulations. Journal of Statistical Software, 39, 1–30. doi:10.18637/jss.v039.i08.
Cole, S. R., Kawachi, I., Maller, S. R., & Berkman, L. F. (2000). Test of item-response bias in the CES-D scale: Experience from the New Haven EPESE study. Journal of Clinical Epidemiology, 53, 285–289. doi:10.1016/S0895-4356(99)00151-1.
Cook, K. F., Kallen, M. A., & Amtmann, D. (2009). Having a fit: Impact of number of items and distribution of data on traditional criteria for assessing IRT’s unidimensionality assumption. Quality of Life Research, 18, 447–460. doi:10.1007/s11136-009-9464-4.
Cox, D. R., & Snell, E. J. (1989). The analysis of binary data (2nd ed.). London: Chapman and Hall.
Crane, P. K., van Belle, G., & Larson, E. B. (2004). Test bias in a cognitive test: Differential item functioning in the CASI. Statistics in Medicine, 23, 241–256. doi:10.1002/sim.1713.
Crane, P. K., Gibbons, L. E., Jolley, L., & van Belle, G. (2006). Differential item functioning analysis with ordinal logistic regression techniques. DIFdetect and difwithpar. Medical Care, 44(11 Suppl 3), S115–S123. doi:10.1097/01.mlr.0000245183.28384.ed.
Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16, 297–334. doi:10.1007/BF02310555.
Diener, E., Emmons, R. A., Larsen, R. J., & Griffin, S. (1985). The satisfaction with life scale. Journal of Personality Assessment, 49(1), 71–75. doi:10.1207/s15327752jpa4901_13.
Diener, E., Suh, E. M., Lucas, R. E., & Smith, H. L. (1999). Subjective well-being: Three decades of progress. Psychological Bulletin, 125(2), 276–302. doi:10.1037/0033-2909.125.2.276.
Dolan, P., Peasgood, T., & White, M. (2008). Do we really know what makes us happy? A review of the economic literature on the factors associated with subjective well-being. Journal of Economic Psychology, 29(1), 94–122. doi:10.1016/j.joep.2007.09.001.
Estabrook, R., Sadler, M. E., & McGue, M. (2015). Differential item functioning in the Cambridge Mental Disorders in the Elderly (CAMDEX) depression scale across middle age and late life. Psychological Assessment, 27(4), 1219–1233. doi:10.1037/pas0000114.
Fleer, P. F. (1993). A Monte Carlo assessment of a new measure of item and test bias. Illinois Institute of Technology. Dissertation Abstracts International, 54(04B), 2266.
Flowers, C. P., Oshima, T. C., & Raju, N. S. (1999). A description and demonstration of the polytomous DFIT framework. Applied Psychological Measurement, 23, 309–326. doi:10.1177/01466219922031437.
Garrido, L. E., Abad, F. J., & Ponsoda, V. (2012). A new look at Horn’s parallel analysis with ordinal variables. Psychological Methods, 18, 454–474. doi:10.1037/a0030005.
Grayson, D. A., Mackinnon, A., Jorm, A. F., Creasey, H., & Broe, G. A. (2000). Item bias in the Center for Epidemiologic Studies Depression Scale: Effects of physical disorders and disability in an elderly community sample. Journals of Gerontology: Psychological Sciences, 55B(5), 273–282. doi:10.1093/geronb/55.5.P273.
Green, S. B., Redell, N., Thompson, M. S., & Levy, R. (2016). Accuracy of revised and traditional parallel analyses for assessing dimensionality with binary data. Educational and Psychologial Measurement, 76, 5–21. doi:10.1177/0013164415581898.
Gurland, B. J., & Gurland, R. V. (2009a). The choices, choosing model of quality of life: Description and rationale. International Journal of Geriatric Psychiatry, 24(1), 90–95. doi:10.1002/gps.2110.
Gurland, B. J., & Gurland, R. V. (2009b). The choices, choosing model of quality of life: Linkages to a science base. International Journal of Geriatric Psychiatry, 24, 84–89. doi:10.1002/gps.2109.
Gurland, B. J., Gurland, R., Mitty, E., & Toner, J. (2009). The choices, choosing model of quality of life: Clinical evaluation and intervention. Journal of Interprofessional Care, Informal Healthcare, 23(2), 110–120. doi:10.1080/ 13561820802675657.
Gurland, B. J., Cheng, H., & Maurer, M. S. (2010). Health-related restrictions of choices and choosing: Implications for quality of life and clinical interventions. Patient Related Outcome Measures, 1, 73–80. doi:10.2147/PROM.S11842.
Gurland, B., Teresi, J. A., Eimicke, J. P., Maurer, M. S., & Reid, M. C. (2014). Quality of life impacts in the 16-year survival of an older ethnically diverse cohort. International Journal of Geriatric Psychiatry, 29, 533–545. doi:10.1002/gps.4038.
Hickey, A., Barker, M., McGee, H., & O’Boyle, C. (2005). Measuring health-related quality of life in older patient populations: A review of current approaches. PharmacoEconomics, 23(10), 971–993. doi:10.2165/00019053-200523100-00002.
Holmes, D., Ory, M., & Teresi, J. (Eds) (1994). Special dementia care: Research, policy, and practice issues. Alzheimer's Disease and Associated Disorders: An International Journal, 8(Suppl 1).
Horn, J. L. (1965). A rationale and test for the number of factors in factor analysis. Psychometrika, 30, 179–185. doi:10.1007/BF02289447.
Iwata, N., Turner, R. J., & Lloyd, D. A. (2002). Race/ethnicity and depressive symptoms in community-dwelling young adults: A differential item functioning analysis. Psychiatric Research, 110(3), 281–289. doi:10.1016/S0165-1781(02)00102-6.
Jensen, R. E., King-Kallimanis, B. L., Sexton, E., Reeve, B. B., Moinpour, C. M., Potosky, A. L., et al. (2016). Measurement properties of PROMIS® sleep disturbance short forms in a large, ethnically diverse cancer cohort. Psychological Test and Assessment Modeling, 58(2), 353-370.
Kahneman, D., & Krueger, A. B. (2006). Developments in the measurement of subjective well-being. The Journal of Economic Perspectives, 20(1), 23–24. doi:10.1257/089533006776526030.
Kahneman, D., Krueger, A. B., Schkade, D., Shwarz, N., & Stone, A. A. (2006). Would you be happier if you were richer? A focusing illusion. Science, 312, 1908–1910. doi:10.1126/science.1129688.
Kapteyn, A., Lee, J., Tasscot, C., Vonkova, H., & Zamarro, G. (2015). Dimensions of subjective well-being. Social Indicators Research, 123(3), 625–660. doi:10.1007/s11205-014-0753-0.
Kim, Y., Pilkonis, P. A., Frank, E., Thase, M. E., & Reynolds, C. F. (2002). Differential functioning of the Beck Depression Inventory in Late-life Patients: Use of item response theory. Psychology and Aging, 17(3), 379–391. doi:10.1037/0882-7974.17.3.379.
Kim, S., Cohen, A. S., Alagoz, C., & Kim, S. (2007). DIF detection and effect size measures for polytomously scored items. Journal of Educational Measurement, 44, 93–116. doi:10.1111/j.1745-3984.2007.00029.x.
Kim, G., Chiriboga, D. A., & Jang, Y. (2009). Cultural equivalence in depressive symptoms in older White, Black, and Mexican-American adults. Journal of the American Geriatrics Society, 75(5), 790–796. doi:10.1111/j.1532-5415.2009.02188.x.
Kleinman, M., & Teresi, J. A. (2016). Differential item functioning magnitude and impact measures from item response theory models. Psychological Test and Assessment Modeling, 58, 79–98.
Kopf, J., Zeileis, A., & Stobl, C. (2015). Anchor selection strategies for DIF analysis: Review, assessment and new approaches. Educational and Psychological Measurement, 75, 22–56. doi:10.1177/0013164414529792.
Lawton, M. P. (1983). Environment and other determinants of well-being in older people. The Gerontologist, 23(4), 349–357. doi:10.1093/geront/23.4.349.
Lawton, M. P. (1997). Assessing quality of life in Alzheimer disease research. Alzheimer Disease and Associated Disorders, 11(Suppl 6), 91–99.
Lawton, M. P., Moss, M., Hoffman, C., Grant, R., Ten Have, T., & Kleban, M. H. (1999). Health, valuation of life, and the wish to live. The Gerontologist, 39(4), 406–416. doi:10.1093/geront/39.4.406.
Lindwall, M., Barkoukis, V., Grano, C., Lucidi, G., Raudsapp, L., Liukkonen, J., et al. (2012). Method effects: The problem with negatively versus positively keyed items. Journal of Personality Assessment, 94(2), 196–204. doi:10.1080/00223891.2011.645936.
Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale: Lawrence Erlbaum.
Lord, F. M. & Novick, M. R. (with contributions by A. Birnbaum) (1968). Statistical theories of mental test scores. Reading: Addison-Wesley Publishing Company, Inc.
Maydeu-Olivares, A., & Coffman, D. L. (2006). Random intercept item factor analysis. Psychological Methods, 11, 344–362. doi:10.1037/1082-989X.11.4.344.
McDonald, R. P. (1999). Test theory: A unified treatment. Mahwah: L. Erlbaum Associates.
McDonald, R. P. (2000). A basis for multidimensional item response theory. Applied Psychological Measurement, 24, 99–114. doi:10.1177/01466210022031552.
McFadden, D. (1974). Conditional logit analysis of qualitative choice behavior. In P. Zarembka (Ed.), Frontiers in econometrics (pp. 105–142). New York: Academic Press.
McHorney, C. A., & Fleishman, J. A. (2006). Assessing and understanding measurement equivalence in health outcomes measures: Issues for further quantitative and qualitative inquiry. Medical Care, 44(Suppl 3), S205–S210. doi:10.1097/01.mlr.0000245451.67862.57.
Meade, A. W., Johnson, E. C., & Bradley, P. W. (2008). Power and sensitivity of alternative fit indices in tests of measurement invariance. Journal of Applied Psychology, 93, 568–592. doi:10.1037/0021-9010.93.3.568.
Meredith, W. (1993). Measurement invariance, factor analysis and factorial invariance. Psychometrika, 58, 525–543. doi:10.1007/BF02294825.
Meredith, W., & Teresi, J. A. (2006). An essay on measurement and factorial invariance. Medical Care, 44(Suppl 3), S69–S77. doi:10.1097/01.mlr.0000245438.73837.89.
Molloy, D. W., & Standish, T. I. (1997). A guide to the Standardized Mini-mental Status Examination. International Psychogeriatrics, 9(Suppl 1), 87–94. doi:10.1017/S1041610297004754.
Mukherjee, S., Gibbons, L. E., Kristiansson, E., & Crane, P. K. (2013). Extension of an iterative hybrid ordinal logistic regression/item response theory approach to detect and account for differential item functioning in longitudinal data. Psychological Test and Assessment Modeling, 55, 127–147.
Muthén, L. K., & Muthén, B. O. (2011). M-PLUS users guide (6th ed.). Los Angeles: Muthén and Muthén.
Nagelkerke, N. J. D. (1991). A note on a general definition of the coefficient of determination. Biometrika, 78, 691–692. doi:10.1093/biomet/78.3.691.
Orlando-Edelen, M., Thissen, D., Teresi, J. A., Kleinman, M., & Ocepek-Welikson, K. (2006). Identification of differential item functioning using item response theory and the likelihood-based model comparison approach: Applications to the Mini-mental State Examination. Medical Care, 44(11 Suppl 3), S134–S142. doi:10.1097/01.mlr.0000245251.83359.8c.
Oshima, T. C., Kushubar, S., Scott, J. C., & Raju, N. S. (2009). DFIT for window user’s manual: Differential functioning of items and tests. St. Paul: Assessment Systems Corporation.
Perkins, A. J., Stump, T. E., Monahan, P. O., & McHorney, C. A. (2006). Assessment of differential item functioning for demographic comparisons in the MOS SF-36 health survey. Quality of Life Research, 15, 331–348.
Pickard, A. S., Dalal, M. R., & Bushnell, D. M. (2006). A comparison of depressive symptoms in stroke and primary care: Applying Rasch models to evaluate the Center for Epidemiologic Studies-Depression scale. Value in Health, 9(1), 59–64. doi:10.1111/j.1524-4733.2006.00082.x.
Pilkonis, P. A., Choi, S. W., Reise, S. P., Stover, A. M., Riley, W. T., & Cella, D. (2011). Item banks for measuring emotional distress from the Patient-reported Outcomes Measurement Information System (PROMIS): Depression, anxiety, and anger. Assessment, 18, 263–283. doi:10.1177/1073191111411667.
R Development Core Team (2008). R: A language and environment for statistical computing.Vienna: R Foundation for Statistical Computing. (ISBN 3–900051–07-0).
Radloff, L. S. (1977). The CES-D scale: A self-report depression scale for research in the general population. Applied Psychological Measurement, 1, 385–401. doi:10.1177/014662167700100306.
Raju, N. S. (1999). DFITP5: A Fortran program for calculating dichotomous DIF/DTF [computer program]. Chicago: Illinois Institute of Technology.
Raju, N. S., van der Linden, W. J., & Fleer, P. F. (1995). IRT-based internal measures of differential functioning of items and tests. Applied Psychological Measurement, 19, 353–368. doi:10.1177/014662169501900405.
Raju, N. S., Fortmann-Johnson, K. A., Kim, W., Morris, S. B., Nering, M. L., & Oshima, T. C. (2009). The item parameter replication method for detecting differential functioning in the DFIT framework. Applied Measurement in Education, 33, 133–147. doi:10.1177/0146621608319514.
Reeve, B. B., Hays, R. D., Bjorner, J. B., Cook, K. F., Crane, P. K., Teresi, J. A., et al. (2007). Psychometric evaluation and calibration of health-related quality of life item banks: Plans for the Patient-reported Outcome Measurement Information System (PROMIS). Medical Care, 45(5 Suppl 1), S22–S31. doi:10.1097/01.mlr.0000250483.85507.04.
Reeve, B. B., Pinheiro, L. C., Jensen, R. E., Teresi, J. A., Potosky, A. L., McFatrich, M. K., et al. (2016). Psychometric evaluation of the PROMIS® fatigue measure in an ethnically and racially diverse population-based sample of cancer patients. Psychological Test and Assessment Modeling, 58(1), 119–139.
Reise, S. P. (2012). The rediscovery of bifactor measurement models. Multivariate Behavioral Research, 47, 667–696. doi:10.1080/00273171.2012.715555.
Reise, S. P., & Haviland, M. G. (2005). Item response theory and the measurement of clinical change. Journal of Personality Assessment, 84, 228–238. doi:10.1207/s15327752jpa8403_02.
Reise, S. P., Widaman, K. F., & Pugh, R. H. (1993). Confirmatory factor analysis and item response theory: Two approaches for exploring measurement invariance. Psychological Bulletin, 114, 552–566. doi:10.1037/0033-2909.114.3.552.
Reise, S. P., Morizot, J., & Hays, R. D. (2007). The role of the bifactor model in resolving dimensionality issues in health outcomes measures. Quality of Life Research, 16(Suppl 1), 19–31. doi:10.1007/s11136-007-9183-7.
Reise, S. P., Moore, T. M., & Haviland, M. G. (2010). Bi-factor models and rotations: Exploring the extent to which multidimensional data yield univocal scale scores. Journal of Personality Assessment, 92, 544–559. doi:10.1080/00223891.2010.496477.
Revelle, W. (2015). Psych: Package psych. http://CRAN.R-project.org/package=PSYCH.
Revelle, W., & Zinbarg, R. E. (2009). Coefficient alpha, beta, omega, and the GLB: Comments on Sijtsma. Psychometrika, 74, 145–154. doi:10.1007/s11336-008-9102-z.
Rizopoulus, D. (2009). Ltm: Latent Trait Models under IRT. https://cran.r-project.org/web/packages/ltm/index.html.
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika Monograph Supplement, 34, 100–114. doi:10.1007/BF02290599.
Saris, W. E., Revilla, M., Krosnick, J. A., & Shaeffer, E. M. (2010). Comparing questions with agree/disagree response options to questions with item-specific response options. Survey Research Methods, 4, 61–79. doi:10.18148/srm/2010.v4i1.2682.
Sass, D. A., Schmitt, T. A., & Marsh, H. W. (2014). Evaluating model fit with ordered categorical data within a measurement invariance framework: A comparison of estimators. Structural Equation Modeling, 21, 167–180.
Schmid, L., & Leiman, J. (1957). The development of hierarchical factor solutions. Psychometrika, 22, 53–61. doi:10.1007/BF02289209.
Seligman, M. E. P., & Csikszentmihalyi, M. (2000). Positive psychology: An introduction. American Psychologist, 55(1), 5–14. doi:10.1037/0003-066X.55.1.5.
Setodji, C. M., Reise, S. P., Morales, L. S., Fongwam, N., & Hays, R. D. (2011). Differential item functioning by survey language among older Hispanics enrolled in Medicare managed care a new method for anchor item selection. Medical Care, 49, 461–468. doi:10.1097/MLR.0b013e318207edb5.
Sijtsma, K. (2009). On the use, the misuse, and the very limited usefulness of Cronbach's alpha. Psychometrika, 74, 107–120. doi:10.1007/s11336-008-9101-0.
van Sonderen, E., Sanderman, R., & Coyne, J. C. (2013). Ineffectiveness of reverse wording of questionnaire items: Let's learn from cows in the rain. PloS One, 8(7), e68967. doi:10.1371/journal.pone.0068967.
Steptoe, A., Demakakos, P., de Oliveira, C., & Wardle, J. (2012). Distinctive biological correlates of positive psychological well-being in older men and women. Psychosomatic Medicine, 74, 501–508. doi:10.1097/PSY.0b013e31824f82c8.
Steptoe, A., Deaton, A., & Stone, A. A. (2015). Subjective wellbeing, health and ageing. The Lancet, 385(9968), 640–648. doi:10.1016/S0140-6736(13)61489-0.
Stone, A. A., Schwartz, J. E., Broderick, J. E., & Deaton, A. (2010). A snapshot of the age distribution of psychological well-being in the United States. Proceedings of the National Academy of Sciences of the United States of America, 107, 19949–19952. doi:10.1073/pnas.1003744107.
Stout, W. F. (1990). A new item response theory modeling approach with applications to unidimensional assessment and ability estimation. Psychometrika, 55, 293–325. doi:10.1007/BF02295289.
Swaminathan, H., & Rogers, H. J. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement, 27, 361–370. doi:10.1111/j.1745-3984.1990.tb00754.x.
Teresi, J. A. & Jones, R. N. (2016). Methodological issues in examining measurement equivalence in patient reported outcomes measures: Methods overview to the two-part series, “Measurement equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) short form measures”. Psychological Test and Assessment Modeling, 58, 37–78.
Teresi, J., Abrams, R., & Holmes, D. (2000a). Measurement of depression and depression recognition individuals with cognitive impairment. In S. Albert & R. Logsdon (Eds.), Assessing quality of life in Alzheimer’s disease (pp. 121–151). New York: Springer.
Teresi, J. A., Kleinman, M., & Ocepek-Welikson, K. (2000b). Modern psychometric methods for detection of differential item functioning: Application to cognitive assessment measures. Statistics in Medicine, 19, 1651–1683. doi:10.1002/(SICI)1097-0258(20000615/30)19:11/12<1651::AID-SIM453>3.0.CO;2-H.
Teresi, J. A., Ramirez, M., Lai, J. -S., & Silver, S. (2008). Occurrences and sources of differential item functioning (DIF) in patient-reported outcome measures: Description of DIF methods, and review of measures of depression, quality of life and general health. Psychology Science Quarterly, 50, 538–612.
Teresi, J., Ocepek-Welikson, K., Kleinman, M., Eimicke, J. E., Crane, P. K., Jones, R. N., et al. (2009). Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS): An item response theory approach. Psychology Science Quarterly, 51, 148–180.
Teresi, J. A., Ocepek-Welikson, K., Kleinman, M., Ramirez, M., & Kim, G. (2016). Psychometric properties and performance of the Patient Reported Outcomes Measurement Information System (PROMIS®) depression short forms in ethnically diverse groups. Psychological Test and Assessment Modeling, 58, 141–181.
Thissen, D., Steinberg, L., & Wainer, H. (1993). Detection of differential item functioning using the parameters of item response models. In P. W. Holland & H. Wainer (Eds.), Differential item functioning (pp. 123–135). Hillsdale: Lawrence Erlbaum, Inc..
Thissen, D., Steinberg, L., & Kuang, D. (2002). Quick and easy implementation of the Benjamini-Hochberg procedure for controlling the false discovery rate in multiple comparisons. Journal of Educational and Behavioral Statistics, 27, 77–83.
Toner, J. A., Teresi, J. A., Gurland, B., & Tirumalasetti, F. (1999). The Feeling-Tone Questionnaire: Reliability and validity of a direct patient assessment screening instrument for detection of depressive symptoms in cases of dementia. Journal of Clinical Geropsychiatry, 5, 63–78. doi:10.1023/A:1022994930394.
Uebersax, J. S. (2000). Polycorr. A program for estimation of the standard and extended polychoric correlation coefficient. Computer program documentation manual.
Vecchione, M., Allesandri, G., Vittorio Caprara, G., & Tisak, J. (2014). Are methods effects permanent or ephemeral in nature? The case of the revised life orientation test. Structural Equation Modeling, 21, 117–130. doi:10.1080/10705511.2014.859511.
Wainer, H. (1993). Model-based standardization measurement of an item's differential impact. In P. W. Holland & H. Wainer (Eds.), Differential item functioning (pp. 123–135). Hillsdale: Lawrence Erlbaum, Inc..
Wang, W.-C., Shih, C.-L., & Sun, G.-W. (2012). The DIF-free-then-DIF strategy for the assessment of differential item functioning. Educational and Psychological Measurement, 72, 687–708. doi:10.1177/0013164411426157.
Watson, D., Clark, L. A., & Tellegan, A. (1988). Development and validation of brief measures of positive and negative affect: The PANAS scales. Journal of Personality and Social Psychology, 54(6), 1063–1070. doi:10.1037//0022-3514.54.6.1063.
Wood, A. M., Taylor, P. J., & Joseph, S. (2010). Does the CES-D measure a continuum from depression to happiness? Comparing substantive and artifactual models. Psychiatry Research, 177(1–2), 120–123. doi:10.1016/j.psychres.2010.02.003.
Woods, C. M. (2009). Empirical selection of anchors for tests of differential item functioning. Applied Psychological Measurement, 33, 42–57. doi:10.1177/0146621607314044.
Woods, C. M., Cai, L., & Wang, M. (2013). The Langer-improved Wald test for DIF testing with multiple groups: Evaluation and comparison to two-group IRT. Educational and Psychological Measurement, 73, 532–547. doi:10.1177/0013164412464875.
Yang, F. M., & Jones, R. N. (2007). Center of Epidemiologic Studies-Depression scale (CES-D) item response bias found with Mantel-Haenszel method was successfully replicated using latent variable modeling. Journal of Clinical Epidemiology, 60, 1195–1200. doi:10.1016/j.jclinepi.2007.02.008.
Yin, L., Muramatsu, N., & Gordon, R. (2015). Evaluating differential item functioning of a CES-D short form in Chinese older men and women: A rasch analysis. The Gerontologist, 55(Suppl 2), 708. doi:10.1093/geront/gnv355.12.
Zinbarg, R. E., Revelle, W., Yovel, I., & Li, W. (2005). Cronbach’s α, Revelle’s β and McDonald’s ω H : Their relations with each other and two alternative conceptualizations of reliability. Psychometrika, 70(1), 123–133. doi:10.1007/s11336–003–0974-7 http://personality-project.org/revelle/publications/zinbarg.revelle.pmet.05.pdf.
Zumbo, B. D. (1999). A handbook on the theory and methods of differential item functioning (DIF): Logistic regression modeling as a unitary framework for binary and Likert-type (ordinal) item scores. Ottawa: Directorate of Human Resources Research and Evaluation, Department of National Defense http://www.educ.ubc.ca/faculty/zumbo/DIF/index.html.
Zumbo, B. D., Gadermann, A. M., & Zeisser, C. (2007). Ordinal versions of coefficient alpha and theta for Likert rating scales. Journal of Modern Applied Statistical Methods, 6, 21–29.
Acknowledgements
Partial funding for the analyses was provided by the National Institute on Aging (NIA)-funded Mt. Sinai Pepper Center, P30AG028741 (PI: Siu), and the NIA Edward R. Roybal Center, P30AG022845 (PI: Reid, Pillemer, Wethington). The authors thank Stephanie Silver, MPH for editorial assistance in the preparation of this manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare that they have no conflicts of interest.
Additional information
This paper was based on a presentation at a preconference: Subjective well-being assessment in minority aging research, delivered on November 18, 2015 at the Gerontological Society of America meetings in Orlando, sponsored by the NIA Resource Centers for Minority Aging Research Coordinating Center (2P30AG021684).
Electronic Supplementary Material
ESM 1
(DOCX 76 kb)
Rights and permissions
About this article
Cite this article
Teresi, J.A., Ocepek-Welikson, K., Toner, J.A. et al. Methodological Issues in Measuring Subjective Well-Being and Quality-of-Life: Applications to Assessment of Affect in Older, Chronically and Cognitively Impaired, Ethnically Diverse Groups Using the Feeling Tone Questionnaire. Applied Research Quality Life 12, 251–288 (2017). https://doi.org/10.1007/s11482-017-9516-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11482-017-9516-9