Abstract
Differential item functioning (DIF) analyses of the Beck Depression Inventory-II (BDI-II) were conducted on samples of 267 women with breast cancer and 294 women with clinical depression. Patterns of items in which there was significant and nonsignificant DIF were identified using statistical tests and measures of DIF effect size. At the most general level, 15 of 21 BDI-II items were associated with nontrivial DIF suggesting that the item responses of these samples do not reflect the same underlying construct. Factor analyses of the BDI-II using a psychometrically defensible method for item level factor analysis supported the conclusions from the DIF analyses. These findings suggest that researchers and practitioners should apply caution when interpreting self-report depression symptoms in breast cancer patients.
Similar content being viewed by others
References
American Cancer Society. (2003). Cancer Facts and Figures 2003. Atlanta, GA: American Cancer Society.
Beck, A. T. (1991). Cognitive therapy: A 30-year retrospective. American Psychologist, 46, 368–375.
Beck, A. T., Steer, R. A., & Brown, G. K. (1996). Beck Depression Inventory: Manual (2nd ed.). New York: The Psychological Corporation.
Bock, R. D., Gibbons, R., & Muraki, E. (1988). Full-information item factor analysis. Applied Psychological Measurement, 12, 261–280.
Camilli, G., & Shepard, L. A. (1994). Methods for identifying biased items. Thousand Oaks, CA: Sage.
Carroll, J. B. (1961). The Nature of the data, or how to choose a correlation coefficient. Psychometrika, 26, 347–372.
Cattell, R. B. (1978). The scientific use of factor analysis in behavioral and life sciences. New York: Plenum.
Clauser, B. E., & Mazor, K. M. (1998). Using statistical procedures to identify differential item functioning test items. Educational Measurement: Issues and Practices, 17, 31–44.
Compas, B. E., & Leucken, L. (2002). Psychological adjustment to breast cancer. Current Directions in Psychological Science, 11, 111–114.
Compas, B. E., Stoll, M. F., Thomsen, A. H., Oppedisano, G., Epping-Jordan, J. E., & Krag, D. N. (1999). Adjustment to breast cancer: Age-related differences in coping and emotional distress. Breast Cancer Research and Treatment, 1233, 1–9.
Croyle, R. T., & Rowland, J. H. (2003). Mood disorders and cancer: A National Cancer Institute Perspective. Biological Psychiatry, 54, 192–194.
Dausch, B., Compas, B. E., Beckford, E., Luecken, L., Anderson-Hanley, C., Sherman, M., et al. (2004). Rates and correlates of DSM-IV diagnoses in women newly diagnosed with breast cancer. Journal of Clinical Psychology in Medical Settings, 11(3), 159–169.
Donoghue, J. R., Holland, P. W., & Thayer, D. T. (1993). A Monte Carlo study of factors that affect the Mantel-Haenszel and standardization measures of differential item functioning. In P. W. Holland & H. Wainer (Eds.), Differential Item Function (pp. 137–166). Hillsdale, NJ: Erlbaum.
Dozois, D. J., & Covin, R. (in press). The Beck Depression Inventory-II, Beck Hopelessness Scale, and Beck Scale for Suicide Ideation. In M. Hersen (Ser. Ed.), D. L. Segal, & M. Hilsenroth (Vol. Eds.), Comprehensive handbook of psychological assessment: Vol. 2. Personality assessment and psychopathology. New York: Wiley.
Frasure-Smith, N., Lesperance, F., Juneau, M., Tlarijic, M., & Bourassa, M. G. (1999). Gender, depression, and one-year prognosis after myocardial infarction. Psychosomatic Medicine, 61, 26–37.
Glinder, J., & Compas, B. E. (1999). Self-blame and psychological adjustment to breast cancer. Health Psychology, 18, 1–9.
Holland, P. W., & Thayer, D. T. (1988). Differential item performance and the Mantel-Haenszel procedure. In H. Wainer, & H. I. Braun (Eds.), Test validity (pp. 129–145). Hillsdale, NJ: Erlbaum.
Holland, P. W., & Wainer, H. (Eds.). (1993). Differential item functioning. Hillsdale, NJ: Erlbaum.
Hollon, S. D. (1992). Cognitive models of depression from a psychobiological perspective. Psychological Inquiry, 3, 250–243.
Ihaka, R., & Gentleman, R. (1996). R: A Language for Data Analysis and Graphics. Journal of Computational and Graphical Statistics, 5, 299–314.
Jacobson, N. S., Follette, W. C., & Revenstorf, D. (1984). Psychotherapy outcome research: Methods for reporting variability and evaluating clinical significance. Behavior Therapy, 15, 336–352.
Jodoin, M. G., & Gierl, M. J. (2001). Evaluating Type I error and power rates using an effect size measure with the logistic regression procedure for DIF detection. Applied Measurement in Education, 14, 329–349.
Kessler, R. C., McGonagle, K. A., Zhao, S., Nelson, C. B., Hughes, M., Esheman, S., et al. (1994). Lifetime and 12-month prevalence of DSM-III-R psychiatric disorders in the United States. Archives of General Psychiatry, 51, 8–19.
Kim, Y., Pilkonis, P. A., Frank, E., Thase, M. E., & Reynolds, C. F. (2002). Differential functioning of the Beck Depression Inventory in late-life patients: Use of item response theory. Psychology and Aging, 17, 379–391.
Knol, D. L., & Berger, M. P. F. (1991). Empirical comparison between factor analysis and multidimensional item response models. Multivariate Behavioral Research, 26, 457–477.
MacCallum, R. C., Zhang, S., Preacher, K. J., & Rucker, D. D. (2002). On the practice of dichotomization of quantitative variables. Psychological Methods, 7, 19–40.
Malcarne, V. L., Compas, B. E., Epping-Jordan, J. E., & Howell, D. C. (1995). Cognitive factors in adjustment to cancer: Attributions of self-blame and perceptions of control. Journal of Behavioral Medicine, 18, 401–417.
McDaniel, J. S., Musselman, D. L., Porter, M. R., Reed, D. A., & Nemeroff, C. B. (1995). Depression in patients with cancer: Diagnosis, biology, and treatment. Archives of General Psychiatry, 52, 89–99.
McDonald, R. P. (1985). Factor analysis and related methods. Hillsdale, New Jersey: Erlbaum.
McDonald, R. P., & Ahlawat, K. S. (1974). Difficulty factors in binary data. British Journal of Mathematical and Statistical Psychology, 27, 82–99.
Meredith, W., & Millsap, R. E. (1992). On the misuse of manifest variables in the detection of measurement bias. Psychometrika, 57, 289–311.
Millsap, R. E., & Everson, H. T. (1993). Methodology review: Statistical approaches for assessing measurement bias. Applied Psychological Measurement, 17, 297–334.
Potenza, M. T., & Dorans, N. J. (1995). DIF assessment for polytomously scored items: A framework for classification and evaluation. Applied Psychological Measurement, 19, 23–37.
Raison, C. L., & Miller, A. H. (2003). Depression in cancer: New developments regarding diagnosis and treatment. Biological Psychiatry, 54, 283–294.
Raju, N. S. (1988). The area between two item characteristic curves. Psychometrika, 53, 495–502.
Raju, N. S., van der Linden, W. J., & Fleer, P. F. (1995). IRT-based internal measures for differential functioning of items and tests. Applied Psychological Measurement, 19, 353–368.
Reise, S. P., Widaman, K. F., & Pugh, R. H. (1993). Confirmatory factor analysis and item response theory: Two approaches for exploring measurement invariance. Psychological Bulletin, 114, 552–566.
Ritterband, L. M., & Spielberger, C. D. (2001). Depression in a cancer patient population. Journal of Clinical Psychology in Medical Settings, 8, 85–93.
Rogers, H. J., & Swaminathan, H. (1993). A comparison of the logistic regression and Mantel-Haenszel procedures for detecting differential item functioning. Applied Psychological Measurement, 17, 105–116.
Rosenberg, S. D., Goodman, L. A., Osher, F. C., Swartz, M. S., Essock, S. M., Butterfield, M. L., et al. (2001). Prevalence of HIV, hepatitis B, and hepatitis C in people with severe mental illness. American Journal of Public Health, 91, 31–37.
Roth, P. L. (1994). Missing data: A conceptual review for applied psychologists. Personnel Psychology, 47, 537–560.
Santor, D. A., & Coyne, J. C. (2001). Examining symptom expression as a function of symptom severity: Item performance on the Hamilton Rating Scale for Depression. Psychological Assessment, 13, 127–139.
Santor, D. A., & Ramsay, J. O. (1998). Progress in the technology of measurement: Applications of item response models. Psychological Assessment, 10, 345–359.
Santor, D. A., Ramsay, J. O., & Zuroff, D. C. (1994a). Nonparametric item analyses of the Beck Depression Inventory: Evaluating gender item bias and response option weights. Psychological Assessment, 6, 255–270.
Santor, D. A., Ramsay, J. O., & Zuroff, D. C. (1994b). Nonparametric item analysis of the Beck Depression Inventory: Examining item bias and response option weights in clinical and nonclinical samples, Psychological Assessment, 6, 255–270.
Santor, D. A., Zuroff, D. C., Cervantes, P., Palacios, J., & Ramsay, J. O. (1995). Examining scale discriminability in the BDI and CES-D as a function of depressive severity. Psychological Assessment, 7, 131–139.
Schafer, J. L., & Graham, J. W. (2002). Missing data: Our view of the state of the art. Psychological Methods, 7, 147–177.
Schmid, J., & Leiman, J. M. (1957). The development of hierarchical factor solutions. Psychometrika, 22, 53–61.
Shealy, R., & Stout, W. (1993). An item response theory model for test bias. In P. W. Holland, & H. Wainer (Eds.), Differential item functioning (pp. 197–239). Hillsdale, NJ: Erlbaum.
Spijker, A. V., Trijsburg, R. W., & Duivenvoorder, H. J. (1997). Psychological sequelae of cancer diagnosis: A meta-analytic review of 58 studies after 1980. Psychosomatic Medicine, 59, 280–293.
Steer, R. A., Ball, R., Ranieri, W. F., & Beck, A. T. (1999). Dimensions of the Beck Depression Inventory-II in clinically depressed outpatients. Journal of Clinical Psychology, 55, 117–128.
Steer, R. A., Kumar, G., Ranieri, W. F., & Beck, A. T. (1998). Use of the Beck Depression Inventory-II with adolescent psychiatric outpatients. Journal of Psychopathology and Behavioral Assessment, 20, 127–137.
Swaminathan, H., & Rogers, H. J. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement, 27, 361–370.
Tellegen, A., & Waller, N. G. (1987). Reexamining basic dimensions of natural language trait descriptors. Paper presented at the 95th annual meeting of the American Psychological Association, New York, New York.
Thissen, D. (1991). Multilog user’s guide: Multiple, categorical item analysis and test scoring using item response theory [Computer program]. Chicago: Scientific Software International.
Thissen, D., Steinberg, L., & Gerrard, M. (1986). Beyond group differences: The concept of item bias. Psychological Bulletin, 99, 118–128.
Thissen, D., Steinberg, L., & Wainer, H. (1993). Detection of differential item functioning using the parameters of item response models. In P. Holland, & H. Wainer (Eds.), Differential item functioning (pp. 67–113). Hillsdale, NJ: Erlbaum.
Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., et al. (2001). Missing value estimation methods for DNA microarrays. Bioinformatics, 17, 520–525.
Waller, N. G. (1998a). LINKDIF: An S-Plus routine for linking item parameters and calculating IRT measures of differential functioning of items and tests. Applied Psychological Measurement, 22, 392.
Waller, N. G. (1998b). EZDIF: A program for the analysis of uniform and nonuniform differential item functioning. Applied Psychological Measurement, 22, 391.
Waller, N. G. (1999). Evaluating the structure of personality. In C. Robert Cloninger (Ed.) Personality and psychopathology (pp. 155–200). Washington, DC: American Psychiatric Press.
Waller, N. G. (2003). WinMicroFACT 2.1: A microcomputer factor analysis program for ordered polytomous data and mainframe sized problems. St. Paul, MN: Assessment Systems Corporation.
Waller, N. G., Thompson, J., & Wenk, E. (2000). Black–White differences on the MMPI: Using IRT to separate measurement bias from true group differences on homogeneous and heterogeneous scales. Psychological Methods, 5, f25–f46.
Waller, N. G., & Zavala, J. (1993). Evaluating the Big Five. Psychological Inquiry, 4, 131–134.
Wherry, R. J., & Gaylord, R. H. (1944). Factor Pattern of test items and tests as a function of the correlation coefficient: Content, difficulty, and constant error factors. Psychometrika, 9, 237–244.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Waller, N.G., Compas, B.E., Hollon, S.D. et al. Measurement of Depressive Symptoms in Women With Breast Cancer and Women With Clinical Depression: A Differential Item Functioning Analysis. J Clin Psychol Med Settings 12, 127–141 (2005). https://doi.org/10.1007/s10880-005-3273-x
Issue Date:
DOI: https://doi.org/10.1007/s10880-005-3273-x