The reliability of a two-item scale: Pearson, Cronbach, or Spearman-Brown?
- 8k Downloads
To obtain reliable measures researchers prefer multiple-item questionnaires rather than single-item tests. Multiple-item questionnaires may be costly however and time-consuming for participants to complete. They therefore frequently administer two-item measures, the reliability of which is commonly assessed by computing a reliability coefficient. There is some disagreement, however, what the most appropriate indicator of scale reliability is when a measure is composed of two items. The most frequently reported reliability statistic for multiple-item scales is Cronbach’s coefficient alpha and many researchers report this coefficient for their two-item measure (Cuijpers et al. 2009; Löwe et al. 2005; Michal et al. 2010; Young et al. 2009). Others however claim that coefficient alpha is inappropriate and meaningless for two-item scales (Sainfort and Booske 2000; Verhoef 2003; Cramer et al. 2006; O’Brien et al. 2008). Instead, they recommend using the Pearson correlation...
KeywordsCoefficient Alpha True Score Local Dependence Classical Test Theory True Reliability
The authors are grateful to William Revelle and an anonymous reviewer for helpful comments on a previous version of this manuscript and suggestions for improvements.
- Bollen KA (1989) Structural equations with latent variables. Wiley, New YorkGoogle Scholar
- Embretson SE, Reise SP (2000) Item response theory for psychologists. Lawrence Erlbaum Associates, MahwahGoogle Scholar
- Hancock GR, Mueller RO (2001) Rethinking construct reliability within latent variable systems. In Cudeck R, Du Toit S, Sörbom D (Eds), Structural equation modeling: present and future. A festschrift in honor of Karl Jöreskog. Scientific Software International, Lincolnwood, pp 195–216Google Scholar
- Lord FM, Novick MR (1968) Statistical theories of mental test scores. Reading, Addison-WesleyGoogle Scholar