Skip to main content
Log in

Sample size determinations for the two rater kappa statistic

  • Published:
Psychometrika Aims and scope Submit manuscript


This paper gives a method for determining a sample size that will achieve a prespecified bound on confidence interval width for the interrater agreement measure,κ. The same results can be used when a prespecified power is desired for testing hypotheses about the value of kappa. An example from the literature is used to illustrate the methods proposed here.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others


  • Cohen, J. (1960). A coefficient of agreement for nominal scales.Educational and Psychological Measurement, 20, 37–46.

    Google Scholar 

  • Cohen, J. (1968). Weighted kappa: Nominal scale agreement with provision for scaled disagreement or partial credit.Psychological Bulletin, 70, 213–220.

    Google Scholar 

  • Dixon, W. J., & Massey, F. J., Jr. (1983).Introduction to statistical analysis (4th ed.). New York: McGraw-Hill.

    Google Scholar 

  • Flack, V. F. (1987). Confidence intervals for the two rater kappa.Communications in Statistics: Theory and Methods, 16, 953–968.

    Google Scholar 

  • Fleiss, J. L. (1981).Statistical methods for rates and proportions (2nd ed.). New York: Wiley.

    Google Scholar 

  • Fleiss, J. L., Cohen, J., & Everitt, B. S. (1969). Large sample standard errors of kappa and weighted kappa.Psychological Bulletin, 72, 323–327.

    Google Scholar 

  • Landis, J. R., & Koch, G. G. (1977). The measurement of interrater agreement for categorical data.Biometrics, 33, 159–174.

    Google Scholar 

Download references

Author information

Authors and Affiliations


Rights and permissions

Reprints and permissions

About this article

Cite this article

Flack, V.F., Afifi, A.A., Lachenbruch, P.A. et al. Sample size determinations for the two rater kappa statistic. Psychometrika 53, 321–325 (1988).

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI:

Key words