European Radiology

, 18:1134

Observer variability in screen-film mammography versus full-field digital mammography with soft-copy reading

  • Per Skaane
  • Felix Diekmann
  • Corinne Balleyguier
  • Susanne Diekmann
  • Jean-Charles Piguet
  • Kari Young
  • Michael Abdelnoor
  • Loren Niklason


Full-field digital mammography (FFDM) with soft-copy reading is more complex than screen-film mammography (SFM) with hard-copy reading. The aim of this study was to compare inter- and intraobserver variability in SFM versus FFDM of paired mammograms from a breast cancer screening program. Six radiologists interpreted mammograms of 232 cases obtained with both techniques, including 46 cancers, 88 benign lesions, and 98 normals. Image interpretation included BI-RADS categories. A case consisted of standard two-view mammograms of one breast. Images were scored in two sessions separated by 5 weeks. Observer variability was substantial for SFM as well as for FFDM, but overall there was no significant difference between the observer variability at SFM and FFDM. Mean kappa values were lower, indicating less agreement, for microcalcifications compared with masses. The lower observer agreement for microcalcifications, and especially the low intraobserver concordance between the two imaging techniques for three readers, was noticeable. The level of observer agreement might be an indicator of radiologist performance and could confound studies designed to separate diagnostic differences between the two imaging techniques. The results of our study confirm the need for proper training for radiologists starting FFDM with soft-copy reading in breast cancer screening.


Breast neoplasms Radiography Breast radiography Comparative studies Cancer screening Full-field digital mammography Interobserver variation 


  1. 1.
    Bick U, Diekmann F (2007) Digital mammography: what do we and what don’t we know? Eur Radiol 17:1931–1942PubMedCrossRefGoogle Scholar
  2. 2.
    Diekmann S, Bick U, von Heyden H, Diekmann F (2003) Visualization of microcalcifications on mammographies obtained by digital full-field mammography in comparison to conventional film-screen mammography (in German). Rofo 175:775–779PubMedGoogle Scholar
  3. 3.
    Fischer U, Baum F, Obenauer S, Luftner-Nagel S, von Heyden D, Vosshenrich R, Grabbe E (2002) Comparative study in patients with microcalcifications: full-field digital mammography vs screen-film mammography. Eur Radiol 12:2679–2683PubMedGoogle Scholar
  4. 4.
    Obenauer S, Luftner-Nagel S, von Heyden D, Munzel U, Baum F, Grabbe E (2002) Screen film vs full-field digital mammography: image quality, detectability and characterization of lesions. Eur Radiol 12:1697–1702PubMedCrossRefGoogle Scholar
  5. 5.
    Obenauer S, Hermann KP, Marten K, Luftner-Nagel S, von Heyden D, Skaane P, Grabbe E (2003) Soft copy versus hard copy reading in digital mammography. J Digit Imaging 16:341–344PubMedCrossRefGoogle Scholar
  6. 6.
    Kim HH, Pisano ED, Cole EB, Jiroutek MR, Muller KE, Zheng Y, Kuzmiak CM, Koomen MA (2006) Comparison of calcification specificity in digital mammography using soft-copy display versus screen-film mammography. AJR Am J Roentgenol 187:47–50PubMedCrossRefGoogle Scholar
  7. 7.
    Beam CA, Layde PM, Sullivan DC (1996) Variability in the interpretation of screening mammograms by US radiologists. Arch Intern Med 156:209–213PubMedCrossRefGoogle Scholar
  8. 8.
    Elmore JG, Wells CK, Lee CH, Howard DH, Feinstein AR (1994) Variability in radiologists’ interpretations of mammograms. N Engl J Med 331:1493–1499PubMedCrossRefGoogle Scholar
  9. 9.
    Elmore JG, Nakano CY, Koepsell TD, Desnick LM, D’Orsi CJ, Ransohoff DF (2003) International variation in screening mammography interpretations in community-based programs. J Natl Cancer Inst 95:1384–1393PubMedGoogle Scholar
  10. 10.
    Skaane P, Engedal K, Skjennald A (1997) Interobserver variation in the interpretation of breast imaging. Comparison of mammography, ultrasonography, and both combined in the interpretation of palpable noncalcified breast masses. Acta Radiol 38:497–502PubMedCrossRefGoogle Scholar
  11. 11.
    Berg WA, Campassi C, Langenberg P, Sexton MJ (2000) Breast imaging reporting and data system: inter- and intraobserver variability in feature analysis and final assessment. AJR Am J Roentgenol 174:1769–1777PubMedGoogle Scholar
  12. 12.
    Skaane P, Young K, Skjennald A (2003) Population-based mammography screening: comparison of screen-film and full-field digital mammography with soft-copy reading—Oslo I study. Radiology 229:877–884PubMedCrossRefGoogle Scholar
  13. 13.
    Skaane P, Skjennald A (2004) Screen-film mammography versus full-field digital mammography with soft-copy reading: randomized trial in a population-based screening program—The Oslo II study. Radiology 232:197–204PubMedCrossRefGoogle Scholar
  14. 14.
    Skaane P, Balleyguier C, Diekmann F, Diekmann S, Piguet JC, Young K, Niklason LT (2005) Breast lesion detection and classification: comparison of screen-film mammography and full-field digital mammography with soft-copy reading—observer performance study. Radiology 237:37–44PubMedCrossRefGoogle Scholar
  15. 15.
    Pedersen K, Nordanger J (2002) Quality control of the physical and technical aspects of mammography in the Norwegian breast-screening programme. Eur Radiol 12:463–470PubMedCrossRefGoogle Scholar
  16. 16.
    Cohen J (1968) Weighted kappa. Nominal scale agreement with provision for scaled disagreement or partial credit. Psychol Bull 70:213–220CrossRefPubMedGoogle Scholar
  17. 17.
    Donner A, Eliasziw M (1992) A goodness-of-fit approach to inference procedures for the kappa statistic: confidence interval construction, significance-testing and sample size estimation. Stat Med 11:1511–1519PubMedCrossRefGoogle Scholar
  18. 18.
    Cicchetti DV (1976) Assessing inter-rater reliability for rating scales: resolving some basic issues. Br J Psychiatr 129:452–456CrossRefGoogle Scholar
  19. 19.
    Venta LA, Hendrick RE, Adler YT, DeLeon P, Mengoni PM, Scharl AM, Comstock CE, Hansen L, Kay N, Coveler A, Cutter G (2001) Rates and causes of disagreement in interpretation of full-field digital mammography and film-screen mammography in a diagnostic setting. AJR Am J Roentgenol 176:1241–1248PubMedGoogle Scholar
  20. 20.
    Lewin JM, Hendrick RE, D’Orsi CJ, Isaacs PK, Moss LJ, Karellas A, Sisney GA, Kuni CC, Cutter GR (2001) Comparison of full-field digital mammography with screen-film mammography for cancer detection: results of 4,945 paired examinations. Radiology 218:873–880PubMedGoogle Scholar
  21. 21.
    Pisano ED, Gatsonis C, Hendrick E, Yaffe M, Baum JK, Acharyya S, Conant EF, Fajardo LL, Bassett L, D’Orsi C, Jong R, Rebner M (2005) Diagnostic performance of digital versus film mammography for breast-cancer screening. N Engl J Med 353:1773–1783PubMedCrossRefGoogle Scholar
  22. 22.
    Glueck DH, Lamb MM, Lewin JM, Pisano ED (2007) Two-modality mammography may confer an advantage over either full-field digital mammography or screen-film mammography. Acad Radiol 14:670–676PubMedCrossRefGoogle Scholar
  23. 23.
    Vigeland E, Klaasen H, Klingen TA, Hofvind S, Skaane P (2008) Full-field digital mammography with flat-panel selenium detectors in a population-based screening programme: The Vestfold County Study. Eur Radiol 18:183-191Google Scholar
  24. 24.
    Kamitani T, Yabuuchi H, Soeda H, Matsuo Y, Okafuji T, Sakai S, Furuya A, Hatakenaka M, Ishii N, Hona H (2007) Detection of masses and microcalcifications of breast cancer on digital mammograms: comparison among hard-copy film, 3-megapixel liquid crystal display (LCD) monitors and 5-megapixel LCD monitors: an observer performance study. Eur Radiol 17:1365–1371PubMedCrossRefGoogle Scholar
  25. 25.
    Fischer U, Baum F, Obenauer S, Funke M, Hermann KP, Grabbe E (2002) Full-field digital mammography (FFDM): intraindividual comparison of direct magnification versus monitor zooming in patients with microcalcifications (in German). Radiologe 42:261–264PubMedCrossRefGoogle Scholar
  26. 26.
    Lazarus E, Mainiero MB, Schepps B, Koelliker SL, Livingston LS (2006) BI-RADS lexicon for US and mammography: interobserver variability and positive predictive value. Radiology 239:385–391PubMedCrossRefGoogle Scholar
  27. 27.
    Berg WA, D’Orsi CJ, Jackson VP, Bassett LW, Beam CA, Lewis RS, Crewson PE (2002) Does training in the breast imaging reporting and data system (BI-RADS) improve biopsy recommendations or feature analysis agreement with experienced breast imagers at mammography? Radiology 224:871–880PubMedCrossRefGoogle Scholar
  28. 28.
    Skaane P, Skjennald A, Young K, Egge E, Jebsen I, Sager EM, Scheel B, Søvik E, Ertzaas AK, Hofvind S, Abdelnoor M (2005) Follow-up and final results of the Oslo I study comparing screen-film mammography and full-field digital mammography with soft-copy reading. Acta Radiol 46:679–689PubMedCrossRefGoogle Scholar

Copyright information

© European Society of Radiology 2008

Authors and Affiliations

  • Per Skaane
    • 1
  • Felix Diekmann
    • 2
  • Corinne Balleyguier
    • 3
  • Susanne Diekmann
    • 2
  • Jean-Charles Piguet
    • 4
  • Kari Young
    • 1
  • Michael Abdelnoor
    • 5
  • Loren Niklason
    • 6
  1. 1.Department of Radiology, Breast Imaging CenterUllevaal University HospitalOsloNorway
  2. 2.Department of Diagnostic RadiologyUniversity ChariteBerlinGermany
  3. 3.Institut Gustave RoussyVillejuifFrance
  4. 4.Institut ImageriveGenevaSwitzerland
  5. 5.Center for Clinical Research, Section of Epidemiology and BiostatisticsUllevaal University HospitalOsloNorway
  6. 6.Hologic Inc.HillsboroughUSA

Personalised recommendations