Breast density effect on the sensitivity of digital screening mammography in a UK cohort

Payne, Nicholas R.; Hickman, Sarah E.; Black, Richard; Priest, Andrew N.; Hudson, Sue; Gilbert, Fiona J.

doi:10.1007/s00330-024-10951-w

Breast density effect on the sensitivity of digital screening mammography in a UK cohort

Breast
Open access
Published: 17 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

Breast density effect on the sensitivity of digital screening mammography in a UK cohort

Download PDF

Nicholas R. Payne¹,
Sarah E. Hickman^1,2,
Richard Black³,
Andrew N. Priest^1,3,
Sue Hudson⁴ &
…
Fiona J. Gilbert ORCID: orcid.org/0000-0002-0124-9962^1,3

304 Accesses
Explore all metrics

Abstract

Objectives

To assess the performance of breast cancer screening by category of breast density and age in a UK screening cohort.

Methods

Raw full-field digital mammography data from a single site in the UK, forming a consecutive 3-year cohort of women aged 50 to 70 years from 2016 to 2018, were obtained retrospectively. Breast density was assessed using Volpara software. Examinations were grouped by density category and age group (50–60 and 61–70 years) to analyse screening performance. Statistical analysis was performed to determine the association between density categories and age groups. Volumetric breast density was assessed as a binary classifier of interval cancers (ICs) to find an optimal density threshold.

Results

Forty-nine thousand nine-hundred forty-eight screening examinations (409 screen-detected cancers (SDCs) and 205 ICs) were included in the analysis. Mammographic sensitivity, SDC/(SDC + IC), decreased with increasing breast density from 75.0% for density a (p = 0.839, comparisons made to category b), to 73.5%, 59.8% (p = 0.001), and 51.3% (p < 0.001) in categories b, c, and d, respectively. IC rates were highest in the densest categories with rates of 1.8 (p = 0.039), 3.2, 5.7 (p < 0.001), and 7.9 (p < 0.001) per thousand for categories a, b, c, and d, respectively. The recall rate increased with breast density, leading to more false positive recalls, especially in the younger age group. There was no significant difference between the optimal density threshold found, 6.85, and that Volpara defined as the b/c boundary, 7.5.

Conclusions

The performance of screening is significantly reduced with increasing density with IC rates in the densest category four times higher than in women with fatty breasts. False positives are a particular issue for the younger subgroup without prior examinations.

Clinical relevance statement

In women attending screening there is significant underdiagnosis of breast cancer in those with dense breasts, most marked in the highest density category but still three times higher than in women with fatty breasts in the second highest category.

Key Points

Breast density can mask cancers leading to underdiagnosis on mammography.
Interval cancer rate increased with breast density categories ‘a’ to ‘d’; 1.8 to 7.9 per thousand.
Recall rates increased with increasing breast density, leading to more false positive recalls.

Volumetric breast density affects performance of digital screening mammography

Article Open access 23 December 2016

The effect of volumetric breast density on the risk of screen-detected and interval breast cancers: a cohort study

Article Open access 05 June 2017

Differential detection by breast density for digital breast tomosynthesis versus digital mammography population screening: a systematic review and meta-analysis

Article Open access 28 March 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Breast cancer is one of the most common forms of cancer and early-stage diagnosis leads to better survival [1]. Population screening programmes aim to detect cancer early and reduce mortality. The UK National Health Service Breast Screening Programme (NHSBSP) invites women aged 50 to 70 years every 3 years for full-field digital mammography (FFDM). The longer screening interval results in more cancers diagnosed after a normal screening episode compared to many countries with a 2-yearly frequency of screening.

Breast density is important as it can mask or hide small cancers lowering the sensitivity of mammography—these cancers are often regarded as “underdiagnosed” cancers. Extremely dense breast tissue confers a fourfold relative risk of developing breast cancer compared to the lowest-density tissue [2, 3]. Breast density can be measured by readers on a visual analogue scale (VAS), categorised on the American College of Radiologists (ACR) Breast Imaging Reporting and Data System (BI-RADS) Atlas Fifth Edition four-point scale [4] or it can be variously assessed by automated tools.

In this study, we aimed to objectively assess the impact of breast density on the sensitivity and specificity of screening mammography and determine the rate of interval cancers (ICs) in each category of breast density using an automated tool. A secondary aim was to determine the optimal threshold when using volumetric breast density (VBD) as a binary classifier, to yield the highest ability to discriminate IC cases within the full cohort and within different age bands, 50–60 and 61–70 years.

Materials and methods

Data were obtained from a single site in the UK where a four-view mammographic imaging protocol is used with cranial-caudal (CC) and mediolateral oblique (MLO) views of each breast. Double-reading with arbitration is undertaken within NHS Breast Screening Programme (NHSBSP) guidelines [5]. Ethical approval (Health Research Authority (HRA) Research Ethics Committee 20/LO/0104, HRA Confidentially Advisory Group (CAG) 20/CAG/0009, and Public Health England Research Advisory Committee BSPRAC_090) was obtained to retrospectively collect data from women who took part in breast screening during the years 2011 to 2020 without obtaining explicit consent from the individuals but with an ability for them to opt-out of their data being used.

The inclusion criteria for this study were mammograms collected as part of breast cancer screening at the local site between 2016 and 2018. Examinations were excluded from analysis if the images could not be obtained from the local picture archiving and communication system (PACS) or if raw DICOM data were not available; if the woman was outside of the 50–70 years age range of routine screening; if the woman had a history of breast cancer with a mastectomy or was undergoing annual screening; or if it was not the first examination of the same woman within the period. Examinations were also excluded if the automated density software was not able to score them and breast cancer cases were excluded if they were not the primary cancer.

Data collection

Consecutive mammographic data from a 3-year cohort from 1st January 2016 to 31st December 2018 was selected to reflect the triennial screening cycle, when raw mammographic data was routinely stored at the site, and provide follow-up to identify all ICs. The FFDM images were acquired on Philips (Philips Healthcare) L30 scanners (~98%) and a small number on GE (GE HealthCare Technologies Inc.) machines. National Breast Screening Service (NBSS) records were used to query systems for imaging and clinical data. Imaging data and related metadata were obtained from the local PACS. The data was pseudonymised and stored in a research database.

All cancer cases were confirmed from histopathology reports. For normal (non-cancer) cases, ground truth was taken as no cancer diagnosis recorded in the National Cancer Registry before the next round of screening they attended or within 40 months if they did not re-attend. Data was collected up to April 2022 giving a minimum follow-up period of 40 months for all women. Ethnicity data, where available, was captured from a combination of NBSS and hospital records.

Screening exams are classified as either screen-detected cancer (SDC), for those diagnosed at screening, or IC if diagnosed between screening episodes and within 40 months of a normal screen. The case is classified as normal if there was no cancer diagnosis within the follow-up period. IC cases are identified by NBSS using a variety of methods, including through breast cancer data recorded by the National Cancer Registry, and then checked by individual screening centres. Histopathology (type, grade, and size) was also obtained from these sources where available. All ICs were symptomatic and none were detected by other screening methods.

Breast density assessment

Automatic assessment of radiographic breast density was performed using Volpara Imaging Software’s density measure (v.3.2.0, Volpara Health Technologies Ltd) [6]. The software requires raw, uncompressed mammograms with DICOM headers containing exposure parameters for its physics-based model to compute the percentage VBD per image. The tool takes the maximum VBD at breast level and converts it to a Volpara Density Grade (VDG) for the case, which has been calibrated to correlate with BI-RADS 5th Edition breast density (a. almost entirely fat (VBD < 3.5%) b. scattered fibroglandular densities (3.5% ≤ VBD < 7.5%) c. heterogeneously dense, (7.5% ≤ VBD < 15.5%) and d. extremely dense (15.5% ≤ VBD)). This model has been validated in several previous studies [7,8,9,10,11].

Exclusions

Between 2016 and 2018, 57,877 women attended screening at the site. Six exams originally denoted as ICs were excluded as they were not primary breast cancers. There were 99 (0.17%) screening examinations not obtainable through the local PACS. A further 5028 (9.53%) were excluded as outside of the 50–70 age range. No raw data was available for 17 (0.03%) examinations. Women with a personal history of breast cancer with a mastectomy 757 (1.46%) or undergoing annual screening 93 (0.18%) were removed. Only the first screening episode of each woman within the cohort was used within the analysis, this led to the exclusion of 1176 (2.32%) of the remaining examinations as, in practice, the 3-year round length is variable and can be under 36 months. A total of 50,701 examinations were processed by the Volpara software of which 753 (1.51%) were not able to be scored—some exams received multiple errors and of those given; 79% were due to the presence of breast implants, 9% were parameter issues in data taken or missing from the DICOM header, 7% were mosaic/extended views (taking multiple images per view to fully cover the breast), and 5% were missing a view or side label. The final analysis was based on the remaining 49,948 examinations (Fig. 1).

Statistical analysis

In the context of breast cancer screening, a true positive case is an SDC, a false positive is a recall with no cancer diagnosis before the next screen (or after 40 months of follow-up if not screened again), a false negative is an IC, and a true negative is a normal examination which was not recalled. For this study, we use the definition of the sensitivity of screening as the number of SDCs divided by the sum of screen-detected and ICs. The specificity of screening is the number of true negatives divided by the sum of true negatives and false positives. The positive predictive value (PPV) is given as the number of SDCs divided by the sum of SDCs and false positives.

Exams were grouped by Volpara Density Grade to measure the effect of breast density on screening performance and the distribution of density within the cohort. The cohort was then subdivided into two groups by age, those aged 60 and younger and those older than 60 years, to assess if either group was disproportionally affected by the impact of density. A two-tailed, two-sample t-test was used to determine the statistical significance between rates of SDC and IC of each density category when compared to that of the most populous/common density category. The same test was used to compare rates for the same density classification between the two age groups.

To investigate the impact of prevalent (baseline) round versus incident round on recall rates, the full cohort, the two age subgroups, and the four density subgroups were subdivided by round status. In each case, a two-tailed, two-sample t-test was used to determine the statistical significance between the recall rates of prevalent round examinations and incident round examinations.

VBD, volumetric density measured on a continuous scale (0 to 100%), was analysed as a binary classifier for identifying IC cases—investigating its potential as a metric to be used to determine which woman may benefit from enhanced screening (supplemental imaging or more frequent screening). Receiver operating characteristic (ROC) curves were computed with R (v. 4.3.0, R Foundation for Statistical Computing) with the pROC package [12] to find the area under the ROC curve (AUC) and optimal operating point/threshold—taken as the closest point to the top left of the chart with no case weighting. We calculated an overall threshold as well as separate thresholds for each age subgroup.

Results

Density distribution

A total of 49,948 women with screening examinations were included in the final analysis, comprising of 409 SDCs, 205 ICs, and 49,336 normals with a mean age of 59.0 years (±6.1 years). The overall density distribution was 17.3% a, 46.4% b, 26.6% c, and 9.7% d with women aged ≤ 60 and > 60 have the following proportions: 15.6% and 19.9% a; 43.1% and 51.5% b; 28.9% and 23.0% c; and 12.4% and 5.6% d, respectively.

Sensitivity of screening

The 3 yearly sensitivity of mammography was highest in density a (75.0%), which was not found to be statistically significantly different from that of category b (p = 0.839). However, compared to category b, the sensitivity was worse in categories c and d which were found to be 59.8% (p = 0.001) and 51.3% (p < 0.001), respectively.

A full breakdown of screening performance by density is given in Table 1 and a further breakdown by density and age is given in Table 2. The lowest sensitivity of 48.1% was found for women aged 60 and under with category d density. Women with fatty breasts had significantly lower SDC rates (p = 0.002) than any other category with twice the rate for the older woman compared to those aged 60 and under (p = 0.009).

Table 1 Number of screening examinations, breast cancer cases, false positives, and true negatives for breast cancer screening at a single UK breast screening site 2016–2018 categorised by density

Full size table

Table 2 Number of screening examinations, breast cancer cases, false positives, and true negatives for breast cancer screening at a single UK breast screening site 2016–2018 categorised by VDG and age

Full size table

Interval cancer, screen-detected cancer, and recall rates

Of the 205 IC cases analysed, 15.1% (31/205) were diagnosed within 12 months of the negative screen, 34.6% (71/205) were diagnosed between 12–24 months, and 50.2% (103/205) were diagnosed after 24 months. The IC rate increased from 1.8/1000 in category a to 7.9/1000 in category d with a large number of ICs in category c (5.7/1000). The IC rates for categories c and d were found to be significantly different to that of category b (p < 0.001 in both cases). The highest rate of cancers ((SDC + IC)/1000) was found in d category density at 16.1/1000 with 14.2, 12.2, and 7.4 per thousand in density categories c, b, and a, respectively.

The recall rate was significantly higher in categories c and d (5.5% (p < 0.001) and 5.3% (p = 0.023), respectively) and lower in category a (3.0% (p < 0.001)) when compared to the most populous category, b (4.5%). When dichotomised by age the recall was consistently and significantly higher in the younger women in density categories b (p = 0.006), c (p = 0.004), and d (p = 0.043) as well as the overall (p < 0.001). The effect of prevalent (baseline) versus incident status is shown in Table 3 with a recall rate of 8.6% found for prevalent rounds and 4.2% for subsequent rounds (p < 0.001). Although this is mirrored in the two age subgroups, it should be noted that 8.3% of examinations in the analysis are from prevalent rounds and 92.5% of those are women in the younger cohort.

Table 3 Recall rates by screening round status (“P” = prevalent (baseline) or “I” = incident) for the full cohort (i), age subgroups (i), and density subgroups (ii)

Full size table

Volumetric breast density

When using VBD as a binary classifier to identify IC cases, the AUC for the full cohort (Fig. 2a) was found to be 64.2 (CI: 60.5–67.9) with an optimal threshold of 6.85 (CI: 4.65–9.85) corresponding to a sensitivity and specificity of 63.6% and 59.4%, respectively. This threshold defines 40.7% (20331/49948) as dense. For the younger subgroup alone (Fig. 2b) the AUC was 64.7 (CI: 60.0–69.4) with an optimal threshold of 8.15 (CI: 6.25–10.35), defining 35.1% (10538/30029) as dense. For the older subgroup alone (Fig. 2c) the AUC was 63.4 (CI: 57.6–69.3) with an optimal threshold of 5.15 (4.45–7.15), defining 52.2% (10397/19919) as dense. Using the optimal thresholds of the two subgroups gives a sensitivity and specificity of 65.0% and 58.2%, respectively—with a combined total of 41.9% (20,935/49,948) defined as dense. Using the thresholds of corresponding to VDG c and d, the sensitivities and specificity for each were found to be 55.8% and 63.8% (c and d) and 18.4% and 90.3% (d only), respectively.

Additional cohort data

Ethnicity information was recorded in only 59.1% of the cohort of which 95.3% were listed as “White—British”, “White—Irish”, or “White—Any other White background”. The next highest ethnicities represented were “Other ethnic groups—Chinese” and “Asian or Asian British—Any other Asian background” each of which was 0.5% of the cohort. This is tabulated in the supplemental material.

For the 205 IC cases within the cohort, the number, median size, and interquartile range (IQR) of size are given by grade, interval, and VDG. This data can be found in the supplemental material. An example case of IC is shown in Fig. 3.

Discussion

This study demonstrates that mammographic sensitivity drops dramatically with increasing breast density and in the 9.7% of the population with extremely dense breasts the sensitivity is only 51.3%. The results are similar to the Dutch biennial breast screening programme which reported a sensitivity of 61.0% in the 8.0% of women with extremely dense breasts [13]. The UK data appears to reflect the longer screening interval, with higher rates of both screen-detected and ICs as well as lower sensitives of screening. These similar findings between countries, with biennial and triennial breast screening programmes, provide evidence that increasing breast density is associated with a decrease in the screening performance of FFDM.

Recall rate increased with increasing breast density and was very similar at 5.5% and 5.3% in the densest two categories. False positive recalls are harmful for women causing short-term distress and costly to the screening programme. Digital breast tomosynthesis (DBT) has been shown to reduce false positive recall rates in women with BI-RADS c and d [14] as this reduces confusing overlapping shadows.

There are a number of ways to measure breast density. Reader assessment is subject to bias and inter-reader variability is marked particularly when the density category is borderline [15,16,17]. Automated tools are more self-consistent [18] but there is still concern about variability in performance and utility when comparing one with another [7]. Only a single-density tool, Volpara, was used in this study, but other studies have shown broad agreement when comparing automated tools [19, 20]. Calibration to particular populations is important [21] and Volpara is a well-validated [7,8,9,10,11], widely used tool with a number of publications reporting this density measure [22,23,24].

The increased number of ICs in the densest categories is of major concern. While many of these cancers have developed during the 3-year interval others will have been “underdiagnosed”, i.e., not seen at the time of screening. The European Society of Breast Imaging (EUSOBI) published new recommendations in 2022 calling for women undergoing screening to be informed of their breast density and for those with extremely dense breasts to be offered MRI every 2–4 years [25]. The Dutch DENSE trial found supplemental MRI for women with extremely dense breast tissue, as scored by Volpara software, halved the IC rate compared with the standard of care [26]. The American and German trials of Abbreviated MRI compared to DBT in dense breasts showed a sensitivity of 95.7% and 39.1%, respectively, but with poor specificity for MRI [27]. In the UK, the BRAID trial is recruiting women from screening with BI-RADS density c or d and randomising them to receive standard-of-care or supplemental imaging with either MRI, automated breast ultrasound, or contrast-enhanced mammography [28]. Full protocol MRI in the Dutch trial was only deemed cost-effective at 4-yearly intervals [29] and so a trial of contrast-enhanced mammography and abbreviated MRI is now planned.

If a form of supplemental imaging were to be offered to the highest category (d) then 9.7% of women would be eligible—which includes 18.6% of IC cases—whereas also including the second highest category (c) would lead to over one third (36.2%) of the screening population being invited for supplemental imaging, including over half (54.4%) of ICs. However, not all of these ICs have been overlooked and not all would be detected using a supplemental technique. Such a high level of eligibility may make arguments for supplemental imaging challenging on health and economic grounds. It may be possible to instead define the proportion of women for whom supplemental imaging could be affordable and set a density threshold on that basis.

More recently AI tools have been developed which assess breast texture as well as density and these have been used to predict who might develop breast cancer within the next 2 to 5 years. The Mirai tool, for example, has been found to have an AUC slightly higher than traditional risk prediction methods such as Tyrer-Cuzick [30]. The Swedish Karma tool has been tuned for 2-year risk and identified those women at high risk of being diagnosed with breast cancer with an AUC of 0.73 for its image-based model [31]. This tool is being tested in the ScreenTrust MRI [32] prospective intervention study.

Where assessment of breast density and textural analysis may help to identify women who would benefit from supplemental imaging to overcome masking due to breast composition, risk prediction could identify those who would benefit from a higher frequency of screening. Indeed, with half of the IC cases being diagnosed over 24 months after their negative screening outcome in the cohort undergoing triennial screening, a change to a biennial programme would reduce the number of ICs markedly. Furthermore, a Canadian study [33] showed biennial programs which additionally offered annual screening to women with dense breasts had an improved annualised IC rate (0.89/1000) compared to those that did not (1.45/1000).

Limitations

This study was conducted at a single site in the UK with only one density tool. However, the similarities between tools [20, 34] are such that a single, well-evaluated tool can be used to demonstrate that mammography underperforms with increased breast density. The software tested requires the raw mammographic data although developments are in hand to use it for presentation images. Raw data, which undergoes vendor and software-specific post-processing to generate “for presentation” images, are not routinely stored by many institutions and therefore may not be available for density tools. However, using the tool prospectively is less problematic as a measurement can be made and raw data is not stored.

The definition of an IC within a triennial screening programme and the inclusion in the calculation of screening sensitivity is also a limitation as it is not possible to know if all ICs have been identified—some women may have moved abroad prior to diagnosis, for example—and, more importantly, it is likely many were not present at the time of the screening examination. Alternatively, some women who received a normal outcome at the round of screening included in the study and were not an IC case went on to be diagnosed with SDC at their next round of screening. Data on these “next round” cancers was not presented or included in sensitivity or specificity calculations despite the possibility of some of those cancers being present on the screening round included in the study.

The FFDM images were predominantly obtained on machines from a single vendor. It may be that the performance will differ on other manufacturer’s machines although this has not been reported.

Ethnicity was collected on just over half the population but the predominantly white British population may have introduced racial bias to both density distribution and screening performance results.

Conclusion

This study has shown in a consecutive screening cohort that mammographic sensitivity and specificity decrease with increasing breast density as measured by an automated tool. The IC rate of 7.9/1000 indicates that a significant proportion of cases are underdiagnosed and consideration should be given to offer supplemental imaging in category d density. The 3-yearly round length exacerbates the problem.

Abbreviations

AUC:: Area under the receiver operating characteristic curve
BI-RADS:: Breast Imaging Reporting and Data System
FFDM:: Full-field digital mammography
IC:: Interval cancer
MRI:: Magnetic resonance imaging
NBSS:: National Breast Screening Service
NHSBSP:: National Health Service Breast Screening Programme
PACS:: Picture Archiving and Communication System
ROC:: Receiver operating characteristic
SDC:: Screen-detected cancer
VBD:: Volumetric breast density
VDG:: Volpara density grade

References

Clarke M, Collins R, Darby S et al (2005) Effects of radiotherapy and of differences in the extent of surgery for early breast cancer on local recurrence and 15-year survival: an overview of the randomised trials. Lancet 366:2087–2106. https://doi.org/10.1016/S0140-6736(05)67887-7
Article CAS PubMed Google Scholar
McCormack VA, dos Santos Silva I (2006) Breast density and parenchymal patterns as markers of breast cancer risk: a meta-analysis. Cancer Epidemiol Biomarkers Prev 15:1159–1169. https://doi.org/10.1158/1055-9965.EPI-06-0034
Article PubMed Google Scholar
Lee A, Mavaddat N, Wilcox AN et al (2019) BOADICEA: a comprehensive breast cancer risk prediction model incorporating genetic and nongenetic risk factors. Genet Med 21:1708–1718. https://doi.org/10.1038/s41436-018-0406-9
Article PubMed PubMed Central Google Scholar
Sickles E, D’Orsi C, Bassett L (2013) ACR BI-RADS® Mammography. American College of Radiology, Reston, VA
GOV.UK (2024) Breast screening: guidance for image reading. In: GOV.UK. https://www.gov.uk/government/publications/breast-screening-guidance-for-image-reading/breast-screening-guidance-for-image-reading. Accessed 12 Apr 2024
Highnam R, Brady SM, Yaffe MJ, et al (2010) Robust breast composition measurement—VolparaTM. In: Martí J, Oliver A, Freixenet J, Martí R (eds) Digital mammography. Springer, Berlin, Heidelberg. pp. 342–349
Brandt KR, Scott CG, Ma L et al (2016) Comparison of clinical and automated breast density measurements: implications for risk prediction and supplemental screening. Radiology 279:710–719. https://doi.org/10.1148/radiol.2015151261
Article PubMed Google Scholar
Gubern-Mérida A, Kallenberg M, Platel B et al (2014) Volumetric breast density estimation from full-field digital mammograms: a validation study. PLoS One 9:e85952. https://doi.org/10.1371/journal.pone.0085952
Article CAS PubMed PubMed Central Google Scholar
Seo JM, Ko ES, Han B-K et al (2013) Automated volumetric breast density estimation: a comparison with visual assessment. Clin Radiol 68:690–695. https://doi.org/10.1016/j.crad.2013.01.011
Article CAS PubMed Google Scholar
Lee HN, Sohn Y-M, Han KH (2015) Comparison of mammographic density estimation by Volpara software with radiologists’ visual assessment: analysis of clinical–radiologic factors affecting discrepancy between them. Acta Radiol 56:1061–1068. https://doi.org/10.1177/0284185114554674
Article PubMed Google Scholar
Lau S, Ng KH, Abdul Aziz YF (2016) Volumetric breast density measurement: sensitivity analysis of a relative physics approach. Br J Radiol 89:20160258. https://doi.org/10.1259/bjr.20160258
Article PubMed PubMed Central Google Scholar
Robin X, Turck N, Hainard A et al (2011) pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12:77. https://doi.org/10.1186/1471-2105-12-77
Article PubMed PubMed Central Google Scholar
Wanders JOP, Holland K, Veldhuis WB et al (2017) Volumetric breast density affects performance of digital screening mammography. Breast Cancer Res Treat 162:95–103. https://doi.org/10.1007/s10549-016-4090-7
Article PubMed Google Scholar
Gilbert FJ, Tucker L, Gillan MG et al (2015) TOMMY trial: a comparison of TOMosynthesis with digital MammographY in the UK NHS Breast Screening Programme. Health Technol Assess 19:1–136. https://doi.org/10.3310/hta19040
Article PubMed PubMed Central Google Scholar
Redondo A, Comas M, Macià F et al (2012) Inter- and intraradiologist variability in the BI-RADS assessment and breast density categories for screening mammograms. Br J Radiol 85:1465–1470. https://doi.org/10.1259/bjr/21256379
Article CAS PubMed PubMed Central Google Scholar
Pesce K, Tajerian M, Chico MJ et al (2020) Interobserver and intraobserver variability in determining breast density according to the fifth edition of the BI-RADS® Atlas. Radiologia 62:481–486. https://doi.org/10.1016/j.rx.2020.04.006
Article CAS PubMed Google Scholar
Portnow LH, Georgian-Smith D, Haider I et al (2022) Persistent inter-observer variability of breast density assessment using BI-RADS® 5th edition guidelines. Clin Imaging 83:21–27. https://doi.org/10.1016/j.clinimag.2021.11.034
Article PubMed Google Scholar
Alonzo-Proulx O, Mawdsley GE, Patrie JT et al (2015) Reliability of automated breast density measurements. Radiology 275:366–376. https://doi.org/10.1148/radiol.15141686
Article PubMed Google Scholar
Astley SM, Harkness EF, Sergeant JC et al (2018) A comparison of five methods of measuring mammographic density: a case-control study. Breast Cancer Res 20:10. https://doi.org/10.1186/s13058-018-0932-z
Article PubMed PubMed Central Google Scholar
Morrish OWE, Tucker L, Black R et al (2015) Mammographic breast density: comparison of methods for quantitative evaluation. Radiology 275:356–365. https://doi.org/10.1148/radiol.14141508
Article PubMed Google Scholar
Portnow LH, Choridah L, Kardinah K et al (2023) International interobserver variability of breast density assessment. J Am Coll Radiol 20:671–684. https://doi.org/10.1016/j.jacr.2023.03.010
Article PubMed Google Scholar
Moshina N, Aase HS, Danielsen AS et al (2020) Comparing screening outcomes for digital breast tomosynthesis and digital mammography by automated breast density in a randomized controlled trial: results from the to-be trial. Radiology 297:522–553. https://doi.org/10.1148/radiol.2020201150
Article PubMed Google Scholar
Han Y, Moore JX, Colditz GA, Toriola AT (2022) Family history of breast cancer and mammographic breast density in premenopausal women. JAMA Network Open 5:e2148983. https://doi.org/10.1001/jamanetworkopen.2021.48983
Article PubMed PubMed Central Google Scholar
Mariapun S, Ho WK, Eriksson M et al (2023) Evaluation of SNPs associated with mammographic density in European women with mammographic density in Asian women from South-East Asia. Breast Cancer Res Treat 201:237–245. https://doi.org/10.1007/s10549-023-06984-2
Article CAS PubMed Google Scholar
Mann RM, Athanasiou A, Baltzer PAT et al (2022) Breast cancer screening in women with extremely dense breasts recommendations of the European Society of Breast Imaging (EUSOBI). Eur Radiol 32:4036–4045. https://doi.org/10.1007/s00330-022-08617-6
Article PubMed PubMed Central Google Scholar
Bakker MF, de Lange SV, Pijnappel RM et al (2019) Supplemental MRI screening for women with extremely dense breast tissue. N Engl J Med 381:2091–2102. https://doi.org/10.1056/NEJMoa1903986
Article PubMed Google Scholar
Comstock CE, Gatsonis C, Newstead GM et al (2020) Comparison of abbreviated breast MRI vs digital breast tomosynthesis for breast cancer detection among women with dense breasts undergoing screening. JAMA 323:746–756. https://doi.org/10.1001/jama.2020.0572
Article PubMed PubMed Central Google Scholar
Gilbert FJ (2022) Breast screening—risk adaptive imaging for density. clinicaltrials.gov
Geuzinge HA, Bakker MF, Heijnsdijk EAM et al (2021) Cost-effectiveness of magnetic resonance imaging screening for women with extremely dense breast tissue. J Natl Cancer Inst 113:1476–1483. https://doi.org/10.1093/jnci/djab119
Article PubMed PubMed Central Google Scholar
Yala A, Mikhael PG, Strand F et al (2022) Multi-institutional validation of a mammography-based breast cancer risk model. J Clin Oncol 40:1732–1740. https://doi.org/10.1200/JCO.21.01337
Article PubMed Google Scholar
Eriksson M, Czene K, Strand F et al (2020) Identification of women at high risk of breast cancer who need supplemental screening. Radiology 297:327–333. https://doi.org/10.1148/radiol.2020201620
Article PubMed Google Scholar
Strand F (2023) Image analysis with artificial intelligence to increase precision in breast cancer screening—the ScreenTrust MRI substudy: a prospective trial of AI to select women for supplemental screening MRI. clinicaltrials.gov
Seely JM, Peddle SE, Yang H et al (2022) Breast density and risk of interval cancers: the effect of annual versus biennial screening mammography policies in Canada. Can Assoc Radiol J 73:90–100. https://doi.org/10.1177/08465371211027958
Article PubMed Google Scholar
Wang J, Azziz A, Fan B et al (2013) Agreement of mammographic measures of volumetric breast density to MRI. PLoS One 8:e81653. https://doi.org/10.1371/journal.pone.0081653
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

This study has received funding from a Cancer Research UK grant [C543/A26884] and NIHR Cambridge Biomedical Research Centre.

Author information

Authors and Affiliations

Department of Radiology, University of Cambridge School of Clinical Medicine, Box 218, Level 5, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK
Nicholas R. Payne, Sarah E. Hickman, Andrew N. Priest & Fiona J. Gilbert
Department of Radiology, Barts Health NHS Trust, The Royal London Hospital, 80 Newark Street, London, E1 2ES, UK
Sarah E. Hickman
Department of Radiology, Addenbrookes Hospital, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
Richard Black, Andrew N. Priest & Fiona J. Gilbert
Peel and Schriek Consulting Limited, London, UK
Sue Hudson

Authors

Nicholas R. Payne
View author publications
You can also search for this author in PubMed Google Scholar
Sarah E. Hickman
View author publications
You can also search for this author in PubMed Google Scholar
Richard Black
View author publications
You can also search for this author in PubMed Google Scholar
Andrew N. Priest
View author publications
You can also search for this author in PubMed Google Scholar
Sue Hudson
View author publications
You can also search for this author in PubMed Google Scholar
Fiona J. Gilbert
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fiona J. Gilbert.

Ethics declarations

Guarantor

The scientific guarantor of this publication is Prof Fiona Gilbert.

Conflict of interest

The authors of this manuscript declare relationships with the following companies: research support from GE Healthcare and Bayer Healthcare. Additionally, FJG, NRP, and SEH have research agreements with iCAD, Lunit, Merantix, ScreenPoint Medical, Therapixel, and Volpara.

Statistics and biometry

One of the authors has significant statistical expertise.

Informed consent

Written informed consent was waived by the Institutional Review Board, however efforts were made to publicise the use of the data including signposting to the ability to opt-out.

Ethical approval

Health Research Authority (HRA) Research Ethics Committee 20/LO/0104, HRA Confidentially Advisory Group (CAG) 20/CAG/0009, and Public Health England Research Advisory Committee BSPRAC_090.

Study subjects or cohorts overlap

No overlap exists for studies on breast density. However, a subset of the cohort has been used in a separate study of breast cancer AI tools which is in press (“Artificial intelligence system prompt accuracy for interval breast cancer detection in screening mammography,” forthcoming in RADIOLOGY).

Methodology

Retrospective
Cross-sectional study
Performed at one institution

Additional information

Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Payne, N.R., Hickman, S.E., Black, R. et al. Breast density effect on the sensitivity of digital screening mammography in a UK cohort. Eur Radiol (2024). https://doi.org/10.1007/s00330-024-10951-w

Download citation

Received: 21 November 2023
Revised: 02 May 2024
Accepted: 26 June 2024
Published: 17 July 2024
DOI: https://doi.org/10.1007/s00330-024-10951-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Breast density effect on the sensitivity of digital screening mammography in a UK cohort

Abstract

Objectives

Methods

Results

Conclusions

Clinical relevance statement

Key Points

Similar content being viewed by others

Volumetric breast density affects performance of digital screening mammography

The effect of volumetric breast density on the risk of screen-detected and interval breast cancers: a cohort study

Differential detection by breast density for digital breast tomosynthesis versus digital mammography population screening: a systematic review and meta-analysis

Introduction

Materials and methods

Data collection

Breast density assessment

Exclusions

Statistical analysis

Results

Density distribution

Sensitivity of screening

Interval cancer, screen-detected cancer, and recall rates

Volumetric breast density

Additional cohort data

Discussion

Limitations

Conclusion

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Guarantor

Conflict of interest

Statistics and biometry

Informed consent

Ethical approval

Study subjects or cohorts overlap

Methodology

Additional information

Supplementary information

Supplementary Material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation