Introduction

More than 2 million women are diagnosed with breast cancer worldwide every year, and the estimated 5-year prevalence is almost 8 million cases. In Europe, breast cancer is the most frequently occurring cancer, accounting for about 15% of all new cancer cases and for about 30% of all new cancer cases in women. More than five hundred thousand breast cancers are diagnosed every year in Europe, and about 13% of women will be diagnosed with breast cancer during their lifetime. Breast cancer more frequently affects women older than 50 years [1]. Nevertheless, the global incidence has constantly increased in the last years in all age groups, with some regional and age group differences [2]. Although not completely clear, this trend could be explained by the introduction of organized screening programs as well as modification in exposure to risk factors, such as hormonal and reproductive factors or smoking. Improvements in diagnosis and treatment have significantly reduced breast cancer mortality in the last three decades. Nevertheless, breast cancer mortality remains an important issue, and about 3% of women will eventually die from breast cancer; the estimated number of deaths for breast cancer in Europe in 2020 was about 140,000 women [3].

Risk stratification

Multiple factors influence the individual risk to develop breast cancer, including age at menarche and menopause, reproductive history, obesity, previous biopsy with atypia, previous thoracic radiation therapy, and family cancer history [4]. Breast tissue composition is also a determining factor. Women with higher mammographic breast density, defined as the proportion of radiopaque fibroglandular tissue compared to radiolucent adipose tissue in the breast, have a 2.9-6-fold increased risk of developing breast cancer compared to women with predominantly fatty breast [5]. Several risk prediction models are available to estimate the individual risk of developing breast cancer [6,7,8]. These models take into account a number of personal factors and can include genetic information, such as expression of high- (e.g., BRCA1, BRCA2, TP53) and low-penetrance (e.g., single-nucleotide polymorphisms, SNPs) genes, as well as the mammographic breast density [7, 9]. Recent studies have shown that the integration of deep learning models applied to mammographic images can further improve risk assessment [10].

Women with a lifetime risk of developing breast cancer of less than 15% are considered at average risk [11]. Women with a previous biopsy showing atypical ductal hyperplasia or lobular carcinoma in situ, as well as women with a previous history of breast cancer, are included in the intermediate risk category with an estimated lifetime risk of 15–20%. The high-risk category includes all women with a lifetime risk to develop breast cancer of more than 20%. However, this is usually split into an intermediate high and a very high category. The intermediate high category includes women with a highly positive family history but no known mutations, as well as those with lower penetrance genetic mutations such as CHEK2 or BARD1. Very high risk—with a lifetime risk of over 50%—is usually due to hereditary mutations in high penetrance genes and is also present in women who underwent chest radiation therapy between the ages of 10–30 years. A genetic predisposition is observed in about 5–10% of breast cancers, and BRCA1 and BRCA2 are the most recognized mutations associated with a lifetime risk for breast cancer of 50–85% and 45–69%, respectively [12]. Dedicated screening recommendations are available for high-risk women, but sometimes differ between women at very high risk and those with an intermediate high risk [13]. Since women at very high risk should start screening from the age of 25, early risk assessment is important. Ongoing clinical trials, e.g., the MyPeBS (My Personal Breast Cancer Screening) trial in Europe and the WISDOM (Women Informed to Screen Depending on Measures of Risk) trial in the United States, are investigating the implementation of different screening modalities and schedules based on personal breast cancer risk estimation for women that are not known to be at a very high risk [14, 15].

The idea behind screening: downsizing and downstaging

The goal of breast cancer screening is to find relevant cancers when they are so small that they do not cause symptoms and are less likely to have spread beyond the location where they originated [16]. In the past, breast self-examination has been recommended for cancer prevention as an inexpensive and noninvasive mean to perform routine screening. Nevertheless, studies have shown that its regular practice is not effective at reducing breast cancer mortality [16, 17]. Mammography meets the characteristics of a screening test: it can be rapidly and easily applied, it is broadly available, and it was shown that the test allows for early detection of breast cancers in several randomized controlled clinical trials that ultimately also demonstrated an estimated breast cancer-related mortality reduction of at least 20–25% [18]. As such, these studies show that downstaging of breast cancer through screening improves survival. However, many factors are involved to determine breast cancer prognosis, including cancer biological features (e.g., tumor subtype and grade, hormone receptor, and human epidermal growth factor receptor 2 status) and the availability of effective therapies. Indeed, the parallel spread of screening programs, the evolution of breast surgical approaches, and the implementation of more effective systemic treatments make the determination of the relative contribution of each factor on the achieved mortality reduction more challenging. Regardless, treatments remain less effective in the case of advanced-stage disease, and mortality reduction is still significantly related to the tumor size and nodal status at diagnosis, which supports screening for breast cancer to pursue an early diagnosis [19, 20].

To assess whether alternative screening methods are effective, pursuing randomized controlled trials with mortality as the endpoint is nowadays regarded as impossible and unethical. Due to the earlier detection on the one hand and the long-lasting effects of palliative therapies in women with (incurable) metastatic disease, on the other hand, the timespan between diagnosis and eventual death is too long to assess the ongoing rapid technological developments in screening methodology. Therefore, surrogate endpoints need to be evaluated that are prognostic for the eventual survival. These include the observation of: 1) a stage shift (i.e., cancers detected through a new screening modality are smaller and less often node positive than cancers detected by the standard modality or through symptoms); 2) a reduction in the frequency of interval cancers (i.e., a decrease in the fraction of cancers that present through symptoms between two screening rounds); and 3) a reduction in the relative frequency of advanced cancers (classically regarded as a reduction in stage III and IV cancers). The achieved mortality reduction and associated cost-effectiveness of screening can subsequently be modeled from either of these outcomes.

Screening methods

Mammography

The most common method of screening women for breast cancer is mammography, either offered by an organized program or in the form of opportunistic screening. In most European countries, mammographic screening programs include all women, with age being the only determinant: screening is offered starting at the age of 40–50 years until the age of 64–74 years or even illimitable if women are in good health (e.g., in Austria) at intervals of 1–2 years (3 years in the UK). Population-based effectiveness of mammography screening programs depend on high participation levels, and the European Guidelines for Quality Assurance in Breast Cancer Screening and Diagnosis recommend that at least 70–75% of a population participates in regular mammography screening [21]. Despite the overall survival advantages achieved with the use of mammographic screening, this “one-size-fits-all” approach is associated with several drawbacks, not least that mammography alone is not sufficient to achieve early diagnosis in the high-risk population. It has been reported that program sensitivities of 86–89% in women with largely fatty breasts drop to 62–68% in the case of women with dense breasts (Fig. 13) [22]. Considering that about 40–50% of women have dense breasts (heterogeneously and extremely dense breast), this represents a non-negligible issue. Breast density category after performing mammography screening should always be reported since this has important implications for the performance of supplemental and/or alternative imaging methods.

Fig. 1
figure 1

Breast density categories according to the Breast Imaging Reporting and Data System (BI-RADS) from the American College of Radiology [57]. Depending on the breast composition, four different categories are identified: a entirely fatty; b scattered areas of fibroglandular density; c heterogeneously dense, which may obscure masses; and (d) extremely dense

Fig. 2
figure 2

45-year-old woman without significant family/personal risk factors for breast cancer undergoing first breast cancer screening examination. In the left mammogram (a cranio-caudal and mediolateral oblique projections), extremely dense breast tissue can be observed, and no suspicious findings could be identified, only a typical benign calcification can be recognized in the retromamillary region. At supplemental ultrasound (b) performed on the same day, a suspicious mass up to 3.3 cm could be identified at 6 o’clock in the left breast. Ultrasound-guided biopsy was performed, and the lesion histologically corresponded to a NST moderately differentiated invasive ductal carcinoma

Fig. 3
figure 3

55-year-old woman without significant family/personal risk factors for breast cancer undergoing routine breast cancer screening examination. In the right mammogram (a cranio-caudal and mediolateral oblique projections) scattered areas of fibroglandular density can be observed. A suspicious irregular shaped and spiculated mass can be seen at 12 o’clock. At ultrasound a corresponding irregular shaped, spiculated and hypoechoic mass up to 0.8 cm can be seen (b). Ultrasound-guided biopsy was performed and the mass histologically corresponded to a NST moderately differentiated invasive ductal carcinoma

Digital breast tomosynthesis

Digital breast tomosynthesis (DBT) is a pseudo-3D X-ray-based imaging technique that involves multiple low-dose projections acquired across an arc over the breast that are subsequently reconstructed into a series of stacked images, as well as a synthetic mammogram. The masking effect is reduced by decreasing superimposition from overlying breast tissue, thus increasing cancer detection rates (up to 4 additional cancers detected per 1000 screens) compared to mammography. Decreased recall rates compared to mammography have also been reported in high recall settings (i.e., most studies in the United States), whereas no differences or even slightly higher recall rates with DBT have been reported in most European studies [23, 24]. Based on these results, DBT is increasingly used for breast cancer screening in Europe, as well as in the United States. Nevertheless, several studies have shown that the positive effects are limited in women with very high breast density and also that the reported increased detection of slow-growing, probably indolent tumors generate concerns related to overdiagnosis (see below). Moreover, the studies investigating the role of DBT on the interval cancer rate still show mixed results [25,26,27,28]. When interpreting DBT images, a 2D image of the same projection is required e.g. for comparison with prior mammogram and assessment of calcifications. For this purpose, mammography projections were often  acquired at the same time. Studies have demonstrated that synthetic mammography, corresponding to 2D reconstructions of the DBT datasets, can be used as an alternative to additional mammography projection without impairing screening performances and reducing radiation exposure as well as image acquisition time [29].

Breast ultrasound

Several studies have shown that supplemental breast ultrasound screening in women with dense breasts increases the detection of 0.9–7.7 cancers per 1000 screening examinations and reduces the interval cancer rate compared to mammography alone. These detected cancers are usually small, invasive, and node-negative. In the last decade, automated breast ultrasound systems have also been introduced on the market to overcome some of the limitations of handheld ultrasound, such as operator dependence, long acquisition time when performed by a physician, and limited reproducibility. The main drawbacks of supplemental breast ultrasound screening remain the increase in recalls and biopsy rates, leading to a reduction of the positive predictive value of screening to about 5% [30]. Mostly due to the long examination times, the cost-effectiveness of ultrasound for screening is questionable. EUSOBI (European Society of Breast Imaging) guidelines suggest the possible usage of supplemental ultrasound screening in women at average or intermediate risk with dense breast and negative mammography, but the balance between advantages and disadvantages was not judged to be clear enough to adopt additional ultrasound as a general policy [31].

MRI

Breast MRI is the most sensitive imaging method for breast cancer detection, mostly because of its ability to detect neovascularity. Moreover, as a functional imaging modality, it is optimized to detect more aggressive tumor subtypes [32,33,34,35]. In addition, the sensitivity of breast MRI is not limited by breast density. Screening with breast MRI is recommended yearly for women with genetics-based increased risk, with a calculated lifetime risk equal or superior to 20% and in case of a history of chest or mantle radiation therapy (Fig. 4). From the age of 35–40 years, supplemental mammography may also be considered. Recently, based on the Dutch DENSE trial and the international EA1411 ECOG-ACRIN study, the EUSOBI has published the recommendations for breast cancer screening in women with extremely dense breast (category D according to the ACR BI-RADS atlas) [36,37,38,39,40]. For this group of women, level I evidence is now available that MRI screening cost-effectively reduces mortality. Based on this evidence, healthcare systems should focus on building programs that can offer MRI from age 50 to 70, at least every 4 years, in women with very dense breasts. Major drawbacks of breast MRI remain higher costs of the screening program, relatively limited availability that currently requires expansion, and long examination time, although the latter seems to be solved by abbreviated protocols that are equally effective and enable imaging in a similar timespan as mammography.

Fig. 4
figure 4

46-year-old patient with a prior history of chest irradiation for Ewing sarcoma. Screening examinations. MLO views (a) from mammography examination that was deeemed normal. Contrast-enhanced breast MRI (b) reveals bilateral breast abnormalities. In the right breast, an irregular spiculated mass is visible, corresponding to a 13 mm NST carcinoma. In the left breast, segmental heterogeneous nonmass enhancement is seen, corresponding to extensive DCIS with microinvasion

Contrast-enhanced mammography

Contrast-enhanced Mammography (CEM) is a recently introduced breast imaging technique that like breast MRI uses contrast medium with the purpose to depict vascularity potentially associated with breast cancer, thus providing anatomic and functional information at the same time. In the screening setting, few studies so far have investigated the use of CEM, particularly among women with intermediate/high lifetime risk [41, 42]. CEM shows superior diagnostic performance compared with digital mammography, DBT, and mammography with supplemental ultrasound and seems to show equal to a bit higher specificity [16] but lower sensitivity compared with breast MRI. Thus, CEM seems a promising alternative to breast MRI when MRI is not available. Nevertheless, most studies published so far are retrospective and/or include a limited number of women. Further investigations on a larger scale are necessary to compare CEM with standard imaging, and particularly MRI, in terms of achieved stage shift and frequency of interval and advanced cancers [43].

Figure 5 summarizes the suggested clinical pathways for breast cancer screening in different women subgroups.

Fig. 5
figure 5

Flowchart summarizing the clinical pathways for breast cancer screening in different women subgroups

The downsides of screening

Despite the evidence that mammography screening programs reduce mortality from breast cancer, the net benefit is counterbalanced by some recognized factors, namely underdiagnosis, false positive cases, and overdiagnosis.

Underdiagnosis

Underdiagnosis is the situation where a clinically relevant breast cancer is present in the breast but is not detected at the time of screening. This leads to cancer detection at a later time when the cancer has progressed and is less easily—and sometimes also less successfully—treatable. Classically, interval cancers are counted as a false negative or underdiagnosed cases. This accounts for approximately 15–40% of cancers detected within a population screened with mammography, mostly depending on breast density. However, underdiagnosis is more frequent as approximately 30% of cancers detected through mammography screening already have spread to the lymph nodes, and 25% are over 2 cm in size (i.e., T2 or more). Retrospective evaluation of screening mammograms shows that 30–50% of these cancers were already visible on the preceding mammogram, but not recognized at that time. Studies with other screening modalities, particularly breast MRI, also show that some cancers could be found earlier when they are smaller and lymph node-negative. Whether this has implications for treatment and prognosis depends also on the biological profile of the cancer. Unfortunately, thus far, it is not possible to predict what type of cancer a woman will develop, and consequently, the only remedy against underdiagnosis is to try to find all cancers as early as possible [8, 44].

False positive

Up to 10–12% of women undergoing screening mammography are recalled for additional workup, which may consist of additional imaging, tissue biopsy, or short-term follow-up. Among these recalls, only about one-third of cases will have a breast cancer diagnosis, whereas the others may have a benign lesion or a finding caused by tissue superimposition. These are referred to as false positive cases [8, 44]. The probability of a false positive result is higher in the case of a baseline mammogram. [45]. The European quality assurance guidelines for breast cancer screening recommend double reading, with both readers initially performing independent mammogram evaluation, followed by resolving discordant results by third reader arbitration or consensus between the two readers [46]. Compared to single reading, double reading reduces the recall rate without reducing the cancer detection rate [47]. False positive cases are considered a harm of screening because of medical costs and discomfort experienced by the patient. Moreover, the rate of participation to future screening rounds may decrease after experiencing a false positive result [48]. To reduce anxiety to a minimum, it is of paramount importance to quickly analyze recalled women.

Overdiagnosis

Overdiagnosis refers to the detection of breast cancers that would never become clinically evident and would not cause any harm during the individual’s lifetime if left untreated. It is, unfortunately, never possible to directly tell which cancers have been “overdiagnosed”. Slower-growing and lower-grade cancers have a higher probability of being overdiagnosed, causing a length bias. Moreover, these cancers are more often associated with microcalcifications and with a stromal reaction (spiculation), which make their identification in mammography easier [36], and, which makes mammography screening prone to overdiagnosis. Overdiagnosis leads to overtreatment and possibly causes morbidity or even mortality without any certain benefit. It is difficult to estimate the extent of overdiagnosis related to mammography screening, as multiple factors are involved, such as age, cancer risk of the screened population, and lead time. Previous published works reported percentages of overestimation varying between 0 and 50% of all detected cancer: this variation also depended on different definitions of overdiagnosis, study settings, and estimation methods [36]. In a more recent study, accounting for the detection of nonprogressive cancer, about 15% of screen-detected cancers were estimated to be overdiagnosed [36]. The impact of overdiagnosis may be reduced by identifying women with “low-risk disease” who could undergo active monitoring instead of surgery or even who do not necessitate further investigations at the point of screening, thus reducing overtreatment. Currently, several ongoing trials are evaluating the use of active monitoring instead of conventional surgical treatment in women with a diagnosis of low-risk ductal carcinoma in situ [49,50,51].

Economic considerations

Although the cost-effectiveness of breast cancer screening is dependent on local costs of imaging techniques and labor as well as the value attributed to increased life expectancy and overall health, economic models suggest that breast cancer screening in the European healthcare setting improves life and/or quality-adjusted life expectancy at acceptable costs. Risk-adapted screening strategies should guarantee further optimization of the resource utilization by assigning each risk category the most advantageous and cost-effective screening method, thereby reducing unnecessary harm and costs [52, 53].

In countries with limited resource availability, screening tools need to be chosen wisely depending on the cost-effectiveness and performance expected in the population. In situations where financial constraints limit the implementation of large-scale screening, priority should be given to raising breast cancer awareness and ensuring timely presentation of breast complaints to radiologists, potentially by education on physical breast examination and modifiable breast cancer risk factors [54].

Women preferences

Women should be properly informed about the advantages as well as disadvantages of a screening test to be able to make informed choices. To minimize anxiety, prompt provision of screening results is also important [55]. Shared decision-making is characterized by the participation of physicians and patients on taking the best decision, based not only on the scientific evidence but also on expectations and wishes of each patient. Breast radiologists should embrace an active role in this, as knowledge about the benefits and harms of different screening scenarios is not common among other physicians. Several studies on population-based mammography screening have shown that women value more the possibility of early diagnosis over the risk of overdiagnosis or a false positive exam, although concerns remain because the meaning of overdiagnosis may not be fully understood [56]. Moving to the scenario of risk-based cancer screening, potentially encompassing a more intensive screening for women at higher risk and less intensive screening for women at lower risk, informed decision-making could become even more important to increase acceptability among those women who are candidates for reduced screening intensity [57].

General recommendations

Screening mammography is an effective tool to reduce breast cancer mortality (evidence level I). Existing screening programs should be continued, and efforts should be made to increase adherence. Digital breast tomosynthesis can be performed as an alternative to mammography.

High-risk women should start screening as early as 25 years of age with yearly breast MRI (evidence level I), supplemented with mammography from age 35–40 years.

Women at intermediate risk of breast cancer may benefit from supplemental screening, including digital breast tomosynthesis, breast ultrasound, breast MRI, and possibly CEM. The most appropriate imaging modalities should be adjusted to patient characteristics. The current evidence supports the use of breast MRI screening in women with extremely dense breast tissue, preferably every 2–3 years (evidence level I). If MRI is not available, supplemental ultrasound can be performed as an alternative, although the added value of supplemental ultrasound regarding cancer detection remains more limited.

Several risk prediction models are available to estimate the individual risk of developing breast cancer. If possible, risk assessment should be performed at a young age (approximately 25 years) to effectively tailor screening recommendations. Individual screening recommendations should be made through a shared decision-making process between women and physicians.

Summary statement

Tumor stage at diagnosis remains a crucial prognostic factor to determine breast cancer survival. By means of mammographic screening, early cancer diagnosis can be achieved thereby reducing breast cancer mortality and the need for aggressive treatments. Nevertheless, breast cancer remains the leading cause of cancer death among women worldwide, and although the benefits of mammographic screening outweigh the harms in the general population and the “one-size-fit-all” approach remains easier to implement, further improvements are sought.

Moving to screening programs adjusted to personal risk level instead of age-based population screening could improve the performance of a screening program by reducing underdiagnosis, false positives, and overdiagnosis as well as improving cost-effectiveness.

Other effective imaging modalities are available and, depending on the characteristics of the screened population, could be implemented as an alternative to mammography (e.g., breast tomosynthesis and breast MRI) or as a supplemental tool (e.g., breast ultrasound) (Table 1). Although these additional examinations may be susceptible to higher false positive rates compared to mammography, considering the potentially dramatical consequences of a late cancer diagnosis, priority should be given to avoid false negative examinations rather than to reduce false positive.

Table 1 Summary recommendations on breast cancer screening

Patient summary

Breast cancer screening can detect small cancers before they become symptomatic, leading to, in most cases, less aggressive treatment and prolonged survival. So far, breast cancer screening has mainly been performed through mammography, which still represents a very effective imaging modality but has recognized weaknesses. For example, the performance of a screening exam can vary depending on personal characteristics such as the family history and the breast density. There is now evidence that considering these and some other factors to provide a more personalized screening and incorporating other screening modalities may help to further improve the early diagnosis of breast cancer. Several current studies are investigating these aspects.