Background

Breast cancer (BrCa) is the most commonly diagnosed cancer worldwide, with an estimated 2.3 million cases and 685,000 deaths in 2020 [1]. Incidence and death rates of female BrCa remain high in developed countries [1] and rapidly increase in transitioning ones (countries with lower Human Development Index). The latter could be attributed to the fact that countries with growing economies have been experiencing significant changes of lifestyle and sociocultural patterns, which, along with the increasing involvement of women in the industrial workforce, have resulted in changes of the prevalence of BrCa risk factors [1, 2].

Approximately 10% of all female BrCa cases are familial and linked to specific highly penetrant gene mutations (e.g., BRCA1, BRCA2) [3]. However, the highest proportion of cases are attributed to both low penetrant genetic and non-genetic factors [3]. For example, menopausal status is an important non-genetic factor that determines BrCa risk [4]. Variations in premenopausal and postmenopausal BrCa incidence and mortality across different countries are associated with income differences as well as with the differential distribution of distinct molecular features and risk factors in each of the two menopausal statuses [4]. In addition, BrCa is classified into molecular subtypes based on whether BrCa cells grow in response to female hormones (i.e., estrogen, progesterone) or growth factors [5]. Stratification of women based on non-genetic risk factors for BrCa is of paramount importance for developing more effective risk reduction strategies as well as for targeted risk- stratified BrCa screening programmes [6].

There is a large number of systematic reviews and meta-analyses on non-genetic factors (including obesity, hormone levels, alcohol consumption, and smoking) and their association with BrCa risk and mortality [7,8,9,10]. However, the results are often contradictory and subject to biases. A few umbrella reviews (i.e., reviews of systematic reviews and meta-analyses), which examined certain types of non-genetic exposures, also included BrCa as one of the studied outcomes [11,12,13,14,15,16]. However, to our knowledge, there has been no systematic effort to summarise and evaluate the robustness of evidence on non-genetic risk factors for BrCa.

Therefore, in view of the large, and often contradictory, amount of published evidence on non-genetic risk factors for BrCa incidence and mortality, we aimed to summarise and evaluate the findings of systematic reviews and meta-analyses in this field, following an umbrella review methodology. The added value of the present umbrella review is that it offers a comprehensive and deep understanding of the aetiology of BrCa by integrating findings from various systematic reviews and meta-analyses, thereby providing a thorough and reliable assessment of the evidence regarding non-genetic factors and risk of BrCa.

Methods

A standardised methodology based on a predefined internal protocol was registered in PROSPERO (CRD42022370675). The findings are reported according to the PRIOR [17] (Preferred Reporting Items for Overviews of Reviews) recommendations (Additional file 1 – PRIOR Checklist).

Search strategy

We identified relevant systematic reviews and meta-analyses investigating the association of any non-genetic factor and BrCa incidence and/or mortality. We searched Medline (via PubMed), Scopus, and the Cochrane database for systematic reviews from inception to October 31st, 2022. The following search algorithm was used: ((Breast OR mammary) AND (cancer* OR neoplasm* OR malignant* OR tumour* OR tumor* OR carcinoma* OR adenocarcinoma*)) AND (meta-analysis OR "systematic review" OR systematic review). The full strategy can be found in the supplement (Additional file 1 – Search strategy).

Eligibility criteria

We included systematic reviews and meta-analyses published in English that studied the association of any non-genetic exposure with female BrCa incidence or mortality due to BrCa as the primary cause of death (when mortality was reported as proxy for incidence in primary studies) among healthy individuals at risk for BrCa. Studies involving women with pre-existing breast cancer investigating survival outcomes following cancer diagnosis were excluded. There were no restrictions depending on publication status such as preprints. However, certain types of publication (e.g., books, commentary, letters) were not evaluated as they were considered unlikely to provide sufficient data for inclusion in our analysis. We only included papers that had performed a systematic literature search; meta-analysis papers without a systematic search of the literature were excluded. We considered meta-analyses if they included at least two independent primary studies. Sub-analyses in a meta-analysis that included only one study were excluded. Finally, we excluded any (otherwise eligible) publications when they did not provide effect estimates and their corresponding confidence intervals (CIs) or some other measure, such as standard errors or P-values, for the individual studies in the meta-analyses, or enough data to reproduce them. Systematic reviews focusing on the association between genes or genetic markers and BrCa risk or on the survival of BrCa cases were not considered. The exclusion criteria are presented in the supplement (Additional file 1 – Exclusion criteria).

Title, abstract, and full text screening was performed in duplicate by 9 authors (AP, AG, AH, ME, KL, EK, CK, MC, MT). Conflicts were resolved by discussion with other team members (AY, KP, GM, GKN) until consensus was reached. In case there were multiple overlapping meta-analyses, we chose only one for our umbrella review, based on the following algorithm: First, we selected the most recent systematic review and meta-analysis. If another meta-analysis had been conducted within 5 years from the date of publication of the most recent one, we chose the one with the largest number of individual studies and largest number of participants, and the most comprehensive one (i.e., the one evaluating the largest number of different comparisons for the risk factor in question). Quality was assessed using the AMSTAR tool [18], which also served as an additional selection criterion if the preceding criteria were comparable.

Data extraction

Data extraction was performed by 7 authors (AP, CK, EK, KL, KP, MC, MT) using a predefined extraction form in Excel. The validity of data extraction was evaluated by another 4 independent authors (AG, AH, GM, ME). The extracted information from each eligible publication included the first author’s last name, year of publication, BrCa types with respect to hormone receptors and human epidermal growth factor receptor 2 (HER2), BrCa stage, examined risk factors, number of studies and estimates included in meta-analyses, characteristics of the study populations (e.g., origin, menopausal status, other characteristics), meta-analysis metric (odds ratio, risk ratio, hazard ratio, etc.; if the meta-analysis metric was not clear from the original publication we used the summary metric as reported in the meta-analysis), meta-analysis method (fixed- or random-effects), summary effect estimates and 95% CIs, and the level of control for potential confounders performed in the studies included in the meta-analysis (adjusted, not adjusted). Our umbrella review described in detail and graded only meta-analyses that were based on studies with adjusted estimates. Meta-analyses of primary studies with crude summary effect estimates (either in totality, dubbed as unadjusted, or partially, dubbed as mixed) are included in the Additional files 1 and 2 to allow for a comprehensive review of the non-genetic risks examined in the literature. However, they were not graded to avoid grading associations at high risk of bias. Within each of the studied associations we extracted data on the first author of the primary study included in the meta-analysis along with the year of publication, study design, and effect estimate with corresponding 95% CI (or any other measure of variation of the effect estimate reported), number of cases and population size (in cohort studies) or the numbers of cases and controls (in case–control studies).

Statistical analysis

This study adopted the methodological approach used in umbrella reviews [19, 20]. Briefly, for each association included in this umbrella review we calculated the summary effect estimate and the corresponding 95% CI using the inverse variance weighted random-effects model [21] due to the expected clinical and methodological heterogeneity across primary studies included in the meta-analytical associations and for consistency in the application of the evidence grading criteria. We assessed the proportion of total variability in effect estimates due to between-study heterogeneity of each meta-analysis using the I2 metric of inconsistency [22] and we also calculated the 95% prediction intervals, which show the range in which the effect estimate of a new study in the future is expected to lie [23]. The possibility of small study effects was assessed using the Egger’s regression asymmetry test [24] (with a significance threshold of 0.10), and based on whether the summary estimate was larger in magnitude than the effect estimate of the largest (i.e., most precise; smallest standard error) primary study included in that meta-analysis. Finally, we used the excess significance test to evaluate whether the observed number of studies in the meta-analysis that presented a nominally significant result (P-value < 0.05) was different from the expected number of studies with significant results [22]. The expected number of statistically significant studies was estimated based on the sum of the statistical power of each individual study, which is a function of the number cases and the total sample size. For meta-analyses in which this information was missing for at least 20% of the primary studies, the excess significance test was not performed.

Quality assessment

The quality of the eligible systematic reviews/meta-analyses was evaluated using the AMSTAR tool [18]. AMSTAR critically appraises the quality of systematic reviews and meta-analyses using 11 items and focusing on key methodological issues. Due to the large number of meta-analyses and primary studies included in this umbrella review, the risk of bias was not assessed individually for each primary study considered in each meta-analysis.

Grading of the evidence

The certainty of the evidence (i.e., the confidence in the effect estimate) was graded in a four-point scale (i.e., weak, suggestive, highly suggestive, and convincing evidence) using certain statistical criteria [19, 20] (Table 1) in accordance with previous umbrella reviews [12, 25, 26]. Associations that did not present at least a statistically significant result (P-value < 0.05) in the random-effects model were “non-significant” and, thus, they were not graded.

Table 1 Statistical criteria used for grading the evidence in the umbrella review of meta-analyses on non-genetic risk factors and breast cancer

Results

Literature search

The search algorithm yielded a total of 20,646 unique citations across the three databases (Fig. 1), of which 1,278 were deemed potentially eligible. After excluding 998 publications in the full-text screening phase (Additional file 2—Table S1), 280 publications [7,8,9,10, 27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166,167,168,169,170,171,172,173,174,175,176,177,178,179,180,181,182,183,184,185,186,187,188,189,190,191,192,193,194,195,196,197,198,199,200,201,202,203,204,205,206,207,208,209,210,211,212,213,214,215,216,217,218,219,220,221,222,223,224,225,226,227,228,229,230,231,232,233,234,235,236,237,238,239,240,241,242,243,244,245,246,247,248,249,250,251,252,253,254,255,256,257,258,259,260,261,262,263,264,265,266,267,268,269,270,271,272,273,274,275,276,277,278,279,280,281,282,283,284,285,286,287,288,289,290,291,292,293,294,295,296,297,298,299,300,301,302] were included in our review, presenting a total of 895 meta-analytic associations of non-genetic factors with BrCa (Additional file 2—Tables S2 and S3). Of these, 781 were meta-analyses of studies with adjusted estimates while 114 meta-analyses included (either in totality or partially) primary studies with crude summary effect estimates. The publication dates ranged between 1995 and 2022.

Fig. 1
figure 1

Flowchart of the literature search and study selection process in the umbrella review of meta-analyses on non-genetic risk factors and breast cancer

Quality assessment

Methodological quality, as assessed using AMSTAR, varied across the 280 publications considered in our umbrella review (Additional file 2—Table S4). The median score was 8 (interquartile range: 6 to 9). Common flaws were the absence of reference to a published protocol (n = 178, 63.6%), the use of publication status as an inclusion criterion (n = 213, 76.1%), and the use of methodological quality in formulating conclusions and recommendations (n = 165, 5.89%). In about 25% (n = 69) of the publications there was no reference to a comprehensive literature search.

Description of the results

In the following sections, only the 781 meta-analyses with adjusted estimates are considered. A brief description of the 114 meta-analyses including (either in totality or partially) primary studies with crude summary effect estimates is presented in the Additional file 1 and the Additional file 2—Table S3.

The median number of included studies in the meta-analyses was 7 (range 2 to 80). Six-hundred-and-thirty-nine (81.8%) meta-analytic estimates pertained to overall BrCa incidence or mortality (with 7 estimates being specific to BrCa ­related mortality), while 131 (16.8%) focused on BrCa molecular subtypes, i.e., estrogen (ER), progesterone (PR), HER2, luminal A and B, and triple-negative, and 11 (1.4%) specifically to the locoregional spread, i.e., in-situ, invasive, localised, non-localised. Most associations (n = 568, 72.7%) pertained to the general population, while 176 (22.5%) associations pertained to menopausal status and 37 (4.8%) to specific populations (i.e., country-, race-, mutation-, parity-, or hormone replacement therapy-specific).

Overview of the available evidence

The identified non-genetic factors were classified in 11 categories (anthropometric measurements, biomarkers, breast characteristics, diet and dietary supplements, environment, exogenous hormones, lifestyle and social factors, medical history, medication, reproductive history, and pregnancy; Fig. 2). All meta-analyses in the family history–consanguinity category were based on unadjusted estimates; thus, this category was not further considered in the evidence assessment.

Fig. 2
figure 2

Summary of the classification of non-genetic factors examined in the 781 meta-analyses and the distribution of the grading for their association with breast cancer risk

Most of the 781 meta-analyses with adjusted estimates (Additional file 2—Table S2) that examined the association of non-genetic factors with BrCa risk were classified in the diet and supplements category (n = 240, 30.7%; Fig. 2). Biomarkers were examined by 18.7% (n = 146) of the meta-analyses with adjusted estimates, and lifestyle and social factors by 10.4% (n = 81) of them. A large number of meta-analyses were found for aspirin use (n = 22, 2.8%), body mass index (BMI) in adulthood or childhood (n = 17, 2.2%), night shift work (n = 16, 2.1%), weight gain (n = 16, 2.1%), Mediterranean dietary pattern (n = 13, 1.7%), body weight (n = 12, 1.5%), and breastfeeding¸ bisphosphonates use, and oral contraceptives (OC) use (each n = 11, 1.4%).

About half (n = 382; 49%) of the 781 meta-analyses were statistically significant (random-effects P-value < 0.05). Of these, 178 (46.6%) associations indicated a decreased risk of BrCa, and 204 (53.4%) an increased risk of BrCa. At a P-value threshold of 10–3, 166 (21.3%) meta-analyses were significant (of these, 103 [62%] indicated an increased risk), whereas for a P-value threshold of 10–6, 81 (10.4%) meta-analyses remained significant (n = 59, 72.8% indicated an increased risk).

High heterogeneity (I2 ≥ 50%) was found in 346 (44.3%) meta-analyses, and in 181 (52.3%) among those with statistically significant results (P-value < 0.05). The 95% prediction intervals excluded the null value (i.e., 1 for binary outcomes) in 69 (8.9%) associations. Evidence of small study effects was observed in 121 (15.5%) meta-analyses. Evidence of excess significance bias was observed in 83 (10.6%) meta-analyses. However, for almost half of the meta-analyses (n = 370; 47.4%), excess significance bias was not estimated, as in these meta-analyses, information was missing for at least 20% of the primary studies.

Evidence for non-genetic factors and BrCa risk

A comprehensive description of the evidence for the association between the non-genetic factors and BrCa risk from meta-analyses with adjusted estimates is shown in the Additional file 1.

Strength of epidemiological evidence

Figure 2 illustrates the classification of non-genetic factors examined in the 781 meta-analyses and the distribution of the grading for their association with BrCa. Tables 2 and 3 summarise the confidence in the effect estimates for protective and harmful non-genetic factors for BrCa (Table S2) and for BrCa receptor-related outcomes [estrogen receptor positive/negative (ER ±), progesterone receptor positive/negative (PR ±), human epidermal growth factor receptor 2 (HER2), luminal, triple negative], reaching at least weak evidence (Table S3).

Table 2 Strength of epidemiologic evidence for protective and harmful non-genetic factors for breast cancer (risk, mortality, invasive, non-invasive, in situ, and localized) in the umbrella review of relevant meta-analyses
Table 3 Strength of epidemiologic evidence for protective and harmful non-genetic factors for breast cancer receptor-related outcomes (ER ± , PR ± , HER2, luminal, triple negative) in the umbrella review of relevant meta-analyses

Seventeen associations (4.4% of 382 meta-analyses with significant results; 2.2% of 781 meta-analyses with adjusted estimates) were graded as convincing. These included three protective associations [older age at menarche (BrCa, ER + /PR + BrCa) [60], higher sex hormone binding globulin (SHBG) [182], and higher total fiber [134]] and 15 associations supporting an increased risk of BrCa [alcohol consumption [64], higher BMI (PR + BrCa) [7], BMI gain [7] and weight gain in postmenopausal women [286], Breast Imaging Reporting and Data System (BIRADS) classification for breast density (D versus B) [215], breast density (25%-49% and 50%-74% vs < 25%; ER- BrCa) [247], higher levels of sex hormones including androstenedione, estradiol, estrone, and testosterone in the general population and in postmenopausal women [182], and oral contraceptive (OC) use in premenopausal women [71].

Highly suggestive epidemiological evidence was found for 26 associations (6.8% of 382 meta-analyses with significant results; 3.3% of 781 meta-analyses with adjusted estimates). Of these, an increased risk of BrCa was found for higher BMI in postmenopausal women (BrCa, ER + and ER + /PR + BrCa) [7], body weight in postmenopausal women (ER + /PR + BrCa) [137], height [300], weight gain in postmenopausal women [7, 179], estradiol levels [182], Wolfe grade (P1, P2, Dy versus N1) [49], breast density (≥ 75% vs < 25%; ER + BrCa) [247], estrogen-progestin therapy [85] and digoxin use (BrCa, ER + BrCa) [65], ever active smoking [8], higher educational level [116], and diabetes mellitus [242]. In contrast, higher early adult BMI in postmenopausal women [7], 25 hydroxy vitamin D [25(OH)D] levels [123], adherence to the World Cancer Research Fund/American Institute for Cancer Research Recommendations (WCRF/AICR) score [130], and moderate-vigorous recreational physical activity (PA) [54] had a protective role.

Sixty-eight associations (17.8% of 382 meta-analyses with significant results; 8.7% of 781 meta-analyses with adjusted estimates) were graded as suggestive, while 224 (58.6% of the 382 significant; 28.7% of the 781 total) statistically significant meta-analyses were graded as weak. Finally, 47 (12.3% of the 382 significant; 6% of the 781 total) nominally significant meta-analyses that did not provide the necessary data for grading (number of cases and excess significance bias) were not considered (Additional file 2—Table S2).

Discussion

Principal findings

This large umbrella review of meta-analyses systematically summarised and critically appraised the epidemiological evidence for the association between non-genetic risk factors and female BrCa. Overall, 895 associations (781 meta-analyses of studies with adjusted estimates) were considered, reporting exposures related to anthropometric measurements, biomarkers, breast characteristics and diseases, diet and supplements, environmental parameters, exogenous hormones, factors associated with pregnancy or birth, lifestyle and social factors, medical history, medication, and reproductive history. The highest number of examined associations was found for the category of diet and supplements and for exposures such as aspirin use and active smoking.

Most of the examined associations were either non-significant (51%) or were supported by weak evidence (28.7%). Only about 5.5% of the associations (11.3% of those with statistically significant results) were graded as convincing or highly suggestive. These meta-analyses supported that alcohol consumption, high BMI (BrCa and ER + , PR + , ER + /PR + BrCa), high body weight (BrCa, ER + /PR + BrCa) and body weight gain in postmenopausal women, high height, P1/ P2/ DY Wolfe grade and high BIRADS/ Breast density classification (BrCa and ER-, ER + BrCa), OC use in premenopausal women, ever active smoking, high androstenedione, estradiol, estrone, and testosterone levels, estrogen-progestin therapy use, high educational level, diabetes mellitus, and digoxin use (BrCa, ER + BrCa) were associated with increased BrCa risk. On the other hand, high BMI at ages 18–30 years in premenopausal women, adherence to the WCRF/AICR score, high moderate-vigorous recreational physical activity in postmenopausal women, menarche at an older age (BrCa, ER + /PR + BrCa), increased total fiber intake, increased blood levels of 25(OH)D, and high levels of SHBG were found to prevent from BrCa. Of note, the associations of body weight and breast density with BrCa, despite reaching high levels of evidence, had a low score in the AMSTAR quality assessment.

Strengths and weaknesses in relation to other studies

In the current era of abundant scientific research, umbrella reviews have emerged as a crucial tool to consolidate and synthesize evidence across entire research domains. It is expected that a few associations covered in our extensive analysis might have already been partially addressed in existing umbrella reviews [12, 14, 303,304,305,306,307]. Nonetheless, our review stands out as the most comprehensive to date, offering a thorough mapping and assessment of all non-genetic risk factors for BrCa. Of note, the association between human papillomavirus infection and BrCa that was graded as convincing in a recent umbrella [305] review was not assessed in ours because it was based on unadjusted estimates. We considered this type of meta-analysis to have a high likelihood of bias.

Our study findings align significantly with existing evidence, reinforcing associations previously acknowledged as robust or reaching high evidence levels, such as alcohol [14], BMI [12], physical activity [11], dietary uptake of fiber [308], diabetes [13], sex hormones, and age of menarche [309]. Additionally, our research highlights new associations, including those for digoxin [65], 25(OH)D, breast density, and healthy lifestyle measured as a WCRF/AICR score. However, we did observe a limited number of associations for which our evidence level conflicted with that from previous studies. For example, coffee consumption, one of the most studied exposures in other umbrella reviews [14, 310, 311], reached “probable” levels of evidence in one of them [310]. However, there was no statistically significant association in our umbrella review similarly to the rest of the reviews in this topic [14, 311]. That review, which found “probable” levels of evidence for coffee consumption, followed a grading approach that allowed for higher levels of evidence to be reached although the included meta-analytical association was not statistically significant [310].

Biological plausibility

The biological mechanisms of the association between BrCa risk and height, obesity, physical activity, diabetes, and sex hormones are related. Height is related to the onset of puberty, which is affected by endogenous estrogens, whose role on BrCa has been very well documented [312,313,314]. On the other hand, there might be a causal association between height and BrCa, in which various genetic and non-genetic factors affect height and, subsequently, BrCa risk through a shared biological pathway [300, 315]. As an example, insulin-like growth factor 1 (IGF-I) has been proven to play a pivotal role in cell proliferation enhancement and apoptosis suppression, while it is also considered to be a major determinant of growth and height [214, 316]. In postmenopausal women, synthesis of estrogens takes place in the adipose tissue, whereas in premenopausal women the major source of estrogens are the ovaries. Obesity in postmenopausal women leads to increased conversion of androgens to estrogens, and, as result, to the promotion of cell proliferation and the inhibition of apoptosis. Furthermore, obesity has been associated with insulin resistance and hyperinsulinemia, which downregulates sex hormone binding globulin production, and, thus, results in increased levels of circulating estradiol. On the other hand, it has been reported that more frequent anovulatory cycles among obese premenopausal women [317, 318], and faster clearance rate of free estrogen in the liver among obese compared to lean women [319] may lead to lower levels of both estrogen and progesterone [320]. The protective effect of high moderate-vigorous physical activity against BrCa in postmenopausal women is most probably explained by the fact that exercise helps to prevent obesity.

The association of BrCa with diabetes could be explained through similar pathways such as the activation of the insulin pathway, the activation of insulin- like growth factor pathway, as well as the regulation of sex hormones. Moreover, hyperglycemia has been associated with increased levels of IGF-I and inflammatory cytokines, resulting in direct and indirect effects on cancer cells proliferation, apoptosis, and metastasis. Insulin promotes the expression of insulin receptors in BrCa cells and thus leads to the malignant transformation of breast epithelial cells. Increased insulin resistance on the other hand could cause higher levels of insulin and, as a result, increased androgen synthesis and decreased estrogen production. High SHBG levels have been shown to have a protective role against BrCa. Apart from the apparent function of the regulation of free sex hormones levels, SHBG seems to act as a direct mediator for cell-surface signaling, cellular delivery, and the biologic action of sex hormones, which results in the regulation of the bioavailable fraction of circulating estradiol [321, 322]. Through these unique features, SHBG reduces BrCa cell growth and proliferation [323, 324].

Alcohol is classified as a Group 1 human carcinogen by the International Agency for Research on Cancer [325] and is acknowledged by the World Health Organization as one of the major modifiable risk factors for breast cancer [326]. Alcohol consumption may contribute to BrCa development through various pathways, including hormonal modulation, DNA damage, oxidative stress, immune system impairment, disruption of normal liver function, folate and other nutrients malabsorption, and induction of inflammation [327].

Vitamin D is a steroid hormone with an established role in mammary gland development through the actions of its main mediator, vitamin D receptor (VDR). Through VDR, vitamin D is known to exhibit an anti-proliferative, pro-differentiating, and pro-apoptotic effect. The active form of vitamin D, 1,25(OH)2D, is responsible for the activation of VDR, therefore, circulating 25(OH)D could potentially have an inverse association with breast cancer risk [102]. Nevertheless, these results should be cautiously interpreted in the light of the consistently null associations of genetically predicted circulating 25(OH)D and breast cancer observed in Mendelian randomisation studies [328,329,330].

There are multiple mechanisms through which fiber uptake could have a protective role against breast cancer development [235]. It has been suggested that fiber delays gastric emptying and increases small intestine transit time, which result in reduced glucose absorption and insulin secretion. Furthermore, fiber could reduce circulating estrogens by promoting their fecal excretion during the enterohepatic circulation. Moreover, fiber seems to reduce reabsorption of estrogens through a reverse effect in intestinal β-glucuronidase activity, which is an essential step for the absorption of hydrolysed conjugated estrogens [235].

The mechanisms explaining the association of breast density with increased BrCa risk have not been clearly determined [331]. Increased breast density reflects an increased proportion of fibroglandular tissue, which could also depict an increased number of epithelial cells more susceptible to carcinogenesis and proliferation. Moreover, known determinants of breast density, such as late menopause, low parity, and use of estrogens, have been found to have a clear role on BrCa risk. Dense breast tissue is believed to exhibit a greater aromatase activity, thus resulting in hormonal sensitive tumors [49].

OC use has been found to be carcinogenic particularly when used before first childbirth. A full-term pregnancy contributes to a natural mature process of the breast epithelial cells in 2 stages, an early growth phase and a later phase of lobular differentiation. The nulliparous breast with its undifferentiated structures is more prone to the carcinogenic effects of OC use [71]. Considering estrogen-progestin therapy use, the progestin upregulates the expression of epidermal growth factor (EGF) and IGF receptors [332]. Progesterone and EGF significantly increase cell proliferation [333]. Non-steroidal anti-inflammatory drugs (NSAIDS) use appears to be protective through their effect on prostaglandin E2, which has been shown to up-regulate aromatase expression in adipose tissue fibroblasts by promoting binding of various transcription factors to aromatase promoters I.3 and II [334]. The structural similarity of digoxin and other cardiac glycosides to digitalis compounds like estradiol could explain the observed positive association with BrCa [65].

The mechanisms by which higher education level was associated with increased risk of BrCa remains unclear, although it is likely that this association is driven by other factors. One theory could be that women of a higher educational level usually have their first childbirth at a later age and, also, have fewer children. Other explanation might be that higher educational level has been associated with later menopause, higher alcohol use, and higher prevalence of hormonal treatments. Menarche at an older age shows a protective role particularly among Luminal tumors. Although there seems to be a hormonal mechanism supporting this association, evidence shows that when estrogen receptor positive (ER +) progenitor cells are exposed to estrogen, they produce paracrine signals that cause neighboring populations of ER- cells to proliferate [335].

Strengths and weaknesses of the study

Certain limitations should be considered with respect to the findings of this umbrella review. The analysis focused on meta-analyses of observational studies missing probably the latest evidence of primary observational studies not considered yet in any evidence synthesis. However, given the large amount of included evidence it seems unlikely that single primary studies would affect the evidence grading to a modest degree. The methodological quality of the included publications was moderate as several publications failed to report or apply critical items of the AMSTAR tool, such as the comprehensive literature search, the use of publication status as an inclusion criterion, and the use of the scientific quality of the included studies for drawing conclusions. A relatively high number of meta-analyses included less than 10 primary studies; hence, the excess significance and small study effects tests could be underpowered. Furthermore, the necessary information for the calculation of the excess statistical significance test was often absent in several meta-analyses, resulting in about one-eighth of the included meta-analyses being non-evaluable, thus likely underestimating the number of convincing associations. Although we graded the certainty of evidence according to prespecified criteria, association does not equal causation, which is difficult to demonstrate in non-randomised studies. While we prioritised meta-analyses of prospective cohort studies providing adjusted estimates, most meta-analyses also included case–control designs further accommodating our evidence interpretation. While case–control studies, especially those with suboptimal designs, are more likely to be subject to epidemiological biases, we did not restrict our analyses to cohort designs to ensure maximal comprehensiveness in the included studies. While we focused our grading only on analyses of adjusted estimates, residual or unmeasured confounding may be present. Furthermore, reverse causation cannot be excluded and the retrospective studies included in certain meta-analyses may be vulnerable to recall bias. Therefore, considering these weaknesses, we advise caution in any interpretation of the results presented in our review. Nevertheless, this umbrella review provides the most comprehensive assessment of the published epidemiological literature of non-genetic factors and BrCa risk. A vast amount of data was considered, and robust methodological approaches were used to assess the evidence. Overall, although the constraints of this umbrella review probably would make our assessment somewhat more lenient, their effect on the associations supported by convincing evidence is expected to be trivial.

Implications for future research

Our findings suggest some highly modifiable protective factors for BrCa. Interestingly, while diet was the most studied exposure category, associations failed to reach higher levels of evidence, indicating the methodological limitations in the field. To improve the validity of these associations future research should focus: i) on more robust study designs, such as high quality randomised controlled trials or Mendelian randomisation studies that have the potential to minimise biases common in observational epidemiological designs, and ii) on better exposure assessment techniques including objective measures and uitilising large scale omics technology to bolster our understanding of the mechanistic evidence underlying these associations. Overall, our study provides knowledge that supports the development of BrCa prevention recommendations and guidance, both at an individual level and for public health initiatives.

Conclusions

As the incidence of BrCa increases in many countries worldwide, the identification of modifiable risk factors is imperative for health care professionals to provide both individualised and public health guidance for BrCa prevention. Our study summarised a large number of publications describing associations between non-genetic factors and BrCa risk with varying methodological quality and varying strength and validity of the associated evidence. The validity of several well-established risk factors was reaffirmed and several risk factors with potentially higher levels of evidence strength were highlighted. These results reinforce the existing guidelines and recommendations advocating for women to maintain a healthy weight, engage in regular physical activity, and adopt a nutritious, high-fiber diet to mitigate the risk of developing BrCa. Moreover, our findings underscore the importance of regular screening, particularly for high-risk groups, i.e., women over 50 years old with increased breast density, poor lifestyle, and prior use of OC, further emphasising the proactive measures that can significantly contribute to breast cancer prevention. However, it is important to note that many associations did not reach higher levels of evidence. Studies following consistent standardisation definitions and procedures could improve the quality of publications and the level of the evidence.