Introduction

Breast cancer is the most prevalent cancer in women worldwide and the second most common cause of cancer mortality [1, 2]. With the growing incidence of breast cancer around the world [3], it is crucial to identify lifestyle risk factors that may help reduce the risk of breast cancer, including physical activity, breastfeeding, and consumption of healthy food.

Population-based studies have reported several dietary patterns that are associated with the risk of breast cancer [4], including Healthy/Mediterranean patterns or based on dietary guidelines (e.g., the Healthy Eating Score) [5], while specific foods such as oily fish, fruits, and vegetables may help reduce the risk of developing breast cancer [6]. However, previous evidence regarding the associations between different types of meat consumption and breast cancer risk has been inconclusive [7, 8], which could be attributed to differences in study design or the use of specific subgroups of women. Moreover, despite the known association between alcohol intake and breast cancer [9, 10], no study to date has assessed their synergistic effect with food on the risk of breast cancer, which could result in targeted interventions for women and increase the cost-effectiveness of interventions.

Besides the interaction with alcohol intake, the association between dietary factors and breast cancer could also be influenced by genetic factors. Recent GWAS studies have revealed 313 SNPs associated with breast cancer risk that could be used as a tool for risk stratification [11]. However, it is still unclear whether the associations with dietary factors differ according to different genetic predispositions to breast cancer, which is important for personalized prevention.

The aim of this study was to evaluate the association between dietary factors and the risk of breast cancer and to investigate whether the association was influenced by alcohol consumption, particularly when taken with meals. We also examined the association between dietary factors and breast cancer risk taking into account genetic susceptibility measured by polygenic risk score.

Methods

Study population, exposure, and outcome

Between 2006 and 2010, a total of 503,317 participants agreed to participate in the baseline assessment of the UK Biobank study, of which 273,382 were women [12]. Participants were followed up from the date of enrollment until the date of diagnosis of breast cancer, withdrawal from the study, death, loss of follow-up, or the end of follow-up (December 31, 2019), whichever occurred first. Information on breast cancer diagnosis was obtained by linking the cohort to the National Health Service (NHS) Digital for England and Wales and NHS Scotland, using unique personal identification numbers. The ICD-10 code C50 and ICD-9 code 174 were used to identify breast cancer diagnoses in the cancer register. Women with breast cancer before participating in the UK Biobank were excluded from the analysis. The date of death was retrieved from death certificates held by the NHS Information Center and the NHS Central Register. The study was approved by The National Information Governance Board for Health and Social Care and the NHS North West Multicentre Research Ethics Committee (06/MRE08/65). The participants gave informed consent at the baseline and agreed to be followed up via data linkage.

Diet group classification

Dietary intake data were collected at recruitment using a self-reported touchscreen questionnaire (http://biobank.ctsu.ox.ac.uk/showcase/showcase/docs/Touchscreen QuestionsMainFinal.pdf). The frequency of processed meat, beef, lamb, pork, poultry, fish (oily/non-oily), and cheese consumption was coded into three categories: never, less than once a week, and more than once a week. In addition, for vegetables (cooked/raw) and fruit (fresh/dried), consumption was coded into four categories (< 2 servings/d, 2.0–2.9 servings/d, 3.0–3.9 servings/d, ≥ 4 servings/d) as suggested by previous studies [13]. Participants who had missing information or responded with "prefer not to answer" or "do not know" were categorized as "missing". A healthy diet was defined according to the Healthy Eating Score (HDS), which was calculated based on: consuming at least four tablespoons of vegetables each day; consuming at least three pieces of fruit each day; consuming fish at least twice each week; consuming red meat (beef, lamb, and pork) no more than twice each week; consuming processed meat no more than twice each week. One point was awarded for each advantageous dietary factor, with the total diet score ranging from 0 to 5 [14]. Participants were grouped into poor dietary patterns (score 0 or 1), medium dietary patterns (score 2 or 3), and ideal dietary patterns (score 4 or 5) [15].

The frequency of alcohol consumption was collected through a questionnaire, and we dichotomized it into whether or not had daily alcohol drinking. Participants reported different types of alcohol, including beer/cider, white wine/champagne, fortified wine, red wine, spirits, and “other”. According to official UKB statistics, a pint or can of beer/lager/cider = 2 units, a 25 ml single shot of spirits = 1 unit, and a standard glass of wine (175 ml) = 2 units were used to estimate alcohol content. When the alcohol consumption was reported monthly, we divided the intake by 4.3. Then, the weekly intake divided by 7 is the daily unit of consumption. The World Health Organization recommends no more than two "standard intakes of alcohol" per day, and alcohol consumption was therefore dichotomized accordingly to investigate their potential interaction with dietary factors. In the questionnaire, the participants also provided information on whether the alcohol was usually taken together with meals.

Polygenic risk score

Blood samples were collected from the participants upon enrollment, and genotyped using the UK Biobank Axiom array. A brief description of the procedures for genotype calling, array design, sample handling, quality control, and imputation of the UK Biobank samples has been provided elsewhere [16]. To determine whether the influence of dietary factors varied based on genetic susceptibility to breast cancer, significant SNPs from a recent meta-analysis of breast cancer GWAS were selected to create polygenic risk scores for breast cancer overall and by estrogen receptor (ER) status [11]. For all individuals, the weighted polygenic risk score (PRS) can be used to calculate the PRS, which is the sum of the products of the logarithmic odds ratio (OR) per allele and the allele dose for each SNP associated with breast cancer. Overall, ER+, and ER− PRS were categorized into quartiles, respectively. Detailed information on PRS score generation is provided in Supplementary Table 1.

Statistical analysis

Cox proportional hazards models were used to assess the associations between dietary factors and breast cancer risk, adjusting for various factors, including smoking status (never, previous, current), ethnicity (grouped into five categories where possible: White, Mixed other, Asian or Asian British, Black or Black British, and unknown), physical activity level (measured in metabolic equivalents task units and categorized into quartiles), Townsend deprivation index (categorized into quintiles), frequency of alcohol intake (≥ once/day or < once/day), employment status (in paid employment, pension, not in paid employment), educational qualifications (college or university degree/vocational qualification; national examination at ages 17–18 years; national examination at age 16 years; other qualifications were treated as missing), body mass index (BMI, categorized as < 18.5, 18.5 to < 25, 25 to < 30, or ≥ 30 kg/m2), 22 UKB centers, number of births (categorized as 0, 1, 2, or ≥ 3), age at menarche (categorized as < 13, 13–15, 16 to < 30 years), menopausal status (categorized as no, yes, not sure—had a hysterectomy, or not sure—other reason), age at first birth (categorized as < 23, 23–27, > 27 years, nulliparous/ missing), ever use of oral contraceptive pill (categorized as no or yes), ever use of hormone replacement therapy (categorized as no or yes), and family history of breast cancer (categorized as no or yes). Missingness in the covariates was categorized as a separate category.

To test the multiplicative interaction between the dietary factors and alcohol consumption, as well as between the dietary factors and PRS, an interaction term was included in the regression models and tested using the likelihood ratio (LR) test. Stratified analyses were also conducted according to alcohol intake frequency, whether alcohol was consumed with a meal and quartiles of PRS. We further estimated interaction in the additive scale for dietary factors and PRS using relative excess risk due to interaction (RERI), and a bootstrap approach was used to estimate the confidence interval and the p values.

Besides, sensitivity analyses were also performed by stratifying the analyses by menopausal status, Townsend deprivation index, and educational level. Likelihood ratio (LR) tests were used to examine the potential interaction by these variables. In this sensitivity analysis, menopausal status was divided into pre- and post-menopause. The Townsend deprivation index was dichotomized by median, and education was divided into a college degree and others.

All statistical analyses were performed using Stata 15.1. All P values were two-sided, and a P value of less than 0.05 was considered statistically significant.

Results

Baseline characteristics

Among all 273,382 women in the UK Biobank, 58 withdrew their consent and were dropped, and 11,471 women were excluded due to a breast cancer diagnosis before baseline, leaving 261,853 participants in our study with 9069 incident breast cancer cases (Supplementary Fig. 1). The median follow-up time was 10.8 years and the incidence rate was 327.457/100000 person-year in the cohort.

Table 1 shows the characteristics of all participants. Breast cancer cases were more likely to be White ethnicity, less physically active, living in more affluent areas (measured by Townsend score), paid employed, holding a university/college degree/NVQ, consuming more alcohol, smoking, having two or more children, experiencing menarche from 13 to 15 years old, being postmenopausal, having their first birth at an age over 23 years old, taking oral contraceptives or hormone replacement therapy, having no family history of breast cancer, and having a higher BMI.

Table 1 Baseline characteristics in the UK Biobank, by frequency of breast cancer (N = 261,853)

Diet, alcohol consumption, and risk of breast cancer

Figure 1 shows the HRs and 95% CIs for breast cancer by consumption level of each food item, using the lowest consumption category as the reference. It is noteworthy that an increase in the risk of breast cancer was observed with the frequent consumption of processed and red meat (HR for processed meat ≥ once/week = 1.10, 95% CI 1.03, 1.18, P-trend = 0.016; HR for lamb ≥ once/week = 1.09, 95% CI 1.02–1.16, P-trend = 0.010). However, the intake of beef, pork, poultry, cooked vegetables, and cheese was not significantly associated with the risk of breast cancer. Conversely, women with high levels of raw vegetables, fresh fruits, and a healthy dietary pattern may have a reduced risk of breast cancer, with corresponding HRs (95% CIs) of 0.93 (0.88, 0.99), 0.87 (0.81, 0.93), and 0.93 (0.86, 1.00), P-trend of 0.025, < 0.001, and 0.041, respectively. Among women who consumed alcohol ≥ once/day, processed meat intake was positively associated with the risk of breast cancer [HR (95% CI)1.20 (1.01, 1.29), p-trend = 0.014], while fresh fruit intake was negatively associated with the risk [HR (95% CI) 0.76 (0.64, 0.90), p-trend = 0.003] (Fig. 2). Moreover, a borderline significant interaction with alcohol consumption was also observed for processed meat (P for interaction = 0.065). To further investigate the synergistic effect between food and alcohol, we stratified the analysis by timing of alcohol consumption (Table 2 and Supplementary Table 2). In women who usually took alcohol together with the meal, the p for interaction between processed meat and alcohol consumption was 0.18. In these women, fresh fruit intake was negatively associated with the risk of breast cancer [HR (95% CI):0.73 (0.59–0.91), p-trend < 0.01], although the interaction with alcohol consumption was not statistically significant.

Fig. 1
figure 1

HRs (95% CIs) for the associations between dietary and breast cancer in UK Biobank participants, by frequency of alcohol consumption. Multivariable Cox regression model adjusted for age at recruitment, smoking, ethnicity, physical activity level, Townsend deprivation index, alcohol intake frequency, employment status, educational qualifications, BMI, 22 UKB centers, number of births, age at menarche, menopausal status, age at first birth, ever use of oral contraceptive pill, ever use of hormone replacement therapy, family history of breast cancer, stratified by alcohol intake frequency. The age at the end of the follow-up (i.e., attained age) was used as the underlying time scale. REF, reference

Fig. 2
figure 2

Associations between diet and any breast cancer, by frequency of alcohol consumption (in women took alcohol < 1/d or in women took alcohol ≥ 1/d). Multivariable Cox regression model adjusted for age at recruitment, smoking, ethnicity, physical activity level, Townsend deprivation index, alcohol intake frequency, employment status, educational qualifications, BMI, 22 UKB centers, number of births, age at menarche, menopausal status, age at first birth, ever use of oral contraceptive pill, ever use of hormone replacement therapy, family history of breast cancer, stratified by alcohol intake frequency. The age at the end of the follow-up (i.e., attained age) was used as the underlying time scale. REF,reference

Table 2 The interaction between meat and alcohol consumption on breast cancer risk, by the timing of alcohol consumption

Stronger associations between fresh fruit, processed meat, and the risk of breast cancer were also observed in women who consumed alcohol ≥ 2 units per day. However, their interactions with a dosage of alcohol consumption were not statistically significant (Supplementary Table 3).

For sensitivity analysis, when stratifying the analysis by menopausal status, processed meat, beef, and lamb were associated with the risk of breast cancer in postmenopausal women, while statistically significant association only persisted in pre-menopausal women who took processed meat ≥ once/week and alcohol ≥ once/day (Supplementary Table 4). Stratification by Townsend deprivation index and education levels suggested no interaction between socio-economic status and those dietary factors identified to be associated with breast cancer in the main analysis, although the point estimates changed slightly (Supplementary Tables 5, 6).

Diet, genetic predisposition, and risk of breast cancer

The association between dietary factors and breast cancer risk was additionally compared among women with the highest and the lowest quartile of PRS (Fig. 3). In women with the highest quartile of PRS, the HRs (95% CIs) for breast cancer were 1.13 (1.02–1.26) for the highest levels of beef consumption compared with the lowest levels. Although multiplicative interaction was not observed, the RERI of beef consumption in women with the highest quartile of PRS was 0.46, and the p value of RERI was < 0.01, which was statistically significant. This significant interaction was still observed when stratified by genetic susceptibility to breast cancer using ER+ PRS and ER− PRS in quartiles (Supplementary Table 7). No interaction was observed for other food.

Fig. 3
figure 3

Associations between diet and breast cancer for individuals with different breast cancer PRSs. Multivariable Cox regression model adjusted for age at recruitment, smoking, ethnicity, physical activity level, Townsend deprivation index, alcohol intake frequency, employment status, educational qualifications, BMI, 22 UKB centers, number of births, age at menarche, menopausal status, age at first birth, ever use of oral contraceptive pill, ever use of hormone replacement therapy, family history of breast cancer, stratified by alcohol intake frequency. The age at the end of the follow-up (i.e., attained age) was used as the underlying time scale. REF, reference

Discussion

Key results

We found a significant association between processed meat, vegetables, fruits, and HDS and the risk of breast cancer, particularly in women with a high frequency of alcohol consumption. Only a borderline significant interaction between processed meat and alcohol consumption was observed among women. We have also discovered a stronger association between beef consumption and breast cancer in women who have a strong genetic predisposition to the disease. No interaction was observed for other dietary factors.

The significant association between processed meat intake and breast cancer risk in our study is supported by several previous findings [8, 17]. Processed meat contains carcinogenic components that can directly damage DNA, such as heterocyclic aromatic amines (HAA) and polycyclic aromatic hydrocarbons (PAHs) resulting from meat processing or preparation, including high-temperature cooking. Nitrites, used as additives, can also induce the formation of n-nitroso compounds (NOCs) in the digestive tract, as confirmed in both animal and human biomonitoring studies [18]. These compounds can also play a carcinogenic role through other mechanisms, such as the estrogenic properties of pseudo-hot isostatic pressing (PhIP) [19, 20].

Furthermore, we found a penitential interaction between intake of processed meat and alcohol consumption on the risk of developing breast cancer. Recent large-scale prospective studies have shown that alcohol intake may increase the risk of breast cancer [21,22,23], likely due to hormonal influences [24,25,26]. Meanwhile, alcohol has also been suggested to have toxic effects and these effects were mediated by DNA damage and carcinogenic effects of alcohol and through mutagenesis by acetaldehyde and by induction of oxidative damage [27,28,29]. One plausible mechanism for the synergistic effect between processed meat and alcohol is that alcohol may enhance the penetration of carcinogenic compounds in processed meat as a solvent [30]. For instance, the concurrent consumption of processed meat and alcohol may result in an increased expression of CYP2E1, leading to elevated levels of oxidative stress and DNA damage. Consequently, CYP2E1 reinforced the activation of Reactive Oxygen Species (ROS) that cause DNA damage through ethanol and ROS production. This activation further stimulated PhIP through an oxidation process, which could trigger or sustain tumor growth [26]. Despite these, the synergic effect between processed meat and alcohol still requires more studies to verify.

Consistent with previous studies [6, 9], our findings support the view that an overall healthy dietary pattern was negatively associated with the risk of breast cancer, and this appears to be due to high consumption of vegetables and fruits [31]. Vegetables and fruits are abundant in potentially anti-carcinogenic nutrients including fiber, vitamins C and E, carotenoids, and other bioactive substances [32,33,34], which may lessen the risk of developing cancer.

In the current study, we observed an association between beef consumption and the risk of breast cancer in women who had the highest quartile of breast cancer PRS. Several studies have also observed additive interactions between lifestyle factors [35], and the genetic predisposition to breast cancer risk [36,37,38]. Moreover, the slightly higher risk of breast cancer among women with high beef consumption and ER+ PRS compared to the overall PRS may suggest a stronger association with ER-positive disease [39, 40]. Our finding of the additive interaction between ER+ PRS and beef further suggested the role of genetic testing in individualized dietary intervention for breast cancer.

The main strength of our study is its large sample size and population-based cohort design. Other strengths include the UK Biobank cohort’s abundant lifestyle and genetic data, which allowed us to explore the gene and lifestyle interactions for the associations investigated. However, our study has several limitations. On one hand, there is no real measurement of alcohol consumption, but only frequency of alcohol consumption. On the other hand, the data from the touchscreen dietary questionnaire might be affected by recall bias. Besides, as we have performed a lot of interaction tests and subgroup analyses, findings with nominal P values for these analyses should be interpreted with caution, considering the multiple testing issues. Further studies are, therefore, needed to validate our findings. Finally, given the observational nature of this study, it is possible that there are still unmeasured confounding factors or residual confounding in our analysis.

In conclusion, our findings support the view that processed meat, vegetable, fresh fruits, and HDS can affect the risk of breast cancer, suggesting that a combined intervention consisting of a well-balanced diet that includes lower processed meat consumption and an increase in vegetable and fresh fruit intake may contribute to preventing breast cancer in women at high risk. However, no strong interaction of dietary factors with alcohol consumption and genetic predisposition was observed for the risk of breast cancer.