1 Introduction

Strong evidence links obesity (defined by high body mass index (BMI) ≥ 25 kg/m2); or indicators of body fat distribution, such as waist circumference (WC : men > 102 cm, women > 88 cm), hip circumference (HC), and waist-to-hip ratio (WHR : men > 0.90, women > 0.80)) with the risk of postmenopausal breast cancer (BC) [1,2,3,4], the most frequent cancer which represents an important public health problem in women [5,6,7,8,9].

Most studies investigated single anthropometric traits in relation to BC risk, which might not adequately capture the complexity of body morphology, specifically among women who are similar in one trait but differ in others [10]. To address this issue, Ried et al. were the first to apply a principal component analysis (PCA)-based approach to estimate principal components (PC) representing body shapes derived from BMI, height, weight, WC, HC, and WHR [11]. The four derived body shape phenotypes explained over 99% of the total variation in these anthropometric traits and were differently associated with several indicators of metabolic health (e.g., hormonal, metabolic, and inflammatory biomarkers) [11]. This PCA-based approach has been subsequently applied by our team to reflect associations of these body shapes with the risk of cancer in the European Prospective Investigation into Cancer and Nutrition cohort (EPIC) [10]. A generally obese body shape and a tall, lean body shape were both positively associated with postmenopausal breast cancer risk, while the other two body shapes were not associated with such risk [10].

The underlying biological and metabolic mechanisms linking obesity to BC are multiple and complex. Obesity has been strongly associated with several metabolic alterations, including deregulation of sex hormones, overexpression of pro-inflammatory cytokines, insulin resistance, hyperactivation of insulin-like growth factor (IGF) pathways, hypercholesterolemia, as well as excessive oxidative stress [5, 12,13,14]. Several of these biomarkers (such as serum sex hormone-binding globulin (SHBG), IGF-1, testosterone, C-reactive protein (CRP)) have also been associated with BC risk [15,16,17]. However, whether these biomarkers mediate the body shape-BC relationship is unknown. Such knowledge could help understand the impact of body shapes on BC risk and possibly identify biological pathways.

The main objective of the present study was to investigate to what extent the presumed associations between body shape phenotypes and postmenopausal BC risk are mediated by biomarkers of metabolic health. The candidate biomarkers were selected based on their implication in the development of BC, as well as their associations with obesity.

2 Materials and Methods

2.1 Study Population

UK Biobank (http://www.ukbiobank.ac.uk/) is a prospective cohort study that recruited a total of 502,418 men and women, aged between 39 and 71 years at enrollment between 2006 and 2010. Study design and methodology have been described elsewhere [9, 18]. At the initial assessment center visit, participants completed a self-administered touchscreen questionnaire that included information on health, demographic, anthropometric, lifestyle, and medical history data, collected in 22 centers across England, Wales, and Scotland. Biological samples including blood, saliva, and urine were also collected at enrollment. The UK Biobank study was approved by the North West Multi-Center Research Ethics Committee, the National Information Governance Board for Health and Social Care in England and Wales, and the Community Health Index Advisory Group in Scotland (http://www.ukbiobank.ac.uk/ethics/). All participants provided written informed consent.

For the present study, we only included women, who were postmenopausal at the time of enrollment. Women were categorized as postmenopausal if they reported “yes” to the question “Have you had your menopause (periods stopped at least one year before enrollment)”, if they were older than 55 years [19] or reported a bilateral oophorectomy. Among these, we excluded women with prevalent cancer, those with missing or implausible anthropometry data, and with missing biomarker data. The study participants flowchart is given in Supplementary Fig. 1. After exclusions, the analysis involved 176,686 postmenopausal women.

2.2 Ascertainment of Breast Cancer Cases

Data on cancer diagnoses were provided by National Health Service (NHS) Digital and Public Health England for participants from England and Wales and by NHS Central Register (NHSCR) for participants residing in Scotland, and BC cases were ascertained through cancer registries [20]. For the present study, complete follow-up data were available up to 29 February 2020 for England and Wales; and 31 January 2021 for Scotland. All registrations coded as C50 using the 10th Revision of the International Classification of Diseases (ICD-10) were considered as invasive BC cases.

2.3 Assessment of Anthropometric Measures

Height, weight, WC, and HC were assessed by trained personnel during the baseline assessment center visit [21]. Body weight (kilograms, kg) was measured using a Tanita BC418MA body composition analyzer. Height was measured using a Seca 240 cm height measure, while HC and WC measurements (cm) were assessed using a Seca 200 cm tape measure. BMI was calculated as body weight (kg) divided by height in meters squared (kg/m2), and WHR was calculated as WC divided by HC.

2.4 Biomarker Assays

UK Biobank measures a wide range of biochemical markers from biological samples collected at baseline in all participants [22]. The biomarkers selected for the assay have been chosen because they are established risk factors for several diseases [22]. The present study examined biomarkers of metabolic health comprising markers of glucose (glucose, glycated hemoglobin, HbA1c, mmol/mol), insulin metabolism (IGF-1, nmol/L), inflammation (CRP, mg/L), sex hormones (testosterone and SHBG, nmol/L), blood lipids (triglycerides, HDL-cholesterol and cholesterol, mmol/L), as well as total protein (g/L). These biomarkers were selected based on their potential links with overweight/obesity, and BC risk [1, 12, 23]. We further explored other biomarkers of metabolic health that were moderately correlated to body shape phenotypes, to identify novel biomarkers that could influence their association with BC risk. These biomarkers included albumin (g/L), glucose (mmol/L), alanine amino-transferase (U/L), apolipoproteins A and B (g/L), cystatin C (mg/L), Gamma glutamyltransferase (U/L), total bilirubin (umol/L), and urate (umol/L).

The following biomarkers were measured with Beckman Coulter AU5800 analyzer (Beckman Coulter (UK), Ltd.) as part of the UK Biobank biomarker project [16, 17]. Triglycerides were quantified by Group Purchasing Organisation-Physician Owned Distributor (GPO-POD) analysis, cholesterol by cholesterol oxidase-peroxidase (CHOD-POD) method, HDL-cholesterol by enzyme immune-inhibition analysis, CRP by immunoturbidimetric-high sensitivity analysis, and total protein by Biuret analysis. Albumin was measured spectrophotometrically using bromocresol green, alanine amino-transferase and gamma glutamyltransferase by enzymatic rate, and glucose by hexokinase. Apolipoproteins A and B, and cystatin were quantified by immuno-turbidimetry, total bilirubin by colorimetric assay, and urate by uricase-PAP. Serum levels of HbA1c were measured by high-performance liquid chromatography analysis on a Bio-Rad, VARIANT II Turbo, and IGF-1 was quantified by chemiluminescence immunoassay (CLIA) technique (DiaSorin Ltd LIASON XL). SHBG was measured using the two-step competitive analysis method (Beckman Coulter, Unicel DxI 800), while testosterone was measured with a one-step competitive analysis (Beckman Coulter, Unicel DxI 800).

2.5 Statistical Analysis

PCA was applied to the standardized residuals of height, weight, BMI, WC, HC, and WHR. BMI and WHR were included in the PCA, because these composite variables usually still show some correlation with weight and height, and WC and HC, respectively. The residuals were predicted from a separate regression of the six anthropometric traits with age and study center. From the PCA, we retained the first four PCs that explained 99% of the variation and represented orthogonal linear combinations of the six anthropometric traits [11]. Each component represented a weighted sum of the six transformed anthropometric traits and is independent of the other components. The weights of each trait per PC are referred to as loadings. We used “https://bodyvisualizer.com/” to visualize the four body shapes by computing the mean values of the six anthropometric traits among participants in the 95% and 5% percentiles of each PC. Pearson correlation coefficients were used to assess the correlations between the six anthropometric traits and PCs.

Cox proportional hazard regression was used to estimate the hazard ratios (HR) and corresponding 95% confidence intervals (CI) of the associations between each body shape PC (continuous and quintiles), and each biomarker (continuous) with BC risk. Continuous models for an increment of one standard deviation (SD) of each PC and biomarker were estimated. Age at entry was age at recruitment, and exit time was considered one of following: age at diagnosis of first incident BC, age of diagnosis of another cancer except non-melanoma skin cancer, age at end of follow-up, age at loss-to-follow-up, or age at time of death, whichever occurred first. The proportional hazards assumptions were tested using scaled Schoenfeld residuals. The shape of the exposure-response curve between each PC and BC risk was estimated using restricted cubic splines [24], with five knots placed at the 5th, 27.5th, 50th, 72.5th and 95th percentiles, as recommended by Harrell et al. for larger datasets [25]. Linear regression was performed to assess the associations between each PC and distinct biomarkers of metabolic health.

We employed med4way mediation analysis [26] to investigate whether metabolic biomarkers can act as individual mediators on the pathway between body shapes and postmenopausal BC risk. Med4way uses parametric regression models to estimate the components of the four-way decomposition of the total effect of the exposure (here: PC) on the outcome (BC) in the presence of the mediator (each biomarker of metabolic health) with which the exposure may interact. The total effect (TE) is decomposed into four components, i.e. the controlled direct effect (CDE, i.e. the effect of PC on BC neither due to mediation nor to interaction), the reference interaction effect (INTref, i.e. the effect due to interaction only), the mediated interaction effect (INTmed, i.e. due to both mediation and interaction) and the pure indirect effect (PIE, i.e. only due to mediation, but not interaction) [26, 27]. The CDE was estimated at a fixed level of the mediator (here: median). Two regression models were fitted: a Cox model for the outcome, and a linear regression model for the mediator. The variable for the interaction between the exposure and the mediator was automatically generated and added to the model for the outcome. In addition to the four components of the TE, we further estimated the proportions of the effect due to each component, including the proportion due to the CDE, the proportion due to the PIE, the proportion due to the INTref, the proportion due to the INTmed, as well as the overall proportion mediated (PIE + INTmed).

The crude models were stratified by age at recruitment in 5-year categories, and study center. All multivariable models were adjusted for the following potential confounders, identified by a directed acyclic graph (Supplementary Fig. 2): age at recruitment, study center, healthy diet score, alcohol intake, smoking status, ethnicity, use of oral contraceptives, use of menopausal hormone treatment (MHT), physical activity, qualifications, Townsend deprivation index, and sedentary behavior. Healthy diet score was calculated based on consumption of these commonly food groups (fruits, vegetables, fish, processed meats, unprocessed red meats, whole grains, and refined grains) [28]. Sedentary behavior is the sum of time spent watching TV, time spent using the computer and time spent driving. Covariates, except for physical activity (missing values = 24.3%) and sedentary behavior (missing values = 4.2%), had less than 2% missing data. The multivariable analyses were thus conducted in the complete-case dataset, excluding all women with a missing value (n = 47,319) for any of the adjusted covariates, which resulted in a final sample size of 129,367 participants. In the mediation analysis, additional mutual adjustment for each biomarker was performed, by adjusting each mediator model for all other biomarkers.

The following sensitivity analyses were performed to assess the robustness of the main results: First, we excluded participants with less than two years of follow-up to control for potential reverse causation. Second, to account for the uncertainty of missing data in the adjustment variables, we performed multivariate imputation using chained equations (MICE) (‘mi impute’ in STATA) [29]. We used 10 iterations to impute multiple variables iteratively with fully conditional specification of prediction equations [29]. Variables with missing values were physical activity, alcohol intake, smoking status, use of oral contraceptives, use of MHT, qualifications, Townsend deprivation index and sedentary behavior. Third, we restricted the analyses to postmenopausal women, who answered having their periods stopped at least one year before enrollment, after exclusion of women, who were 55 years or older at enrollment, or reported a bilateral oophorectomy (n = 108,754). Finally, we investigated associations of single anthropometry measures (BMI, WC, WHR, and height) with BC risk. As sensitivity mediation analyses, additional models without mutual biomarkers adjustment were conducted. Additional mediation analyses were also conducted for BMI, WC, WHR, and height.

Body shape phenotypes from PC analysis was done with the package “FactoMineR” using R version 4.2.3, all other statistical analyses were performed using STATA 14.

3 Results

3.1 Characteristics of the Study Population

After a median follow-up of 10.9 years (interquartile range = 10.1–11.7), 6,396 incident BC events were diagnosed among the 176,686 postmenopausal women. The characteristics of the study participants by cases/non-cases status are shown in Supplementary Table1. The average age at recruitment (± SD) was 61.0 (± 5.3) years for women with BC and 60.4 (± 5.7) years for women without BC. Compared to women without BC, participants with BC were more likely to have higher anthropometric measures, to be less physically active, to have a higher level of alcohol intake, and to be MHT users. The distribution of other characteristics was generally similar. Concentration levels of CRP, testosterone, and urate were slightly higher among women with BC compared to those without, while SHBG was lower. All other biomarkers’ concentration levels were comparable (Supplementary Table2). Overall, except between HDL-cholesterol and apolipoprotein A (correlation coefficient = 0.9), there were no strong correlations between biomarkers (Fig. 1).

Fig. 1
figure 1

Pearson’s correlation matrix between the biomarkers. C coefficients of correlation, HDL cholesterol high-density lipoprotein cholesterol, IGF-1 insulin-like growth factor, SHBG sex hormone-binding globulin

3.2 Body Shape Phenotypes

Loadings and explained variance of the six PCs for women are presented in Supplementary Table3. PC1 (65.7% of the total variation) described individuals with general obesity vs. a lean body shape (Supplementary Fig. 3). PC2 (19.1% of the total variation) characterized tall individuals with low WHR vs. short individuals with large WHR (Supplementary Fig. 4). PC3 (13.5% of the total variation) characterized tall individuals with high WHR vs. short individuals with low WHR (Supplementary Fig. 5). PC4 (1.6% of the total variation) showed high loadings for BMI and weight, and low loadings for HC and WC (Supplementary Fig. 6). Pearson ’s correlation coefficients between the six anthropometric measures and PCs were comparable to results with the loadings for the individual PCs (Supplementary Fig. 7). Baseline characteristics of the study participants are further presented by quintiles of PC1 scores (Table 1). As compared to women in the lower quintile (Q1), women in the upper quintile (Q5) had higher anthropometric measures, except for height. Women in the two lowest quintiles of PC1 had a healthier diet, a higher educational level, a higher level of alcohol intake, a lower Townsend deprivation index, were more physically active, less sedentary, and more likely to be never smokers compared with those in the upper quintile of PC1.

Table 1 Characteristics of participants according to quintiles of scores of principal component 1 (general obesity) in the UK Biobank cohort study

3.3 Body Shape Phenotypes and Breast Cancer Risk

Table 2 shows crude and multivariable-adjusted associations between the four body shapes and BC risk. After adjusting for measured confounders, each 1 SD increment in PC1 (i.e., more of a generally obese body shape) was associated with a 12% (95% CI: 9–16%) higher relative risk of BC. Similarly, each 1 SD increment in PC2 (i.e., more of a tall and lean body shape) was associated with an 8% (95% CI: 5–11%) higher relative risk of BC. The linearity of these associations was confirmed in the analyses by quintiles (Table 2) and visually using restricted cubic splines (Supplementary Figs. 8 and 9). In contrast, neither PC3 (i.e. more of a tall and centrally overweight body shape) nor PC4 (i.e., an “athletic” body shape) were associated with BC risk (Table 2, Supplementary Figs. 10 and 11).

Table 2 Associations between principal components (PC) of body shape and postmenopausal breast cancer risk

Results after excluding participants with less than two years of follow-up remained comparable to the main findings (Supplementary Table4). Results after multiple imputation of missing values in covariates yielded similar results but the 95% CIs were narrower; for PC3, the HR slightly increased to 1.05 (95% CI: 1.02–1.08). Sensitivity analyses based exclusively on postmenopausal women, who answered having their periods stopped for at least one year at the time of recruitment showed similar results (Supplementary Table4). Finally, further sensitivity analyses for associations between single anthropometry measures and BC risk were presented in Supplementary Table5. Each 1 SD increment in BMI (5.1 kg/m2), WC (12.4 cm), WHR (0.1), and height (6.2 cm) was associated with 10% (95% CI: 7–14%), 12% (95% CI: 8–15%), 6% (95% CI: 3–9%), and 7% (95% CI: 5–9%) higher relative risk of BC, respectively, in the fully adjusted model.

3.4 Biomarkers and Breast Cancer Risk

The multivariable-adjusted HRs for the associations between the biomarkers of interest and the risk of BC are presented in Supplementary Table6. Concentration levels of CRP, IGF-1, testosterone, gamma-glutamyltransferase, and urate were positively associated with BC risk, whereas HDL-cholesterol, SHBG, albumin, and apolipoprotein A were inversely related to BC risk. Triglycerides, alanine aminotransferase, cystatin C, and total bilirubin were not associated with BC risk.

3.5 Four-Way Decomposition Mediation Analysis

As the association between PC and BC risk is a necessary condition for mediation analyses, these analyses were restricted to PC1 and PC2. The potential mediators were metabolic biomarkers which were associated with BC risk (Supplementary Table6). Furthermore, apart from total protein, cystatin C, and gamma-glutamyltransferase, all biomarkers of interest were either positively or inversely associated with PC1 or PC2 (Supplementary Table7). We considered causal effects for a change in PC1 and PC2 from the 25th to the 75th percentile, and each mediator fixed at its median level, after mutual adjustment for all other biomarkers.

3.6 General Obesity Body Shape (PC1)

The results from the four-way decomposition of each potential mediator of the associations between PC1 and postmenopausal BC risk are shown in Supplementary Table8 (effect estimates) and Table 3 (attributable proportions). Overall, the CDE, the effect due to neither mediation nor interaction, showed strong positive associations between PC1 and BC risk across all 13 investigated biomarkers (i.e., mediators), with CDEs between 88.9% (95% CI: 82.2–95.6%), when holding testosterone levels fixed, to 101.9% (95% CI: 93.4–109.9%), when holding IGF-1 fixed at its median (Table 3 and Supplementary Fig. 12).

Table 3 Proportions attributable for the four-way decomposition of each mediator of the associations between a principal component 1 (PC1) of body shape (a general obese body shape) and postmenopausal breast cancer risk

There was a PIE (the effect only due to mediation, but not interaction) through IGF-1 and testosterone with mediated proportions equal to -4.1% (95% CI: -11.9 to 3.8%) and 10.4% (95% CI: 4 to 16%), respectively (Table 3 and Supplementary Fig. 12). The overall proportion mediated (sum of PIE and mediation that was activated because of an interaction of PC1 with each of the two biomarkers) was -12.3% (95% CI: -20.5% to -4.0%) and 11.4% (95% CI: 5.1 to 17.8%.) for IGF-1 and testosterone, respectively. There was no evidence for mediation by the other investigated biomarkers (Table 3 and Supplementary Table8).

3.7 Tall/Lean Body Shape (PC 2)

There was little variation in the proportion of CDE after fixing the mediators (i.e., 13 biomarkers) at their median values (Tables 4 and Supplementary Table9). Minor proportions of the association between PC2 and BC were mediated by IGF-1 (PIE: 2.8%, 95% CI: 0.6 to 4.9%), and SHBG (PIE: -6.1%, 95% CI: -10.9% to -1.3%). There was no evidence of mediated interaction for IGF-1 (P = 0.242), while there was some indication of mediated interaction for SHBG (proportion mediated interaction: 3%, 95% CI: 0.7 to 6.0%) (Table 4 and Supplementary Fig. 13).

Table 4 Proportion attributable for the four-way decomposition of each mediator of the associations between principal component 2 (PC2) of body shape and postmenopausal breast cancer risk

3.8 Sensitivity Analyses

Overall, sensitivity mediation analyses without mutual biomarkers adjustment showed no substantial differences as compared to mutual adjustment for PC1 (Supplementary Tables10 and 11). Regarding PC2, there was suggestive mediation of small proportions through CRP, HDL-cholesterol, SHBG, albumin, and urate, which however, are likely due to mediator-mediator confounding (Supplementary Tables12 and 13). Mediation analyses of the associations of BMI and WC with BC risk are shown in Supplementary Figs. 14 and 15. Results were generally in line with results for PC1. Minor proportions of the association between height and BC were mediated by IGF-1, SHBG, testosterone, and albumin (Supplementary Fig. 16).

4 Discussion

Among 176,686 postmenopausal women enrolled in UK Biobank, a generally obese compared to a lean body shape (PC1), and a tall/lean compared to a short/centrally overweight body shape (PC2) were both associated with an increased risk of BC. The controlled direct effects (i.e., associations due to neither mediation nor interaction) were large, suggesting that most of the excess BC risk was due to other pathways than investigated here. Nevertheless, there was evidence that a pathway through testosterone mediated about 10% (i.e., pure indirect effect) of the positive association between body shape PC1 and the risk of BC. In contrast, IGF-1 and body shape PC1 were jointly associated with an increased risk of BC (i.e., reference interaction), and in addition, there was a mediated interaction, whereby body shape PC1 ‘causes’ IGF-1 and both were jointly associated with BC risk. This suggests that there are antagonistic associations between the joint association of PC1 and IGF-1 on BC risk (positive association), and the effect of PC1 on IGF-1 (inverse association) leading to a negative overall mediated proportion (-12%). For the association between body shape PC2 and BC risk, a small proportion (i.e., 2.8%) was mediated by IGF-1 (i.e., pure indirect effect). For SHBG, there was evidence for mediated interaction, but in opposite direction of the pure indirect effect, meaning that the overall proportion mediated was negligible. Sensitivity mediation analyses using BMI yielded comparable mediating effects to those of PC1.

In agreement with our study, a study conducted in EPIC reported an increased risk for BC in relation to both body shape PC1 (general adiposity) and body shape PC2 (tall; low WHR) [10]. These results are also congruent with previous studies investigating the association between obesity and risk of BC, but using single-trait anthropometric indicators such as BMI [1, 6, 13].

Using multi-trait body shapes has the advantage of removing redundant correlation between single anthropometric indicators (e.g., BMI is strongly correlated with WC), and thus potential confounding between them, which facilitates their use in statistical modelling. Furthermore, body shapes may capture phenotypic information that goes beyond single traits due to the way they combine. This is exemplified by a genome wide association study (GWAS) of our study group, where, for example, out of 678 genetic variants robustly associated with PC1 (p-value < 5 × 10-8), 21 variants were not previously linked to any of the six anthropometric traits [30]. Such additional genetic variation may translate into phenotypic differences that could also uncover novel mechanistic pathways explaining adiposity-cancer associations. However, the interpretation of the body shapes is less straightforward. To facilitate interpretation of the PCs of the body shapes, we provided the arithmetic means of each anthropometric trait among participants in the top and bottom 5% percentiles for the four body shape phenotypes, as well as the variation of these traits in the population (1 SD) (Supplementary Figs. 36). For PC1, for example, the difference in BMI between the top and bottom percentiles was 20 kg/m2, corresponding to 4 times the SD (5.1 kg/m2) in the study population. Similarly, there were significant differences in weight (4 times the SD), WC (3.9 times the SD), HC (3.9 times the SD), and WHR (2 times the SD). By contrast, the difference in height was 3 cm, corresponding to half of the SD in the study population. For PC2, the difference between the bottom and the top corresponded to 3.7 SD increment in height, 1.1 SD increment in weight and 1 SD increment in HC, while BMI, WC and WHR decreased by ≤ 1 SD.

Altered sex hormone metabolism is a main biological mechanism that could link excess adiposity with postmenopausal BC risk through increased aromatase enzyme activity in peripheral adipose tissue known as aromatization [31, 32]. Aromatization leads to increased levels of bioavailable sex hormones including testosterone, which may induce breast carcinogenesis [31]. Several epidemiological studies, including in UK Biobank, reported an increased BC risk associated with elevated blood levels of testosterone [15, 17]. Our study is in accordance with these findings and in addition, our mediation analysis supports the hypothesis that testosterone links general adiposity (i.e., body shape PC1) with postmenopausal BC risk.

There is strong evidence that higher IGF-I levels are associated with a greater risk of BC [33, 34]. Our findings for postmenopausal BC risk are congruent with this evidence (Supplementary Table 5). However, whether IGF-I links adiposity to BC is debated [31]. Levels of IGF-I increase only to a BMI of approximately 27 kg/m2, thereafter declining with increasing weight, and in individuals with overweight, who intentionally lose weight, IGF-1 levels tend to increase [31]. Our mediation analysis may shed some light on this uncertainty. While we did not observe a significant pure indirect effect through IGF-1, there was a reference interaction between PC1 and IGF-1, i.e., jointly increasing the risk of BC (‘joint effect’) beyond their individual risk associations. In addition, there was a mediated interaction between PC1 and IGF-1 leading to a negative overall proportion mediated of 12% (Table 3).

In contrast, we estimated a small but clearcut mediation of 2.8% (i.e., proportion of pure indirect effect) through IGF-1 for the association between PC2 (tall, low WHR) and BC risk. IGF-1 signaling is well known to induce expression of several oncogenes and high concentrations of IGF-1 are associated with increased risk of BC [35]. Because several genetic variants related to the IGF signaling pathway are related to height, height might be a crude anthropometric marker of early-life IGF-1 exposure [35]. Our results confirm that a proportion, albeit small, of the height-BC relationship is mediated through IGF-1.

We identified a second molecular pathway linking PC2 with BC risk, which was through SHBG. Since the height component of this body shape was positively associated with SHBG, and in turn SHBG was inversely associated with the risk of BC, the mediated proportion was negative (‘pure indirect effect’: -6%). This is in line with evidence that higher levels of SHBG lead to lower levels of bioavailable testosterone and estrogens, and thus a lower BC risk via this pathway [15, 16]. A meta-analysis of 26 prospective studies showed that high SHBG levels were significantly associated with decreased risk of BC in postmenopausal women, the pooled RR for BC comparing the highest vs. lowest categories of SHBG was 0.64 (95% CI: 0.57–0.72) [36]. In a recent study from UK Biobank, SHBG was inversely associated with BC risk in postmenopausal women [16]. The mediating effect of SHBG in the association between adiposity and BC development has not been investigated in other studies yet. However, SHBG has been reported to mediate a small proportion of the relationship between BMI and endometrial cancer risk (7%) [37], and the alcohol-BC association (13%) [38].

Our findings suggest differential pathways linking these body shapes with BC risk. Indeed, PC1 may have similar pathways as obesity measured by BMI, while PC2 may have different biological/metabolic pathways involved in carcinogenesis of BC, more comparable to height.

There have been many advances in mediation analysis methodology over the years, with various methods of mediation analysis reported in the literature. One of the novelties of this study is that we considered both the mediation and interaction pathways simultaneously using one of the most recent approaches to causal mediation analysis: “the four-way decomposition”. This approach allows us to estimate both the pure indirect and interaction effect, as compared to previous studies using conventional approaches for mediation analysis which may have missed mediated interaction effects. The present study is also the first to investigate the potential mediator role of several biomarkers of metabolic health in the association between body shape phenotypes and BC development, as compared to previous studies investigating classical anthropometric parameters (mainly BMI or WHR). Finally, we accounted for confounding by other biomarkers. Limitations of this study included the lack of data on hormone receptor status since the relationship between obesity and BC risk may differ according to estrogen or progesterone receptor. We lacked sufficient individuals with oestradiol measurements to investigate this sex-hormone as potential mediator which has been both related to obesity and BC development. Also, we could not rule out the possibility of unmeasured confounding. Lack of representativeness of the study sample for the general population is acknowledged.

In the context of two or more mediators explaining an exposure-outcome association, as for example in the current study, where levels of testosterone and IGF-1 explained each about 10% of the body shape PC1-BC association, it is conceivable that mediators also interact with each other. This requires that the 4-way decomposition is extended to accommodate such a potential mediator-mediator interaction [39] which, however, was beyond the scope of the current study. Alternatively, future studies could consider dimension reduction approaches, such as PCA, to derive one or more patterns from multiple correlated biomarkers, and subsequently use these in a mediation framework.

5 Conclusion

In summary, although the direct effects of body shape phenotypes on BC risk are high, our findings are consistent with a possible mediated effect by testosterone and IGF-1 for the body shape PC1 and BC development association, while IGF-1 and SHBG may have a role in the association between body shape PC2 and BC development. As the mediating biomarkers may interact with each other, future studies should also consider a decomposition of the total effect in the presence of multiple mediators, particularly by family of biomarkers.