figure b

Introduction

Cardiometabolic diseases (CMDs) are among the top ten causes of death and are associated with increased healthcare costs globally, making their relationship with adiposity a major public health concern [1,2,3,4]. Excess adiposity is associated with increased risk of CMDs, as well as increased risk of all-cause mortality [5,6,7,8]. The BMI category at lowest risk of early death is 20–25 kg/m2 in populations of European ancestry, with average health worsening significantly within the ‘overweight’ category, and deteriorating further as BMI increases [9]. Both observational studies and some causal inference studies suggest that BMI has a J-shaped relationship with all-cause and cardiovascular mortality [5, 10].

Observational studies often suffer from residual confounding and reverse causality, as do the relational shapes they describe. For example, the high mortality rate observed in some people with lower BMI (J-shaped relationship) is probably caused by the chronic disease cachexia [11]. While causal relationships between adiposity and CMDs have been determined previously, most studies assume these relationships are linear [12,13,14,15]. In addition, observational studies have shown that sex confers differential CMD risk profiles in men and women, but extensive investigation of such differences within a causal framework is lacking [16,17,18,19]. Therefore, understanding the nature of causal relationships between excess adiposity, CMDs and any sex differences therein may help to refine public health interventions [20].

Patterns of causal associations between excess adiposity and cardiometabolic outcomes remain understudied; given the shapes reported in observational studies, we hypothesised that adiposity has non-linear causal effects on cardiometabolic outcomes, with sex differences within this causal framework. The purpose of this study was to elucidate the nature of causal effects and explore the sex differences in the effects of BMI on CMDs (coronary artery disease [CAD], type 2 diabetes, chronic kidney disease [CKD], stroke and hypertension). We further extended these investigations to risk factor biomarkers: glycaemic markers (glucose, HbA1c), lipids (triacylglycerols, total cholesterol, LDL-cholesterol and HDL-cholesterol), lipoprotein A (LPA), urea and BP.

Methods

Population

We used individual-level data from the UK Biobank, a cohort of approximately 500,000 participants of mixed ancestries assessed across 22 centres in the UK. For this study, we selected individuals of white European descent only (n=409,584). In summary, participants aged 40–69 years were enrolled between 2006 and 2010, and standard anthropometric measurements were taken, in addition to biological samples (urine, blood and saliva); socio-demographic, lifestyle and other health determining factors were recorded. The UK Biobank study received approval from the Multi-centre Research Ethics Committee (reference 16/NW/0274), and all participants gave informed consent [21]. Information about recruitment and data collection has been provided elsewhere [22]. The current analysis is based on application number 57232 to the UK Biobank resource. Use of UK Biobank data for the analysis described here was approved by the Swedish Ethics Approval Authority (application number 2021-03174).

Outcome variables

Disease outcomes

For each of the disease outcomes (type 2 diabetes, hypertension, stroke, CAD and CKD), information was obtained from ICD-9 (http://www.icd9data.com/2007/Volume1/default.htm) and ICD-10 (http://apps.who.int/classifications/icd10/browse/2016/en) diagnosis codes for both prevalent and incident disease, self-reported diagnoses, enrolment interview reports and self-reported medication data. We excluded participants whose reported age of type 2 diabetes diagnosis was 20 years old or less, as it was deemed to be probable type 1 diabetes. For cardiovascular disease (CAD, stroke and hypertension) and CKD, we additionally used information regarding surgical operations, plus interventional procedures related to each disease from OPCS4 codes (https://classbrowser.nhs.uk/#/), self-reported surgical procedures, and details of vascular diseases diagnosed by a doctor, which contained specific coding for each disease outcome. Additional information was obtained from medication data for both men and women, based on self-reported data collected at enrolment. Details of codes and data fields used for each disease are provided in electronic supplementary material (ESM) Table 1.

Disease-risk biomarkers

The disease-risk biomarkers that we included were glucose, HbA1c, triacylglycerols, cholesterol (total, HDL and LDL), urea and LPA, plus systolic BP (SBP) and diastolic BP (DBP). This information was obtained from the blood biochemistry categories of the UK Biobank, details of which are provided in ESM Table 2. For BP, we added 15 and 10 mmHg, respectively, to the values for SBP and DBP in participants taking BP medication [23].

Genetic data

Details of enrolment and genetic data handling have been extensively explained by Bycroft et al [22]. For this project, we used version 3 of the imputed genotypes data from the UK Biobank. We excluded SNPs and individuals with a genotype call rate <99%, SNPs with a Hardy–Weinberg equilibrium p value <1×10−10, those with an imputation score <80%, any duplicated SNPs, and SNPs with a minor allele frequency <0.01. Using quality control results provided by UK Biobank, we further excluded individuals deemed outliers for heterozygosity (indicating poor sample quality or contamination), those with sex ambiguity and aneuploidy, and one of any pair of related individuals (up to third-degree relatedness, kinship coefficient 0.0442–0.0882). After further exclusion of participants with missing anthropometric measurements or HbA1c beyond detectable ranges (>184 mmol/mol or 19%), our final sample comprised 333,582 individuals (ESM Fig. 1).

Computing the BMI polygenic risk score

We used genome-wide association study (GWAS) summary statistics from the latest GIANT meta-analysis of BMI GWASs (excluding participants from the UK Biobank), and selected only genetic variants that were associated with BMI at a genome-wide significance level (p5×10−8): n=1560 SNPs [24]. Individual genetic data were obtained from the UK Biobank. A BMI polygenic risk score (PRS; PRSBMI) was calculated by weighting each SNP by its effect size from GWAS summary data and then summing these values for all SNPs for each individual in our sample. Prior to PRSBMI calculation, clumping restricted to r2=0.2 and a 250 kb window was performed to ensure that only SNPs that are not in linkage disequilibrium were used. After this quality control step, there were 89 uncorrelated BMI SNPs available for use in generating the PRSBMI. All PRSBMI calculations were performed using PRSice-2 software [25]. To reduce the chances of horizontal pleiotropy between PRSBMI and the various diseases and risk factors, we selected BMI SNPs specific to each trait. This was done by excluding any SNPs that were associated with the respective trait at genome-wide significance from the BMI SNPs by comparing with GWAS summary data for the trait. We then computed a trait-specific PRSBMI (for instance, a PRSBMI for CAD analysis that used BMI SNPs that were not associated with CAD) for use in downstream analyses involving that specific trait.

Statistical analysis

Causal effect assessment

We used two-stage least-squares (2SLS) Mendelian randomisation (MR), with PRSBMI as the genetic instrumental variable, to estimate causal effects of BMI on cardiometabolic traits. Prior to analysis, BMI was transformed in the same way as in the discovery GWAS by Locke et al [24]. Specifically, the effects of age, age squared, smoking status, alcohol consumption, UK Biobank assessment centre and the Townsend Deprivation Index were regressed out separately for men and women. Residuals from each of the models, men and women, were then inverse normal-transformed to create a main exposure variable representing BMI.

In the first stage, the exposure was regressed on the PRSBMI in a linear model, adjusting for genotyping array and the first ten genetic principal components characterising the population substructure. Thereafter, fitted values were generated and used in the second stage of 2SLS, where logistic and linear regression models were used for binary and continuous traits respectively, with the fitted values as the exposure, adjusting for the same covariates as in the first stage. The regression coefficients of these fitted values in the second stage represent an estimate of the causal effect of BMI on the outcome [26]. We ran 2SLS models for each disease outcome and each biomarker, and also performed sex-stratified analyses.

Continuous outcomes were scaled so that the results represent a change in SD units of outcome per unit change in BMI. Cochran’s Q test was used to assess sex differences in the sex-stratified analysis. To estimate the causal effect of BMI on any CMD, we used both fixed and random effects meta-analysis, and considered the combined outcome as the likelihood of any CMD. We performed 15 main hypothesis tests (for the five disease outcomes and ten disease-risk biomarkers) and 30 sex-stratified tests; therefore, the Bonferroni-corrected significance level was set at p=0.001 (0.05/45).

Determining the shape of the causal relationships

To describe the shape of the causal relationships between BMI and each of the traits, we used a non-linear MR fractional polynomials method [27]. The method involves calculating the local average causal effect (LACE) in quantiles based on the instrumental variable-free distribution of the exposure. These LACE estimates are then meta-regressed against the means of the exposure in each quantile, and tests of non-linearity are applied to test the null hypothesis that the resultant non-linear model is no different from a linear model. We used 100 quantiles in each assessment, as a high number of quantiles provides greater resolution in fractional polynomials [27]. To obtain a deeper understanding of causal shapes, we also performed piecewise MR to investigate whether any of the relationships had a threshold effect and to confirm the results of the non-linearity tests. Unlike fractional polynomial MR, this method does not smooth over the different quantiles. Instead, it fits a linear model in each quantile, with the slope representing the LACE. For each trait, we also conducted sex-stratified analysis. All analyses were performed using R software version 3.6.2 (https://www.R-project.org/).

Sensitivity analyses

To address potential bias due to extreme values, varying incompleteness of phenotype data (e.g. LPA) and effects of factors such as menopause and waist–hip ratio, we performed several sensitivity analyses as follows: (1) using complete cases only; (2) excluding outliers of BMI, defined using Tukey’s lower and upper fences [28]; (3) including residuals from the first stage in the second stage (two-stage residual inclusion, 2SRI); (4) adjusting for lipid-lowering medication and waist–hip ratio; (5) excluding premenopausal or postmenopausal women, stratifying women by menopause status (self-reported or by age cut-point of 55 years), and stratifying both men and women by age; and (6) using a G-estimator method [29] to calculate causal estimates. All sensitivity analyses were also sex-stratified where applicable.

In 2SRI, the residuals are included as a control function to minimise bias of the standard 2SLS, especially when the effect measure is non-linear. The G-estimator gives a consistent estimate of the causal effect that varies the least. The causal effect estimates obtained using these methods should therefore not differ substantially from each other. We finally used two-sample MR to assess bidirectional causation.

Results

Participants’ characteristics are shown in Table 1. The dataset included slightly more women (n=179,522, 53.8%) than men (n=154,060, 46.2%). On average, women had slightly lower BMI (27.0±5.1 kg/m2) compared with men (27.8±4.2 kg/m2). There was no difference in the mean age, men 57.1±8.1 years, women 56.7±7.9 years. Men had higher baseline mean BP (SBP=145±19.4 mmHg; DBP=86.6±11.0 mmHg) compared with women (SBP=138±21.2 mmHg; DBP=82.4±11.1 mmHg), and a higher prevalence of CMDs (e.g., 67.4% in men vs 32.6% in women for CAD). Differences in anthropometric measures and disease prevalence persisted across age groups (ESM Figs 2 and 3).

Table 1 Participant characteristics (n=333,582)

In the 2SLS analyses, BMI was associated with type 2 diabetes (OR 3.10; 95% CI 2.73, 3.53; p=1.38×10−67), hypertension (OR 1.53; 95% CI 1.44, 1.62; p=8.92×10−44) and CAD (OR 1.20; 95% CI 1.08, 1.33; p=6.86×10−4), but not CKD (OR 1.08; 95% CI 0.67, 1.72; p=0.76) or stroke (OR=1.08; 95% CI 0.92, 1.28; p=0.34) (Table 2).

Table 2 Estimates of causal relationships between BMI and cardiometabolic outcomes using 2SLS MR in the UKB

For disease-risk biomarkers (coefficients expressed in SD units), urea (β=0.05; 95% CI 0.01, 0.08; p=0.01) and LPA levels (β=0.02; 95% CI −0.02, 0.05, p=0.31) were not significantly associated with BMI, after correcting for multiple testing (pBonferroni=0.001). A positive causal effect of BMI was observed for glucose (β=0.16; 95% CI 0.13, 0.20; p=4.90×10−24), HbA1c (β=0.22; 95% CI 0.19, 0.26, p=2.30×10−34), and triacylglycerol levels (β=0.13; 95% CI 0.09, 0.16, p=2.38×10−13). BMI had an inverse causal relationship with total cholesterol (β=−0.18; 95% CI −0.21, −0.14, p=1.37×10−24), LDL-cholesterol (β=−0.10; 95% CI −0.14, −0.07, p=9.59×10−10) and HDL-cholesterol (β=−0.26; 95% CI −0.30, −0.22, p=4.36×10−35). The effect of BMI on DBP variation (β=0.15; 95% CI 0.12, 0.19, p=1.30×10−18) was almost twice the effect on SBP variation (β=0.09; 95% CI 0.06, 0.12, p=2.31×10−7) (Table 2).

Sex-stratified analyses

As shown in Table 2, the causal effect of BMI on CAD in women was not statistically significant (OR=0.97; 95% CI 0.81, 1.18, p=0.78), but it was in men (OR=1.30; 95% CI 1.15, 1.47, p=2.55×10−5) (p value for sex difference=0.01; however, this was not significant after accounting for multiple testing, p<0.001, Table 3). No significant differences between sexes were observed for the causal effects of BMI on type 2 diabetes, stroke, hypertension or CKD.

Table 3 Cochran’s Q test of the difference between men and women for causal effects of BMI on cardiometabolic traits

Of the biomarkers, LDL-cholesterol was not significantly associated with BMI in women (β=−0.05; 95% CI −0.09, 0.00, p=0.05), but it was in men (β=−0.17; 95% CI −0.21, −0.12, p=4.79×10−11), and this sex difference was significant even after adjusting for multiple testing (pBonferroni=0.001). The causal effect of BMI on total cholesterol in men (β=−0.23; 95% CI −0.28, −0.18, p=3.96×10−19), was almost double the effect seen in women (β=−0.13, 95% CI −0.18, −0.08, p=5.13×10−8), but the sex difference did not persist after correcting for multiple testing (pBonferroni=0.001). In men, urea was not significantly associated with BMI (β=0.02; 95% CI −0.03, 0.07, p=0.37); however, a positive association was observed in women (β=0.06; 95% CI 0.02, 0.11, p=8.70×10−3), although this was not significant after Bonferroni correction. BMI was associated with DBP in both men and women, but was associated with SBP in men only (Tables 2 and 3).

Effect of BMI on any cardiometabolic disease outcome

In combined meta-analysis of the causal effect sizes of BMI on CMD, BMI was significantly associated with increased causal odds of any CMD (fixed effects OR 1.55; 95% CI 1.48, 1.62; p=8.23×10−78; random effects OR 1.48; 95% CI 1.00, 2.19; p=0.05). In men, BMI was also causally linked to any CMD using both fixed and random effects, but in women this association was only significant when considering fixed effects (Fig. 1).

Fig. 1
figure 1

Forest plots of a summary meta-analysis combining the causal effect estimates of BMI on CMDs in (a) men, (b) women, and (c) all participants. The common outcome in both fixed and random effect lines represents any CMD. T2D, type 2 diabetes

2SLS sensitivity analyses

In the combined analyses, results did not differ across the three methods used to estimate causal effects (2SLS, 2SRI and G-estimator) for any trait except BP, HbA1c and LPA levels, where use of the G-estimator gave larger effect sizes with wider 95% CIs (ESM Tables 3 and 4). Adjusting for lipid-lowering medication or waist–hip ratio did not materially change the results in either the main analysis or when excluding outliers for BMI (ESM Figs 4 and 5; ESM Tables 5 and 6).

A sex difference in effects of BMI on hypertension was observed when comparing men to premenopausal women, but this was not significant after accounting for multiple testing (pBonferroni=0.001). Significant sex differences were observed for the relationship between BMI and LDL-cholesterol after multiple testing correction, but not when comparing men to postmenopausal women (ESM Table 7). In the age-stratified analyses (i.e., <55 years or 55 years and above), BMI was associated with CAD across all groups in men and in premenopausal (self-reported) women only. The causal effect of BMI was statistically significant in all groups for hypertension and type 2 diabetes, but not stroke or CKD (ESM Fig. 6 and ESM Table 8). Analyses performed to assess bidirectional causation did not yield results supporting such relationships. The association between SBP and BMI had a null effect size, while that between DBP and BMI suffered from horizontal pleiotropy (ESM Table 9).

Shapes of causal relationships

From the non-linear MR fractional polynomials (FP), there was evidence to support a non-linear causal effect of BMI on CAD and type 2 diabetes. In both cases, a quadratic model was better than a linear one (pQuadratic=0.02 and 0.004, respectively). For type 2 diabetes, a fractional polynomial of one degree was a better fit than a linear model (pFP=0.01). CAD and type 2 diabetes showed LACE heterogeneity across quantiles (PHet=0.03 and 0.003, respectively). In men, the causal effect of BMI was non-linear in type 2 diabetes, with a quadratic model being better than a linear model (pQuadratic=0.02). In women, non-linear models were a better fit of the data for CAD, type 2 diabetes and hypertension (pCochranQ=0.002, 0.02 and 0.02, respectively). This evidence did not pass the multiple testing threshold (pBonferroni=0.001). However, in the stratified analyses, triangulated evidence suggested BMI had a non-linear causal association with CKD in both men and women (see Fig. 2).

Fig. 2
figure 2

Plots showing the estimated shapes of the causal relationships between BMI and CMDs in combined analyses (ae), men (fj) and women (ko). Shape estimates are derived from the function of fractional polynomials that best fits the data. The solid black line represents the function curve, the blue band represents 95% CI, the red dot represents the reference BMI of 25 kg/m2, and the dashed line represents the null effect size. The plots have been cropped to depict estimated causal associations for BMI up to 40 kg/m2 and OR up to 3.0 for ease of comparison. The uncropped plot is presented in ESM Fig. 7. HTN, hypertension; T2D, type 2 diabetes

There was no statistically significant evidence to support a non-linear causal relationship between BMI and LPA, DBP, SBP and urea after correcting for multiple testing (pBonferroni=0.001), but the results were significant for glycaemic and lipid biomarkers. In men, there was statistically significant evidence, accounting for multiple testing, to support a non-linear causal association between BMI and HbA1c, HDL-cholesterol, LDL-cholesterol, triacylglycerols and total cholesterol, a pattern that was also observed in women (see Tables 4 and 5, Figs 3 and 4, and ESM Figs 79). The piecewise MR results were consistent with these results; however, interpretation of the plots can be difficult, especially at the tails of the effect estimate distribution, where the linear segments are unrestricted and thus extrapolate to the most extreme values (ESM Fig. 10).

Table 4 Tests for shapes of causal relationships between BMI and cardiometabolic phenotypes
Table 5 Tests for sex-stratified shapes of causal relationships between BMI and cardiometabolic phenotypes
Fig. 3
figure 3

Plots showing estimated shape of the causal relationships between BMI and selected cardiometabolic biomarkers in combined analyses (ae), men (fj) and women (ko). Shape estimates are derived from the function of fractional polynomials that best fits the data. The solid black line represents the function curve, the green band represents 95% CI, the red dot represents the reference BMI of 25 kg/m2, and the dashed line represents the null effect size. The plots have been cropped to depict estimated causal associations for BMI up to 40 kg/m2 and β between −2 and 2 for ease of comparison. The uncropped plot is presented in ESM Fig. 8. GLU, glucose; HDL, HDL-cholesterol; LDL, LDL-cholesterol; T_CHOL, total cholesterol

Fig. 4
figure 4

Plots showing estimated shape of the causal relationships between BMI and selected cardiometabolic biomarkers in combined analyses (ae), men (fj) and women (ko). Shape estimates are derived from the function of fractional polynomials that best fits the data. The solid black line represents the function curve, the green band represents 95% CI, the red dot represents the reference BMI of 25 kg/m2, and the dashed line represents the null effect size. The plots have been cropped to depict estimated causal associations for BMI up to 40 kg/m2 and β between −2 and 2 for ease of comparison. The uncropped plot is presented in ESM Fig. 9. TG, triacylglycerol

Discussion

In this study, we investigated the shapes of causal relationships between BMI, CMDs and biomarkers of disease risk. We further investigated sex differences within the causal framework, and estimated the causal effect of BMI on each CMD studied. The estimates from combined analyses showed that BMI is significantly associated with type 2 diabetes, CAD and hypertension, but not CKD or stroke; it is also associated with all assessed biomarkers except LPA and urea levels after controlling for multiple testing. In men, BMI associations mirrored those of the unstratified analyses, but BMI was not causally associated with CAD, LDL-cholesterol or SBP in women. BMI was causally associated with increased odds of any CMD in both sex-combined and sex-stratified analyses, when assuming fixed effects. When assuming random effects, the association in women was no longer significant. Sex differences persisted for causal effects of BMI on LDL-cholesterol only (with threefold attenuation of effect towards the null in women) after correcting for multiple testing. In investigations of non-linearity, after triangulation, the data support non-linear causal relationships between BMI and glucose, HbA1c, HDL-cholesterol, LDL-cholesterol, triacylglycerols and total cholesterol (corrected for multiple testing). In sex-stratified analyses, triangulated evidence supported a non-linear association between BMI and CKD, HbA1c, HDL-cholesterol, LDL-cholesterol, triacylglycerols and total cholesterol in both men and women.

Causal associations between excess adiposity and cardiometabolic health have been reported previously, with results that are largely consistent with ours [13, 14, 30]. In our analysis, BMI was inversely associated with all cholesterol types and directly associated with triacylglycerol levels. This may reflect dyslipidaemic obesity, characterised by high levels of triacylglycerols and NEFAs, decreased HDL-cholesterol with HDL dysfunction (a shift towards proinflammation and altered reverse cholesterol transport), and normal or slightly increased LDL-cholesterol, attributed to altered metabolism favouring hypertriglyceridaemia [31]. One study assessed sex differences for causal effects of BMI in leading causes of death including cardiometabolic diseases such as type 2 diabetes, CAD and stroke [15]. In that study, BMI was causally related to the three diseases in men and women; the relationship with type 2 diabetes, but not CAD or stroke, varied by sex. In our study, BMI was not associated with stroke, and sex differences in type 2 diabetes were not replicated; the inconsistent findings between these studies may reflect our decision not to use sex-specific SNP effect estimates.

We found that BMI was associated with CAD in men but not all women. While some reasons for such findings may include weak instruments or violations of MR assumptions (conditional restriction), the ‘weak instrument test’ was not suggestive of weak instruments in our case (statistic=3015.08, p<2×10−16), and the Durbin–Wu–Hausman test supported the instrumental variable analysis as more consistent (p=1.04×10−6) than the ordinary least-squares regression. It is possible that a BMI PRS that is weighted using effect sizes from combined GWASs may not fully capture general adiposity in women, or that general adiposity itself is a poor predictor of CAD in all women. However, when women were grouped by menopause status, BMI was found to be significantly associated with CAD in premenopausal women only. This possibly points to detrimental effects of excess adiposity, which nullify the ‘protective effects’ of sex hormones [32]. In this cohort, premenopausal women with obesity had more than twice the prevalence of CAD compared with their non-obese counterparts (ESM Table 10). The null association observed in postmenopausal women may reflect dampened effects of general adiposity (or the presence of other competing/stronger risk factors) on CAD risk in this group. In two prospective studies of postmenopausal women, central or truncal obesity was associated with CAD/cardiovascular disease risk, but not general adiposity [33, 34]. We did not investigate different adiposity phenotypes, and the relationship between such phenotypes and CAD warrants further investigation within a causal framework.

Sexual dimorphism in lipid metabolism and the pathophysiology of CMDs is well-established [35,36,37]. For example, obesity tends to peak about 10 years earlier in men (50–54 years) compared with women (60–64 years). Even at the same BMI and age or fitness levels, men have a worse cardiometabolic health profile despite women having higher fat mass and lower skeletal muscle mass [9, 35, 37, 38]. Women tend to store excess lipids in subcutaneous adipose tissue (which is considered to be protective against CMDs), especially in the gluteal–femoral region, while in men excess fat is more centrally distributed in the visceral adipose tissue (which increases risk of CMDs). These differences are diminished when perturbations in oestrogen levels occur, as in the menopause (low levels) or when taking oral contraceptives (supraphysiological levels) [37, 39].

While the observed sex differences do not stand after correcting for multiple testing, except for LDL-cholesterol, they are worth considering given the documented role of sexual dimorphism in energy homeostasis and cardiometabolic health. Further, despite mixed results from studies, sexual dimorphism may have implications for weight loss interventions in men and women with different levels of metabolic health [40,41,42,43,44]. Complications during pregnancy, such as gestational diabetes and pre-eclampsia, confer additional risk for CMDs in women. Furthermore, from our results, excess adiposity appears to be detrimental to women both pre- and post menopause, while men have a higher burden of CMD at an earlier age compared with women of similar BMI. Such differences may have clinical implications. For men, screening for CMD at an earlier age and at a lower BMI threshold could identify people predisposed to CMD earlier, who would benefit from timely interventions. In women, targeted screening for CMD should take into consideration obesity in premenopausal women.

Non-linear MR has been previously used to assess causal relationships (e.g., the effect of alcohol on cardiovascular disease [45] or BMI on socioeconomic status [46]), but there is a dearth of literature on the nature of the causal effects of BMI on cardiometabolic health between the sexes. In one study focused on CKD, BMI was found to be causally associated with CKD using summary data MR, with evidence of non-linearity in the UK Biobank [47]. We found such an association in sex-stratified analyses only. This may be partly explained by our selection of CKD cases, which was more detailed than the previous study (ESM Table 1). Determining the shapes of causal associations may help estimate the relative benefits of interventions at different levels of exposure. For instance, lowering BMI from 40 to 25 kg/m2 would result in an approximately twofold decrease in the causal risk of type 2 diabetes or a 50% decrease in the causal risk of CAD, respectively (Fig. 2). Use of causal estimates could therefore provide a powerful tool for public health decision-making.

Causal inference studies using MR attempt to give an unbiased estimate of a causal effect of a given exposure on an outcome of interest, provided that the assumptions of MR are not violated, and the instruments explain sufficient variance in the exposures and/or outcomes of interest. To mitigate potential bias, we specifically used SNPs generated from GWASs that did not overlap with the UK Biobank, as the latter dataset was used in our primary analyses. We also performed sensitivity analyses to assess whether the results would change, and chose SNPs unrelated to each specific outcome to mitigate chances of horizontal pleiotropy. Although other problems of MR, such as canalisation, cannot be formally assessed, we believe that the estimates provided in this study offer a glimpse into the differences in causal effects of BMI on CMD between men and women, supporting further investigation.

Strengths

In this study, we used MR, which offers a powerful alternative to assess causal relationships between exposures and outcomes of interest [48]. Conventional MR methods assume a linear relationship to estimate the population-averaged causal effect; however, we tested those linear assumptions to offer better insights for formulating public health policies and interventions [49]. Further, we used both fractional polynomials and piecewise linear MR to triangulate the evidence. We also had the advantage of a large sample size from the UK Biobank.

Weaknesses

We used BMI as our sole measure of adiposity. BMI does not account for differential adiposity, nor is it a reliable measure of relative adiposity across different populations or ethnicities, making it hard to generalise the findings. However, BMI has been shown to be a reliable population-level measure for assessing general adiposity. MR faces challenges of horizontal pleiotropy and canalisation. While there is no formal method to test the latter, we selected BMI SNPs that were not associated with each respective outcome assessed, hence reducing the chances of horizontal pleiotropy. We also could not rule out methodological limitations, in that there may be other shapes, unavailable to us, that better fit these data.

Conclusion

In this analysis, BMI was found to be causally associated with increased risk of type 2 diabetes, CAD and hypertension, but not stroke or CKD, and was also associated with variation in disease-risk biomarkers, except LPA and urea. Further, BMI was causally associated with any CMD when considering fixed effects, in combined and sex-stratified analyses. We found evidence in support of a non-linear causal relationship between BMI and glycaemic and lipid biomarkers, except LPA. The adverse consequences of BMI on CAD risk are similar in men and premenopausal women. However, although BMI continues to confer increased CAD risk in men, it seems to be no longer a strong risk factor in postmenopausal women. These results further our understanding of the complex nature of the causal relationships between BMI and CMD. It also highlights the role of sex in CAD and lipid and glucose homeostasis in the context of causal risk conferred by excess adiposity, and underscores the need for consideration of sex in the management of excess adiposity. Finally, reducing excess adiposity remains highly beneficial in improving energy and lipid metabolism, as well as reducing the risk of CMD.