Background

Polycystic ovary syndrome (PCOS) is the most common endocrinopathy in women of childbearing age, characterized by hyperandrogenism, menstrual disorders, and polycystic ovarian changes [1, 2]. It was estimated that in 2019, approximately 66.0 million individuals were affected by PCOS worldwide and the overall prevalence rate was 829.6 per 100,000 [3]. Patients with PCOS may not only have a higher rate of infertility but also have an increased risk of developing diabetes and cardiovascular diseases, which poses a great challenge to public health management [2].

While the exact cause and development of PCOS remain unclear, evidence suggests that a combination of genetic and environmental factors significantly contribute to its etiology and pathogenesis. Established risk factors for PCOS encompass genetic susceptibility, obesity, and insulin resistance [4,5,6,7]. Sex hormone binding globulin (SHBG) acts as a carrier for sex hormones, binding with both testosterone and estrogen, and holds a crucial role in various physiological and pathological contexts [8]. Observational studies have consistently shown lower circulating SHBG levels in PCOS patients when compared to healthy controls. For instance, in a study involving 200 PCOS patients and 200 controls, peripheral SHBG levels were significantly higher in the control group than in the PCOS group [9]. Another study, which included 585 women with PCOS and 171 controls, similarly observed lower serum SHBG levels in PCOS patients compared to controls [10]. However, since findings from traditional observational epidemiological studies are susceptible to bias such as confounding and reverse causation, it remains unclear whether the observed association was causal or not.

Mendelian randomization (MR) is a genetic epidemiological approach that utilizes single nucleotide polymorphisms (SNPs) strongly linked to the exposure as instrumental variables (IVs) to assess potential causal relationships between the exposure and the outcome [11]. Since genotypes are assumed to be randomly distributed during gamete formation, the application of instrumental variable models effectively addresses the issue of confounding in observational studies, particularly the potential bias caused by unmeasured confounding variables in causal inference [12, 13]. Furthermore, as genotypes precede the onset of disease, MR studies can effectively mitigate the issue of reverse causation. Consequently, in this study, we conducted a two-sample MR investigation to explore the potential link between circulating SHBG levels and the risk of PCOS.

Methods

Selection of genetic variants

The overall design of this study is shown in Fig. 1. The IVs for serum SHBG were obtained from a recent genome-wide association study (GWAS) involving 425,097 individuals of European descent [14]. A total of 305 independent SNPs (r2 threshold < 0.001, kb = 10,000) associated with circulating SHBG levels at genome-wide significance level (P < 5 × 10−8) were identified. Of these, 16 were not present in the primary outcome dataset. Consequently, we ultimately utilized 289 SNPs associated with circulating SHBG concentrations as instrumental variables (IVs) in the subsequent MR analyses. The details of GWAS studies and datasets used in the present study are listed in Supplementary Table 1, and the detailed information of the selected SNPs used as IVs is displayed in Supplementary Table 2.

Fig. 1
figure 1

An overall design of the present study. Abbreviations: IVW, inverse-variance weighted; GWAS, Genome-wide association study; MR, Mendelian randomization; MR-PRESSO, MR pleiotropy residual sum and outlier test; PCOS, polycystic ovary syndrome; SHBG, sex hormone binding globulin; SNP, single nucleotide polymorphism

Furthermore, as body mass index (BMI) has been shown to be correlated with both circulating SHBG levels and PCOS risk, we selected 462 independent SHBG-associated SNPs (with an r2 threshold < 0.001 and kb = 10,000) from the same GWAS meta-analysis, after adjusting for BMI (P < 5 × 10−8). Among them, 22 SNPs were not available in the outcome dataset. Finally, 440 SNPs associated with circulating SHBG concentrations adjusted by BMI were used as IVs in the primary analyses. The detailed information of the selected SNPs used as IVs is displayed in Supplementary Table 3.

Outcome dataset

The genetic association data for PCOS were sourced from a GWAS meta-analysis, which encompassed 10,074 cases and 103,164 controls of European ancestry [15]. This represents the most extensive GWAS meta-analysis of PCOS to date, incorporating participants from seven cohorts, including Rotterdam, EGCUT, deCODE, UK (London/Oxford), Chicago, Boston, and 23andMe [15]. However, due to unavailability of data from 23andMe, the final study population for this investigation comprised 4,138 PCOS patients and 20,129 controls [15]. Comprehensive details of these studies have been previously documented in published articles, and all participating studies received approval from their respective ethical committees. Furthermore, we selected another summary data associated with PCOS involving 1,424 cases and 200,581 controls from the Finland consortium (https://www.finngen.fi/fi), which collected and analyzed genetic and health related data from about 500,000 participants to replicate our findings.

Statistical analysis

We first calculated F-statistics to quantify the strength of the IVs, with the equation of F = beta2 /se2 [16]. A F-statistic greater than 10 suggests the IVs are unlikely to suffer from weak instrument bias.

Our primary analysis employed the inverse-variance weighted (IVW) method to investigate the potential causal association between circulating SHBG concentrations and PCOS risk. This method initially derives causal effect estimates from individual genetic instrumental variables, namely, the effect estimates of each SNP on both SHBG and PCOS. These estimates are then aggregated through meta-analysis to yield a consolidated causal effect estimate [17]. The estimated value from the IVW method is essentially equivalent to the regression coefficient in weighted regression with zero intercept term [18, 19]. In addition, we conducted a series of alternative MR methods to analyze the effect of potential pleiotropy on causal estimation. For instance, MR-Egger regression was used to evaluate influence of potential directional pleiotropy. In MR-Egger regression analysis, the intercept term signifies the average pleiotropic effect of genetic variation [20]. If the intercept is different from zero, there is evidence of directed pleiotropy [20]. Under the assumption that the correlation between genetic variation and exposure is unrelated to the direct impact of genetic variation on the outcomes, the slope coefficient in MR-Egger regression provides a consistent estimate of the causal effect [20]. Furthermore, we employed both the simple-median and weighted-median methods, which involve combining either unweighted or weighted estimates using the median. As long as the weight of the causal effect calculated by the effective instrumental variable reaches 50%, a consistent estimation of the causal effect can be obtained [21]. Moreover, the likelihood-based method was used to evaluate the linear relationship between the exposure and the outcome, and the likelihood-based estimator expresses the causal increase in the outcome per unit change in the risk factor assuming a linear association between the risk factor and the outcome variables [22]. Lastly, we employed the Mendelian randomization pleiotropy residual sum and outlier (MR-PRESSO) method to identify and address horizontal pleiotropic outliers. This method conducts a global heterogeneity test by regressing the SNP-outcome (PCOS) associations against the SNP-exposure (SHBG) associations and then comparing the actual distance of each SNP from the regression line with the expected distance under the null hypothesis of no pleiotropy [23].

All statistical analyses were performed using R (version 3.6.3) with packages “MendelianRandomization” and “MR-PRESSO”, unless otherwise noted. An observed P-value < 0.013(0.05/2/2) was considered as statistically significant evidence for a causal association by using Bonferroni correction. A P-value ranging from 0.013 to 0.05 was considered as suggestive evidence.

Results

In the present study, a total of 289 and 440 independent SNPs associated with SHBG and SHBG adjusted BMI achieving genome-wide significance were used as IVs in the primary analysis. The F-statistic for SHBG and SHBG adjusted BMI was 21.77–1857.13, satisfying the threshold of > 10.

As shown in Table 1, genetically predicted higher levels of circulating SHBG were significant associated with a decreased risk of PCOS. For one standard deviation (SD) (approximately 30.3 nmol/L) increment of SHBG levels, the odds ratio (OR) of PCOS was 0.56 [95% confidence interval (CI): 0.39–0.78, P = 8.30 × 10–4]. The causal effect estimate was consistent in the sensitivity analyses using the simple-median (OR = 0.51, 95% CI: 0.30–0.87, P = 0.013), weighted-median (OR = 0.60, 95% CI: 0.35–1.03, P = 0.066) and Maximum-likelihood (OR = 0.55, 95% CI: 0.39–0.78, P = 7.28 × 10–4) methods. One outlier SNPs were detected using MR-PRESSO test, and the causal effect estimate between SHBG and PCOS was similar (OR = 0.54, 95% CI: 0.39–0.76, P = 4.07 × 10–4). MR-Egger regression did not suggest evidence of potential directional pleiotropy (P for intercept = 0.626) (Fig. 2). Since BMI has been reported to be associated with both risk of PCOS and levels of circulating SHBG, we repeated our MR analysis using SHBG associated SNPs with adjustments for BMI as IVs to further evaluate the robustness of our main analyses. The causal effect of circulating SHBG adjusted for BMI with risk of PCOS remained consistent (OR = 0.53, 95% CI: 0.38–0.73, P = 4.07 × 10–4, by IVW method) (Fig. 2).

Table 1 Genetically predicted circulating SHBG levels and risk of PCOS
Fig. 2
figure 2

Association between genetically predicted SHBG levels, SHBG adjusted BMI levels and risk of PCOS based on different MR methods. Abbreviations: BMI, Body mass index; IVW, inverse-variance weighted; MR, Mendelian randomization; SHBG, sex hormone binding globulin; SNP, single nucleotide polymorphism

When replicating the findings in the Finnegan consortium dataset, we observed consistent results for the association between SHBG and SHBG-adjusted BMI using the IVW method (OR = 0.45, 95% CI: 0.30–0.68, P = 1.83 × 10–4 for SHBG and OR = 0.50, 95% CI: 0.33–0.75, P = 8.87 × 10–4 for SHBG-adjusted BMI). The results of the other sensitivity analyses are listed in Supplementary Table 4.

Discussion

In this study, a two-sample MR approach was employed to assess the potential causal link between circulating SHBG levels and PCOS risk. The analysis revealed that genetically predicted high SHBG levels were linked to a reduced PCOS risk. Our findings remained consistent across various sensitivity analyses using different MR methods and alternative instrumental variable sets, underscoring their robustness.

SHBG, a serum protein linked to sex hormones, plays a critical role in facilitating the action and transport of these hormones [24]. Its levels have been documented to correlate with PCOS development, as well as with complications and long-term outcomes in PCOS patients. A reduced serum SHBG level in PCOS patients serves as a significant risk factor for hyperandrogenemia and serves as a crucial predictor of insulin resistance, as well as disturbances in glucose and lipid metabolism [25]. Observational epidemiological studies have provided evidence of an association between SHBG and PCOS risk. Cross-sectional studies have further indicated that SHBG levels are lower in the PCOS group compared to the control group [26, 27]. In a case–control study, the mean SHBG concentration was notably lower in the PCOS group compared to the control group, with statistically significant differences observed (P = 0.004) [28]. In addition, in a recent meta-analysis comprising 5,121 cases and 5,059 controls, it was reported that circulating SHBG levels were lower in PCOS patients compared to controls (SMD = -0.83, 95%CI = -1.01 to -0.64) [29]. Consistently, our findings support the notion of a protective effect of circulating SHBG levels against the development of PCOS.

While the precise molecular mechanism by which SHBG contributes to PCOS development remains unclear, various studies have indicated potential biological pathways involved in the pathogenesis of PCOS. For instance, the reduction in SHBG levels among PCOS patients results in elevated free and bioactive androgen levels, increased luteinizing hormone secretion, and reduced follicle-stimulating hormone secretion. These factors collectively lead to a high rate of follicle atresia and ultimately contribute to ovulation disorders [30, 31]. Moreover, insulin resistance (IR) stands out as a prominent characteristic of PCOS [32]. In a study by Fu Chen et al., SHBG was identified as an independent influential factor for HOMA-IR and could serve as a valuable predictive marker for IR in PCOS patients [33]. Feng C et al. conducted in vitro studies using a human insulin-resistant cell model, demonstrating that SHBG may down-regulate the Phosphatidylinositol 3 kinase/protein kinase B (PI3K/AKT) pathway. This pathway is known to be associated with the development of both local and systemic insulin resistance, which is considered to play a role in the connection between metabolic disorders and reproductive dysfunction in PCOS [34]. While these explanations hold biological plausibility, further studies are needed to clarify the precise role of SHBG in PCOS development.

Prior investigations have also indicated an association between BMI and both SHBG levels and PCOS risk. For instance, in a population-based cohort study, it was observed that women who were overweight or obese at both 14 and 31 years of age had a heightened risk of PCOS development [relative risk (RR) = 1.71, 95% CI = 1.30–2.24] [35]. Another study reported that SHBG correlated negatively with BMI in both PCOS (P < 0.0001) and non-PCOS groups (P = 0.001) [36]. Hence, in this study, we conducted a sensitivity analysis by employing IVs adjusted for BMI and reran the MR analysis. The causal association between SHBG and PCOS risk remained consistent, bolstering confidence in the potential causal role of SHBG in PCOS development.

There are some strengths in our study when compared with previous studies [14, 37]. To ensure three fundamental assumptions, we selected SNPs from independent loci related to circulating SHBG levels with more rigorous criterion (P < 5 × 10−8, r2 threshold < 0.001, kb = 10,000) from the largest GWAS to date as our IVs [13]. Second, MR analyses employing genetic variants as IVs largely address the issue of confounding inherent in traditional observational studies because genotypes are randomly allocated during gamete formation. We also performed different MR methods to test for potential pleiotropy. We did not observe evidence of directional pleiotropy for the causal association between SHBG and risk of PCOS in any of the above MR approaches. Lastly, we replicated our findings using another set of summary data, yielding consistent results. Nonetheless, this study also encounters limitations. While the majority of participants in the GWAS meta-analysis were of European descent, it's important to acknowledge the potential confounding effects arising from population stratification. Consequently, the results of this study may not be entirely applicable to individuals of non-European descent. Furthermore, we could not assess the potential nonlinear effects of serum SHBG on the risk PCOS by using the MR method. While we try our best to avoid pleiotropy, we cannot totally rule out the possibility of pleiotropy. Canalization plays a crucial role in complex organisms and should be considered when investigating genetic causality through two-sample MR analysis. However, the association between SHBG and the risk of PCOS needs further validation through additional studies due to the absence of individual data.

Conclusions

In conclusion, our study revealed a genetic association between high SHBG levels and reduced PCOS risk, implying a potential causal role for SHBG in PCOS development. Further investigations are needed to elucidate the underlying mechanisms by which SHBG influences the development of PCOS.