Background

The utilization of frozen-thawed embryo transfer (FET) has witnessed a significant increase, due to the diminished necessity for clinical visits and enhanced scheduling flexibility, hormone replacement therapy (HRT) has found extensive application in FET. Early investigations proposed that employing ultrasound to evaluate endometrial thickness could function as an alternative to invasive methods, such as endometrial biopsy, for gauging endometrial receptivity in the context of FET [1]. Several studies have shown that optimal endometrial thickness is crucial for successful reproductive outcomes in FET cycles. An endometrial thickness of 7–14 mm is generally considered ideal, with thicknesses outside this range being associated with lower pregnancy rates and live birth rates [2,3,4,5]. Additionally, reproductive endocrinology is one of the most important regulatory mechanisms in the endometrium, and it is crucial to improve endometrial receptivity by adjusting the application and timing of E2 and progesterone. E2 fosters the proliferation of the endometrium and triggers the expression of progesterone receptors within the endometrial tissue, a prerequisite for the successful implantation of a transferred embryo [6]. A positive correlation exists between serum E2 levels in HRT-FET cycles and the dosage of exogenous E2 medication administered. However, the evaluation of exogenous E2 dosage is influenced by various factors, including the type of drug, administration route, and individual metabolic variations, resulting in significant heterogeneity in the data, posing analytical challenges. Therefore, we opted for serum E2 levels as a research indicator to provide a more precise reflection of the biological utilisation of E2.

Some studies have suggested that supraphysiological levels of serum E2 can disrupt the synchronisation of embryo-endometrial interactions and/or promote pathological endometrial development, thereby impairing optimal endometrial receptivity and embryo implantation [7,8,9]. However, other research have indicated that elevated supraphysiological concentrations of serum E2 do not impact the physiological levels of progesterone to induce secretory transition [10]. Several investigations have additionally conveyed findings that the serum E2 levels during the mid-cycle phase exhibit no correlation with outcomes in assisted reproductive technology (ART) [11,12,13]. In fresh embryo transfer cycles following ovarian stimulation, elevated serum E2 levels have been frequently associated with adverse reproductive outcomes, including lower birth weights and preterm births. In contrast, such adverse outcomes are less commonly observed in artificial FET cycles [14, 15]. Elevated E2 levels in fresh cycles are believed to impair endometrial receptivity, potentially due to endometrial overstimulation. This overstimulation can create suboptimal conditions for implantation, leading to poorer pregnancy outcomes. Conversely, in FET cycles, the endometrium is prepared in a more controlled and regulated manner, which may mitigate some of the adverse effects associated with high E2 levels in fresh cycles. However, the need to measure serum E2 levels during artificial cycling remains controversial. Its effect on the final assisted reproductive outcome also remains a subject of debate. Based on the above findings, we aimed to further explore this contentious issue by analyzing a large-scale, single-centre, 6-year retrospective cohort comprising HRT-FET cycles that were uniformly prepared.

Methods

Study design and population

In a single-center retrospective cohort study, we collected data from the Centre for Reproductive Medicine’s database for all HRT-FET procedures conducted between January 2017 and December 2022. Throughout this period, embryo cryopreservation exclusively utilized vitrification, and serum E2 levels were assessed using a single hormone assay. Patients were followed up for at least 1 year. Patients with uterine malformations (unicornuate uterus, septate uterus, or double uterus), untreated hydrosalpinx, unaddressed endometrial lesions (endometritis, endometrial polyps, or intrauterine adhesions), those undergoing pre-implantation genetic testing cycles for chromosomal abnormalities, those treated with gonadotropin-releasing hormone agonists, those with embryos derived from in vitro maturation, those who underwent oocyte-donation cycles, and patients who canceled the embryo transfer on the planned transfer day were excluded from the study. Finally, 26,194 patients were included in the study. Patients were classified into three groups based on tertiles of serum E2 level before progesterone administration: group 1 (≤ p33.3, E2 ranging from 10.00 to 118.00 pg/mL, n = 8784), group 2 (p33.4–p66.6, E2 ranging from 118.44 to 231.00 pg/mL, n = 8662), and group 3 (≥ p66.7, E2 ranging from 232.00 to 3325.00 pg/mL, n = 8748). The retrospective study received approval from the Institutional Review Board and Ethics Committee, with waived informed consent due to its retrospective design.

Artificial FET protocol

All patients underwent HRT for endometrial preparation. Hormonal therapy was initiated on the second or third day of the menstrual cycle after confirming baseline serum hormone levels (E2 < 50 pg/mL, and progesterone < 1 ng/mL). Patients received oral administration of 2–5 mg of E2 (Progynova, Bayer-Schering Pharma AG, Berlin, Germany) twice daily. Transvaginal ultrasound measured endometrial thickness and shape on the seventh day of administration, and E2 dosage adjustments were made based on endometrial thickness, following the physician’s preference and experience. After 12–14 days of treatment, the endometrium underwent re-evaluation using transvaginal ultrasonography, accompanied by measurements of serum E2 and progesterone levels. In instances where endometrial thickness was either < 7 mm or > 15 mm, patients opted for embryo transfer after being informed of associated risks. When serum progesterone levels were < 1 ng/mL, vaginal micronized progesterone (Utrogestan, 200 mg, three times daily; Besins Healthcare, UK) was administered on the second day after completing E2 and progesterone measurements to induce the luteal phase. Embryo transfer for cleavage-stage embryos occurred on the fourth day of progesterone administration, and blastocyst transfer was conducted on the sixth day (the first day of progesterone administration was recorded as D1, cleavage-stage embryo transfer as D4, and blastocyst transfer as D6). An additional daily oral dose of 20 mg dydrogesterone (Duphaston, Abbott, Netherlands) was administered on the day of embryo transfer. Luteal phase support continued until the Human chorionic gonadotropin pregnancy test. If the test result was positive, support extended until the 12th week of pregnancy.

Definition of clinical outcomes

The primary outcome measures were the clinical pregnancy rate (CPR) and live birth rate (LBR), defined according to the criteria established by the American Society for Reproductive Medicine in 2017 [16]. Clinical pregnancy was ascertained by the presence of one or more gestational sacs detected through ultrasonography. Live birth was defined as the delivery of at least one viable baby after 22 weeks of gestation. The calculation for CPR (%) involved dividing the total number of clinical pregnancy cycles by the number of embryo transfer cycles and multiplying by 100. Similarly, LBR (%) was determined by dividing the total number of live birth cycles by the number of embryo transfer cycles and multiplying by 100.

Statistical analysis

Participants were initially assigned to distinct groups according to their baseline parameters, and the chi-square test was employed to compare both CPR and LBR. Subsequently, a multivariate logistic regression analysis was conducted to assess the correlation between CPR and LBR. All statistical analyses were carried out utilizing IBM SPSS Statistics 21 (IBM Corp., USA), with all P values being two-sided, and statistical significance set at P < 0.05.

Results

Characteristics of the patients

The analysis incorporated 26,194 cycles of FET-ART, exhibiting a CPR of 54.4% (14,300/26,194) and a LBR of 44.1% (11,571/26,194). Table 1 summarizes the baseline clinical characteristics and cycle parameters of the participants. Stratifying the cycles into three groups based on tertiles of serum E2 levels revealed a progressive increase in female age, the proportion of patients with secondary infertility, the proportion of cleavage-stage embryo transfers, serum E2 levels both before estrogen administration and before progesterone administration, and the proportion of previous ET or FET cycles with failure. Conversely, body mass index (BMI), the proportion of patients with primary infertility, the proportion of blastocyst transfers, and the proportion of first IVF-FET cycles exhibited a decreasing trend (all P < 0.001). In addition, significant differences were observed in the indications for infertility [tubal, endometriosis (EMT), polycystic ovary syndrome (PCOS), and male factor], the number of embryos transferred, and endometrial thickness among the three groups. Significant statistical differences were observed in both CPR and LBR among the three groups (p < 0.000 for both), with CPR and LBR decreasing gradually with increasing serum E2 levels (refer to Tables 2 and 3). In addition, Supplemental Table 1 presents detailed descriptive statistics of serum E2 levels before progesterone administration in the three groups, including the mean, standard deviation, range, median, and interquartile range.

Table 1 Patients’ baseline characteristics
Table 2 CPR in HRT-FET cycles based on different parameters
Table 3 LBR in HRT-FET cycles according to different parameters

Comparison of CPR differences among the E2 groups

The CPR in the three E2 groups were 59.7% (5,242/8,784), 56.2% (4,869/8,662), and 47.9% (4,189/8,748) for groups 1, 2, and 3, respectively (Table 2). For the intra-group analysis of the three groups, CPR was statistically different across different age groups of women (< 35, 35–39, or ≥ 40 years), infertility diagnosis (primary or secondary infertility), cycle category (first IVF-FET cycle or previous ET or FET cycles with failure), stage of embryo transfer (cleavage-stage embryo or blastocyst transfer), number of embryos transferred (one, two, or three), indication for infertility (tubal, EMT, PCOS, male factor, or multiple factors), insemination method-fresh cycle [In vitro fertilization(IVF) or Intracytoplasmic sperm injection(ICSI)], and endometrial thickness (< 7 mm or ≥ 7 mm). For groups 2 and 3, CPR varied among patients based on BMI (≤ 18.5, 18.6–24.9 or ≥ 25 kg/m²) and indications for infertility-EMT.

In the inter-group analysis, after adjusting for confounding factors stratified by age, BMI, infertility diagnosis, cycle category, stage of embryo transfer, number of embryos transferred (one or two), indication for infertility, insemination method, fresh cycle, and endometrial thickness, CPR gradually decreased with increasing E2 levels among the three groups. However, no difference was observed only in the subgroup of patients who underwent transfer of three embryos (Table 2).

Comparison of LBR differences among the E2 groups

The LBR in the three E2 groups were 49.4% (4,337/8,784), 46.0% (3,981/8,662), and 37.2% (3,253/8,748) for groups 1, 2, and 3, respectively (Table 3). For the intra-group analysis of the three groups, LBR was statistically different across different age groups of women, BMI, infertility diagnosis, cycle category, stage of embryo transfer, number of embryos transferred, indication for infertility, insemination method-fresh cycle, and endometrial thickness. For groups 2 and 3, LBR varied among patients based on the indication for infertility-EMT.

In the inter-group analysis, after adjusting for confounding factors stratified by age, BMI, infertility diagnosis, cycle category, stage of embryo transfer, number of embryos transferred (one or two), indication for infertility, insemination method, fresh cycle, and endometrial thickness, LBR gradually decreased with increasing E2 levels among the three groups. However, no difference in women aged ≥ 40 years and in those who underwent transfer of three embryos (Table 3).

Influencing factors of CPR and LBR during the FET cycle

Univariate logistic analysis (Supplemental Table 2) was used to evaluate the effect of each variable on pregnancy outcomes. Generally, insemination method-fresh cycle-ICSI, blastocyst transfer, increased number of transferred embryos, tubal disorder, PCOS, male factor infertility, and multiple factors infertility were positively correlated with the CPR and LBR. In contrast, elevated E2 levels (E2 group 2 and 3), advanced age, BMI, secondary infertility, previous ET or FET cycles with failure, endometrial thickness less than 7 mm, and EMT were negatively correlated with the CPR and LBR.

All CPR- and LBR-related factors were reanalyzed simultaneously using a multivariate logistic regression analysis model with adjusted data (Table 4). Elevated E2 levels (third tertile of E2 level group 3), advanced age, endometrial thickness less than 7 mm, and the indication for infertility-tubal were identified as negative predictors for both CPR and LBR. Conversely, positive predictors for CPR and LBR included insemination method-fresh cycle-ICSI, blastocyst transfer, increased number of transferred embryos, the indication for infertility-PCOS, and male factor infertility. Notably, BMI and previous ET or FET cycles with failure were negative predictors for LBR but not for CPR. Serum E2 levels before estrogen administration, infertility diagnosis (secondary infertility/primary infertility), and indications for infertility—EMT and multiple factors—were not included in the final multivariate logistic regression model due to lack of statistical significance.

Table 4 Logistic regression analysis of factors associated with CPR and LBR in HRT-FET cycles

Discussion

In this study of 26,194 FET-ART cycles, we observed a CPR of 54.4% and an LBR of 44.1%. The stratification of cycles based on E2 tertiles demonstrated a significant association between elevated serum E2 levels and diminished CPR and LBR. These findings suggest that consideration of E2 levels during FET cycles is essential for optimizing reproductive outcomes.

Prior investigations have suggested that increased serum E2 levels during the ovulation stimulation cycle could adversely impact pregnancy outcomes [17,18,19]. Fritz’s study revealed a substantial decrease in ongoing pregnancy and LBR within the high E2 level group (692–1713 pg/mL) in contrast to the low E2 level group (135–214 pg/mL) [20]. Animal model studies have shown that elevated doses of exogenous E2 may facilitate the shift of the endometrium from a receptive to a non-receptive state. This process is mediated through the rapid closure of the endometrial implantation window, leading to a diminished pregnancy rate [21]. Patel et al. reported a concentration-dependent effect of E2 on protein expression. Specifically, pro-apoptotic caspases 3, 8, and 9 exhibited an increase in expression levels when E2 concentrations reached ≥ 25 nM. Concurrently, the anti-apoptotic protein B-cell lymphoma 2-alpha demonstrated a decline in expression at E2 concentrations exceeding 10 nM [22]. These findings suggest that elevated E2 levels induce trophoblast cell death or impede proliferation. Our study aligns with these precedents, suggesting that increased serum E2 levels before progesterone conversion during HRT cycles may detrimentally affect endometrial receptivity, leading to decreased CPR and LBR. However, in 2020, a retrospective analysis indicated that pre-progesterone E2 levels had no effect on LBR during HRT-FET-assisted reproduction [23]. It has also been shown that pregnancy outcomes are not correlated with serum E2 levels on the day of progesterone conversion in HRT-FET cycles [12, 13, 24]. We identified variations in the definition of elevated E2 levels during HRT cycles across different studies. Serum oestrogen levels are influenced by factors, such as the type of oestrogen medication, administration route (oral, transdermal, or vaginal), and individual metabolic variations, leading to significant interindividual differences and diverse study outcomes. To address this, our study, which was characterised by a substantial sample size, classified serum E2 levels into tertiles and investigated the correlation between serum E2 level trends and assisted reproduction outcomes. This approach effectively addressed potential biases arising from differences in E2 cut-off values and inter-group sample volumes, thereby enhancing the reliability of our results.

In a retrospective study, it is imperative to acknowledge that variations in demographic characteristics among diverse populations can potentially impact research outcomes. Influencing factors in this study include female age, BMI, infertility diagnosis, cycle category, embryo transfer stage, number of embryos transferred, indications for infertility and endometrial thickness, among others. To mitigate the influence of these confounding variables, we conducted multifactorial stratified and multivariate regression analyses. Detailed intra- and inter-group analyses revealed the impact of various clinical parameters on CPR and LBR within each E2 tertile group, revealing multifaceted associations with demographic and clinical factors. Age, infertility diagnosis, stage of embryo transfer, number of embryos transferred, indication for infertility, and insemination method (fresh cycle and progesterone-induced endometrial thickness) emerged as critical factors influencing CPR and LBR, consistent with findings from previous studies [25,26,27,28]. A meta-analysis revealed a significant reduction in the probability of achieving clinical pregnancy when the endometrial thickness was ≤ 7 mm, as opposed to cases with endometrial thickness of > 7 mm (23.3% vs. 48.1%, odds ratio: 0.42) [29]. This finding aligns with the outcomes observed at our centre. Notably, only 5.80% (1520/26,188) of patients with endometrial thickness of less than 7 mm underwent transfer, primarily because most patients chose to cancel the procedure when the endometrial thickness fell below 7 mm. In addition, we observed that overweight women (BMI ≥ 25 kg/m²) had lower CPR and LBR than women with normal weight, and multivariate regression analysis also revealed BMI as a negative factor for LBR, with substantial support from existing research supporting our findings [30,31,32]. Compared to the first IVF-FET cycle, previous ET or FET cycle failures were negative predictors for LBR but not for CPR. In patients with prior embryo implantation failures, there may be underlying abnormalities in endometrial receptivity [33]. Additionally, complex multifactorial causes could contribute to the reduced LBR. Notably, the association between E2 level, CPR, and LBR persisted even after adjusting for these confounding factors, emphasising the independent role of E2 in predicting FET outcomes. These findings also underscore the need for personalised approaches tailored to patient characteristics.

Notably, an interesting observation was made from the subgroup analysis. In most cases, as the E2 levels increased, both CPR and LBR gradually decreased. However, no notable distinctions were detected in the subgroups that underwent the transfer of three embryos. In the subgroup of older women (≥ 40 years), the differences in LBR among women with different E2 levels were also not statistically significant. Several factors may have contributed to this finding. First, patients undergoing a single cycle with the transfer of three embryos in our centre often exhibited poor embryo quality, history of repeated implantation failures, and advanced maternal age. These factors could have individually or collectively influenced the impact of serum E2 level on reproductive outcomes. Additionally, the negative impact of advanced age, especially in women aged ≥ 40 years, may be more pronounced on live birth rates, rendering the positive impact of low E2 levels less significant. Therefore, it is necessary to expand the sample size for this population in future studies.

In addition, the conclusions were further supported by multivariate logistic regression analysis, which identified elevated E2 levels (second and third tertiles) as independent negative predictors for CPR and LBR compared to low E2 levels (first tertile). Supraphysiological E2 levels could potentially disrupt the normal estrogen-progesterone equilibrium within the endometrium, resulting in failed endometrial compaction [34]. Elevated oestrogen levels may also lead to a decreased sensitivity of the endometrium to progesterone, resulting in progesterone resistance. Based on our research findings and reasonable speculation, we consider this to be a promising field for future exploration.

This study has several strengths. Notably, a substantial sample size of 26,194 FET-HRT cycles bolstered the study’s statistical power and enhanced the reliability and generalisability of the findings. In addition to oestrogen, we performed subgroup analyses encompassing various clinical parameters, conducted intra- and inter-group analyses, and employed multivariate regression analyses to optimise the control of confounding factors. Even after adjusting for multiple confounders, our study consistently demonstrated that elevated E2 levels (second and third tertiles) have a potential negative impact on both CPR and LBR compared to low E2 levels (first tertile), offering pertinent guidance for clinical practitioners.

Despite these strengths, this study has several limitations that must be acknowledged. First, its retrospective design introduces an inherent bias. Second, while our study is based on data from a single center, potential unaccounted confounding factors may still influence the final outcomes. For instance, discrepancies in the duration of oestrogen action and measurement time, as well as the specific time interval between ovarian stimulation and FET cycles, may also have affected the results. Although we regret the absence of these data, we assumed that any potential measurement and duration biases were evenly distributed across all study patients, enabling us to analyse the results. Third, the study’s exclusive focus on patients undergoing HRT–FET cycles necessitates caution when generalising the findings to other populations. The outcomes may not be extrapolated to patients undergoing natural or induced ovulation cycles with FET, or to those who opted for IVF with fresh embryo transfer. Therefore, further investigation of these distinct patient cohorts is required.

Conclusions

This large retrospective study indicates that elevated serum E2 levels before progesterone administration during HRT-FET cycles are linked to reduced CPR and LBR post-embryo transfer. Therefore, it is advisable to monitor serum E2 levels and consider adjusting treatment strategies based on individual patient’s serum E2 levels to maximize patient outcomes.