Introduction

In the field of in vitro fertilization (IVF), which has been in development for over forty years, physicians frequently use β-hCG to forecast the likelihood of successful pregnancy outcomes in women who have conceived naturally or with the use of assisted reproductive technology (ART) [1, 2]. However, there remains ongoing debate regarding the most effective methods for predicting adverse pregnancy outcomes, such as biochemical pregnancies, in order to facilitate a smooth transition from clinical pregnancy to term and minimize the occurrence of biochemical pregnancies.

The role of estradiol (E2) is well documented in IVF cycles, particularly in fertilization and implantation. Many researches in hints that successful IVF and a healthy pregnancy is associated with the levels of estrogen. Currently, a number of studies suggested that levels of hormones such as E2 may play a role in predicting successful pregnancy in IVF cycles [3, 4], but some results are controversial [5, 6]. What’s more, most previous studies paid attention to the association of clinical outcomes with E2 on the day of β-hCG administration, or E2 related with fresh embryo transplantation. However, rare study investigated earlier serum E2 levels on day 12 after frozen-thawed embryo transfer (FET) and lately clinical pregnancy outcomes. Hormonal replacement treatment cycles (HRTs) and natural endometrial preparation cycles (NCs) are two popular protocols for endometrial preparation, and this retrospective study aimed to assess the association between earlier serum E2 and β-hCG levels on day 12 following FET with NCs and clinical pregnancy outcome. Furthermore, our study aimed to identify the optimal E2 and β-hCG threshold values for predicting pregnancy outcomes in FET with NCs.

Materials and methods

All procedures in this study were approved by the Ethics Committee of Affiliated Women’s Hospital of Jiangnan University (2023-06-1213-65).

Cohort selection

In this retrospective study, infertile women undergoing FET with NCs were included based on electronic medical records from 2012 to 2022.

Clinical characteristics such as maternal age at diagnosis, nationality, the years of infertility, body mass index (BMI), basal estradiol (bE2), basal follicle-stimulating hormone (bFSH) and basal luteinizing hormone (bLH), anti-Müllerian hormone (AMH), basal prolactin (bPRL), β-hCG levels, etc., were collected. Weight and height at time of diagnosis were measured by BMI calculated and classified according to the ethnicity-specific WHO classification for Asian/Indian women: underweight (< 18.4 kg/m2), normal weight (18.5–22.99 kg/m2), overweight (23–27.49 kg/m2), and obese (> 27.50 kg/m2) [7].

The study included the patients aged up to 35 years old and without major system disease. Patients with uterine malformation, uterine fibroids or adenomyosis, severely damaged endometrium and chromosomal abnormalities of either part of the couple were excluded. Patients with incomplete basic endocrine parameters and untimely β-hCG and E2 data were also excluded from the study. Additionally, patients with an endometrial thickness less than 8 mm were not included in the analysis. Those who required exogenous estrogen supplements were also excluded due to their slow endometrial growth without supplementation. Finally, a total of 1521 cycles with positive β-hCG values were collected, forming two groups: CE Group (n = 595) and B Group (n = 926).

Blood samples were collected on day 12 post-embryo transfer, followed by quantification of serum E2 and β-hCG levels. The CE Group was further divided into clinical pregnancy group (CE-CP Group, with gestational sac) and biochemical pregnancy group (CE-BP Group, without gestational sac), based on the cutoff value of β-hCG ≥ 5 mIU/mL [8]. Similarly, the B Group was divided into blastocyst-clinical pregnancy group (B-CP Group) and blastocyst-biochemical pregnancy group (B-BP Group). A flow diagram of the patient-selection process is presented in Fig. 1.

Fig. 1
figure 1

Detail criteria and selection of the cohort selection. FET: frozen-warmed embryo transfer; NCs: natural endometrial preparation cycles; CE: cleavage embryo; B: blastocyst; CE-CP: cleavage embryo to clinical pregnancy; CE-BP: cleavage embryo to biochemical pregnancy; B-CP: blastocyst to clinical pregnancy; B-BP: blastocyst to biochemical pregnancy

Treatment protocols and endometrial preparation

In the natural FET cycles, endometrial maturation is determined by endogenous estradiol and progesterone, which are generated by stimulating the growth of the endometrium and the development of follicles. Regularly menstruating women can use NCs to plan FET without the need for high dose exogenous hormone supplements. However, oral low-dose estrogen (Estradiol Valerate, Bayer, Germany) will be administered to certain patients whose endometrium thickness is consistently thin, particularly when the dominant follicle is up to 14 mm, in order to promote endometrial growth. Finally, FET will be carried out if the endometrial thickness is ≥ 8 mm. Otherwise, the cycles will be cancelled. Furthermore, for those who might not make enough progesterone during embryo transfer, progesterone treatment is carried out during natural cycles.

Following 12–14 days of menstruation, an ultrasound evaluation was done to track follicle growth and measure follicle size. When the follicle reaches a diameter of 16 mm, daily transvaginal ultrasound becomes necessary, and serum levels of LH, E2, and P can be detected until spontaneous ovulation. It is routine to do NCs-FET at the cleavage embryo and blastocyst on Days 0 + 3 and 0 + 5 days, respectively, taking into account that the day of ovulation is Day 0. For luteal phase support, oral progesterone tablets (Duphaston, Abbott, 20 mg) and vaginal progesterone gels (Crinone, Merck Serono, Germany, 90 mg qd) were used. Support for the luteal phase was continued until week 10, at when the β-hCG tests turned positive.

Morphological assessment of cleavage embryos and blastocysts

D3 cleavage embryos were divided into four groups based on their cell number, symmetry of blastomeres, and degree of fragmentation. Grade I included embryos with 7–9 cells, symmetrical blastomeres, and < 5% fragmentation; Grade II included embryos with 6 cells, symmetrical blastomeres, and < 5% fragmentation; Grade III included embryos with 4–5 cells, symmetrical blastomeres, and < 5% fragmentation; Grade IV included embryos with < 4 cells. We classified Grade I and Grade II as high-quality embryos.

Blastocyst morphology was assessed based on the Inner cell mass (ICM) and trophectoderm (TE) which were categorized according to the number and structure of cells. The appearance of tightly packed ICM/TE cells was defined as “A”; several grouped cells as “B”; a few loose cells as “C”. High-grade blastocysts were defined as AA, AB, BA or BB expanded or hatched blastocysts with high ICM/TE grading.

Hormone assay

The levels of serum E2 and β-hCG were quantified using commercially available automated electrochemiluminescence immunoassays (DxI 800 Immunoassay System, Beckman Coulter, USA). All measurements were performed by skilled technicians in strict adherence to the manufacturer’s instructions.

Outcomes

Pregnancy outcomes were as follows: biochemical pregnancy, clinical pregnancy, ongoing pregnancy, and live birth [9]. Biochemical pregnancy [10] was defined as a serum β-hCG level ≥ 5 mIU/mL 12 days after embryo transfer, and that fails to progress to the point of ultrasound confirmation of gestational sac 28–35 days after embryo transfer. Clinical pregnancy was defined as an intrauterine/extra uterine gestational sac that was detected 28–35 days after embryo transfer by ultrasound.

Statistical analysis

IBM SPSS Statistics for Windows Version 27.0 (Armonk, NY: IBM Corp) software was used to analyze all the data. For analysis, we performed Kolmogorov-Smirnov test to determine whether variables were normally distributed or not, and then parametric continuous variables were expressed as mean values and standard deviation (̄x ± s), the difference was assessed using T-test. Where required, non-parametric variables were analyzed with the median (1st and 3rd quartiles) and compared using the Mann Whitney U test. A Chi-squared test was used to compare rates.

The receiver-operating characteristic (ROC) analysis was used to assess the predictive accuracy of the serum E2 and β-hCG concentrations. AUC measures the diagnostic accuracy of the test, with a greater area corresponding to a greater ability to discriminate between patients with clinical pregnancy or not. AUC < 0.5 indicated no predictive value, 0.5 ≤ AUC < 0.7 indicated a low predictive value, 0.7 ≤ AUC < 0.9 indicated a moderate prediction value and 0.9 ≤ AUC < 1 indicated a high predictive value.

Spearman’s rank correlation (rs) was used to examine the relationship between pregnancy outcome and early serum E2 and β-hCG levels. The Pearson’s test was used to assess the difference.

Logistic regression analyses were used to study the association between variables (day 12 serum E2 and β-hCG concentration) and pregnancy outcome. Hosmer-Lemeshow test was done to assess the model’s goodness-of-fit.

Statistical significance was set at a two-sided P < 0.05.

Results

General characteristics of the included patients

Table 1 General characteristics of the included patients

The general characteristics of the included patients are presented in Table 1. No statistically significant differences were observed between the CE-CP Group and CE-BP Group, as well as within the B-CP Group when compared to the B-BP Group.

Serum E2 and β-hCG were significantly higher in clinical pregnancy

Serum E2 levels were compared among all groups on day 12 following FET with NCs revealing significantly higher serum E2 levels in the CE-CP Group compared to the CE-BP Group (212.04 ± 151.27 vs. 112.18 ± 43.18 pg/mL, P < 0.001). Similarly, there was also a significant difference observed between B-CP Group and B-BP Group (215.06 ± 96.80 vs. 135.87 ± 55.20, P < 0.001). Likewise, there was significant difference observed regarding β-hCG level on day 12 following FET with NCs between CE-CP Group versus CE-BP Group (474.73 ± 351.88 vs. 69.18 ± 43.41 mIU/mL, P < 0.001) and similarly within B Group (832.88 ± 694.36 vs. 107.49 ± 88.44 mIU/mL, P < 0.001).

No significant differences were observed in terms of the average number of embryos transferred, the number of high-quality embryos transferred, and endometrial thickness between B-CP Group and B-BP Group, as well as between CE-CP Group and CE-BP Group (P > 0.05, Table 2).

Table 2 Cycle parameters and serum E2 and β-hCG levels on day 12 following FET with NCs

AUC analysis hormone profile of serum E 2 and β-hCG levels in predicting clinical pregnancy following FET with NCs

The ability of serum E2 and β-hCG levels to predict transplant outcomes was evaluated using AUC analysis (Fig. 2). For the CE Group, serum E2 levels demonstrated moderate predictive ability for clinical pregnancy (AUC = 0.801; sensitivity = 74.45%; specificity = 71.28%; cutoff value = 129.25 mIU/mL). In terms of β-hCG levels, a cutoff value of 156.60 mIU/mL was identified to predict clinical pregnancy with high sensitivity (84.20%), specificity (95.74%) and predictive value.

For the Group B, the cutoff values for E2 and β-hCG concentrations were determined as predictors of clinical pregnancy probability: 174.45 pg/mL for E2 and 217.70 mIU/mL for β-hCG respectively. These values yielded AUCs of 0.762 and 0.924 respectively along with sensitivities of 63.77% and 81.56% respectively, specificities of 77.84% and 94.01% respectively. The differences were all significant.

Fig. 2
figure 2

ROC curves for predicting pregnancy outcomes by serum E2 and β-hCG levels 12 days following FET with NCs. (a) ROC curve analysis of day 12 serum E2 and β-hCG levels following FET with NCs in Group CE; (b) ROC curve analysis of day 12 serum E2 and β-hCG levels following FET with NCs in Group B

Significant relations between day 12 E2 and β-hCG concentrations

Significant correlations between serum day 12 E2 and β-hCG concentrations were found in the CE Group (rs=0.117; P = 0.004) and B Group (rs=0.170; P < 0.001).

Association between variables and clinical outcome by logistic analyses

Logistic regression analysis revealed that serum day 12 E2 and β-hCG concentrations were the significant predictors of clinical pregnancy (Table 3). The analysis revealed that serum E2 and β-hCG on day 12 were significantly associated with clinical pregnancy in the CE Group [E2 day 12: odds ratio (OR) of 0.978 (95% CI 0.972–0.984); β-hCG day 12: OR of 0.979 (95% CI 0.973–0.984)] and B Group [E2 day 12: OR of 0.986 (95% CI 0.983–0.989); β-hCG day 12: OR of 0.988 (95% CI 0.986–0.991)].

Table 3 Predictive accuracy of serum E2 and β-hCG concentrations on day 12 following FET with NCs

Discussion

In recent years, IVF has successfully addressed various causes of infertility. However, accurately predicting of IVF cycle outcomes remains a critical concern. Previous research has indicated a correlation between β-hCG levels at different time points after fresh embryo transfer or on the day of ovulatory dose of hCG administration along with E2 levels and pregnancy outcomes. Nevertheless, the predictive value of E2 is still controversial. Therefore, this study aims to assess the predictive value of serum E2 and β-hCG concentrations on day 12 in determining early reproductive outcomes for women undergoing FET with natural cycles using natural cycles. We observed significant differences in serum E2 levels between biochemical pregnancy group and clinical pregnancy group, and proposed cutoff values for both E2and β-hCG. Moreover, a significant correlation between serum E2 and β-hCG was found in CE Group and B Group.

Previous research has suggested that E2 levels during the luteal phase may serve as a predictive factor for ongoing pregnancy post-clinical pregnancy [4, 11, 12]. However, conflicting results have been reported [5, 6]. Goldman RH et al. [5] found that elevated serum E2 levels at the initiation of progesterone treatment may adversely affect implantation, pregnancy, and live birth outcomes following FET. Excessive serum E2 levels could negatively impact the chances of conception and live birth after IVF. Huang’s study [13] demonstrated that E2 influence during the peri-implantation period (day 11) does not affect clinical outcomes in fresh embryo transplantation. A meta-analysis by Karatasiou GI et al. [14] revealed insufficient evidence to support or refute an association between serum E2 levels on the day of triggering final oocyte maturation with hCG and the probability of pregnancy. A study using mouse models showed that variations in E2 levels play an crucial role in determining the window of opportunity for embryo attachment to the uterine wall and subsequent development [15]. Additionally, another study indicated a potential positive correlation between higher E2 levels and improved pregnancy outcomes [16]. Furthermore, recent research has linked decreased E2 levels on cycle day 28 (22 days post fresh embryo transfer) to poorer pregnancy outcomes [17]. There are some methodological differences as well as variations in patient populations between these studies and our research. Our focus is specifically on infertile women who underwent FET with NCs, aiming to evaluate the association between E2 level and clinical outcomes.

Current clinical methods for predicting pregnancy outcomes often utilize a combination of β-hCG and P levels, particularly in women undergoing ART or conceiving naturally [1, 18]. Normally, serum β-hCG and P are lower in nonpregnant women, and their levels could rise significantly after pregnancy. Previous studies have demonstrated that β-hCG levels at 7 or 14 days after transplantation can be used as predictors of pregnancy outcomes [19, 20]. Additionally, serum β-hCG levels on postovulatory day 12 and 14 can also serve as predictors of pregnancy outcome following an IVF-ET cycle [21]. However, the association between serum E2 and β-hCG levels on day 12 following FET with natural cycles has not been reported in previous studies. This gap is addressed by our research.

In contrast to current clinical methods for predicting pregnancy outcomes, our study distinguishes itself in several key ways. Firstly, we specifically focus on FET cycles to eliminate the confounding effects of high-dose hormones typically associated with fresh embryo transfer during ovulation induction cycles. Secondly, we concentrate on endometrial preparation for transfer regimens in natural cycles, thereby excluding the influence of exogenous hormones on predicted outcomes. Lastly, our study focuses on FET with natural cycles and proposed the cutoff values of E2 (129.25 pg/mL for CE Group and 174.45 pg/mL for B Group) for predicting clinical pregnancy following FET with NCs, which is a relatively new perspective on predicting pregnancy outcomes.

Uterine decidualization supports embryo implantation and placentation, as well as subsequent events that maintain a successful pregnancy together [22]. Recent research suggests that endometrial receptivity rather than embryo quality is a key factor restricting successful pregnancy after FET. HRTs or NCs which were usually used for the endometrial preparation [23] can be observed as an important factor affecting the pregnancy rate of FET. In order to eliminate the impact of hormone supplementation, our study focuses on infertile women who underwent FET with natural cycles and aims to evaluate the association between E2 levels and clinical outcomes.

To date, limited research has explored the relationship between pregnancy outcomes and E2 levels on day 12 following FET with NCs. In this retrospective cohort study, we examined the correlation between E2 levels and clinical outcomes in infertile women undergoing FET with NCs. Additionally, we established E2 cutoff values that, when combined with β-hCG levels on day 12 following FET with NCs, can be used to assess both positive and negative pregnancy outcomes. Our findings indicate that E2 levels were significantly higher in the clinical pregnancy group compared to the biochemical pregnancy group on day 12 following FET with NCs, suggesting a potential correlation between E2 levels and clinical outcomes. Lower E2 levels may potentially affect endometrial gland development and maturation, leading to a decrease in clinical pregnancy rates. By evaluating serum E2 levels alongside β-hCG levels on day 12 post-FET, healthcare providers can enhance outcome predictions and customize treatment plans for patients. This approach may also lead to a reduction in patient visits and clinical serum hormone tests, ultimately decreasing psychological stress.

Although our study effectively establishes a correlation between E2 levels and clinical pregnancy outcomes in infertile women undergoing FET with NCs, there are deficiencies in our research. There are limitations inherent in its retrospective design which weaken its statistical power. Additionally, the specific thresholds for elevated E2 levels affecting endometrial receptivity remain uncertain.

Conclusion

Our findings suggest that there is a significant difference in serum E2 levels between biochemical and clinical pregnancies following FET with NCs, and we have proposed E2 cutoff values for predicting clinical pregnancy in this context. The results of our retrospective cohort study indicate that the cutoff value of E2 for predicting clinical pregnancy with acceptable sensitivity and specificity is 129.25 pg/mL for CE Group and 174.45 pg/mL for B Group.

In conclusion, incorporating both early E2 and β-hCG levels on day 12 post-FET may serve as a predictive tool for assessing both positive and adverse pregnancy outcomes in FET with natural cycles. These findings suggest that in addition to serum β-hCG concentrations on day 12, serum E2 levels on the same day can also be indicative of clinical pregnancy following FET with NCs. Furthermore, both serum E2 and β-hCG levels on day 12 are equally valuable in predicting clinical pregnancies, showing a significant correlation between them. Therefore, integrating day 12 E2 levels into routine follow-up protocols may be beneficial.