Introduction

During in vitro fertilization (IVF) cycles, controlled ovarian hyperstimulation (COH) treatment using exogenous gonadotropin (Gn) to stimulate follicle development is a critical step in ensuring the acquisition of mature eggs and a satisfactory pregnancy rate. In clinical practice, assisted reproductive technology (ART) outcomes are often monitored according to the size and number of follicles and serum estradiol (E2) levels after Gn stimulation. However, no accurate indicator to predict pregnancy outcomes has been identified to date, and it has not been determined how to choose subsequent drugs and doses based on the ovarian response after stimulation.

Additionally, the need for combined monitoring (using transvaginal ultrasound and serum estradiol) during ovarian stimulation is controversial. Some people argued that vaginal ultrasound alone should be considered to simplify treatment, as combined monitoring is costly, time-consuming, and inconvenient [1]. Neither E2 on the day of human chorionic gonadotropin (hCG) administration nor other stages were linked to pregnancy rates in women undergoing ART cycles [2,3,4,5]. Additionally, E2 level was found to be a poor predictor of treatment success [6]. However, the evidence has a low overall quality [7]. Additionally, other researchers have suggested that E2 levels can be used to predict pregnancy outcomes in combination with FSH, age, inhibin B, and other factors [8,9,10,11]. Some researchers have suggested that a poor ovarian response can be characterized by peak E2 levels [12]. Phelps et al. and Kahyaoglu et al. [13, 14] explored the relationship between E2 levels on the fourth day of Gn and IVF outcomes and believed that estradiol levels on the fourth day of COH cycle could predict the response of early follicles to ovarian stimulation. When the serum estradiol level on the fourth day of Gn was low, the current treatment cycle was abandoned. Other groups concluded that a low E2 concentration after five days of Gn stimulation predicted a high cycle cancellation and lower pregnancy outcome, even with similar numbers of oocytes and fertilization rates [15]. Lower E2 levels on the sixth day of Gn have also been associated with a lower pregnancy rate and the likelihood of live births [8, 16, 17]. It has been reported that an appropriate range for E2 exists, and a higher range is not beneficial [18, 19]. Older women (> 35 years) appeared to be more vulnerable to the harmful effects of high E2 levels than younger women (≤ 35 years). Valbuena et al. [20, 21] found that high E2 concentration affects embryonic adhesion. Additionally, high E2 concentration affects endometrial receptivity [22,23,24,25]. However, Blazar et al. [26,27,28] discovered that higher E2 levels on hCG days predicted a greater number of oocytes, and any adverse impact on endometrial can be conquered in IVF-ET. Only the estradiol level on the hCG day was a significant predictive variable for clinical pregnancy [29, 30]. Additionally, according to percentile curves, Papageorgiou et al. [31] did not identify any deleterious effects of high E2 levels. Super physiological estradiol levels also do not affect oocyte and embryo quality [32].

Everyone had different opinions on determining E2 in COH cycle, but it was undeniable that determining E2 during follicular phase has become a part of routine clinical practice over the last decade. In previous studies, most of them only analyzed the estrogen level, and there was no study on the ratio of estrogen increase, although different people had different antral follicle counts and different average estrogen levels of a single follicle. Estrogen levels represent the static condition of the ovarian response and do not reflect the dynamic state of exogenous control of ovarian stimulation on follicular growth. Thus, this study aimed to determine whether serum E2 levels and E2 increase ratios during Gn ovarian stimulation correlated with IVF and pregnancy outcomes in 335 patients on antagonist regimens. Additionally, this study aimed to determine the link between E2 levels and ratios in the whole Gn stimulation period and the outcomes, and the number of embryos was investigated to distinguish E2 effects on the endometrium and embryos. If this hypothesis is substantiated, it may be time to seek a new method to evaluate the ovarian response during Gn stimulation, adjust the dose, and ensure treatment outcome of COH.

Materials and methods

Study subjects and protocol

From April 2017 to July 2020, our center conducted a retrospective analysis of infertile patients who underwent IVF cycles. The women were monitored until live birth or until the embryos in the current cycle had been completed. The study protocol was approved by the institutional human ethics committee.

Inclusion criteria were as follows: 1) age ≤ 38 years, 2) basal follicle-stimulating hormone (bFSH) ≤ 10 IU/L, basal antral follicle count (bAFC) > 3, and 3) body mass index (BMI) < 30 kg/m2.

Exclusion criteria were as follows: 1) patients with chromosomal abnormalities, reproductive malformation, and a history of recurrent spontaneous abortion; 2) patients undergoing coasting to prevent ovarian hyper stimulation syndrome; 3) insufficient information; 4) uterine pathologies that might compromise pregnancy potential; and 5) cycles canceled due to failure of embryo thawing and survival.

Ovarian stimulation protocol

Patients received IVF treatment according to the Fixed GnRH antagonist protocol [33]. On the second day of the menstrual cycle, recombinant human follicle-stimulating hormone 150–300 U (Gonal-F; Merck, Lyon, France; Puregon, MSD, Boulogne, France) was injected as Gn. Additionally, Gn doses were determined based on patient age, body mass index (BMI), bFSH, and bAFC. GnRH-ant (Cetrotide, Merck, Lyon, France) was administered from day 5 onward. Oocytes were then collected by follicular aspiration under ultrasound 34–36 h after triggering with GnRH-a (Triptoreline, Decapeptyl, Ipsen, France) or recombinant hCG (rhCG, Ovitrelle, Merck, Lyon, France). Eighteen hours after fertilization, embryo development was monitored daily and graded based on the number and size of blastomeres, fragmentation rate, multinucleation, and early densification. Notably, on the third day following oocyte retrieval, an embryo with seven–ten blastomeres was defined as high quality [34].

On day 3, one or two embryos in the best shape were selected and transferred using a soft Wallace catheter, or whole embryo freezing for vitrification was chosen to avoid ovarian hyperstimulation syndrome (OHSS), embryo–endometrium asynchrony, and other reasons.

For luteal support in advance of fresh embryo transfer (ET), we used an injection of progesterone (20 mg/branch, Zhejiang Xianju Pharmaceutical Co., Ltd.), 40 mg daily, and oral dydrogesterone tablets (10 mg/tablet, Abbott Healthcare Products B.V.), 30 mg per day, or progesterone vaginal sustained-release gel (90 mg/dose, Crinone VR 8%, Merck, Sherano, Switzerland). In addition, two bags of Chinese medicine, the Gushen Antai pills, were used daily. For the frozen embryo transfer (FET), the endometrial preparation protocols used before the FET were natural cycles, hormone replacement cycles, and stimulated cycles. Embryo transfer was performed after three days of progesterone supplementation. For natural cycles and hormone replacement cycles, progesterone supplementation was added, similar to the cycle of fresh embryo transfer. Stimulated cycle protocols for endometrial preparation were not supplemented with progesterone.

All patients completed an IVF cycle and then performed ET or FET until live birth or until the embryos in the current cycle were used up.

Calculation of serum E2

Venous blood samples were collected on the day of Gn (Gn1), on five days of Gn (Gn5), on eight days of Gn (Gn8), and the day of trigger (HCG). In addition, Gn5/Gn1, Gn8/Gn5, HCG/Gn1, HCG/Gn5, and HCG/Gn8 represented the ratios of serum E2 increase. The ratios were calculated as follows: Gn5 divided by Gn1, Gn8 divided by Gn5, HCG divided by Gn1, HCG divided by Gn5, and HCG divided by Gn8.

Pregnancy outcomes

Cumulative live birth rate (CLBR) was the primary outcome. Secondary outcomes included cumulative clinical pregnancy rate (CCPR), number of oocytes, number of fertilization, number of blastomeres, number of embryos, and number of high-quality embryos.

CLBR was defined as the first live birth after using fresh or frozen embryos derived from a single ovarian stimulation cycle.

Clinical pregnancy was defined as an intrauterine gestational sac with a fetal heartbeat detected on transvaginal ultrasonography after six weeks of gestation. CCPR was calculated as the number of first clinical pregnancies generated from a single IVF cycle, including all fresh and frozen embryo transfers generated from the IVF cycle.

Statistical analysis

Statistical analysis was conducted using SPSS version 26.0 (SPSS Inc., Chicago, IL, USA). The Shapiro–Wilk test was used to assess data normality. Due to skewed distributions, quantitative variables were expressed as medians (interquartile range, range between the 25th and 75th percentiles), and Mann–Whitney U and Kruskal–Wallis tests were performed. Qualitative variables were expressed as frequencies and analyzed using chi-square test. P ≤ 0.05 was considered statistically significant.

Groups A and B were defined according to the 25th, 50th, and 75th percentiles of each ratio of E2 increase. According to the ratio of estradiol increase, the patients were divided into four groups: A1 (Gn5/Gn1 ≤ 6.44), A2 (6.44 < Gn5/Gn1 ≤ 10.62), A3 (10.62 < Gn5/Gn1 ≤ 21.33), and A4 (Gn5/Gn1 > 21.33); B1 (Gn8/Gn5 ≤ 2.39), B2 (2.39 < Gn8/Gn5 ≤ 3.03), B3 (3.03 < Gn8/Gn5 ≤ 3.84), and B4 (Gn8/Gn5 > 3.84).

Spearman’s correlations were used to determine the correlation between the quantitative parameters and the increase in E2 levels. The propensity scores were calculated using binary logistic regression analysis based on the following patient characteristics: female age, infertility duration, body mass index (BMI), infertility factors, Gn usage time, and Gn dosage. We calculated crude odds ratios (OR) and adjusted OR with 95% confidence intervals (CI).

Results

Study population

From April 2017 to July 2020, we retrospectively analyzed 335 patients who received in vitro fertilization (IVF) with antagonist regimens at the Affiliated Hospital of Shandong University of Traditional Chinese Medicine. Table 1 summarizes the characteristics of the study population for positive and negative CCPR and CLBR results. CCPR positives included 160 patients, and negative s included 175 patients. A total of 124 women had viable CLBR, whereas 211 did not.

Table 1 Baseline characteristics for CCPR and CLBR

Baseline characteristics

Positive and negative results were similar for BMI, infertility duration, Gn days, number of embryos transferred, endometrial thickness on transplantation day, and baseline hormone levels.

However, patients with CCPR and CLBR positive results were younger than those with negative results (P = 0.001 and P = 0.006, respectively) and had a lower Gn dosage (P = 0.001 and P = 0.005)). The patients with CCPR and CLBR positives had more antral follicle counts (P < 0.001 and P = 0.001). The patients with CCPR and CLBR positives achieved better IVF-ET outcomes (both P < 0.001), such as the number of oocytes, fertilization, blastomere, and embryos.

Serum estradiol levels and ratios of CCPR and CLBR

Table 2 compares the outcomes based on E2 levels and ratios. Gn1 did not impact IVF outcomes (P = 0.134; P = 0.122). However, elevated E2 levels following gonadotropin stimulation correlated with higher CCPR and CLBR (both P < 0.05), particularly in the late follicular phase, E2 of Gn8 (P < 0.001; P = 0.001), and HCG (P < 0.001; P = 0.002). However, in early follicular growth, the estrogen increase ratios were more statistically significant with the outcomes. Following gonadotropin stimulation, the higher the serum estradiol ratios of Gn5/Gn1 (group A) and Gn8/Gn5 (group B), the higher the CCPR (P = 0.004; P = 0.001) and CLBR (P = 0.006; P = 0.002, respectively). In contrast, estrogen ratios of HCG/Gn5 and HCG/Gn8 in late follicle growth were similar across groups without reaching statistical significance.

Table 2 The level and the ratio of serum E2 for CCPR and CLBR

Parameters of different serum estradiol ratios

According to Table 3, data were classified into four groups based on the quartile of serum E2 ratio. As displayed in Table 3, although group A exhibited no significant differences in terms of female age, BMI, and infertility time, group B indicated significant differences (both P < 0.05). For group A, the higher the estrogen growth rate, the shorter the Gn time required and the lower the Gn dose used (both P < 0.001), whereas for group B, the opposite was true (both P < 0.005). However, when estrogen growth rate increased, groups A and B produced superior IVF-ET outcomes in terms of the number of embryos harvested, blastomeres, embryos, CCPR, CLBR, etc. (both P < 0.05). The ratio in group A increased as the ratio in group B decreased (P < 0.001). When the ratio in group B was larger, the corresponding ratio in group A was smaller. When the values of the two groups were higher, E2 levels on the hCG day were also higher (both P < 0.001).

Table 3 Parameters and outcome of IVF-ET as a function of serum E2 ratios

Serum estradiol ratio of different ages

As illustrated in Table 4, there were 252 patients in the < group with age below 35 years old and 83 in the other group. The number of eggs obtained and high-quality embryos were statistically significant with the different E2 ratios of Gn5/Gn1, Gn8/Gn5 and HCG/Gn1 in both groups with age < 35 and age ≥ 35 (both P < 0.001). For people with age < 35, the higher ratios of Gn8/Gn5 and HCG/Gn1 could achieve the higher CCPR and CLBR (both P < 0.05). In contrast, the difference was not statistically significant with the E2 ratio of Gn5/Gn1. For people with age ≥ 35, the CCPR and CLBR were higher with the E2 ratio of HCG/Gn1. However, the differences were insignificant with Gn5/Gn1 and Gn8/Gn5.

Table 4 Relationship between the ratio of serum E2 and the outcome according to age

Serum estradiol ratio of different infertility factors

Infertility factors included tubal factors (231, 68.9%), ovulation disorders (64, 19.1%), male factors (27, 8.1%), pelvic endometriosis (8, 2.4%), and other factors (5, 1.5%). Table 5 indicates that groups A and B did not correlate with varying infertility factors.

Table 5 Relationship between serum E2 increase and infertility factors

Estradiol levels and ratios of different ovarian responses

Estradiol levels and ratios of different ovarian responses were displayed in Table 6. We divided the population into three parts according to the number of oocytes retrieved, high ovarian responders (> 18 oocytes retrieved), low ovarian responders (< 6 oocytes retrieved), and normal ovarian responders groups. After analysis respectively, just the E2 ratio of Gn8/Gn5 had statistically significant. For the low ovarian response and normal ovarian response, the CLBR positive one had higher E2 ratio (P < 0.001; P = 0.017, respectively). In the high response group, the E2 ratio with CLBR positive was lower than that with CLBR negative (P = 0.004). However due to the small sample size post-treatment, this result requires further verification.

Table 6 The level and the ratios of serum E2 for different ovarian responses

Effects of estradiol ratios on IVF-ET outcome

As indicated in Table 7, logistic regression analysis revealed that both group A1 (OR = 0.376 [0.182–0.779]; P = 0.008*, OR = 0.401 [0.188–0.857]; P = 0.018*) and B1 (OR = 0.363 [0.179–0.735]; P = 0.005*, OR = 0.389 [0.187–0.808]; P = 0.011*) had opposite influences on CCPR and CLBR compared with groups A4 and B4, whereas groups A3 were weakly but significantly associated with CCPR.

Table 7 Binary logistic regression on CCPR and CLBR with the ratio of serum E2

Discussion

The role of E2 in IVF-ET is well known to the stage of the trigger day of hCG injection, and it indicates follicular maturation when estrogen level reaches 250 pg/mL; however, its role before that stage remains controversial. In this study, the serum estrogen levels of Gn1, Gn5, Gn8, and HCG were measured, and their ratio was calculated to evaluate ovarian response and predict treatment outcomes. In the statistical analysis, the estrogen levels of Gn5, Gn8, and HCG, as well as the ratios of Gn5/Gn1, Gn8/Gn5, and HCG/Gn1, all had clinical guiding significance, and the low growth level and ratio significantly reduced CCPR and CLBR. The increment coefficient of estrogen was observed at different stages of IVF-ET, and it was discovered that during Gn stimulation, the change in estrogen in the early and middle stages was also associated with pregnancy outcome, which may be linked to the follicle growth mode. During the early follicular stage, a group of antral follicles is recruited and induced to develop. The collected follicular fluid had low estrogen levels. At this point, estrogen growth may be linked to recruited follicles. As the follicle grows, follicular granulosa cells increase in number and exhibit aromatase activity [35]. Therefore, follicular fluid contains high levels of estrogen. At this stage, the increase in estrogen levels may be related to follicular quality.

Second, no study has explored the relationship between the ratio of E2 increase during Gn treatment and prognosis with an antagonist regimen. According to Table 2, we revealed that serum E2 ratios of groups A (Gn5/Gn1) and B (Gn8/Gn5) were statistically significant compared with pregnancy outcomes. Therefore, we chose these two indicators for further analysis and grouping according to the 25th, 50th, and 75th percentiles. The Chi-square test and constructing a binary logistic regression analysis model aimed at pregnancy outcomes revealed that patients with lower serum E2 ratios in groups A1 and B1 had lower CCPR and CLBR, and group B had more significance. It is suggested that, in clinical medication, estrogen levels can be observed after five days of Gn treatment, and medication can be adjusted when the increase in estrogen is not ideal to obtain satisfactory efficacy.

Third, we analyzed the factors influencing the E2 ratio during the Gn stimulation cycle. According to the statistical analysis, the increase in estrogen levels during the middle stage of Gn stimulation (Gn8/Gn5, group B) was associated with age, infertility years, and BMI but not with the increase in estrogen levels during the early stage of Gn stimulation (Gn5/Gn1, group A). These results imply that the basic characteristics of patients greatly affect the rate of estrogen increase during the Gn8/Gn5 stage and may be a key factor affecting follicle quality. In follicular growth, a higher ratio of estrogen increase can result in higher clinical pregnancies and live births, regardless of whether they are in groups A or B. However, the link between groups A and B was the opposite. The estrogen ratio in group B decreased as the ratio in group A increased. Estrogen growth was fastest in the early stage and slowest in the middle stage in the corresponding patients. An insufficient increase in estrogen levels in patients who recruit more follicles in the early stage could be due to an insufficient dose of Gn. When the association between group A and Gn usage was examined, it was discovered that the longer the Gn days and the higher the dose of Gn, the slower the early-stage estrogen increase. This may be related to the patient’s baseline conditions. Typically, patients with a high BMI, age, or more antral follicles receive a higher Gn initiation dosage, although a lower estrogen growth rate is generally obtained. In contrast, in the group B study, the longer the Gn days and the larger the required dosage of Gn, the faster estrogen growth in the middle stage. This may indicate the regularity of Gn dosage in follicular development process, and the dosage of Gn8/Gn5 is more critical at this stage, and it is also the time to increase Gn dosage in a clinical setting.

Fourth, the main purpose of monitoring estrogen in IVF-ET is to assess the availability of adequate quantity and quality of mature oocytes on the trigger day. This study revealed that estrogen ratios increased during early and middle ovulation induction (Gn5/Gn1, Gn8/Gn5), as well as estrogen levels on hCG day (HCG), were significant for IVF-ET outcome. However, the ratio of estrogen increase was not significantly different in the late stages of Gn-stimulated follicular growth (HCG/Gn5 and HCG/Gn8). Tan et al. [36] discovered no statistically significant differences in pregnancy rates among three groups of patients who respectively received hCG on the day of the leading follicle reaching 18 mm, on the second day, and the third day. At the moment of follicular maturation, estrogen concentration may be more important than estrogen growth ratio, necessitating a rethink of the role of estrogen in trigger day selection.

Predicting outcomes in ART may allow for earlier treatment strategy adjustment and protect patients from unnecessary physical and financial burdens throughout the treatment cycle. It is currently being monitored using recurrent transvaginal ultrasonography or serum E2. We considered that ultrasound could measure follicle growth, whereas serum E2 levels mainly reflected follicle function. As a result, estradiol plays a critical role despite its relatively low predictive value as a single factor. Additional parameters are required to identify more sensitive biochemical markers that may predict the probability of achieving clinical pregnancy before hCG administration. Our study demonstrated that accurate monitoring of E2 ratios is a key aspect that supports the prognosis of IVF-ET outcomes, dose adjustment, and cycle cancellation evaluation. This provides the clinic with a new, simple, and convenient predictive method. By observing and calculating the range of E2 increase ratio, we found that both Gn5/Gn1 and Gn8/Gn5 could predict IVF outcome success, but Gn8/Gn5 was more accurate. Maintaining a serum estradiol increase ratio above 6.44 on Gn5/Gn1 and 2.39 on Gn8/Gn5 may result in a higher pregnancy rate.

The main limitation of our study was related to the sample size. It was difficult to recover all the necessary data for calculating the estradiol increase ratios throughout the stimulation treatment. Therefore, we excluded patients with insufficient information and those who had achieved pregnancy but had not yet given birth. Furthermore, we need to evaluate whether using different stimulation gonadotropin doses will lead to a deviation in our results in a single center.