Background

Controlled ovarian hyperstimulation (COH) is the key step in in vitro fertilization and embryo transfer (IVF-ET). High ovarian response (HOR) refers to the abnormal sensitivity of the ovary to gonadotropin, which leads to simultaneous development of multiple follicles and increases the risk of ovarian hyperstimulation syndrome (OHSS) [1]. Supraphysiological steroid hormones during COH affect endometrial receptivity by changing the endometrial immune environment and gene expression [2,3,4], resulting in poor pregnancy outcomes [5]. Adopting a freeze-all strategy in HOR patients can greatly reduce the risk of OHSS and avoid the influence of COH on endometrial receptivity [6]. An increasing number of studies have confirmed that frozen embryo transfer (FET) has better pregnancy and perinatal outcomes than fresh embryo transfer [7,8,9].

However, the best time to perform FET following COH in HOR patients is controversial in clinical work. Postponement of FET may increase the anxiety of patients [10]; in the immediate FET cycle, poor endometrial receptivity or physical condition may not be fully recovered to the pre-stimulation state, which may affect pregnancy outcomes [11]. It is unclear whether the detrimental effects on endometrial receptivity caused by COH would be sustained over a long period of time, up to the subsequent menstrual cycle, especially in patients with HOR who are most affected by COH. Moreover, the use of different gonadotropin-releasing hormone (GnRH) analogues in the process of COH act on the pituitary in different ways [12], and it is controversial whether the timing of FET affects pregnancy outcomes in different COH protocols [11, 13].

Thus, this study aimed to investigate whether the FET timing affects pregnancy outcomes in HOR patients undergoing freeze-all cycles, whether different COH protocols affect pregnancy outcomes in the subsequent FET cycle, and to provide reference results for the HOR population to choose the optimal time to start FET.

Materials and methods

Study population and design

We conducted a retrospective cohort study including all patients from January 2015 to March 2019 at our reproductive medicine center and the study was conducted in accordance with ethical standards (2020PS006F); informed consent was obtained from all subjects. The inclusion criteria were as follows: (1) patients on their first IVF or intracytoplasmic sperm injection (ICSI) cycle who were diagnosed with HOR and adopted a freeze-all strategy. The diagnostic criteria for HOR was > 5000 pg/ml of estradiol on human chorionic gonadotropin (HCG) day or more than 15 oocytes retrieved [14, 15]; (2) three different selected GnRH analogues stimulation protocols, including short-acting GnRH agonist (GnRH-a) long protocol, long-acting GnRH-a long protocol, and GnRH antagonist (GnRH-ant) protocol; (3) women 20–45 years old; and (4) hormone replacement therapy (HRT) for endometrial preparation in the FET cycle. The exclusion criteria were as follows: (1) presence of uterine abnormalities; (2) patients with endometriosis and adenomyosis; (3) presence of autoimmune, endocrine, and metabolic diseases; (4) previous diagnosis of uterine adhesion; (5) patients with chromosomal abnormalities; (6) patients who underwent blastocyst biopsy for pre-implantation genetic testing; (7) patients using frozen donor semen; (8) patients using long-acting GnRH-a as pretreatment for FET after the freeze-all cycle; (9) patients with a natural cycle for endometrial preparation for FET; (10) patients with no embryo formation for FET; and (11) patients with ectopic pregnancies.

Patients were divided into the immediate FET group and the delayed FET group, which were defined as FET that took place within the first menstrual cycle following oocyte retrieval or afterwards.

Ovarian stimulation protocol, endometrial preparation protocol, and luteal support

According to age, anti-Müllerian hormone, body mass index (BMI), number of antral follicles in bilateral ovaries, and prior response to stimulation, we can predict the HOR population and determine the initial dose of gonadotropin to prevent the occurrence of OHSS [16, 17]. All patients were treated with the following three COH protocols: the short-acting GnRH-a long protocol involved daily subcutaneous injection of 0.05 mg of short-acting GnRH-a triptorelin (Diphereline, 0.1 mg, IPSEN, France) at the middle luteal phase of the menstrual cycle as pituitary down-regulation for 14 days, and gonadotropin introduction at the subsequent menstruation; the long-acting GnRH-a long protocol involved a single-dose intramuscular injection of a quarter to a full dose (0.75–3.75 mg) of long-acting GnRH-a (Diphereline, 3.75 mg, IPSEN, France) on the second day of menstruation, with gonadotropin given 20–30 days later when the follicle diameter reached 3–5 mm; the flexible GnRH-ant protocol involved starting gonadotropin on the second day of menstruation, and GnRH antagonist (Cetrotide, Merck Serono, France) was added when the lead follicle reached 13–14 mm in diameter or when the estradiol was > 300 pg/ml. Follicle development was detected by transvaginal ultrasonography, and the dosage of gonadotropin was adjusted according to the different ovarian responses.

When the follicles reached a mean diameter of > 17 mm, final oocyte maturation was triggered. HCG or triptorelin was used alone or in combination in the GnRH-ant protocol. Only HCG was used for the trigger in the GnRH-a long protocol. Oocyte retrieval was performed 36 h after triggering by transvaginal ultrasound-guided aspiration. Cleavage stage embryo quality was evaluated at day three based on the Istanbul consensus to confirm extended embryo culture, and the day three embryos with a cell number of ≥7 and fragmentation of < 20% were graded as good quality [18]. Blastocyst morphology was evaluated in the morning of days five and six according to the Gardner criteria [19], and only blastocysts scoring 4BB or higher were graded as good quality. We adopted a freeze-all strategy for HOR patients to avoid the occurrence of OHSS. The patients were informed that they could chose the menstrual cycle following oocyte retrieval for FET all by themselves.

Hormone replacement therapy (HRT) was used for endometrial preparation in the FET cycle, with 4–8 mg of estradiol valerate (Progynova, Bayer, Germany) taken orally for at least ten days from the 3rd to 5th day of menstruation to promote the growth of endometrium. Ultrasonic examination should be completed before medication, when the thickness of the endometrium is < 6 mm and the drug can be used, otherwise the FET in this cycle will be cancelled. Vitrification and warming procedures were performed using Embryo Vitrification/Thawing Media (KITAZATO) according to the manufacturer’s instructions. Cleavage-stage embryos were warmed on the day before transfer, cultured for approximately 24 h, and then transferred. The blastocysts were warmed on the day of transfer, kept in culture for approximately one hour, and then transferred.

The luteal phase was supported by 90 mg per day of vaginal progesterone gel (Crinone, Fleet Laboratories Ltd., UK) administered vaginally, and estradiol was maintained at the original dose. Luteal support was continued until 11 weeks of gestational age.

Statistical analysis

The primary outcome of our study was live birth rate (LBR). Secondary endpoints were clinical pregnancy rate (CPR) and spontaneous abortion rate (SAR). Clinical pregnancy was defined as the detection of a gestational sac through ultrasound imaging at seven weeks of gestational age [20]. In China, spontaneous abortion was defined as the loss of pregnancy spontaneously after clinical pregnancy and before 28 weeks of gestational age, and live birth was defined as the survival delivery after 28 weeks of gestational age.

As an observational study, multiple maternal and IVF characteristics were considered as potential risk factors that could moderate pregnancy outcomes, and the potential risk factors between the immediate and delayed FET groups were unbalanced distribution (Table 1). Thus, we used propensity score matching (PSM) to make the potential risk factors between the immediate and delayed FET groups balanced and comparable. We used 1:1 nearest-neighbor matching without replacement to compare the variables and tried the match tolerance value from 1 to 0 until P values of the variable between the two groups were 1.000. χ2 test was performed for comparison of the categorical variables of the immediate and delayed FET groups (Table 1).

Table 1 Baseline characteristics of immediate and delayed FET groups in the entire and propensity score-matched cohorts

As the effect of FET timing on different COH protocols were controversial, subgroup analysis was performed. Multivariable logistic regression models were calculated on each COH cohort, with the timing of FET as the main exposure of interest. Potential risk factors entered into the multivariable regression model were those that showed clinical relevance or showed a univariate relationship with pregnancy outcomes. The included variables were carefully selected based on the number of events available to ensure the stability of the regression equation. Adjusted odds ratio (OR) and their 95% confidence interval (CI) were calculated to analyze the independent effect of immediate and delayed FET on the pregnancy outcomes.

Statistical significance was set at P < 0.05. All statistical analyses were performed using IBM SPSS 26.0 (IBM Corp., Armonk, NY, USA).

Results

Potential risk factors between immediate and delayed FET groups in the entire and propensity score-matched cohort

A total of 2128 HOR patients adopting a freeze-all strategy underwent their first IVF/ICSI cycle (Fig. 1). The immediate and delayed FET groups consisted of 1130 and 998 patients, respectively. Patients’ and IVF characteristics in the immediate and delayed FET groups, which were the potential risk factors, are presented in Table 1. Before matching, the distribution of these risk factors were not absolutely balanced. The distribution of the COH protocol, number of embryos transferred, embryo stage, and multiple pregnancies were significantly different between the two groups (P < 0.05). No significant differences were found in maternal age, body mass index (BMI), insemination method, top or good quality embryo transfer, and infertility causes (P > 0.05). We obtained 1366 patients by PSM, and all potential risk factors and pregnancy outcomes of the multiple pregnancies were balanced and comparable (Table 1).

Fig. 1
figure 1

Flow chat showing the selection of the study polulation

Multivariable logistic regression analysis on FET timing and potential risk factors for pregnancy outcomes in the entire before-matching cohort

Multivariable logistic regression analysis on the entire before-matching cohort demonstrated no statistical differences on pregnancy outcomes between the immediate and delayed FET groups [CPR, adjusted odd ratio (OR), 0.939, 95% confidence interval (CI), 0.781–1.129; SAR, adjusted OR, 0.831, 95% Cl, 0.628–1.098; LBR, adjusted OR, 1.056, 95% Cl, 0.883–1.263] (P > 0.05), which were adjusted for maternal age, BMI, COH protocol, insemination method, number of embryos transferred, embryo stage, trigger type, top or good quality embryo transfer, and cause of infertility (Table 2).

Table 2 Multivariable logistic regression analysis on potential risk factors for pregnancy outcomes in the entire cohort

Immediate versus delayed FET cycles on pregnancy outcomes in the entire cohort and propensity score-matched cohort

The CPR (entire cohort, 61.0% versus 63.4%; PSM cohort, 60.5% versus 63.5%), SAR (entire cohort, 12.6% versus 10.1%; PSM cohort, 11.6% versus 12.3%), and LBR (entire cohort, 49.9% versus 49.2%; PSM cohort, 48.0% versus 49.3%) had no significant differences between the immediate and delayed FET groups in the entire cohort and PSM cohort (P > 0.05) (Table 3).

Table 3 Pregnancy outcomes of immediate FETa and delayed FETb groups in the entire cohort and propensity score-matched cohort

Differences between the groups were evaluated by χ2 test.

Subgroup analysis on immediate versus delayed FET on pregnancy outcomes in different COH protocol cohorts

To investigate the effect of FET timing on pregnancy outcomes in different COH protocols, multivariable logistic regression were performed on each COH protocol cohort, including 918 patients in the antagonist protocol, 1000 patients in the short-acting GnRH-a long protocol, and 210 patients in the long-acting GnRH-a long protocol. Multivariable logistic regression analysis demonstrated no statistical differences on pregnancy outcomes between the immediate and delayed FET groups in the short acting and long acting GnRH-a long protocols (P > 0.05). However, the SAR of the immediate FET group was lower than that of the delayed FET group in the GnRH-ant protocol (adjusted for maternal age, BMI, trigger type, insemination method, embryo stage, number of embryos transferred, top or good quality embryo transfer, and cause of infertility) (adjusted OR, 0.645, 95% CI, 0.430–0.966) (P < 0.05). No significant differences were found on CPRs and LBRs in the GnRH-ant protocol (P > 0.05) (Tables 4, 5, 6).

Table 4 Multivariable logistic regression analysis on potential risk factors for pregnancy outcomes in GnRH-ant protocol (n = 918)
Table 5 Multivariable logistic regression analysis on risk factors for pregnancy outcomes in short-acting GnRH-a long protocol (n = 1000)
Table 6 Multivariable logistic regression analysis on risk factors for pregnancy outcomes in long-acting GnRH-a long protocol

Discussion

In clinical work, the timing of when to start the FET cycle after COH is controversial, especially for patients with HOR who are most affected by COH. It is known that in the COH process, increased levels of supraphysiological steroid hormones and premature progesterone affects the gene expression and immune environment of the endometrium, which alters the embryo-endometrium asynchrony and negatively affects endometrial receptivity, reducing the CPR and LBR [5, 21]. Moreover, the secretory activity factors produced by the residual luteal cysts derived from COH last longer after oocyte retrieval in HOR patients. However, no study has investigated the specific duration of these adverse effects. To avoid this concern, some clinicians recommend the conservative scheme which is to start the FET cycle at the second or third withdrawal bleeding after oocyte retrieval. This prolongs the IVF treatment period and increase the anxiety of patients who had experienced infertility for many years, increasing their mental and economic loses and affecting their pregnancy outcomes [10, 22]. Lattes et al. reported that there were no differences on the first FET and subsequent FET cycles after oocyte retrieval [23]. A retrospective study by Huang et al. demonstrated that immediate FET was associated with a higher live birth than delayed FET [24], and meta-analysis indicated that immediate FET was not associated with negative pregnancy outcomes [25]. In our retrospective study, we only selected the HOR population who adopted HRT as the endometrial preparation protocol in the past five years and found no significant differences for CPR, SAR, and LBR between the immediate and delayed FET groups. We believe it is not accurate to assume that COH will still affect endometrial receptivity in the first withdrawal bleeding cycle after oocyte retrieval and that it is not necessary for the HOR population to wait for several menstrual cycles to begin FET after the freeze-all strategy.

In the process of COH, different GnRH analogues have different degrees and properties of inhibition effects on hypothalamic-pituitary-ovarian axis, and they also have different effects on the corpus luteum [26], which might have an impact on endometrial receptivity and pregnancy outcomes [27]. A retrospective study showed that immediate FET had similar CPR to delayed FET in patients with GnRH-ant protocol [13], which is in agreement with our subgroup results. However, our results are contrary to a population-based study on the short-acting GnRH-a long protocol, which found that delayed FET was better for pregnancy outcomes. They believed that the initial flare up effect of short acting GnRH-a during the down-regulation period caused an early rise of progesterone, which affected the outcomes in the immediate FET cycle [11]. However, a limitation of that study was the small sample size of 67 patients in the immediate FET group and 62 in the delayed FET group. In this study, 1000 patients with short acting GnRH-a long protocol were studied (434 in immediate FET group and 566 in delayed FET group), and we found that the timing of FET did not affect pregnancy outcomes in the short-acting GnRH-a long protocol. However, in the GnRH-ant protocol, we found that the SAR in the delayed FET group was significantly higher than that in the immediate FET group. Among all the COH protocols, the GnRH-ant protocol has the shortest treatment period, where oocyte retrieval takes place after a mean of 8–10 days of ovarian stimulation; the short-acting GnRH-a long protocol requires 14 days of down-regulation on that basis; while the long-acting GnRH-a long protocol needs a down-regulation of more than 20 days. Therefore, the immediate FET cycle in the GnRH-ant protocol can result in the shortest treatment period. Psychological factors can lead to infertility and spontaneous abortion, and its potential impact on neuroendocrine and immune changes could affect early pregnancy risk [28, 29]. We considered that longer treatment periods could result in increased patient anxiety, causing the increase in the SAR. It is worth mentioning that one study has shown that residual luteal cysts may increase the expression of relaxin in circulation [30], which is related to endometrial angiogenesis and prevents recurrent abortion [31]. Therefore, the effects of residual luteal cysts in immediate FET cycles that we had previously worried about may be beneficial to endometrial receptivity and pregnancy outcomes.

The limitation of this study lies in the retrospective nature as well as the possibility of unmeasured confounding factors such as smoking habits and alcohol consumption. Although we have obtained many cases with HOR and made the immediate and delayed FET groups comparable by PSM, we have lost 762 cases without successful matching in this process. In addition, we do not know whether these cases will affect the actual situation. Notably, the pregnancy outcomes we have studied were not the only endpoint, as other obstetric outcomes and neonatal outcomes should also be recorded. Moreover, the self-selection of patients into immediate or delayed FET remains a major limitation of the study. We believe that further prospective, randomized, and controlled studies are needed to confirm these results.

In summary, this study indicated that FET timing might not affect pregnancy outcomes in high responder patients undergoing freeze-all cycles. Immediate FET might be associated with lower odds of spontaneous abortion.

Conclusion

The pregnancy outcomes of immediate FET were no different to delayed FET in high responder patients undergoing freeze-all cycles.