Introduction

The choice of a simple, safe, and cost-effective protocol in controlled ovarian hyperstimulation (COH) is crucial for improving the quality of care in assisted reproduction, through high pregnancy rates. The main parameters are the numbers of oocytes retrieved and good-quality embryos available for transfer; both of them are closely related to high live birth rate [1]. Still, there is an ongoing clinical uncertainty among fertility specialists in the optimal way for treating couples with a low-cost regimen and especially those at a relatively advanced reproductive age.

We know that the increasing luteinizing hormone (LH) in the follicular phase of the cycle leads to a number of key changes in oocytes and follicular cells, modifying the micro- and macro-environment of the ovary, and is linked to the normal oocytes’ maturation, the ovulation process, and subsequent fertilization and implantation [2,3,4]. Human chorionic gonadotropin (hCG) has been used as a substitute for LH, due to the high degree of similarity between the α-subunits of the two hormones, similarly to FSH, LH, and TSH. Both hCG and LH are heterodimeric glycoproteins with high cysteine ​​content and exert similar activity in inducing ovulation and supporting luteal cells [5]. In contrast, their differences at the molecular level, mainly due to the extra 24 amino acids in the β-chain, give hCG the ability of possessing additional glycosylation sites and thus a longer half-life, providing a more sustained ovarian stimulation [6], compared to LH: thus, hCG has a higher binding affinity for the receptor [inducing high proapoptotic activity in vitro] and mediates a more potent cAMP increase in human primary granulosa cells [3, 4, 7, 8].

Concerning the supplementation with r-LH, there is evidence on its positive impact on the outcome parameters in IVF own fresh cycles [9,10,11], causing a low intrafollicular concentration of vascular endothelial growth factor [12], in women having at least 2 previous failed IVF attempts [13], in poor responders and women over 35 years [14, 15], and in oocyte donors [16]. In contrast, there are reports with opposite conclusions [2, 17]. The current evidence through systematic reviews is rather conflicting: some have failed to confirm a positive impact in the general infertile population [18, 19], whereas others report significantly higher clinical pregnancy rates [20], especially in poor responders, in women of advanced reproductive age [21, 22], and in those with the previous hypo-response to exogenous FSH stimulation [23].

The rationale for adding daily HCG during COH in IVF is that, similar to LH, it improves the clinical outcome; notably, the earlier that HCG is added (i.e., at day 1), the more favorable the outcome will be. This starts with an optimal follicular development—a threshold level of 1.2 IU/L of LH is necessary to avoid premature luteinization and reduced COH response [6, 24]—due to increased interaction at the LH/hCG receptor level [25]. It was reported that higher serum hCG levels are correlated with lower progesterone and a higher number of follicles/oocytes and pregnancy rates [26, 27]. Day 6 hCG concentrations have been shown to predict the frequency of top-quality embryos, ongoing pregnancy, and live birth rates [28]. Concerning dosage, supplementation with hCG from the first day of COH at a range of 0–150 IU/day was reported as “compatible with good live birth rates” [29]. Similar findings have been reported when hCG is added during the early follicular phase [13, 30,31,32]. The full mechanism still remains unknown, but hCG seems to enhance the production of a different profile of cytokines and gene expression in both cumulus and endometrial cells, generating a better environment to produce healthier—perhaps fewer—oocytes compared with rLH, higher endometrial receptivity, and implantation potential [28]. Moreover, when added—or replace rFSH—during the late follicular phase, it yields similar results concerning IVF outcomes [33, 34]: the rationale is based on the higher expression of LH receptors during that phase; thus, rFSH can be replaced by lower doses of hCG, which are less expensive, and associated with lower incidence of ovarian hyperstimulation syndrome [OHSS]. Similarly, the meta-analysis by Checa et al. supports the use of hCG mainly in the late follicular phase as an efficient alternative to conventional modalities for COH as it was associated with improved clinical pregnancy rates and reduction in the administered rFSH doses [35].

In this context, we performed a RCT to test the efficacy and safety of adding daily low-dose hCG in a short protocol with GnRH agonist and ovarian stimulation with rFSH from the onset of the follicular phase and throughout the duration of stimulation in women of advanced reproductive age undergoing IVF/ICSI.

Materials and Methods

Study Population

The present study was designed as a prospective, multicenter, randomized, double-blind, placebo-controlled, two parallel groups of patients, phase IIIb clinical study to deal with the specific research question: to test the efficacy and safety of adding daily low-dose hCG in a short protocol with GnRH agonist and ovarian stimulation with recombinant FSH (rFSH) from the onset of the follicular phase and throughout the duration of stimulation towards the increase of outcome parameters.

Consent to use anonymous data for research purposes was obtained from all participants, and the research protocol was approved by the Bioethics Committee and the Scientific Council of the Hospital, prior to study initiation (EVD no 47/1/13-3-2017; 4/13-3-17) and the relevant committees (EED 46-17/23-6-2017; EOF IS 44-17/40345/2017). The protocol was registered in the relevant scientific databases (NCT03423527; study name, hCG-GR-001-2016; EudraCT Number 2016-005208-24).

Data was collected on 81 IVF/ICSI cycles, at the assisted reproduction units of the Third Department of Obstetrics and Gynecology; “Attikon” University Hospital, the First Department of Obstetrics and Gynecology; “Alexandra” University Hospital, Athens; and the Department of Obstetrics and Gynecology of University of Thessaly, Greece, and from July 2017 to April 2019. For the eligible population, conclusive data was available on demographics, medical/reproductive history for both partners. Inclusion criteria were age between 35 and 40 years, physiological menstrual cycles (24–35 days), normal endocrine function (normal PRL and TSH, FSH ≤ 15 IU / ml), transvaginal ultrasound without pathological findings, free personal medical history, indication for IVF/ICSI [36], and first or second IVF/ICSI cycle. Exclusion criteria included endocrine or metabolic disorders, e.g., PCO (S), pathology of the uterus and/or endometrium, basal FSH levels> 15 IU / ml, surgery in the ovaries, body mass index (BMI) ≥ 35 kg/ m2, and age <35 years and > 40 years old.

No other interventions were applied during cycles, apart from the routine protocols described in the regular practice of all IVF Units, which conform to international trends and European standards for appropriate medical management. The GnRH agonist short protocol was used to promote ovarian stimulation, which has been described in one of our previous studies [13]. All procedures, culture conditions and success rates, along with medical and paramedical personnel, were stable throughout the years of the more than 10 years function of all Units, all of which were under standard operating procedures. Embryo transfer took place 3 days after the oocyte retrieval; the maximum of embryos placed was two, according to the Hellenic legislation.

The study group (group 1 [HCG [+] group]) indicated the application of the protocol by adding hCG with the initiation of standard short GnRH agonist protocol for IVF/ICSI with 200 IU rFSH (BEMFOLA, Fertilland Pharma/Gedeon Richter Plc, Hungary), while the control group (group 2 [HCG [-] group]) indicated the application of the same protocol, adding placebo. The addition of hCG (Pregnyl®, MSD Greece) at a dose of 100 IU per day was administered subcutaneously and continued until the administration of hCG for follicular maturation/triggering; flacons were prepared centrally and distributed to all involved Units, by one study midwife, using 5000 IU commercial ampoules. As for placebo, an identical flacon with 100 IU containing N/S 0.9% per day, was injected.

A dynamic database was generated with Microsoft Excel and used for data storage and preprocessing. In total, 62 parameters were available, as categorically reported here: infertility duration and factor, medical and reproductive history, lifestyle information and demographics linked to fertility dynamics, pathology linked to infertility factors for both partners, age at cycle, parity, hormonal profile along with selected information on menstrual cycle characteristics for the female partner and ultrasonographic tests, such as antral follicle count [AFC], and sperm analysis for the male partner. Cycle characteristics of the previous ART cycle and their outcome, together with parameters of ovarian response to ovarian stimulation, were monitored. The full set of parameters is presented in the supplementary file [Supp. file].

Participants were randomly divided into two groups: group 1 and group 2. Randomization was done using sealed opaque envelopes marked as “group 1” and “group 2”. The randomization list was prepared centrally for the three participating Units [research centers]. Each patient’s assignment to a treatment group was made by calling the Unit’s midwife who had 2 envelopes with the above indications and who was not present when the couple was informed by two members of the research team.

Outcome Measures

The primary outcomes were the number of top-quality transferred embryos and clinical pregnancy rate. Secondary endpoints included the number of follicles [> 11,> 14,> 18 mm] and endometrial thickness on the day of triggering; the total number of retrieved and MII oocytes; the total number of total, top-rated, and frozen embryos; biochemical, ectopic, and multiple pregnancy; miscarriage; and OHSS rates. Definitions of the outcomes were according to the International Glossary on Infertility and Fertility Care [37].

Statistical Analysis and Power Calculation

As for the sample size calculation, using similar trials as references, we observed that the main differences of outcomes (not very profound in all studies) were achieved in the level of embryos, and especially their quality, leading to higher pregnancy rates [13, 17, 29, 30]. Thus, we calculated that samples of 50 participants from each arm would be required, to provide a significance of 0.05 and a power of 0.8 detecting a difference equal to 0.58*standard deviation in the top-quality embryos, including dropouts and missing data. So, the total number of participants required was 90. Unfortunately, the study was prematurely ended (while relative authorities were notified officially), as one of the main investigators moved to another IVF Unit and mainly because of the Covid-19 pandemic: IVF protocols in all IVF Units worldwide changed from fresh to frozen, so that it was impossible to recruit more patients during the last year.

The statistical analysis encompassed descriptive statistics, univariate, and multivariate analysis. Regarding descriptive statistics, variables were summarized separately by group/treatment arm; categorical variables were summarized as frequency (%), whereas median and interquartile range (IQR) were presented for continuous variables. Deviations from normality were evaluated with the Shapiro-Wilk test. For categorical variables, differences between the two treatment arms were evaluated with Pearson’s chi-squared test or Fisher’s exact test, as appropriate. Regarding continuous variables, differences were assessed with t test for independent samples in case of a normal distribution, whereas the Mann-Whitney-Wilcoxon test for independent samples was implemented for variables that were not distributed normally.

Univariate and multivariate logistic regression analysis was performed to evaluate differences in outcomes between the two groups. The outcomes were set as dependent variables; at the multivariate models, the effects of group (group 1 as reference category) were adjusted for age (≥38 vs. <38 years), BMI (≥25 vs. <25 kg/m2), baseline AMH (≥1 vs. <1 ng/mL), and AFC (>4 vs. ≤4). Odds ratios (ORs) and their 95% confidence intervals (95% CI) were estimated.

Outcomes were categorized a priori as detailed below, to be converted to binary dependent variables in the logistic regression models. Clinical pregnancy (yes vs. no) and excellent quality embryos on day 3 (≥2 vs <2) were the primary outcomes. Concerning secondary outcomes, the ≥median vs <median cutoff was adopted for the number of follicles >11mm, >14mm, and >18 mm at triggering, E2 and P4 at triggering, endometrial thickness at triggering, number of MII oocytes, ET, and frozen embryos; the ≥2 vs <2 cutoff was utilized for the number of oocytes, moderate-quality embryos, and poor-quality embryos on day 3; the yes vs. no classification was inherently utilized to treat biochemical pregnancy, cycle cancellation, and ovarian hyperstimulation syndrome as dependent variables.

The statistician was blinded to the treatment arm corresponding to each treatment arm/group coding in the dataset. The level of significance was set at 0.05. Statistical analysis was performed with STATA/SE version 13 statistical software (Stata Corp. College Station, TX, USA).

Results

An ensemble of 62 parameters and variables was available for 81 IVF/ICSI cycles, sourcing from a total of 81 infertile couples, with overall successful clinical outcome with clinical pregnancy in 21.6% of the cycles included.

Descriptive statistics for the two treatment arms are presented in Table 1. The median age was equal to 38 years in both arms. No significant differences were noted regarding sociodemographic and lifestyle features, including BMI, previous parity, smoking, and alcohol consumption. Unexplained infertility (n=16, 40%, and n=13, 31.7%), as well as male factor infertility (n=16, 40%, and n=17, 41.5%) was the most common underlying cause in both groups. No significant differences were documented in reproductive and clinical history variables, including chronic disease, age at menarche, duration of menstruation, type of infertility, duration of infertility, and the examined cycle being the first IVF/ICSI cycle between the two treatment groups.

Table 1 Descriptive statistics in the two study arms

Similarly, no significant differences were documented in baseline heart rate, systolic blood pressure, baseline FSH, TSH, LH, E2, AMH, PRL, and AFC, or features of the male partner, including BMI, smoking, alcohol consumption, chronic disease, and proven fertility, between the two treatment arms.

At the univariate analysis examining differences in primary and secondary outcomes between the two groups, no significant differences were noted either for primary or secondary outcomes (Table 2). Specifically, regarding primary outcomes, the clinical pregnancy rate was equal to 25% (10/40) and 24.4% (10/41), p=0.949, whereas the proportion of two or more excellent quality embryos on day 3 was 52.5% (21/40) and 34.2% (14/41) in groups 1 and 2, respectively, p=0.095.

Table 2 Primary and secondary outcomes in the two study arms

The results of the multivariate logistic regression analysis comparing the odds of the primary and secondary outcomes between the two groups are detailed in Table 3. No significant associations were noted for primary outcomes (clinical pregnancy: adjusted OR=0.89, 95% CI: 0.29–2.75; two or more excellent quality embryos on day 3: adjusted OR=0.54, 95% CI: 0.21–1.42, with group 1 set as reference category). With respect to secondary outcomes, no differences were noted, except for increased odds of two or more poor-quality embryos on day 3 occurring in group 2 (adjusted OR= 11.69, 95%CI: 1.29–106.19).

Table 3 Univariate and multivariate logistic regression analysis examining outcomes in the two study arms. Group 1 was set as the reference category

Discussion

We performed a prospective, multicenter, randomized, phase IIIb clinical study to test the efficacy and safety of adding 100IU hCG/day in a short IVF protocol with GnRH agonist and rFSH from the onset of the follicular phase and throughout the duration of stimulation. The two groups demonstrated similar demographic baseline characteristics. The analysis showed similar rates of the number of top quality at day 3 and clinical pregnancy in the two groups studied. The same results were observed in the secondary endpoints of this study, including numbers of total, top-rated, and frozen embryos; biochemical, ectopic, and multiple pregnancy; miscarriage; and OHSS rates.

The rationale of daily hCG supplementation is based on its positive effects on the follicular microenvironment, endometrial receptivity, and the immune system response during COH [38]. Notably, this effect is dose dependent, as larger doses could result in an androgenic environment; a daily dose of 100–150 IU was proposed as optimal [29, 39], constituting the testing dose in our trial.

The comparison with the current literature has resulted in conflicting results.

On one hand, high serum hCG levels along with the addition of hCG to rFSH have been correlated with a high number of follicles/oocytes and pregnancy rates, irrespective of the COH protocol used [26, 27, 40]. In a retrospective study, the addition of hCG led to favorable IVF outcomes in women above 40 years of age [41]. Similarly, in a dose-response RCT, increasing doses of hCG supplementation stimulated the intrafollicular concentration of both estradiol and androgens, with a shift toward a more androgenic milieu which was apparent in large follicles with oocytes giving rise to good-quality embryos [39].

In contrast, our study did not reveal any statistically significant difference and thus clinical benefit through the addition of daily hCG, in terms of the number of top-quality embryos and clinical pregnancy. Notably, the difference demonstrated in the number of top-quality embryos available for transfer and the poor-quality embryos at day 3, between the two groups (52.5% vs 34.2% and 2.5% vs 17.7%, respectively), did not manage to reach statistical significance. A different, compared to our study, practice of hCG administration has been investigated in a Cochrane review (33); authors aimed to assess the effectiveness and safety of low-dose hCG when replacing FSH for COH: that did not reduce the chance of ongoing and clinical pregnancy and resulted in an equivalent number of oocytes retrieved, while the consumption of FSH was lower. In a similar study carried out later [42], there were no significant differences between pregnancy rates in the two study groups. In a recent RCT, the administration of low-dose hCG daily from the early follicular phase in PCOS patients through a progesterone protocol did not manage to improve the number of retrieved and mature oocytes and good-quality embryos, along with implantation and clinical pregnancy rates [43].

Concerning the secondary outcomes preset for this RCT, we found no significant differences between the two groups. For example, the level of E2 at triggering was higher in the hCG group (1770 vs. 1100 pg/ml), indicating better cytoplasmic maturation and probably better expected quality of oocytes/embryos [44], but not significantly. Similar results have been reported in fertilization, implantation, and ongoing pregnancy rates [40, 41] and when HCG was administered in the late follicular phase [45]. Finally, we observed a nonsignificant increase of the incidence of OHSS in the hCG group (7.5% vs. 2.4%); all cases concerned mild/moderate OHSS not requiring hospitalization; all of them were canceled, as no HCG triggering was applied, except from one case that ended in clinical pregnancy.

Importantly, two meta-analyses ended up with conflicting conclusions [34, 35]: reasons for that were: the significant diversity of the inclusion and exclusion criteria of the included studies, the heterogeneity of the reported outcomes, and the methodological assessment of the outcomes leading to different interpretations of the results.

Among the strengths of our trial, we have to point out the properly taken steps from the conception of the protocol up to the publication of the results, in an effort to minimize all domains of biases. Limitations include the small cohort size, which was based on the power calculation on the number of top-quality embryos at day 3 and not on live birth. Others include the use of the specific COH protocol and the small dose of hCG (100 IU/day) used.

The co-administration of daily hCG during COH for IVF seems a promising alternative to conventional modalities, mainly related to the enhancement of LH activity. The results of this RCT showed that the addition of 100 IU of hCG/day from the start of stimulation with rFSH in a GnRH agonist short protocol does not seem to increase significantly the rate of top-quality embryos and clinical outcomes. As yet, the relevant data in the literature is conflicting, necessitating more studies to clarify the role, pathways of action, and the effectiveness of hCG and other synthetic gonadotropins—with individual isoform composition and receptor binding properties—in COH for IVF/ICSI. Also, other infertile populations, womens’ age groups, protocols, and doses should also be tested to prove or disprove the value of daily hCG administration.