Introduction

A diagnosis of unexplained infertility is considered in couples who have tried to conceive for at least 1 year without success, despite clinical evidence of ovulation, tubal patency, and normal semen parameters [1]. Unexplained infertility accounts for approximately 30% of infertility in couples seeking in vitro fertilization (IVF) [2], though up to 50% with a diagnosis of unexplained infertility may conceive spontaneously after initial assessment [3].

Unexplained infertility comprises a heterogeneous group of conditions [4, 5], including the gradual decrease in fecundity with advancing age (age-related infertility) [6]. In women aged 40 years or more, the cumulative live birth rate after IVF is reportedly around 7% [7]. Given the higher likelihood of unsuccessful IVF outcome, immediate IVF is recommended, particularly in women over 40 years [8, 9] in which there is a rapid decline in oocyte quantity and quality [10]. But how best to treat unexplained infertility in older women poses several challenges. A potential overuse of infertility treatments carries unnecessary health risks for the patient and incurs wastage of resources [11,12,13]. Whether reproductive aged women with unexplained infertility might benefit from immediate IVF treatment or from delayed treatment in the hope of achieving spontaneous conception remains an open question [14].

Previous studies that indirectly sought to address this issue reported higher pregnancy rates in older women with unexplained infertility after immediate IVF than after intra-uterine insemination (IUI) [15]; nonetheless, a therapeutic space for IUI remains [16]. A recent meta-analysis found higher pregnancy rates after immediate IVF, although the quality of evidence was moderate and based on outcomes in 725 couples with a poor prognosis of natural conception [17].

Couples are increasingly concerned about access to IVF treatment since the outbreak of the COVID-19 pandemic [18], which prompted discontinuation of treatment due to limited knowledge about the potential effects of SARS-CoV-2 infection on pregnancy [19,20,21,22]. A silent cause of distress is the uncertainty surrounding the possible consequences of prolonged waiting for treatment outcomes [23, 24]. Just how long a couple with unexplained infertility and advanced woman’s age can wait before starting IVF treatment is unknown. Some experts claim to wait a maximum of 6 months before accessing IVF treatment, but there is no strong evidence on how much further waiting can harm the couple in terms of IVF outcomes. As in other countries, couples entitled to assisted reproductive technology (ART) treatment covered by the Italian public health care system are put on a waiting list. The alternative option is to pay for treatment out of pocket. With the above points in mind, we compared live birth rates after delayed management versus immediate IVF treatment in couples with unexplained infertility and women aged 39 or older. To do this, we retrospectively retrieved data from a cohort of women who had waited 1 year before starting IVF treatment and from a cohort that had undergone immediate self-paid IVF treatment.

Methods

Patient Selection and Follow-up

The study was performed in accordance with the Helsinki Declaration and with approval of the City of Health and Science Institutional Review Board (n. 0,040,486). The informed consent of the patients to the treatment and data collection was obtained according to the indications of the City of Health and Science Institutional Review Board.

In this retrospective analysis, we included a cohort of women aged 39–43 years seeking infertility diagnosis and treatments, who were admitted to the IVF unit of the S. Anna academic hospital and to its private appendix between 2010 and 2019. Couples who enter a waiting list for publicly funded IVF and those receiving immediate self-paid treatment were compared. In both cases, the same diagnostic and therapeutic protocols were applied.

Only couples with unexplained infertility were selected. In detail, all women included in the study had ovulatory cycles and patency of at least one fallopian tube at hysterosonosalpingography (HSSG), whereas all male partners had normal basic semen parameters according to the indications of WHO 2010. Exclusion criteria were any known cause of female infertility (i.e. previous history of pelvic inflammatory disease, positive anti-Chlamydia IgG, endometriosis, anovulation, etc.), early follicular phase follicle stimulating hormone (FSH) > 20 UI/l and/or anti-Mullerian hormone (AMH) < 0.1 ng/ml and female body mass index (BMI) > 32 kg/m2. A previous diagnosis of unexplained infertility at other IVF centres was not considered complete if these criteria were not met.

Couples were considered as part of the ‘waiting’ group if they abided by national protocol to wait for about 1 year after diagnosis, before IVF treatment. Couples were considered as part of the ‘immediate’ group if they opted to pay for access to immediate private IVF. We will also refer to these groups as ‘strategies’. ‘IVF cycle’ was defined as the completion of all transfers deriving from the same ovarian stimulation and oocyte retrieval. Follow-up for the waiting group was up to their first IVF cycle. Follow-up for the immediate group was 1 year after their last embryo transfer from their first IVF cycle (‘last embryo transfer’ defines the last transfer of cryopreserved embryos obtained from the same IVF cycle). In the immediate strategy, some couples went through multiple cycles in the follow-up period. Only the first IVF cycle was analysed in order to allow follow-up for both groups to be similar. Figure 1 shows the timeline of 12 months plus 1 IVF cycle in total for both groups.

Fig. 1
figure 1

Timeline of 12 months plus one cycle of IVF in the two groups (A). In (B), we show the two analyses for 473 couples in the immediate group who have received more than one IVF cycle: we either follow them between and 474 after unsuccessful IVF (primary analysis) or censor them after their 1st cycle (sensitivity)

IVF Procedure and Clinical Outcomes

Controlled ovarian stimulation (COS) was performed either with recombinant FSH (rFSH), human

Menopausal Gonadotropin (HP-hMG) or rFSH plus recombinant luteinizing hormone (rLH), under

pituitary suppression. The choice of the starting gonadotropin dose was based on age, BMI, antral

follicular count (AFC), AMH circulating concentrations, as well as on the response to previous COS. In the absence of any pre-fixed criteria, the COS regimen (type of protocol and type of medication) was decided and prescribed by different physicians of the IVF unit, according to their own clinical experience, as per real-life clinical practice.

Both long (Gn-RH agonist for pituitary suppression) and short (Gn-RH antagonist for pituitary suppression) protocols were used. COS was monitored by serial transvaginal US and serum estradiol (E2) measurements performed every second day from stimulation days 6–7. COS continued until at least two follicles reached 18 mm in mean diameter, when ovulation was triggered by injecting subcutaneously either 10,000 international units (IU) of hCG (Gonasi HP, IBSA, Lugano, Switzerland) or 250 mcg of rHCG (Ovitrelle, Merck, Darmstadt, Germany). US-guided transvaginal oocyte pick-up (OPU) was performed approximately 36–37 h after hCG administration, under local anesthesia (paracervical block). Conventional IVF or ICSI was performed on all available oocytes within 4 h after oocyte collection or after 2 h from cumulus cell removal, respectively. Normal fertilization was assessed for the presence of two pronuclei (2PN) and the extrusion of the second polar body after 16–18 h post-insemination.

All cleaved embryos were morphologically evaluated under a conventional stereomicroscope, using the IMCS score by Holte [25]. According to the policy of our IVF unit during the time period under study, embryo transfer (ET) was performed either on day 2 or day 3 based on the presence of 1–3 embryos or ≥ 4 embryos, respectively. Embryos were selected and transferred in uterus using a soft catheter, under ultrasound guidance, applying the method previously published by our group [26]. The remaining embryos (if any) were kept in culture until day 6 of development for cryopreservation at the blastocyst stage or discharge. The luteal phase was supported by administering 180 mg/day natural progesterone (Crinone 8, Merck, Darmstadt, Germany) for 15 days. Pregnancy was assessed by serum hCG assay after 14 days from embryo transfer and then confirmed if at least one gestational sac was visualized on transvaginal US after 2 further weeks.

The primary outcome was the cumulative live birth rate (cLBR). The cLBR was defined as live deliveries (at least one live birth) per women over the full follow-up period. The full follow-up period was defined as either a period of expectant management up to the first IVF/ICSI cycle (including both fresh and frozen/thawed embryo transfers) or 1 year after the last transfer (fresh or frozen/thawed) of the first IVF/ICSI cycle. Live birth was defined as the delivery of a live-born infant (> 24 weeks of gestation).

Analyses

We tabulated the baseline patient characteristics for the two strategies separately.

As a crude analysis, we simply calculated the proportion of live births in both groups and compared these using a two-sample Z test for equality of proportions. We then calculated the crude odds ratio for immediate IVF versus waiting using a logistic regression model.

Next, we adjusted for confounding characteristics in the logistic regression model. In our study design, the only difference between strategies was the timing of IVF. However, as there was no random allocation to the waiting and immediate strategy, these groups could still differ in terms of prognostic characteristics associated with opting for waiting or immediate IVF. We had to adjust for these confounding characteristics that explained both the chance of live birth and the choice between waiting or immediate IVF. We identified the following characteristics a priori as confounders: female age, duration of infertility, primary or secondary infertility, previous miscarriage, previous IUI/IVF treatment, AMH, AFC and previous induced abortion.

Sensitivity Analyses

We also considered a more technical, complicated statistical analysis: the idea is to emulate a randomized controlled trial [27]. Rather than a simple adjustment in the logistic regression model, which is not strictly the same as a randomized trial, we conducted inverse probability weighting that weights some couples more than others to obtain a balance between groups [28]. We regressed the previously identified patient characteristics (female age, duration of infertility, primary or secondary infertility, previous miscarriage, previous IUI/IVF treatment, AMH, AFC and previous induced abortion) on the strategy groups i.e. ‘immediate group (a ‘1’) versus the waiting group (a ‘0’)’ using a logistic regression model. This model was used to predict the probability of receiving immediate IVF or waiting given a couples’ individual characteristics. For couples who received immediate IVF, we divided the proportion of couples who received immediate IVF by a couple’s individual probability of choosing immediate IVF [28]. For couples who waited, we divided the proportion of couples who waited by a couple’s individual probability of waiting. The result was an individual weight value. This weight value counts certain couples more often than others in order to balance the cohort in terms of the mentioned baseline characteristics. This process was repeated until balance was found, defined as standardized mean differences below 0.1.

The standard errors obtained from standard software are incorrect as they ignore the weighting procedure [28]. To obtain valid standard errors, we resampled 5000 times, repeated the procedure, calculated the statistic and reported their 2.5th and 97.5th percentile as the bootstrap results.

After we derived the weights, we first checked with a new tabulation whether the patient characteristics were similar between weighted groups, then repeated the previously conducted analyses in the weighted data: we calculated the proportion of live births in both weighted groups and compared these using a two-sample Z test for equality of proportions. In addition, we calculated the odds ratio for immediate IVF versus waiting using a logistic regression model including the weights.

Couples who opted to pay for immediate IVF could receive multiple IVF cycles, which leads to follow-up timelines that are slightly different. In the previous analyses, we assumed that these couples could still conceive naturally between IVF cycles and after their last IVF cycle. To assess the influence on this assumption on our results, we now considered couples who received more than one IVF cycle as censored, i.e. no longer pursuing expectant management after their first unsuccessful IVF cycle. After censoring, we estimated the proportion of live birth as the cumulative probability over one IVF cycle and 1 year of expectant management using the Kaplan–Meier method. Finally, we compared proportions between groups with Z tests as before.

Software

Data were prepared in Microsoft Excel. All analyses were conducted in R version 3.6.0 and RStudio (R Core Team, 2013) using the rms, mice, dplyr and CreateTableOne packages.

Results

Data on 635 couples were available. Out of 359 couples in the immediate group, 70 (19.5%) had a live birth of which 11 after natural conception and 59 after IVF. Out of 276 couples in the waiting group, 57 (20.7%) had a live birth of which 37 after natural conception and 20 after IVF. Couples in the waiting group started their first IVF cycle after a median of 12 months (25th–75th percentile: 11–14).

Characteristics in both groups are depicted in Table 1.

Table 1 Baseline characteristics of couples in the immediate or waiting group

There was no statistically significant difference between the two groups in terms of the cLBR (19.5% immediate versus 20.7% waiting, 95%CI for difference: − 0.07 to 0.05). In addition, the crude odds ratio (OR) did not significantly differ from 1 (OR 0.93, 95%CI: 0.63–1.38).

The adjusted OR for the immediate group compared to the waiting group was 0.69 (95%CI: 0.39–1.22) and thus also not significant. In Table 2, we present the ORs for all characteristics.

Table 2 Adjusted odds ratios in the logistic regression model on the outcome live birth. This was the extended model, including adjustments

Sensitivity Analyses

After weighting, there was an adequate balance between groups in terms of female age, duration of infertility, AMH, AFC, primary or secondary infertility, previous miscarriage, previous IUI/IVF treatment and previous induced abortion as shown by the point estimates provided in the Table 3.

Table 3 Baseline characteristics of couples in the waiting or immediate group after weighting

In the weighted cohort, there was no significant difference between the two groups in terms of the cLBR (16.8% immediate versus 26.6% waiting, bootstrap 95%CI for difference: − 0.20 to 0.01). In addition, the OR did not significantly differ from 1 (OR 0.56, bootstrap 95%CI: 0.32–1.08).

In the sensitivity analysis in which we censored couples in the immediate group after their first unsuccessful IVF cycle if they received at least one addition cycle, the cLBR was 20.8% for the immediate group instead of 19.5%. There was no statistically significant difference between the two groups in terms of the cLBR (20.8% immediate versus 20.7% waiting, 95%CI for difference: − 0.07 to 0.06). In the weighted cohort, there was no significant difference between the two groups in terms of the cLBR (17.8% versus 26.6%, bootstrap 95%CI for difference: − 0.20 to 0.03).

Discussion

The main finding of this retrospective study involving couples with unexplained infertility and woman’s age 39–43 years was that the cumulative 1-year live birth rate after one IVF cycle was comparable between the patients who had received delayed and immediate IVF treatment.

Recent studies involving large cohorts of infertile couples with varying woman’s age have reported that the probability of achieving pregnancy is higher after IVF than after expectant management; however, the chances decrease with advancing woman’s age [9]. The overall effect of increasing female age (per year) was also tested in our study, and we found this to be non-significant associated to the cumulative live birth rate (Table 2): this somewhat surprising result can be explained in two ways. First, the range of female age is relatively narrow in our population (39 to 43), so there are not many between-couple comparisons to be drawn. In addition, the effect of female age here represents a combination of age on natural conception and age of conception after IVF, which is difficult to interpret [9]. This raises the question of how long we can delay treatment without causing the patient harm and allow time for the couple to conceive spontaneously. According to a recent retrospective study, a delay of up to 6 months is not associated with poor IVF outcomes in patients with reduced ovarian reserve [29]. But because the chances of achieving pregnancy without IVF were not investigated, the dilemma remains to be solved.

Treatment strategies for reproductive aged women have gained greater interest with the shift to later childbearing [30, 31]. Recommendations by medical societies suggest rapid diagnosis starting at age 35 years; however, consensus is lacking on the best treatment strategies when common causes of female and male infertility have been ruled out [17, 32, 33]. This uncertainty persists also in women categorized as poor responders by commonly shared criteria [34]. Though such patients should receive IVF treatment as soon as possible, it may be too early to establish whether the infertility is due to reduced ovarian reserve or perhaps to other unexplained factors. The idea that IVF can overcome age-related subfertility in women is doubtful: IVF does not enhance oocyte quality, and ovarian responsiveness decreases with time, raising the risk of poor response to ovarian stimulation [35,36,37].

Our findings indicate that an expectant management strategy before IVF need not be considered ineffective or potentially detrimental; we found no evidence for a negative effect on live birth rate for the women in the delayed treatment group. The two cohorts included women aged 39 or older with acceptable ovarian reserve, as measured by AFC and AMH levels: whether these results are reproducible also in women with severely diminished ovarian reserve is uncertain. While couples who have experienced or will experience delays in accessing IVF treatments due to the COVID-19 pandemic may find these findings reassuring [29], they should be advised that after age 39 neither delayed nor immediate IVF treatment increases the probability of pregnancy with a live birth. In brief, reproductive counselling is warranted in younger women wishing to delay motherhood.

Our study findings add to the current debate on the optimal therapeutic approach to unexplained infertility, as couples that conceive after IVF might have otherwise conceived naturally [1, 12, 38]. The 37 (13%) couples in the delayed treatment group achieved spontaneous pregnancy with a live birth during the observation period. This rate was far lower compared to a previous study with a design similar to ours [39], but the inclusion of younger patients may explain the higher pregnancy rates. Similar to the rates we recorded, a Dutch study involving couples waiting for IVF reported a 14% cumulative live birth rate over 1 year [40]. Both of these early studies [39, 40] and a more recent larger one [9] found immediate IVF superior to expectant management also in couples with unexplained infertility. Since such couples can still conceive naturally, the question is not whether but when they should receive IVF. If treatment is delayed and the couples conceive naturally, they are spared potentially premature, expensive and invasive treatment [1, 41, 42] and the offspring are spared exposure to the potential risks associated with ART [43, 44].

Confirmation of the present findings in larger series of patients may necessitate revision of protocols for treating unexplained infertility in reproductive aged women. Waiting before treatment or selecting women who are expected to benefit most from IVF could save couples from unnecessary invasive treatments while optimizing allocation of economic resources [9, 13, 14, 18, 42].

A major strength of our study is that the patients in both cohorts were managed by the same medical and embryologist team that followed identical diagnostic and therapeutic protocols. Furthermore, couples in Italy have the option to wait or pay out of pocket for immediate IVF treatment. This condition cannot be easily accepted in a prospective randomized controlled trial (RCT). RCTs investigating unexplained infertility that include a no treatment arm (i.e. expectant management) are lacking since clinicians might be reluctant to include such an arm. Furthermore, couples may perceive further expectant management as a waste of time and may be unwilling to volunteer in a study in which they might be allocated to the expectant management arm, whereas they will receive IVF if they refuse to enter the study [45]. Likewise, couples informed of their chances of spontaneous conception may refuse to adhere to IVF treatment or to seek further diagnostic testing. So, it is not surprising that an RCT similar in design to our study was stopped due to a low adhesion rate (Trial NL5384-NTR5484). Because of lack of evidence from RCTs, most studies published so far have compared treatment arms or separate observational databases [15, 46, 47].

A major limitation to our methodological approach is the potential selection bias associated with choosing either treatment strategy. Although the ovarian reserve was similar in both groups (see AFC and AMH levels in Table 1), they differed by several baseline characteristics known to influence chances of pregnancy: prevalence of primary/secondary infertility, history of previous miscarriage, and previous IUI/IVF. By applying inverse probability weighting [28], we were able to reasonably account for these confounding variables after weighting (Table 2). Furthermore, the duration of infertility was often longer than the 12 months usually required for diagnosis. This reflects an underestimation of the infertility diagnosis, which is often overlooked by couples or insufficiently investigated by general practitioners. We cannot know whether such a long period of infertility impacted our analysis. Since there was no significant difference in this parameter between the two groups, the comparison can be assumed to be reliable.

The couples undergoing multiple IVF cycles in the immediate treatment group might have had follow-up timelines (time available for spontaneous conception) that were not strictly comparable with the other group. However, most of the couples in the immediate treatment group who received multiple IVF cycles were still at risk of natural conception for most of the year. This is likely, given that the majority of couples that received multiple cycles in the immediate treatment group did not conceive after their first IVF cycle and remained free for several months between cycles for potential spontaneous conception.

We believe our primary analysis is realistic. The sensitivity analysis showed an estimated (unadjusted) proportion of live births of 20.8% versus 19.5%, which did not affect the study conclusions. We argue that, given the study design, this difference does not substantially influence the results. Finally, it is increasing common to perform a single blastocyst transfer also in older couples. A further limitation of our study is that fresh-transfer was performed with embryos on day 2/3 and only those from the frozen to the blastocyst stage. In the absence of strong evidence for the one or the other strategy, we believe that our results can still be considered useful, given that many IVF treatment centres still work on day 2/3 fresh-transfers.

In conclusion, our study showed similar cumulative live birth rates in couples with unexplained infertility and advanced woman’s age that received delayed and immediate IVF treatment. We found no evidence of cause for concern in couples currently awaiting IVF treatment. This may be comforting for couples for which treatment has been or will be delayed due to the COVID-19 pandemic. Further studies are needed to confirm these findings and to optimize economic resources, particularly in a setting of publicly funded IVF treatment.