Background

According to the World Health Organisation, infertility affects up to 15% of reproductive-aged couples worldwide [1], and the same percentage has been estimated also in Italy [2]. Among couples hit by infertility, in most cases, causes can be attributable to either males (qualitative and quantitative alterations of semen parameters) or females (tubal disease, ovulatory disorders, endometriosis), while in approximately 15% of couples’ infertility remains unexplained Carson and Kallen [3].

The assisted reproductive technology (ART) was born in order to help couples with infertility issues in having a baby. The first treatments of in vitro fertilisation used the spontaneous cycle of the women, with the retrieval of only one oocyte. Further studies have shown that it is possible to induce ovulation by administrating gonadotropins (Gonas) during the menstrual cycle, in order to obtain a higher number of oocytes.

The availability of different Gona formulations in terms of extractive (highly purified) urinary Gonas (EXT) and recombinant Gonas (REC), and the authorisation of follitropin alpha biosimilars make the choice of the individual treatment complex.

Evidence from clinical studies regarding different efficacy and safety profiles of urinary versus recombinant formulations is controversial: some authors report findings which favour one formulation over the other: a paper by Out et al. [4] states favourable results in terms of pregnancy among women treated with recombinant FSH. And there is also evidence for advantages in using a combined protocol of both formulations (CBD): a prospective randomised study Pacchiarotti et al. [5] showed that using a combination of both urinary and recombinant FSH for ovarian stimulation improves oocyte maturity and embryo cleavage and increases pregnancy and implantation rates. The advantage of the combined protocol over the single formulations was confirmed by a review of literature Pacchiarotti et al. [6]. At the same time, the risk of ovarian hyperstimulation syndrome was observed to be more frequent in women treated with recombinant formulations with respect to extractives Pachiarotti et al. [7].

A recent review of the results of comparative clinical trials, Cochrane analyses, and meta-analyses concluded that the available evidence from the published literature does not show significant differences between urinary and recombinant gonadotropins in terms of safety and efficacy Patki et al. [8]. In the absence of differences in efficacy and safety, several authors recommend considering other factors when choosing a gonadotropin regimen, such as costs, patient acceptability, and drug availability [9,10,11].

Since their introduction, recombinant Gonas have steadily gained market, outranging the use of urinary formulations. While in the first years, this might have been partly explained by better patient acceptability due to a simpler administration (subcutaneous) with respect to the urinary formulations (intramuscular), and this does not hold true in recent years, as almost all urinary formulations are administered subcutaneously now.

Initial disadvantages of recombinant formulations due to higher costs with respect to extractives have been mainly overcome after the introduction of the recombinant FSH biosimilar, which led to price reductions of the originator as well.

To date, evidence from real-world settings is missing, but might add useful information for clinicians, when choosing the treatment protocol for their patients, and for policy makers, when deciding on recommendations and targeting policies with respect to refunding the different formulations.

The present study tries to close the gap on evidence under real-world conditions and provide evidence through an observational population-based approach, aiming to investigate the use patterns of the different Gona formulations and perform a comparative evaluation of the risk–benefit profile of different formulations of Gonas used for ovarian stimulation in female infertility, to answer to the question if recombinant formulations present advantages over urinary formulations.

Methods

Study design

We performed a retrospective observational population-based study.

Data sources

The present study used data from the administrative healthcare databases of the Lazio region, namely: the regional healthcare assistance file, which contains demographic and residence information about all residents alive and registered in the regional health service at a specific year’s date. The regional drug dispensing registry (Pharm). The registry is limited to drugs dispensed to outpatients and reimbursed by the healthcare system. Drugs are identified by the national drug register code, which refers to the international Anatomical Therapeutic Chemical Classification System (ATC). Individual patient data and the date of drug claim are reported for every prescription. The hospital information system (HIS), which provides, for every hospitalisation in a regional hospital, information on patients’ demographic characteristics, discharge diagnoses, and procedure codes according to the International Classification of Disease, Ninth Revision, Clinical Modification (ICD-9-CM). The regional mortality information system (MIS), providing information on demographic characteristics, as well as date, place, and cause of death (codified by ICD-9 codes). The regional database of birth certificates [12], which collects information of socio-demographic (mother) and health (newborn) status referring to all deliveries in the region.

Study population

From administrative healthcare databases, all drug claims of Gonas to women enrolled in the regional healthcare system in the 24 months before each drug claim (characterisation period), and aged 18–45 years were retrieved for the period 2007–2019. Women treated with human chorionic gonadotropin (HCG) only or with more than 9 treatment cycles were excluded. Our unit of observation was the treatment cycle, which was defined as a 21-day mobile time window (Fig. 1). Thus, each woman could contribute with up to 8 observations. The analysis related to the outcomes comes as far as the updating of the information source, CeDAP, permits (births occurring through 2018).

Fig.
figure 1

1 Definition of the treatment cycle from drug dispensing data *For each assisted woman in the Lazio region, therapeutic cycles are identified based on the dates of drug delivery in a 21-day mobile window. The drugs assigned to each window allow to associate one of the following formulations to each cycle: recombinant, extractive, combined (recombinant + extractive)

Exposure

All Gonas dispensed in the study period were retrieved, considering the ATC group G03GA (Appendix Table 3). Exposure was defined based on the treatment cycle, i.e., the 21-day mobile time window and classified into treatment with EXT, REC, or a combination of both (CBD) (Fig. 1).

Validation of the treatment cycle

The availability of the data gathered in the Medically Assisted Procreation Center of the San Filippo Neri Hospital in Rome (in accordance with the GDPR procedure), referring to patients treated between 2016 and 2019, allowed to evaluate the concordance between cycles registered in the clinical dataset and those retrieved from administrative data, for the same woman, in particular in relation to the definition of the therapeutic cycle (REC, EXT, CBD).

Follow-up

For each treatment cycle, the observation started on the day of the first drug dispensing and ended at the first occurrence of study outcome, death, or loss to follow-up.

Outcomes

Effectiveness

From HIS and CeDAP, pregnancies, both ending in abortion and in childbirth, were retrieved and linked with cycles within the 9 months from the last Gona claim.

Safety outcomes

Infantile

From CeDAP, the following parameters were considered: number of births, caesarean birth, twin births, sex ratio, preterm births, weight/length/cranial circumference at birth, Apgar score, still birth, and admission to emergency neonatal therapy (Appendix Table 5).

Maternal

From HIS pre/eclampsia, ovarian hyperstimulation syndrome and gestational diabetes were retrieved (Appendix Table 4).

Confounding

A pre-defined list of potential confounders (based on the literature Bateman et al. [13] and integrated in collaboration with clinicians) and measures of overall status of women and socio-demographic characterises, at the moment of each therapeutic cycle, was taken into account for risk-adjustment when looking at their healthcare outcomes (Appendix Table 4).

Analysis

Treatment patterns

For each woman, the transition between treatment cycles from the first up to max the fourth cycle was described using a Sankey diagram. After the first cycle, classified as one of the three exposure categories, the second event could be either pregnancy (abortion or birth), a second cycle (categorised in the three exposure categories), or the end of treatment. The same was investigated for those women undergoing a second treatment cycle, and so on.

Efficacy and safety

The association between exposure to EXT, REC, CBD, and each outcome was investigated applying log-binomial regression models.

Results

After applying inclusion and exclusion criteria, 104,938 cycles prescribed to 41,858 women were retrieved (Fig. 2). The validation performed through linkage with 268 clinical records shows satisfactory concordance with our approximation (K = 0.9017). On average, women underwent 2.51 cycles and each cycle was defined based on 2.65 prescriptions. Accounting for the availability of data from birth certificate up to the end of 2018, and allowing 12 months of observation, our cohort included a total of 90,292 cycles related to 35,899 women (Table 1). Of these, almost half were REC, and a quarter EXT or CBD, respectively. Analysing the characteristics of women prescribed with different cycles, we found that women over 40 were prescribed more often with CBD (37.9%) and EXT cycles (34.0%) compared to REC (30.9%). In terms of access to healthcare, we observed differences in use of co-medication and access to ambulatory visits during the 6 months preceding the cycles, with REC cycles attributable to lower proportions. In general, women included were in good health condition. The only comorbidities with higher frequency are those which can be considered related to resorting to ART—like female infertility, cervical polyp, and endometriosis (Table 2).

Fig. 2
figure 2

Flow-chart of the study cohort: women, cycles, and drug dispensing

Table 1 Characteristics of the study cohort
Fig. 3
figure 3

Pathway analysis: treatment patterns and transition between treatments and pregnancy (abortion and pregnancy)

Through the alluvial diagram, it is possible to map the main events of a woman within a path of ART and visualise the attempts she makes to obtain a pregnancy. The magnitude of each stream represents the proportion of women moving from a status to another (Fig. 3).

Each woman is observed over time. The 1st observation represents the proportion of women starting with the first gonadotropin-based drug cycle: 51.5% REC, 26.3% EXT, 22.2% CBD.

In subsequent observations, it is possible to verify whether a woman obtains the desired result (child delivery, box filled with points), reaches pregnancy followed by abortion (black category), or continues the treatment with further attempts (same therapy or switch). In the white category, “loss to follow-up,” we observe women for whom we have no more information. The same setting is repeated for subsequent observations.

The 2nd observation shows that 15.2% of women successfully achieve a pregnancy at first attempt (whether it results in a delivery or an abortion), with differences between Gona types, i.e., 15.9% for women exposed to REC, 12.7% to EXT, and 17.0% to CBD.

Considering the entire pathway cumulatively, even beyond the 4th event, 28.4% of women overall achieve a pregnancy with delivery (this accounts for 11.6% of therapeutic cycles).

23.3% of women are lost to follow-up after the 1st observation. Most women who achieve pregnancy over time do not encounter additional information from information systems.

Table 2 presents the frequencies of treatment success and maternal and infantile outcomes for the three exposure categories.

Table 2 Cycles related to pregnancy, characteristics of outcomes

The conception rate, i.e., successful achievement of pregnancy, independently from childbirth, was highest for CBD cycles (16.7%) and lowest for EXT (14.4%). On the contrary, the number of cycles needed to achieve pregnancy was higher in the CBD group (19.1%).

Regarding the effectiveness of therapeutic treatment, REC cycles, compared to EXT, showed a higher association with pregnancy (conception rate RR adj 1.06, 95% CI: 1.01–1.12) and with childbirth (birth rate, RR adj 1.07, 95% CI: 1.02–1.13). The association was stronger when we compared the effectiveness of CBD vs EXT cycles: conception rate RR adj 1.17, 95% CI: 1.11–1.24 and birth rate, RR adj 1.19, 95% CI: 1.12–1.27.

The characteristics of deliveries related to treatment with different Gona formulations were similar. Generally, multiple births are considerably more frequent in medically assisted pregnancies compared to the overall population, independently of the type of Gona used (18.1% respect to 1.8% in the Lazio birth register (13CeDAP)). Caesarean section was applied in almost 64% of deliveries, about one-fifth of children were born at preterm, and 11% were treated in emergency neonatal therapy. We observed very few stillbirths (0.2%), and most children were in excellent health conditions at birth (97.1%). The three maternal outcomes considered, occurred rarely (4.1% gestational diabetes, 2.0% pre-eclampsia, and 0.6% ovarian hyperstimulation syndrome), with similar proportions across exposure categories.

Results of the regression models investigating maternal and infantile outcomes are presented in Figs. 4 and 5, respectively, comparing REC and CBD treatments to EXT treatment. Regarding infantile outcomes, similar results were obtained, and all estimates included the null difference level. Yet, some slight advantages were found for REC and CBD treatments: for both with respect to relative and absolute measures of neonatal weight and length (SGA: RRREC: 0.93, RRCBD: 0.94), for CBD treatments regarding caesarean section (RRCBD: 0.93), and for REC treatment with respect to admission to neonatal emergency therapy (RRREC: 0.87). For all maternal outcomes, point estimates were slightly higher for REC and CBD cycles, but no significant differences were observed. At the same time, CBD and REC cycles were associated with a slightly increased risk for preterm (RRREC/CBD: 1.03) and multiple births (RRREC: 1.06, RRCBD: 1.15).

Fig. 4
figure 4

Association between exposure to the three exposure categories and infantile outcomes

Fig. 5
figure 5

Association between exposure to the three exposure categories and infantile outcomes

Discussion

In the present study, efficacy and safety profiles of recombinant gonadotropins have proved to be like those of extractive gonadotropins. Regarding the efficacy in terms of conception rate and birth rate, protocols using the combination of both formulations were found to perform slightly better. Outcomes related to maternal and infantile safety were generally very rare, and safety features were very similar between Gona formulations.

Our findings of overlapping safety and efficacy are in line with the available evidence provided by clinical studies, reviews, and meta-analyses. The higher efficacy of the mixed approach with respect to monotherapy confirms findings reported in the Italian study Pacchiarotti et al. [5] based on clinical data.

As contextual information, the average number of therapeutic cycles per woman is in line with the refund policies in use in the Lazio region.

The added value of our study stems from its nature, being, to our knowledge, the first observational approach performed in a real-world setting and comprising treatments offered at the level of the resident population over a long period, granting a robust number of observations. This was possible thanks to the availability of administrative healthcare data and the collaboration in a multidisciplinary approach, comprising epidemiologists, statisticians, and specialist clinicians.

Still there are some limitations to be mentioned. Most importantly, our data do not comprise information on treatment cycles, and consequently, we had to develop an algorithm for its approximation. Fortunately, we had the opportunity to use data from a clinical setting which allowed to evaluate the performance of our algorithm, and this comparison proofed satisfactory concordance about the categorisation of the cycle.

Another weakness of our data stems from the lack of information regarding clinical details, and lifestyle data. This may have multiple impacts: first, it limits the choice of the outcomes we can measure. For example, our information on spontaneous abortions is restricted to cases requiring access to emergency room or hospital. This probably implies an underestimation of the conception rate. Similarly, intermediate treatment results, such as oocyte maturity or embryo cleavage, cannot be considered. Second, outcomes may be sensitive to different doses of gonadotropins, an information which is not currently available from our administrative data. Third, we do not have the possibility to distinguish different levels of medically assisted procreation with the right accuracy level. Last, we cannot account for some potential confounders such as BMI, alcohol consumption, or smoking. Yet, we do not expect any differences in the prevalence of these latter variables between treatment regimens.

To date, there are no agreed treatment guidelines which favour one gonadotropin formulation over the other, enhance combined treatment, or recommend specific doses. Treatment choices are rather made by the specialist physician upon a complex evaluation of the specific case, which accounts not only for clinical aspects of the woman to be treated but also for parameters referring to the partner. These considerations cannot be retrieved in our data and leave space for indication bias.

Taken all these considerations together, we believe that the findings of our study add an important piece of information to the complex field of infertility treatment with gonadotropins, both for clinicians and health policy makers.

Future research might investigate the cost–benefit profiles of the different Gona formulations, to complete the picture.