Introduction

Following their discovery in the 1920s, the pituitary gonadotropins, follicle-stimulating hormone (FSH) and luteinizing hormone (LH), were first used successfully for ovulation induction in 1958 [1]. Since then, exogenous hormone preparations of these gonadotropins have been widely used for the treatment of oligoanovulatory infertility, as well as for stimulation of multiple follicular development in assisted reproductive techniques.

The first commercially available gonadotropin preparation, human menopausal gonadotropin (hMG), was purified from the urine of postmenopausal women and contained equal amounts of FSH and LH activity [2]. With the development of sophisticated purification techniques [3], more purified FSH preparations containing small amounts of LH became available in 1986. Thereafter, a highly purified FSH preparation was developed and approved for use in 1996.

Advances in molecular biology techniques made it possible to produce recombinant FSH, which is secreted from genetically engineered Chinese hamster ovary cells [4]. Despite identical amino acid sequences, recombinant and natural FSH preparations have different glycosylation patterns, which are known to influence the isoform profile of each product [5, 6]. The clinical implications of differing isoform profiles for FSH products are unclear and there are no convincing data to suggest marked superiority of either human-derived or animal-derived FSH formulations for ovarian stimulation in terms of efficacy or safety [714]. Nonetheless, since subtle differences due exist between human derived and recombinant forms of FSH, efforts continue to identify clinically relevant differences between the two FSH preparations.

This report presents the pooled data from two independent trials, nearly identical in design, which compared the efficacy and safety of highly purified human-derived FSH (Bravelle®) with recombinant follitropin-β (Follistim®), administered subcutaneously in infertile women with regular ovulatory menstrual cycles undergoing in vitro fertilization procedures. The present data form part of the clinical development program describing the comparative efficacy and safety of Bravelle®, the newest approved, highly purified (96–98%), human-derived FSH which contains ≤ 2% LH activity.

Materials and Methods

Two randomized, parallel group, clinical trials (one recently published [8]) compared single cycle treatment with subcutaneously administered, purified, human-derived FSH (Bravelle®; n = 120) vs. recombinant follitropin-β (Follistim®; n = 118) in infertile female patients undergoing IVF. Data were collected from 19 IVF-ET centers, each of which obtained institutional review board approval. Eleven centers participated in the first study while 13 centers participated in the second study. Both studies were conducted in accordance with Good Clinical Practice guidelines and written informed consent was obtained from all participants prior to screening and enrollment.

Patients were non-smoking, infertile, premenopausal females, aged 18–39 years, with body mass index (BMI) ≤ 34 kg/m2 and regular ovulatory menstrual cycles (24–35 days). Causes of infertility were attributed to tubal factor, endometriosis (American Fertility Society revised stage I or II) or unexplained. Participants had ultrasound-confirmed normal ovaries with healthy uterus and adnexae. Further eligibility criteria included: unremarkable medical history and physical examination with normal baseline hematology and clinical chemistry; normal range serum estradiol (E2), follicle stimulating hormone (FSH), luteinizing hormone (LH), testosterone, dihydroepiandrosterone sulfate, prolactin and thyroid stimulating hormone. All patients needed to be seronegative for hepatitis B and C, HIV and syphilis and have a negative pregnancy test before starting treatment. Male partner or donor semen was required to meet World Health Organization criteria [15] for normal, within 6 months of study entry.

Patients who fulfilled screening criteria were instructed to inject leuprolide acetate (TAP Pharmaceuticals, Deerfield, IL), 0.5 mg subcutaneously each morning, for pituitary down-regulation. Leuprolide acetate was injected into the thigh or arm, beginning 7 days before the anticipated onset of menses and administration continued for ≤ 20 days or until their serum E2 concentration was ≤ 45 pg/mL and endometrial thickness was ≤ 7 mm on transvaginal ultrasound. If no menstrual bleeding occurred during this time, the patient was withdrawn from the trial, otherwise, leuprolide acetate was continued at the same dose until the day before hCG (Novarel™, Ferring Pharmaceuticals, Inc, Suffern, New York) administration.

Patients who met pituitary down-regulation requirements were randomized, using randomization codes generated with SAS® Proc Plan, to receive Bravelle® (n = 120; Ferring Pharmaceuticals, Inc, Suffern, New York) or Follistim® (n = 118; Organon Inc., West Orange, New Jersey) once daily, in a dose of 225 IU, subcutaneously, for 5 days. Study center staff administered the first dose of gonadotropin and provided guidance on abdominal injection technique, including alternation of injection sites, for subsequent dosing at home. Patients were instructed to maintain a daily, gonadotropin diary of the injection site and the presence and intensity of any pain.

After the initial 5 days of FSH treatment, ovarian response was evaluated by transvaginal ultrasound and serum E2 concentrations. If necessary, the dose of gonadotropin was increased in increments of 75–150 IU/day on alternate days, to a maximum of 450 IU/day. The duration of the controlled ovarian hyperstimulation cycle was not to exceed 12 days. At their discretion, investigators could reduce the dose or discontinue treatment at any time if concerned about patient safety.

Eligibility criteria for Novarel™ administration were the presence of at least three follicles with a diameter of ≥ 16 mm, calculated as the mean of three perpendicular planes by transvaginal ultrasound, along with a serum E2 level that was judged by the investigator to be appropriate for the number of follicles. A single dose of 10,000 IU USP Novarel™ was administered intramuscularly, 1 day after the final dose of gonadotropin. Patients who had not achieved eligibility criteria for Novarel™ treatment following a maximum of 12 days of gonadotropin stimulation were allowed to 'coast' for a further 2 days. If the follicular response remained suboptimal, participants were considered non-responders and were withdrawn from the trial.

Standard site-specific procedures were used for oocyte retrieval and IVF-ET. Intracytoplasmic sperm injection (ICSI) and assisted hatching were prohibited. A maximum of four embryos were allowed to be transferred to each patient. Progesterone for luteal phase support was either self-administered (Crinone™ 8%, 90 mg q.d., Serono Laboratories Inc, Randolph, MA) or administered in the form of progesterone in oil (50 mg, i.m., q.d., Schein Pharmaceuticals, Florham Park, NJ), beginning 2 or 3 days after oocyte retrieval. Luteal phase support was maintained at least until there was fetal heart motion in an intrauterine pregnancy or a negative pregnancy test (β-hCG) was documented. The first serum pregnancy test was performed 14 days after embryo transfer and, if positive, was followed by a confirmatory β-hCG measurement 2 days later. Transvaginal ultrasound was performed approximately 2 weeks later to verify a clinical pregnancy (gestational sac and fetal heart motion) and repeated 1 week later to confirm a continuing pregnancy.

The primary efficacy parameter for this single-cycle analysis was the number of oocytes retrieved. Secondary efficacy variables included the percentage of cycles with oocyte retrieval; duration of gonadotropin therapy; total FSH dose; peak serum E2 levels; embryo transfer and implantation rates; chemical pregnancies (positive serum β-hCG); clinical pregnancies (intrauterine gestational sac with fetal heart motion); continuing pregnancies and live birth rates.

Statistical Analysis

The validity of pooling data from the two independent studies was confirmed by statistical examination, which concluded that the two studies were homogeneous by excluding treatment-by-study interaction for the number of oocytes retrieved, using an ANOVA model with factors for 'treatment', 'study' and 'treatment-by-study' interaction. The treatment-by-study interaction was not statistically significant (P = 0.87) and a fixed effects approach was therefore appropriate for data analysis. Data were pooled and adjusted using the 'study' variable as a fixed effect, to account for between-study variability.

Safety was evaluated by analysis of the nature and incidence of adverse events, irrespective of their relation to study medications. Injection site pain was self-rated using a daily diary with an analog scale of 1 (no pain) to 10 (severe or unbearable pain).

For the first study, sample size calculations were based on the clinical assumption that 10 oocytes per patient would be retrieved in the reference group. Power calculations were based on an α level of 0.05 (two-tailed test) with 80% power to detect a 30% difference with 50 evaluable patients per group. A sample of 60 patients per group was selected. For the second study, the assumption for the number of oocytes retrieved from the reference group was changed to 13.7, based on the results of the first study. The power calculation was based on an α level of 0.025 (one-tailed test) with 80% power to detect a 30% difference with 56 evaluable patients per group.

Continuous variables, including primary and secondary outcome parameters, were analyzed using a two-way ANOVA model with treatment and study as fixed effects. In addition, an analysis of covariance (ANCOVA) model was used to compare primary and secondary efficacy variables after correcting for baseline age and BMI. The difference between treatments (Bravelle® vs. Follistim®) was evaluated using a one-sided, 95% confidence interval. Bravelle® could be declared equivalent in terms of efficacy if the lower limit of the 95% confidence interval was greater than a difference of -2.2 oocytes.

Categorical variables, including adverse events, were compared using Cochran-Mantel-Haenszel (CMH) analyses, stratified by study, and a logistic regression model using treatment, study, baseline age and BMI as covariates.

Injection site pain scores on each day were compared using the two-way ANOVA model with treatment and study as fixed effects and also using a linear mixed model to compare data across all time points. The latter allowed analysis of continuous data to account for intrasubject variability. The Restricted Maximum Likelihood Method was used with a compound symmetric within-subject covariant structure, and the sandwich variance estimator.

Results

A total of 323 patients were enrolled in the two studies and received leuprolide acetate for pituitary down-regulation, 297 were randomized between the study arms. The most common reasons for non-randomization was failure to down-regulate to the protocol-specified criteria or patients had a positive pregnancy test prior to down-regulation on LA therapy. The initial study contained a third arm consisting of 59 patients, which are not included in this pooled analysis. The remaining 238 patients were randomized to receive Bravelle® (n = 120) or Follistim® (n = 118).

In the Bravelle and Follistim groups, 94.2% and 97.5% of women respectively, received Novarel™ and underwent oocyte retrieval. Since nearly all randomized patients received Novarel™, there were no clinically or statistically meaningful differences between the intent-to-treat analysis and the primary efficacy responders (those who received Novarel™) analyses. Therefore, efficacy data presented in this report are based on the primary efficacy responder population for the primary efficacy parameter, number of oocytes retrieved, and subsequent analyses. Baseline demographic characteristics and safety data comparisons are presented for all enrolled patients regardless of administration of Novarel™.

Demographic and baseline data, including endocrine status, primary infertility diagnosis and IVF history are presented in Table 1. The groups were generally well matched demographically, although women in the Bravelle® group had a significantly lower mean weight, and therefore lower BMI (23.3 vs. 24.5 kg/m2, respectively, P = 0.021). The modest difference in BMI was not considered to be clinically meaningful, in addition, for the ANCOVA analysis of efficacy parameters, BMI and age were used as covariates to account for the effect these differences may have had on outcomes.

Table 1 Baseline demographic characteristics (prior to leuprolide acetate) of the 120 patients who received Bravelle® and the 118 patients who received Follistim®

During the course of treatment, there was no statistically significant differences between groups for the primary efficacy measure, mean number of oocytes retrieved per cycle, the numbers of oocytes that underwent normal fertilization or the number of embryos transferred (Figure 1). The implantation rate did not differ between groups, as it was 21.3% for the Bravelle® group and 20.3% for the Follistim® group.

Figure 1
figure 1

Clinical response for the 113 patients that received Bravelle® (black bars) and the 115 patients that received Follistim® (gray bars). Mean value per patient of oocytes retrieved, oocytes fertilized and embryos transferred.

No significant differences were observed in the total dose of FSH administered, the duration of treatment or peak estradiol levels between groups (Table 2). Patients receiving Bravelle® had a numerical trend towards higher chemical, clinical, continuing pregnancy rates as well as live birth rates (Figure 2), but the results did not attain statistical significance.

Table 2 Clinical response for the 120 patients who received Bravelle® and the 118 patients who received Follistim®.
Figure 2
figure 2

Pregnancy and live birth results for the 113 patients that received Bravelle® (black bars) and the 115 patients that received Follistim® (gray bars).

There were no significant differences in the number of patients reporting adverse events (P = 0.680), severe adverse events (P = 0.307) or serious adverse events (P = 0.161) between the Bravelle® or Follistim® groups. The most frequently reported adverse events were headache, abdominal cramps, vaginal bleeding/spotting and nausea (Table 3). One woman taking Bravelle® experienced a 'severe' adverse event, compared with three patients in the Follistim® group. 'Serious' adverse events were reported by two patients in the Bravelle® group.

Table 3 Summary of adverse events

A two-way ANOVA model used to compare mean daily injection-site pain revealed a significantly lower pain score in Bravelle® patients on most treatment days and during the entire treatment period (P < 0.05). The difference had greater significance when a linear mixed model, which controls for intrasubject variation, was used to compare data across all time points [3.1 and 3.9, respectively (P < 0.001)] (Figure 3).

Figure 3
figure 3

Comparison of injection site pain scores on a self-assessment scale of 0 (no pain) to 10 (severe pain) for the 113 patients that received Bravelle® (black bars) and the 115 patients that received Follistim® (gray bars).

Discussion

The data from these two randomized, controlled, clinical studies were used to evaluate the efficacy and safety of a new, highly purified, human derived FSH (Bravelle®) compared to follitropin-β (Follistim®) in patients undergoing controlled ovarian hyperstimulation for IVF-ET. Clinicians continually update their safety and efficacy data as patients are treated, in order to better understand their treatment protocols. This paper is a similar exercise in that it combines two regulatory grade IVF studies to confirm the efficacy and safety results from the previous study (8). The fact that the results and conclusions of the second study are similar to the first further highlights the consistency and reliability of Bravelle® in patients undergoing in vitro fertilization.

The baseline weight and BMI differences between Bravelle® and Follistim® patients were modestly different, but were unlikely to have affected trial outcomes since neither group could be categorized as obese (BMI ≥ 28 kg/m2) or underweight (BMI < 20 kg/m2), variables thought to correlate with reduced success during assisted conception [1618]. Nonetheless, an ANCOVA analysis was performed using BMI and age as covariates, to ensure that the difference in BMI did not effect outcome measures.

Recombinant DNA engineering technology allows for the production of human FSH in Chinese hamster ovary cells [4]. The secreted protein is collected from the culture medium and purified. Efforts have been made in an attempt to identify differences in efficacy between recombinant and human derived gonadotropin preparations. It is recognized that differences exist in the glycosylation patterns of these preparations [5, 6]. However, collectively, the results from numerous, prospective, clinical studies [714], and recent meta-analyses [19, 20] of published data have failed to show any meaningful or reproducible differences.

Nonetheless, it is well known that the success of a controlled ovarian hyperstimulation cycle is in part dependent on the presence of LH. Bravelle® provides a low level of LH activity which might benefit certain patients by enhancing intrafollicular estradiol levels, resulting in the excellent single cycle pregnancy rates observed in this analysis. The presence of small amounts of LH activity may be of specific benefit in patients with low endogenous LH levels that undergo pituitary down regulation. Overall, Bravelle® provides a safe and highly effective alternative to gonadotropins devoid of LH activity.

The adverse event data from this study demonstrate that Bravelle® is at least as well tolerated as Follistim®, but causes significantly less injection-related pain. This finding, reported in a previous study [8] may be related to the purity of Bravelle® and the differences in excipients used in the manufacturing processes for the two compounds. For example, Follistim® contains sodium citrate which has previously been implicated with injection site pain by some [22] while Bravelle® does not.

In conclusion, this analysis further supports the clinical data indicating the comparable efficacy and overall safety of Bravelle® as compared to Follistim® in controlled ovarian hyperstimulation for IVF-ET, and highlights the potential benefit of Bravelle® which produced less injection site pain.