Introduction

Hodgkin lymphoma (HL) accounts for approximately 5–6% of all childhood cancers (Siegel et al. 2023). Owing to improved treatment strategies, survival rates nowadays exceed 90% in pediatric patients (Borchmann et al. 2012; Mauz-Körholz et al. 2022). First-line HL treatment consists of consecutive multi-agent chemotherapy, with additional radiotherapy for some cases, depending on treatment response (Mauz-Körholz et al. 2022).

Both chemotherapy and radiotherapy are associated with late health effects, such as fertility issues that can substantially affect quality of life in survivorship (Landier et al. 2018; Robinson and Hudson 2014). With an increasing population of childhood cancer survivors (CCSs), there is a growing need to increase knowledge on the occurrence and, where possible, prevention of late effects of treatment.

Girls are born with a fixed pool of primordial follicles that progressively declines throughout life. The rate of oocyte quantity- and quality loss accelerates near the end of the fertile life, culminating in a fully depleted follicle pool at time of menopause at an average age of 51 years old (range 40–60 years old) (Broekmans et al. 2009; Morabia 1998; Te Velde and Pearson 2002). Cytotoxic treatments can induce apoptosis, which accelerates the natural process of oocyte depletion and could cause premature ovarian insufficiency (POI) (Bines et al. 1996; Chapman 1982; De Vos et al. 2010). POI is characterized by the combination of elevated gonadotropin levels, low estradiol levels and cycle irregularity (amenorrhea or oligomenorrhea) before the age of 40 years (De Vos et al. 2010; Webber et al. 2016). The decisive state of ovarian insufficiency is often preceded by incipient ovarian failure (IOF), during which fertility issues already tend to exist (Absolom et al. 2008).

Clinical markers to estimate the size of the remaining follicle pool include serum anti-Müllerian hormone (AMH) and the antral follicle count (AFC) measured by (transvaginal) ultrasound. Moreover, elevated serum follicle stimulating hormone (FSH) and low inhibin B levels can indicate impaired ovarian function, although levels significantly fluctuate throughout the menstrual cycle and tend to become abnormal relatively late in the process of follicle pool depletion (Broer et al. 2014; Parry and Koch 2019).

Studies among CSSs have shown that specific subgroups of survivors are at risk of having abnormal reproductive markers (low AMH, low AFC, high FSH) which may be a sign of fertility impairment or a reduced fertile life span (Charpentier et al. 2014; Krawczuk-Rybak et al. 2013; Roshandel et al. 2021; van den Berg et al. Van Den Berg et al. 2018, 2021). Moreover, several reports suggested reduced pregnancy rates, increased time to pregnancy and risk of premature delivery among CSSs (Anderson et al. 2018, 2022a, b; Armuand et al. 2017; Brämswig et al. 2015; Chow et al. 2016; Green et al. 2009; Madanat‐Harjuoja et al. 2010; Madanat et al. 2008; Magelssen et al. 2008; Oktem et al. 2018; Reulen et al. 2017; van de Loo et al. 2019; van Dijk et al. 2020). Adverse outcomes appear to specifically present after treatment with high doses of alkylating agents (particularly procarbazine) and abdominal/pelvic radiotherapy (van de Loo et al. 2019; Van Den Berg et al. 2018; van Dijk et al. 2020). These high-risk modalities have been essential components of HL treatment for decades, although current trials specifically aim to reduce and replace gonadotoxic modalities in standard HL treatment (European Network-Paediatric Hodgkin Lymphoma Study Group (EuroNet-PHL) 2015; Mauz-Körholz et al. 2022). Current evidence on fertility after treatment for HL mainly derives from cohort-studies including many different types of CCSs or adult (HL) survivors. Studies are small and heterogeneous, with a short follow up, and comparisons with a control population are sparse.

The aim of the current study was to assess the effect of childhood HL treatment on markers of ovarian reserve (AMH, AFC), ovarian function (FSH, inhibin B) and pregnancy rates and -outcomes (live birth, miscarriage, time to pregnancy, use of ART (assisted reproductive technology), premature delivery and having a small-for-gestational-age (SGA) infant). In addition, we aimed to identify treatment- and diagnosis related factors associated with impaired reproductive function.

Methods

Study design and study population

The present study is part of the Dutch Childhood Oncology Group (DCOG) LATER-VEVO study, a nationwide, multicenter, retrospective cohort study among Dutch female CCSs, that aimed to evaluate the effects of cancer treatment during childhood on reproductive function, ovarian reserve, and the risk of premature menopause. Details about the study design, the study population, and data collection methods, have been described previously (Overbeek et al. 2012; Van Den Berg et al. 2014, 2018). In short, the studied survivor-group consisted of adult women treated for cancer before the age of 18 between 1963 and 2002 who survived at least five years from diagnosis. The control group consisted of sisters from participating CCSs and women from the general population recruited through general practitioners’ offices. Control subjects had never been diagnosed with cancer.

In total 1106 female CCSs and 819 controls participated in the VEVO study. In the current study, only participating survivors treated for childhood HL were included, as well as controls. Survivors and controls who did not complete the questionnaire but only participated in the clinical part of the study were excluded from the analyses.

Data collection and outcomes

Data were collected by questionnaire, blood sampling and a transvaginal ultrasound of the reproductive organs (Overbeek et al. 2012). Primary outcomes of the present study include (low) AMH and (low) AFC, while secondary outcomes comprise (high) FSH, (low) inhibin B and pregnancy rates- and outcomes.

Questionnaire data Baseline characteristics, including socio-demographic factors, smoking, weight and height, use of hormonal contraceptives, cycle characteristics and menopausal status were obtained by a questionnaire as was information on number of pregnancies, pregnancy outcomes, age at first birth, time to pregnancy (TTP, months of unprotected sexual intercourse until pregnancy) and method of conception (spontaneous versus using artificial reproductive technologies (ART) such as IUI or IVF/ICSI). Pregnancy outcomes included live birth, still birth (pregnancy loss after the 20th week of gestation), miscarriage (pregnancy loss before the 20th week of gestation), induced abortion (deliberate termination of pregnancy before the 24th week of gestation) and ectopic pregnancy. Live births before the 37th week of gestation were considered preterm and infants with birthweight below the 10th percentile of Dutch reference curves [PRN, (Visser et al. 2009)] were considered SGA. In this paper, only the results of the first attained pregnancy were assessed.

Educational levels up to and including lower technical and vocational training were categorised as ‘low’. Education up to and including secondary technical and vocational training was classified as ‘medium’ and educational levels up to and including higher technical and vocational training and university were classified as ‘high’. Menopause was defined as cessation of the menstrual cycle for at least 12 consecutive months, and menopause before the age of 40 was considered premature (Webber et al. 2016).

Hormonal and ultrasound markers of reproductive function Levels of FSH, AMH and Inhibin B were evaluated from collected serum samples, while AFC, defined as the number of all ovarian follicles sized 2-10 mm in both ovaries, was determined by transvaginal ultrasound. Blood sampling and ultrasound measurements were timed based on hormonal status, i.e. day 2–5 of a natural cycle or anytime in case of amenorrhea (no menses > 6 months). Females on hormonal contraceptives (HCs) were asked to discontinue HC use at least two months prior to study measurements. In females not wishing to temporarily cease HCs, measurements were planned on day 7 of the HC-free week.

Minimum detectable concentration for AMH was 29 pg/mL with an intermediate coefficient assay variation of 2.5% (ultra-sensitive immunoassay, pico-AMH, AnshLabs, USA). Lower limit of quantitation (LLOQ) for FSH was 0.5 IU/L, with 3–5% intra-assay variation and 6–7% inter-assay variation (immunometric assay, Delfia, Perkin Elmer, Wallac, Turku, Finland). Serum inhibin B levels were analyzed by the Gen II Inhibin B Elisa (Bechman Coulter), with LLOQ 11 ng/l and intra-assay and inter-assay variations of < 9% and < 10%, respectively (Van Den Berg et al. 2018).

Transvaginal ultrasounds were performed by trained personnel using a HD11 XE ultrasound system with 3D imaging. 3D data were analysed by two independent trained researchers, using customized software (provided by Philips Ultrasound, Inc.) and a pre-specified protocol. If the ovary was not found during the ultrasound evaluation, the AFC measurement (of that ovary) was imputed with ‘0’.

Data on diagnosis and treatment Data on (age at) HL diagnosis and received anti-cancer treatment were retrieved from (original) medical files. The cumulative alkylating agent exposure was estimated by calculating the cyclophosphamide equivalent dose (CED in mg/m2) (Green et al. 2014).

Age at time of study was determined based on date of birth and date of clinical assessment. For participants who did not participate in the clinical assessment, age at time of questionnaire was used.

Statistical analysis

Missings in primary (AMH, AFC) or secondary outcomes (FSH, inhibin B, pregnancy rates- and outcomes) were listwise excluded. FSH serum levels that were reported as “ < 0.5” by the laboratory were replaced by 0.5 (1 HL survivor, 4 controls) to enable statistical analyses. Likewise, Inhibin B serum measurements “ < 10” were changed to 5.0 (10 HL survivors, 35 controls). Entries “0 months” and “0.5 months” for TTP within the questionnaire were corrected to 1 month (1 HL survivor and 13 controls).

AMH and AFC measurements below the p10 values of controls (determined by quantile regression analysis) were considered low. High FSH and low inhibin B were determined based on cut-off values (> 10 U/l for FSH and < 20 ng/l for Inhibin B, respectively) (Jiao et al. 2021; Laporte et al. 2011). Pregnancy outcomes were evaluated in the subgroup of females who stated they are currently attempting to become pregnant, ever had been pregnant or ever had tried to become pregnant (referred to as “ever attempted to become pregnant”). TTP was dichotomized during the analysis (i.e. < 12 months and > 12 months), as time to conceive exceeding a year was considered clinically relevant.

Baseline characteristics, clinical measurements, pregnancy rates and pregnancy outcomes were compared between HL survivors and controls. Results were presented as number (percentage) or median (IQR: interquartile range). Differences were analyzed using chi-square test or Fisher’s exact tests for categorical variables and Mann–Whitney U-test for continuous variables (because of non-normal distribution).

Moreover, clinical markers (i.e. AMH, FSH, Inhibin B and AFC) were evaluated in linear regression analyses and presented as beta-coefficient with 95% CI. The residuals of regression analyses of AMH values were non-normally distributed, and therefore, the regression model on AMH was performed on log-transformed values. The beta-coefficient was retransformed into the original scale and presented as Geometric Mean Ratio (GMR) with 95% confidence interval (CI).

Abnormal clinical markers (i.e. low AMH, high FSH, low Inhibin B and low AFC) and pregnancy rates and -outcomes were evaluated using logistic regression models. Results were expressed as odds-ratio with corresponding 95% CI. In a subgroup analysis, first pregnancy outcomes in HL survivors who received pelvic radiotherapy were compared to pregnancy outcomes in HL survivors who had no pelvic radiotherapy. A similar analysis was performed to compare HL survivors who had received treatment with CED score > 6000 mg/m2 and CED-score ≤ 6000 mg/m2. Regression models on hormonal and ultrasound data were adjusted for age at time of study, current smoking and current use of hormonal contraceptives. Analyses on overall pregnancy-, live birth- and miscarriage rates were adjusted for age at time of study, age at first pregnancy, educational level and marital status. The regression models on first pregnancy outcomes were adjusted for age at first pregnancy, educational level and marital status.

The contributing effects of diagnosis- and treatment-related factors on clinical markers were assessed using four different multivariable models, with each model evaluating a specific factor: (1) Age at time of diagnosis (< 10, 10–13, > 13 years), (2) Chemotherapy agents (cyclophosphamide, dacarbazine, procarbazine), (3) CED-score (0 mg/m2, 0–6000 mg/m2, > 6000 mg/m2) and (4) Radiotherapy body site (abdominal/pelvic area, other body sites, no radiotherapy). The control group was used as the reference group in all models. All models were adjusted for age at time of study, smoking and HC use. Depending on the factor of interest, additional corrections were added for pelvic radiotherapy (model 1, 2, 3) and/or CED score (model 1, 4).

All analyses were executed using R [version 4.0.3, (R Core Team 2018)]. P values < 0.05 were considered statistically significant.

Results

Included females

A total of 84 HL survivors and 798 controls were included in the present study (see Fig. 1). Of these, 45 (53.5%) survivors and 413 (51.8%) controls provided a blood sample, while ultrasound data were available in 40 (47.6%) survivors and 351 (43.9%) controls, respectively. Questionnaires were completed between December 2004 and January 2014. Median interval between the date of questionnaire and date of clinical assessment was ± 1 month (range – 11; 25 months).

Fig. 1
figure 1

Selection process of female childhood Hodgkin lymphoma survivors and controls from the LATER VEVO study– cohort. AMH anti– Müllerian hormone, CCSs childhood cancer survivors, FSH Follicle stimulating hormone, HL Hodgkin lymphoma

Baseline characteristics are presented in Table 1. Median age at time of study was 29.6 years (IQR 19.8; 51.4) in survivors and 32.7 years (IQR 19.7; 49.6) in controls (p = 0.08). Median BMI ranged from 22.6 to 23.0 kg/m2 (p = 0.47), most participants had medium to high educational levels (94.1–97.2%) and approximately 80% of both groups were married or in a relationship.

Table 1 Baseline characteristics of childhood Hodgkin lymphoma survivors and controls

There were no statistically significant differences in menarcheal age or cycle characteristics between the two groups of interest. There were 27 postmenopausal women included in this study (6 (7.1%) survivors and 21 (2.7%) controls). Of the menopausal survivors, only 1 (1.2%) was diagnosed with POI. However, only 9 survivors (10.7%) and 168 controls (21.1%) were aged > 40 years at time of the study.

In total, 40 survivors (47.6) and 399 controls (50.7%) used a form of hormonal contraceptives at time of study invitation. Approximately 50% of these women did not participate in the clinical part of the study. Timing of clinical measurements of the other 50% is reported in Table 1. There were no statistically significant differences in use of hormonal contraceptives and timing of clinical measurements between survivors and controls.

Among the survivors, median age at HL diagnosis was 13.4 years old (IQR 6.4; 16.4), with a median time since diagnosis of 16.5 years (IQR 8.4; 36.6) (Table 1). Ann Arbor stage was 1 or 2 in approximately 75% of cases. Many women (n = 71, 87.7%) received chemotherapeutic agents that are commonly considered as high risk for gonadotoxicity (i.e. cyclophosphamide, melphalan, procarbazine and/or ifosfamide). CED-score exceeded 6000 mg/m2 in 56 (66.7%) women. Three survivors (3.6%) did not receive any chemotherapy and were treated locally with radiotherapy only. In total, 47 women (56.0%) received radiotherapy, of whom 14 (16.7%) were irradiated to the abdominal/pelvic area (20-40 Gy). Two women (2.4%) received autologous stem cell transplantation.

Baseline characteristics of the subset of survivors and controls in whom laboratory measurements and/or ultrasound data were available are described in separate columns in Table 1. There were no substantial differences in baseline data, when compared to all included participants.

Clinical measurements

Anti-mullerian hormone and antral follicle count

Unadjusted median AMH serum levels were 0.5 ng/ml (IQR < 0.1; 7.5) in 45 survivors and 1.8 ng/ml (IQR < 0.1; 11.8) in 403 controls (p < 0.001), see Table 2. Adjusted GMR was 0.6 (95% CI 0.6; 0.7; p < 0.001).

Table 2 Hormonal and ultrasound markers of reproductive potential in childhood Hodgkin lymphoma survivors and controls

In Fig. 2 all individual AMH serum measurements in survivors are depicted in a scatterplot. Included reference lines represent percentile lines of the measurements in the control group. There were 21 (46.7%) survivors with low AMH serum levels (< p10 of controls). This proportion was statistically significantly higher in the survivor group, when compared to the control group (32 (7.9%) controls with AMH < p10; p < 0.001). Adjusted odds for having a low AMH as a survivor was 10.1 (95% CI 4.9; 20.6; p < 0.001).

Fig. 2
figure 2

Scatterplot of anti– Müllerian hormone (AMH) measurements in Hodgkin lymphoma survivors. AMH anti– Müllerian hormone, HL Hodgkin lymphoma. Scatterplot including all AMH serum measurements in the included HL survivors (n = 45). Each black dot represents a single measurement. Depicted reference lines are based on parameter estimates by quantile regression analysis in the control group (n = 406 AMH measurements) P3: − 0.021* age + 0.994; P5: − 0.024 * age + 1.135; P10: − 0.043 * age + 1.953; P50: − 0.125 * age + 6.155; P90: − 0.282 * age + 16.079; P95: − 0.277 * age + 18.040; P97: − 0.317 * age + 20.652

In total, 393 women (40 survivors, 353 controls) underwent a vaginal ultrasound. Two assessments were excluded from the analyses because of ovariectomy (2 controls). Median AFC was 11.0 (IQR 0.0; 26.3) in survivors and 15.0 (IQR 0.8; 48.5) in controls. A significant effect of HL survivorship on the AFC measurement was seen in adjusted regression analysis, β – 6.9 (95% CI – 10.2; – 3.7; p =  < 0.001). A total of 14 (35.0%) survivors had low AFC measurements (< p10 of controls). The adjusted odds-ratio for having a low AFC as a survivor was 4.6 (95% CI 2.1; 9.9; p < 0.001). Results are also presented in Table 2.

Follicle stimulating hormone and inhibin B

As reported in Table 2, median FSH levels were significantly higher, and Inhibin B levels were significantly lower in the survivor group, when compared to controls (FSH levels: 7.7 U/l (IQR 1.8; 51.9) in survivors, 5.9 U/l (IQR 1.8; 55.9) in controls; p = 0.006 and Inhibin B levels: 50.1 ng/l (IQR 5.0; 123; 0) in survivors, 70.4 ng/l (5.0; 183.9) in controls; p = 0.010, respectively). Adjusted analyses resulted in similarly significant results. Roughly 30% of the HL survivors had high FSH (≥ 10 U/l) levels and/or decreased inhibin B levels (≤ 20 ng/l), compared to 10–15% of controls. Odds ratios for having these abnormal markers of gonadal function as a survivor were 15.3 (95% CI 5.7; 41.1) in case of elevated FSH and 3.6 (95% CI 1.7; 7.7) in case of decreased inhibin B (both p < 0.001).

Consistency in clinical markers of reproductive function

Data on all included clinical markers were available in 39 survivors and 340 controls. In 46% (18) of survivors and 74% (253) of controls, AMH, AFC, FSH and Inhibin B levels were all normal. All markers were abnormal in 8 (20.5%) survivors and 4 (1.2%) controls. Six (15.4%) survivors and 9 (2.6%) controls only had low AMH levels, while the other measurements were still within normal range (see Supplementary file 1).

Time to (first) pregnancy and pregnancy outcomes

Of all participating women, 47 (56.0%) survivors and 430 (53.9%) controls stated in the questionnaire that they had (ever) tried to become pregnant. Women who did not attempt pregnancy were mainly young (74.7% of survivors and 69.3% of controls were below age of 30 years) and reported that there were circumstances currently preventing them to attempt a pregnancy (see Online Resource 2). In both groups, approximately 92% of the women who ever attempted to become pregnant succeeded, 81% ever achieved a live birth and 21% ever experienced a miscarriage. There were no statistically significant differences between survivors and controls.

Forty-three survivors (100%) and 389 out of 396 controls (98%) reported details on their first pregnancy (Table 3). Median age of survivors at first pregnancy was 2 years younger when compared to controls (27.0 years old (IQR 20.0; 35.0) versus 29.0 years (IQR 18.0; 37.0); p = 0.04, respectively). Median interval between HL diagnosis and first pregnancy was 13.8 years (IQR 3.1; 24.8). Most females conceived spontaneously (95.4% of survivors and 94.6% of controls).

Table 3 First pregnancy in childhood Hodgkin lymphoma survivors and controls

There was no difference in use of ART among the two studied groups (4.6% in survivors versus 5.4% in controls; p = 1.000, respectively). Time to pregnancy was 4.5 months (IQR 1.0; 24.0) in survivors and 3.0 months (IQR 1.0; 38.8) in controls (p = 0.15).

The number of women with a time to pregnancy over a year appeared to be higher in HL survivors (n = 10, 23.8%), when compared to controls (n = 45, 2.4%) (p = 0.07), but only the adjusted odds-ratio for TTP > 12 months reached statistical significance (odds-ratio 2.5 (95% CI 1.1; 5.6); p = 0.03). These survivors had a median age at first pregnancy of 29.5 years (IQR 21.5; 37.2), CED-score exceeded 6000 mg/m2 in 8 (80%) women and 4 (40%) had received pelvic radiotherapy.

When looking at the pregnancy outcomes in the survivor- and control group, no statistically significant differences were observed in live birth-, still birth-, miscarriage-, induced abortion- and ectopic pregnancy rates. Median birthweight and gestational age at delivery were comparable. In addition, no statistically significantly differences were observed in pregnancy rates and -outcomes when comparing HL survivors based on whether they received pelvic radiotherapy (n = 10 ‘yes’, n = 33 ‘no’) or the CED-score of received treatment (n = 9 ‘ ≤ 6000 mg/m2’, n = 32’ > 6000 mg/m2’) (see Online Resource 3 and 4). However, all survivors who experienced a preterm birth (n = 3) or gave birth to an SGA infant (n = 4) had received high-risk treatment with CED > 6000 mg/m2, p = 0.566 and p = 1.000.

Diagnosis and treatment related factors

Results of the regression models including diagnosis- and treatment related factors are included in Table 4. Overall, the deteriorating effect of treatment on reproductive markers appeared to be specifically present in protocols with procarbazine (GMR AMH 0.4 (95% CI 0.3; 0.6 95% CI); p < 0.001, B-coefficient AFC – 9.2 (95% CI – 17.9; – 0.4); p = 0.04). A higher CED-score had a negative effect on all clinical markers. There was no such pronounced effect of (pelvic) radiotherapy on the assessed measurements, although there was a trend towards the most abnormal values of the clinical markers in the subset of patients receiving pelvic radiotherapy. No clear effect of age at diagnosis was observed.

Table 4 Diagnosis and treatment-related factors associated with abnormal markers of reproductive potential in childhood Hodgkin lymphoma survivors

Discussion

This study broadly evaluated (indirect) clinical markers of ovarian reserve and -function, and (first) pregnancy results in a group of childhood HL survivors, compared to siblings and controls from the general population. All assessed clinical markers, including serum AMH, FSH, inhibin B and AFC measured by ultrasound, were significantly worse in survivors compared to controls. Survivors had a lower age at first pregnancy and a longer time to first pregnancy (> 12 months). However, when they did become pregnant, pregnancy outcome and live birth rates were comparable between the survivor and the control group.

Ovarian reserve and ovarian function

HL survivors were more likely to have lower AMH serum levels and AFC measurements, when compared to the control population. Almost half of the 45 assessed survivors in this study had AMH levels below p10 of healthy controls. Two previous cross-sectional studies on female HL survivors reported comparable results, even though applied cut-off values to define low AMH varied (Beek et al. 2007; Charpentier et al. 2014). The age-dependent changes of AMH serum levels should be considered when determining the cut-off value to classify ‘low’ levels (Fleming et al. 2012; Kelsey et al. 2011). In this study, the cut-off value of p10 was established to identify survivors at risk of a shortened reproductive life span. This cutoff value is relatively conservative and may have resulted in an over-estimation of low AMH in this cohort. However, the timely recognition of survivors at risk of impaired fertility is crucial in order to be able to act upon it, i.e. survivors could be advised to not postpone an active childwish or preserve fertility during survivorship.

During and directly after cancer treatment, a significant drop in AMH is often seen (Anderson, et al. 2022a, b). Subsequent recovery may occur. Previous studies reported peak concentrations of AMH 2–3 years post-treatment followed by a continuous, gradual decline (Irene Su et al. 2020). The present study included HL survivors with a median time of 16.5 years (IQR 8.4; 36.6) since diagnosis, hence additional improvement or recovery of serum AMH seems highly unlikely in this cohort. A longitudinal follow-up study among female CCSs demonstrated that the pattern of AMH decrease over a 3-year period (median 16 years post-treatment) was consistent with the pattern of AMH decline observed in healthy women (Van Der Kooi et al. 2017). These results suggest that the follicle reserve pool could be reduced after gonadotoxic cancer treatment, while the process of oocyte decay (of the remaining follicle pool) may not necessarily be accelerated during survivorship (Anderson and Su 2020).

High FSH serum levels and decreased inhibin B concentrations, as seen in the survivor group, are associated with impaired reproductive function. Most women who did not use hormonal contraceptives had regular cycles. One of the survivors had experienced premature menopause, with a group prevalence of 1%. However, only 9 out of the 84 survivors had reached the age of 40 years at time of the questionnaire, and premature deterioration of ovarian function could still occur.

Pregnancy

Despite significant differences in clinical markers between survivors and controls, not in favor of survivors, the reported overall live birth rate and miscarriage rates were similar in both groups (~80% and ~20% in both the survivor- and control-group). These rates are very similar to pregnancy outcomes in the general population (Gnoth et al. 2003; Wang et al. 2003; Wesselink et al. 2017). Most previous (cohort)studies mentioned number of female HL survivors achieving pregnancy or a live birth during follow up, without evaluating their wish to conceive, time to conceive or other fertility-related factors that may affect their reproductive potential (Beek et al. 2007; Drechsel et al. 2023; Fernandez-Pineda et al. 2018; Gözdasoglu et al. 1995; Green and Hall 1988; Horning et al. 1981; Licht et al. 2021; Mackie et al. 1996; Madsen et al. 1995; McCullough et al. 2010; Papadakis et al. 1999; Perrone et al. 1989; Reulen et al. 2009; Swerdlow et al. 1996; Sy Ortin et al. 1990; van der Kaaij et al. 2012; Wilimas et al. 1980). Among the two studies who specifically assessed pregnancy rates in females attempting pregnancy after diagnosis, reported pregnancy rates were comparable to our observations (77% of 26 survivors and 81% of 218 studied survivors, respectively) (Horning et al. 1981; van der Kaaij et al. 2012).

Pregnancy rates can be underestimated if participants who never had an active child wish were to be included. However, females who know they are subfertile or infertile could also indicate they have no child wish because they adjusted their future perspectives and lifegoals. Within the questionnaire, potential reasons for not wanting to become pregnant were asked. Most women felt they were still too young or reported that their current life situation (relation, study, financial, home) was not (yet) suitable for children. At the same time our questionnaire unfortunately did not include the decision making towards having children and whether having had a potential gonadotoxic treatment urged them to start their family at a younger age.

So although the probability of achieving pregnancy, resulting in a live birth, appeared to be within normal range, it should be noted that HL survivors were relatively young at time of conception when compared to the control group, and they had their first child ~ 3 years earlier than the general population (median age at first pregnancy 27 years in HL group versus 29 years in controls; Dutch population ~ 30 years old (CBS 2022)). Hypothetically, survivors were well informed about the risk of an impaired reproductive lifespan and pursued their child wish earlier than their peers. Adverse effects on pregnancy rates (or outcomes) could be more pronounced among survivors who attempt pregnancy at an older age.

In the general population, 80% of the couples conceives a pregnancy within a year (Gnoth et al. 2003; Taylor 2003; Wang et al. 2003). Several reports suggested time to conceive may be increased in cancer survivors (Barton et al. 2013; van Dijk et al. 2020). A study among childhood HL survivors, reported a median TTP of 42 months (3–100 months) among 20 women who were treated with at least five highly gonadotoxic MOPP (mechlorethamine, vincristine, prednisone, and procarbazine) courses and ~ 50% received pelvic radiation (> 3000 rads) (Horning et al. 1981). In the current study, median TTP in survivors was not as high as the previous report (4.5 months, 1–24 months) and there were no statistically significant differences observed in median TTP, compared to controls. However, adjusted analyses in which corrections were made for age at time of study, age at first pregnancy, educational level, and marital status, resulted in a significantly higher odds for having TTP > 12 months as a survivor.

Even though TTP may have been prolonged in survivors, use of assisted reproductive technology (ART) was minimal. Only 2 (4.6%) survivors and 21 (5.4%) controls achieved pregnancy via intra-uterine insemination (IUI) or in-vitro fertilization (IVF). Gonadotoxic treatment is associated with risk of impaired reserve status with fewer oocytes, but the effect on the quality of the oocytes is harder to study, so far there is no evidence demonstrating an impaired quality (Somigliana et al. 2019; van Dijk et al. 2020). Fecundability is also influenced by many other factors (such as female age, uterine function, immunology, etc.) but low AMH levels in women with an ovulatory cycle are not necessarily linked to impaired fecundity (Steiner et al. 2017; Depmann et al. 2017). Today it is unknown whether HL treatment has a lasting effect on other fertility factors explaining a prolonged time to pregnancy.

Received HL treatment

Although the power to perform additional analyses on treatment-related factors was limited and gonadotoxicity of existing treatment protocols could not be evaluated individually, results were consistent with literature; especially after treatment with procarbazine, abnormal levels of reproductive markers were present (Drechsel et al. 2023). Analysis showed no significant effect of cyclophosphamide, although only 9 of the survivors were treated with this agent of whom 3 did not participate in the clinical assessment, meaning the power of that analysis was impaired.

A clear gonadotoxic effect of a CED-score > 6000 mg/m2 was observed. Our results support recent recommendations from the PanCare-LIFE consortium and IGHG guidelines-group to lower the CED cut-off from 8000 to 6000 mg/m2 to determine high risk treatment (Mulder et al. 2021).

It is hypothesized that the alkylating agent dacarbazine is less gonadotoxic, compared to more traditional alkylating drugs such as nitrogen mustard, cyclophosphamide, and procarbazine (Mauz-Körholz et al. 2010, 2022). The recent EuroNET-C1 study demonstrated that the substitution of dacarbazine for procarbazine in the standard HL consolidation regimen (COPP to COPDAC) reduced gonadal toxicity, with less detrimental effects on FSH levels at least one year post-treatment, without compromising event-free survival rates (Mauz-Körholz et al. 2022). In our regression analysis, we observed a potential trend towards relatively higher AMH and AFC levels after treatment with dacarbazine (corrected for other chemotherapeutics, CED-score and pelvic radiotherapy), but results did not reach statistical significance. Available fertility data is limited, and additional, long-term follow-up studies are needed to evaluate the safety of dacarbazine.

After adjusting for CED-score, no individual significant effect of abdominal radiotherapy was seen on the assessed clinical measurements. However, it is unknown if the patients who had abdominal radiation (n = 14) underwent an oophoropexy to prevent radiation effect on the ovary. Previous studies revealed that uterine radiation can negatively affect birthweight and gestational age at delivery, in a dose-dependent matter (Green et al. 2002; Sudour et al. 2010; van Dijk et al. 2020). We attempted to evaluate the effect of pelvic radiotherapy on the obstetric results of the survivor-cohort in sensitivity analyses. No statistically significant differences were observed between HL survivors who did and did not receive pelvic radiotherapy. However, presented results should be carefully interpreted due to power issues. A previous study on all CSS-subgroups of the VEVO-LATER cohort reported a clear (dose-dependent) effect of abdominal radiation on hormonal and ultrasound markers of ovarian reserve (Van Den Berg et al. 2018).

Strengths and limitations

This study evaluated multiple clinical markers of reproductive ability and self-reported data on pregnancy outcomes in childhood HL survivors. Results were compared to a large control cohort and data cover a relatively long-time off treatment.

However, several limitations should be addressed. The studied cohort comprised a relatively young population. Risk of POI could not be assessed, and a considerable number of women indicated they considered themselves too young to aim to achieve pregnancy. Some women were still pregnant at time of questionnaire. Data were partly collected by self-report, information on for example the regularity of the menstrual cycle and TTP may not always be reliable and there may have been recall bias. Interval between clinical measurement and time of questionnaire varied widely and therefore data of both assessments could not be combined. Moreover, not all women participated in the clinical part of this study and some measurements were taken while using hormonal contraceptives. No adjustments were made to control for a potential confounding effect of polycystic ovary syndrome (PCOS) on AFC and AMH levels due to lacking data. Nevertheless, there were no statistically significant differences in cycle regularity, use of hormonal contraceptives or timing of clinical measurements between survivors and controls.

Heterogeneity in received treatment and power issues complicated the sensitivity analyses. The multivariable regression models on treatment-related factors could not be performed on dichotomized data, due to limited sample size.

Clinical implications and future research

Survivors should be well informed about their potential risk of a reduced fertile life span after cancer treatment. Data derived from follow-up studies should be used to improve recognition of patients at high risk of adverse effects on fertility and update guidelines on fertility preservation.

Included survivors were treated between 1973 and 2001. Since then, HL regimen has evolved considerably. Most patients of the studies cohort were treated with (high doses of) procarbazine, which is completely omitted in current regimens. Study outcomes may specifically be useful to counsel HL survivors who were treated with these preceding HL regimens. There is limited data available on current HL treatment protocols and there is also a gap in knowledge on gonadal-toxicity profiles of new immunotherapeutics (e.g. Brentuximab, Nivolumab). Additional, large, prospective studies are needed to fully assess fertility after treatment for HL. Ideally, studies should have a follow-up until after the age of 40, to evaluate risk of POI and the fulfillment of the desired number of children. Multiple consecutive assessments of markers (including at least AMH and AFC) during follow-up will result in a comprehensive evaluation of reproductive ability. Logically, comparable prospective studies should be executed among male HL patients and survivors.

Summarizing conclusions

Female HL survivors are at risk to have a reduced fertile life span. Low AMH, low AFC, elevated FSH and low inhibin B measurements were frequently present among female childhood HL survivors who were treated between 1970 and 2000s. Chance to conceive and pregnancy outcomes appear to be reassuring when attempting pregnancy at a relatively young age. Newly diagnosed patients and survivors should be counselled individually with respect to family planning and potential use of fertility preservation methods in survivorship. Additional research is needed to improve knowledge on reproductive ability after treatment of childhood HL.