Introduction

Male malignant lymphoma survivors treated with chemotherapy have a higher prevalence of sexual dysfunction than men from the general population [1, 2]. The dysfunction is at least partly caused by a reduced serum concentration of testosterone [1, 3]. Calculated free testosterone (cFT) may be considered the biologically active testosterone and therefore the golden standard for measuring testosterone deficiency [4]. Nevertheless, most studies of sexual dysfunction use serum total testosterone (TT) for screening purposes, whereas only few include sexual hormone binding globulin (SHBG) with subsequent calculations of FT (systematic review submitted by this research group). Sexual dysfunction caused by testosterone deficiency may only be apparent when calculated free testosterone (cFT) is accounted for and thus, testosterone deficiency (TD) may be underestimated if the assessment is solely based on TT [5].

Serum concentrations of SHBG are susceptible to changes in the body and are shown to increase with age [5, 6] but decrease in obese [5, 7] and type 2 diabetic men [8]. However, altered liver function can also result in altered concentrations of SHBG, including increased levels, which may lead to lower cFT if TT does not increase accordingly [9]. Lymphoma patients are often treated with chemotherapy with potential impact on liver function, and some studies [10,11,12,13,14] but not all [15,16,17] have detected a trend toward increased SHBG concentrations among treated patients. However, most of these studies have evaluated the effects of treatments that are not in accordance with current intensified standard regimens. If such intensive therapies impact SHBG concentration, this may lead to reduced cFT in case the Leydig cells also loose capacity for testosterone production. Thus, patients may have reduced concentrations of cFT despite a normal concentration of TT.

Our aim with this study was to evaluate whether serum TT is sufficient for screening malignant lymphoma survivors with sexual dysfunction, or if the cFT improves diagnostic accuracy. Secondly, we aimed at investigating differently treated subgroups to identify if some of these might be at increased risk of testosterone deficiency. Thirdly, analytically immunoassays (IA) are frequently used in routine diagnostics of testosterone deficiency, and we therefore also aimed to elucidate if IA can be used interchangeably with the gold standard liquid chromatography with mass spectrometry (LC–MS/MS) [4], for the assessment of TD in male survivors of malignant lymphoma.

Study population and methods

Study population

We included data from a cohort of adult male survivors of Hodgkin lymphoma (HL) and diffuse large B-cell lymphoma (DLBCL) diagnosed and treated with first-line treatment in the period 2008 to 2018 and in complete remission at inclusion. Survivors were identified through the Danish Lymphoma Registry [18]. The cohort consisted of 172 male survivors and has been described in a previous article [3]. As outlined in Fig. 1, eight patients with TT below the lower reference limit and additional 21 patients not available for blood samples at FU2 were not included in our data analysis (no contact: 11 survivors, declined inclusion because of lack of time: 3, did not complete the additional blood samples: 3; Online resource). Therefore, 143 survivors were included in this data analysis. The included cohort was divided into 83 survivors with TT of at least 15 nmol/l at FU1 (defined as “High normal TT”) and 60 survivors with TT < 15 nmol/l but above lower age adjusted reference limit at FU1 (defined as “Low normal TT”). Median time from diagnosis to follow-up was 7 years (range 2–13) and 8 years (range 4–14), respectively. Reference levels for age 18–49 years were 8.6 nmol/l and for age > 49 years 6.7 nmol/l (Fig. 1).

Fig. 1
figure 1

Flowchart of included male long-term survivors of malignant lymphoma. Gray box: the primary samples for this article. TT, total testosterone. aTotal testosterone alone was measured in a cross-sectional study (VitalityObs), and results are used as follow-up 1 samples. b − 2 SD = age adjusted reference levels: lower cutoff for age 18–49 years = 8.6 nmol/l, for age 50 years and above = 6.7 nmol/l. cTotal testosterone, calculated free testosterone, sexual hormone binding globulin, and luteinizing hormone were measured for 60 survivors included in a subsequent study (VitalityCheck) and represent follow-up 2. Blood samples available at diagnosis = samples collected from medical journal including total testosterone, calculated free testosterone, sexual hormone binding globulin, and luteinizing hormone

Clinical factors, including date of diagnosis, lymphoma diagnosis, treatment, and follow-up, were obtained from the Danish Lymphoma Registry. Information on sociodemographic factors, e.g., relationship status, sexual activity, erectile function, education, employment, children status, BMI, smoking, alcohol consumption, exercise, and comorbidity, was obtained by telephone interview or during a clinical visit.

Hormones

At FU1, all 172 participants had a fasting venous blood sample drawn before 10 am. Samples were analyzed at the department of clinical biochemistry at Copenhagen University Hospital—Rigshospitalet, Denmark. Analyses were performed on a Cobas e801 analytical unit, with Elecsys Testosterone II assays from Roche. These results were used to stratify the men into one of three groups: eight men with TT below lower reference level (− 2 SD), 81 men with TT above − 2 SD but < 15 nmol/l (“Low normal TT”), and 83 men with TT ≥ 15 nmol/l (“High normal TT”) (Fig. 1).

The 81 men with “Low normal TT” were eligible for an extended hormonal determination, and 60 men accepted referral to the Department of Growth and Reproduction (Dept. of GR), Copenhagen University Hospital—Rigshospitalet, Denmark, for an extended hormonal determination, FU2. Twenty-one men were not included in these additional examinations as stated above. The extended investigation included renewed venous blood sampling in the morning hours determined for TT, SHBG, and LH. TT and SHBG were measured by chemiluminescence immunoassays using the Access 2 instrument (Beckman Coulter, Brea, CA, USA). The limits of detection (LODs) were 0.35 nmol/l for T and 0.33 nmol/l for SHBG, and the corresponding coefficients of variation (CVs) were below 4% and 8%, respectively. Total testosterone was also assessed by liquid chromatography–tandem mass spectrometry (LC–MS/MS). Assessment of serum concentrations of LH chemiluminescence immunoassays was performed using the Atellica instrument (Siemens Healthineers, Tarrytown, NY, USA) with LOD of 0.07 IU/l and inter-assay CV < 15%. All analyses were accredited by The Danish Accreditation Fund for medical examination according to a European and international standard approved in Denmark (the standard DS/EN ISO 15189). Free testosterone was calculated based on TT assessments and SHBG concentrations, using the formula proposed by Vermeulen et al. assuming a fixed albumin concentration of 43.8 g/l [4, 19].

Among the 143 men included in the current study, 20 men in the “Low normal TT” and 30 men in the “High normal TT” subgroups also had hormonal assessment performed at the time of diagnosis of lymphoma as part of semen cryopreservation prior to onset of chemotherapy. Venous blood samples had been taken during daytime from 8 am to 1 pm and assessed at Dept. of GR, and TT also assessed by use of immunoassay.

Statistics

Statistical analyses were performed using the statistical software R, version R-4.1.2. Medians and range were used for descriptive statistics because of small subgroups, where both means and medians with standard deviations and percentiles were used for hormonal analyses. For evaluation of differences between paired samples, hormonal analyses were transformed using the natural logarithm to calculate changes in percent. Fisher’s exact test was used for comparison of groups for categorical variables to calculate p values, and paired t-test was used for continuous variables to calculate confidence intervals for paired samples, and unpaired t-tests for unpaired samples.

For evaluation of comparability of analyses-methods, we used linear regression analyses and Bland–Altman plots.

Results

Among the 143 men included, 67 (47%) had been treated for HL, of whom 38 men (56%) were in stage I/II, and 76 (53%) had been treated for DLBCL, of whom 45 men (59%) were in stage I/II, at the time of diagnosis. Table 1 shows the sociodemographic and clinical features of the men stratified according to group: “High normal TT” or “Low normal TT” and diagnosis. HL survivors were younger when diagnosed than DLBCL survivors in both groups (“High normal TT”: p < 0.002, “Low normal TT”: p = 0.04). HL survivors with “Low normal TT” were not significantly older than HL survivors with “High normal TT” (p = 0.13), and no age difference was seen for DLBCL survivors (p = 0.63). Most men had received standard first-line treatment with either R-CHOP, ABVD, or BEACOPP according to lymphoma type. Distribution of treatment intensity and chemotherapy regimens received did not differ between the “High normal TT” and “Low normal TT” groups (p = 0.80 and p = 0.38). The number of men in a committed relationship (p = 0.68), offspring status (p = 0.74), education (p = 0.14), employment status (p = 1.0), hours of exercise per week (p = 0.28), number of standard drinks per week (p = 0.81), number of pack years (p = 0.23), and prevalence of ED (p = 0.24) were approximately equally distributed within the two groups. Survivors with “Low normal TT” had higher comorbidity scores (p = 0.03) and higher body mass indexes (BMI) (p < 0.002) compared to survivors with “High normal TT.”

Table 1 Clinical and epidemiological baseline characteristics of adult male lymphoma survivors according to testosterone at first follow-up and diagnosis [3]

The analyses shown in Tables 2 and 3 reflects the androgen status at (1) the time of diagnosis, (2) the time of inclusion in the VitalityObs study (FU1 with TT alone), and (3) the time of inclusion in the VitalityCheck study (FU2 with full androgen status). TT concentrations at follow-up for the 20 survivors with both a diagnosis and a FU2 sample did not differ from the 40 men where TT was not determined prior to lymphoma treatment (mean difference 0.4 nmol/l, 95% CI: − 1.0–1.9), and the 20 men may therefore be considered “representative” of all men in the group of “Low normal TT” at diagnosis (Table 2). Median age was slightly higher for the 20 survivors with “Low normal TT” than for the 30 with “High normal TT,” both at time of diagnosis (median 28.0 and 26.0 years) and follow-up (37.5 and 33.0 years). Despite the difference, the age may be regarded as similar from a biological point of view. In survivors who had cryopreserved semen at time of diagnosis, and therefore had a sample taken at diagnosis, BEACOPP treatment was more common than in survivors who did not cryopreserve semen, most pronounced in survivors with “Low normal TT” at inclusion. There was no difference in treatment intensity between any groups.

Table 2 Clinical and epidemiological characteristics of adult male lymphoma survivors according to testosterone group, with subdivisions according to the availability of two sample, at diagnosis and follow-up [3]
Table 3 Hormonal analyses of adult male lymphoma survivors according to testosterone group, with subdivisions according to the availability of two sample, at diagnosis and follow-up

In the group of 20 men, in the “Low normal TT” group with both a diagnosis and a follow-up sample, the TT decreased from 14.2 nmol/l at time of diagnosis to 10.6 nmol/l at FU2 corresponding to a decrease of 21.1% (95% CI: 5.2–34.4%) (Table 2). The decrease was not a reflection of change in the concentration of SHBG, and thus cFT also decreased by 21.1% (95% CI: 5.2–34.3) (Table 3). SHBG, LH, TT/LH, and cFT/LH showed non-significant trends toward decreases over time: 4.4% (95% CI: − 18.2–11.6), 13.9% (95% CI: − 38.2–21.5), 11.1% (95% CI: − 39.8–31.4%), and 4.2% (− 41.0–34.2%), respectively. Nine of the 60 survivors (15%) with “Low normal TT” were identified with cFT testosterone concentration below age adjusted reference levels. Of these 5/9 were diagnosed with DLBCL and had received R-CHOP/R-CHOEP, and 4/9 mere diagnosed with HL where three received ABVD and one received BEACOPP. Median TT was comparable to others in the “Low normal TT” group, but cFT was lower (median 190 pmol/l vs 279 pmol/l), SHBG and LH were higher, comorbidity scores were higher, and IIEF5 scores below cutoff for erectile dysfunction (Table 4).

Table 4 Androgen status, questionnaire scores, and comorbidity score of nine male survivors of malignant lymphoma with normal TT but low cFT

For the group of “High normal TT,” TT concentrations increased from 15.4 to 18.0 nmol/l corresponding to an increase of 22.6% (95% CI: 1.6–47.9). Medians for the TT/LH ratios and the FT/LH ratios were all lower for survivors with “Low normal TT” (2.8 and 2.3 at follow-up for both subgroups) than for the “High normal TT” (4.4 at follow-up), which is a result of dividing groups according to T concentrations (Table 3). Both TT/LH and FT/LH ratios at the time of diagnosis for the 20 survivors with “Low normal TT” decreased approximately 4% (4.2 and 4.6%). For the group of 40 men with “Low normal TT” but no diagnosis sample, TT was similar to the TT of the 20 men, but cFT was lower, and SHBG concentration higher (34%, 95% CI: 10.6–63.3%) at FU2. No significant differences were seen for the concentrations of LH or the T/LH ratios between the 20 and the 40 men of the group with “Low normal TT.” Thirty survivors with “High normal TT” at FU1 also had a diagnosis sample and differed from the group with “Low normal TT” by having a significantly higher TT at the time of diagnosis and an increase over time of 22.6% (95% CI: 1.6–47.9%) (Table 2), and a higher TT/LH ratio. Furthermore, SHBG concentration in the group of 30 men was comparable to the group of 40 men, and thereby higher than for the group of 20 men (Table 3).

Figure 2 shows that TT/LH and cFT/LH ratios are comparable at diagnosis for all 50 men who had a diagnosis sample, but that ratios for the 20 men, in the “Low normal TT” group with both a diagnosis and a follow-up sample, decreased over time, because of decreased T concentration combined with stable LH concentration, indicating some degree of downregulation of the pituitary gonadotropin secretion.

Fig. 2
figure 2figure 2

Testosterone-LH ratios for the subgroup of 50 survivors with serum testosterone at both diagnosis and follow-up. TT, total testosterone; cFT, calculated free testosterone; LH, luteinizing hormone. Dashed lines represent reference levels, used at the Department of Growth and Reproduction at Copenhagen University Hospital—Rigshospitalet, for testosterone and luteinizing hormone based on the median age of included survivors (age 45–50 years): TT = 7.5 nmol/l, cFT = 150 pmol/l, LH = 8 IU/l. A TT/LH (IA) and B FT/LH (IA) at diagnosis the group with “Low normal TT” and “High normal TT.” Survivors with “High normal TT” (black dots) at diagnosis show no sign of compensated TD (no increase in LH), whereas the group with “Low normal TT” (white dots) show signs of compensation with some survivors with LH above reference levels. C TT/LH (AI) and D FT/LH (IA) at diagnosis and FU2 in the group with “Low normal TT.” For both the TT/LH and the FT/LH ratios, T levels decrease over time. For both time points, some compensation with increased LH is seen

Analyses included in this study were performed with either IA or LC–MS/MS. IA was used for the analyses performed in all three parts of this assessment (at diagnosis, at FU1, and FU2) and LC–MS/MS was performed on FU2 alone. Differences between the analyses were tested, and IA was found comparable between FU1 and FU2. Furthermore, LC–MS/MS and IA were found comparable on Bland–Altman plots (supplementary data). Only IA results are therefore stated in this article for simplicity.

Discussion

The two main findings of this study are (1) that men treated with BEACOPP are at increased risk of decreasing TT and (2) that screening by use of TT alone is insensitive in diagnosing men with biologically low testosterone. Surprisingly, TT was comparable between younger HL survivors and older DLBCL survivors, despite the expectation of older men having lower Leydig cell function than younger men. One difference between the two age groups was the difference in treatment regimen. The younger HL survivors were more often treated with harsh chemotherapy—BEACOPP. This indicates that even younger survivors of malignant lymphoma, who receive treatment with BEACOPP, should be assessed endocrinologically because of an increased risk of lower Leydig cell function. Routinely, cryopreservation is offered to younger patients with HL who are expected to receive more toxic chemotherapy regimens, meaning that the low age of the subgroup of 20 survivors from the “Low normal TT” group might be expected. We have previously described BEACOPP-treated survivors with a trend toward worse erectile function [3] and others have found BEACOPP-treated survivors to have worse sexual health than ABVD-treated and/or controls [20, 21]. Behringer et al. found male lymphoma survivors with impaired fertility and testosterone deficiency in up to 28% of BEACOPP-treated [22]. Furthermore, the toxic effect of BEACOPP on gonadal function, with impaired fertility as a consequence, is a well-established late effect [23]. In the current study, a trend toward more severe gonadal damage, as evidenced by lower testosterone levels, was also found. BEACOPP must be regarded as a high-risk treatment for gonadal function as well as sexual health of male malignant lymphoma patients. The differential decrease in TT for the two groups of survivors with “High normal TT” and “Low normal TT” at inclusion into the current study supports the hypothesis that BEACOPP-treated survivors constitute a high-risk group. Furthermore, decreasing testosterone concentrations with no compensatory increase in LH over time could indicate some level of downregulation of pituitary secretion of LH, perhaps an effect by treatment with BEACOPP, even though evidence from previous research is not compelling [24].

In the era of new targeted therapy, patterns of side effects will undoubtedly change. This can also have an impact on the occurrence of gonadal toxicity. In DLBCL, the combination of the antibody–drug conjugate polatuzumab targeting CD79b combined with R-CHP has shown promising results in the POLARIX study [25]. Bispecific monoclonal antibodies are also tested first line in early clinical trials. In Hodgkin lymphoma, a phase III study tested the antibody–drug conjugate brentuximab vedotin plus AVD versus ABVD [26], and another large phase III study is underway investigating the brentuximab vedotin regimen BrECADD (brentuximab vedotin, etoposide, doxorubicin, cyclophosphamide, dacarbazine, dexamethasone) versus BEACOPP [27]. Both therapies have been approved in several countries for first-line therapy, and it is expected that these regimens will be more common in the near future. Despite a changed side effect profile, many new biologically targeted treatments also affect gonadal function and both polatuzumab and brentuximab are accompanied by neurotoxicity and immune-related adverse reactions in a substantial proportion of patients. Research investigating these adverse gonadal reactions are scarce, and further focus on both spermatogenesis and Leydig cell function ought to be initiated. In the present study, we found a comparable decrease in TT and cFT over time for the 20 men in the “Low normal TT” group. For TT concentrations to be mirrored by cFT, SHBG has to be unaltered after chemotherapy. We found SHBG to be slightly non-significantly decreased at follow-up, but higher in the older survivors constituting the 40 men of the “Low normal TT” group. This is in line with SHBG being known to increase by increasing age [6]. Despite the increase, cFT was within reference levels for 85% of the “Low normal TT” group, where nine survivors (15%) had cFT below age adjusted reference levels in combination with normal TT indicating that some survivors with TD goes undiagnosed with TT screening alone. In the general population (40–79 years of age), the prevalence has been shown to be 8.3% [5], and normal TT with cFT below reference level is therefore more common in our cohort of male lymphoma survivors. This indicates that using TT alone for the screening of testosterone deficiency in the initial study VitalityObs was not fully sufficient to detect TD and would have increased the initial prevalence of TD from 4.7 to 9.9% in the VitalityObs study. A previous study has also shown an independent relationship between the TT and the cFT [4]. The nine survivors are not visible as below reference levels in Fig. 2, because the reference levels are shown for the median age of the included survivors. This was accepted because this figure was meant to show the overall course of the T/LH ratios.

Many subgroups were identified in this study, and it is important to clarify whether results from the small subgroup of 20 are representative of the full “Low normal TT” group, and whether the 60 survivors included in the VitalityCheck study are representative of the 172 survivors included in the VitalityObs study. The 20 survivors with hormonal evaluation at both diagnosis and FU2 differed from the 40, but combined the 60 survivors are considered representative of the 172 survivors of the initial study, because gonadal function in this group was the most impaired. Therefore, the identification of younger BEACOPP-treated survivors and older survivors as high-risk groups is deemed appropriate. Twenty-one survivors were not included in the “Low normal TT” group in the VitalityCheck study but were comparable to the group of 40 men without a diagnosis sample, where 15/21 would have been characterized as such (data not shown). Including these 21 men would probably not have changed results.

Some limitations of the current study design exist. Sample sizes were small, and our study is based on collection of samples from archives combined with samples taken at two different times of follow-up. Analyses compared have therefore been performed on different analysis machines (between archive samples, VitalityObs samples and VitalityCheck samples), and at different laboratories (VitalityObs performed at different laboratory), which could impact reliability of the results. Furthermore, sampling was not consistently performed in the morning for all archived samples, which could result in affected testosterone concentrations being less comparable to those sampled in the morning, because of diurnal changes in TT.

We investigated the differences between TT measured using immunoassays at FU1 and 2 and found them comparable. However, TT measured at FU2 was consistently 2 nmol/l higher than for samples taken at FU1. Because of this difference, we did not compare these directly. However, if taking this difference into account, the difference between FU1 in the “High normal TT” group and FU2 in the “Low normal TT” groups would only be even more pronounced. Furthermore, if mean TT at diagnosis in the “Low normal TT” group was 2 nmol/l higher than the stated, the difference between the diagnosis and the FU2 value would also be more pronounced. Therefore, we did not consider this difference to be a problem. However, a prospective design would provide more stringent groups of equal numbers, and a proper control group would be preferable.

We have omitted LC–MS from these analyses, because validation of our results found the IA analyses and the LC–MS analyses to yield the same results (online source). Because we have only IA analyses for comparison between diagnosis and follow-up, we chose to focus on these analyses. However, LC–MS is considered the gold standard, meaning future prospective studies should include LC–MS alongside IA samples. IA is still relevant in research studies because it is used frequently in the clinical endocrinologic evaluation of patients and survivors.

For the calculation of cFT, the method proposed by Vermeulen et al. was used [19], which is considered to be a precise method. However, because we did not have albumin analyses available, the albumin value was fixed for the calculations. Earlier, we hypothesized that liver damage could result in altered SHBG concentrations, and if this is correct albumin concentrations could be hypothesized to be altered as well. However, in this study, we did not find SHBG levels to be above reference levels, making this of less concern.

When investigating pre-treatment testosterone, it would be optimal to collect samples before the onset of disease in a large cohort study investigating gonadal health over time in the general population, and not at diagnosis of a lymphoma disease. In times of disease, gonadal function can be impaired, reducing production of testosterone. In this study, pre-treatment testosterone concentrations could therefore be somewhat reduced [28, 29], and the true effect of treatment even greater.

Conclusion

When using TT as screening, the presence of biologically relevant testosterone deficiency in at least 5.2% of lymphoma survivors may be missed. Thus, screening of older malignant lymphoma survivors and survivors treated with harsh combination chemotherapy, BEACOPP, should rely on cFT. Especially men treated with BEACOPP appear to be of increased risk of TD and should be followed more closely during follow-up. Prospective studies with longitudinal follow-up of patients at high risk of TD are needed.