Introduction

Glioblastoma (GB) is the most common malignant tumor of the central nervous system in adults [1], accounting for approximately 69% of all malignant adult brain tumors in Germany [2]. The prognosis remains dismal, with a median overall survival (OS) for the whole patient population below 21 months and 5-year OS rates below 10% after standard-of-care trimodal therapy, consisting of surgery followed by radiotherapy with concomitant and adjuvant temozolomide (TMZ) chemotherapy [3,4,5].

Prognostic factors for patients with GB are related to patient-, treatment- and tumor characteristics. Patient-related prognostic factors include age at diagnosis, clinical and neurological performance at diagnosis or recurrence [6,7,8,9], as well as body mass index [10,11,12], and blood glucose levels [13,14,15,16,17]. Treatment-related factors include gross total resection, concomitant and adjuvant use of TMZ, tumor-treating fields, and aggressive salvage therapy (if possible) [4, 18,19,20,21]. Molecular factors include methylation status of the O6-methylguanine-DNA methyltransferase (MGMT) promoter [22] and mutation status of isocitrate dehydrogenase 1 and 2 (IDH 1/2) [23,24,25]. The role of telomerase reverse transcriptase (TERT) promoter mutation remains ambiguous to this date [26, 27].

The analysis of Brain Tumor Study Group protocols (1966–1975) revealed a radiotherapeutic dose–effect relationship [28], suggesting dose escalation as a viable possibility of treatment optimization. The main focus of accelerated hyperfractionated radiotherapy (HFRT) was to exploit the differences in redistribution, reoxygenation, repopulation, and DNA damage repair between normal and tumor cells [29, 30] on the way to dose escalation.

Several prospective glioma trials from the pre-TMZ era have evaluated the potential of HFRT [31,32,33,34]. HFRT with or without dose escalation however failed to show any superiority in OS or progression-free survival (PFS) in comparison to normofractionated radiotherapy (NFRT) [35]. A more recently published randomized controlled trial comparing chemo-radiation protocols of dose escalated HFRT with NFRT did not point towards any benefit of HFRT in terms of OS [36].

In addition to the evidence for non-inferiority of HFRT in the abovementioned trials, HFRT schemes significantly reduce treatment time, even though the absolute number of radiotherapy treatments is higher in NFRT schemes. Therefore, the rationale to use it as an alternative scheme for patients is to shorten the time of irradiation from 6 to less than 4 weeks. Furthermore, radiobiologists have hypothesized a reduction of late radiation injury as well as reduced repopulation rates in tumor cells [37]. The current treatment standard for primary GB is the combination of radiotherapy and TMZ according to the Stupp protocol [38] plus an eventual adjuvant therapy with tumor treating fields which may add a few months to the PFS and OS [4]. To the best of our knowledge, there are no prospective randomized trials investigating the efficacy of HFRT versus NFRT in combination with TMZ, with only a couple of monocentric retrospective studies reporting comparable outcomes [39, 40]. To further explore any differences that might exist between HFRT and NFRT, we performed a retrospective multicenter analysis of GB patients which allowed us to model the effects of both schemes on PFS and OS while accounting for known prognostic factors.

Materials and methods

This work is based on data of patients with newly diagnosed GB treated from 10/2004 to 7/2018 at four tertiary care institutions. Variables of interest were retrospectively collected by each center and inserted into a Microsoft® Access database, which contained pre-specified selection possibilities for the value assignment of categorical variables. The anonymized center-specific databases were then exported and pooled into a single Microsoft® Excel file, which was further processed with R statistical software.

Inclusion criteria for this analysis were at least one follow-up of OS, age ≥ 18 years and having received either HFRT (37 × 1.6 Gy or 30 × 1.8 Gy twice/day) or NFRT (30 × 1.8 Gy, 30 × 2 Gy or 30 × 1.8 Gy with 2 Gy simultaneously integrated boost). This resulted in 484 patients being eligible for analysis of which 310 had received NFRT and 174 HFRT. Differences between the NFRT and HFRT group were assessed using the Wilcoxon rank sum test and Fisher’s exact test for continuous and categorical variables, respectively. PFS was calculated as the time difference between the start of treatment and first clinical or radiological progression. The latter had to be confirmed according to the Response Assessment in Neuro-Oncology (RANO) criteria [41] by a board-certified neuroradiologist. Follow-up included clinical and radiological evaluation every 3 months or depending on the patients’ performance status. Salvage treatment was defined as any treatment initiated after tumor progression (repeat irradiation, surgery, bevacizumab, TMZ, and combinations thereof).

The primary outcome of interest for this study was OS, with PFS as a secondary outcome, both calculated according to the Kaplan–Meier method. PFS and OS were correlated with patient-, tumor- and treatment-related variables via univariable and multivariable Cox frailty models. The frailty model accounts for clustering of patients within centers and therefore for any unmeasured center-specific factors that may affect the outcome (e.g. patient selection bias) [42, 43]. The Cox frailty model can be written as

$$h_{i} \left( t \right) = h_{0} \left( t \right)\exp \left( {{\varvec{X}}_{i} \beta + \alpha_{j} } \right) = h_{0} \left( t \right)\exp \left( {\alpha_{j} } \right){\text{exp}}\left( {{\varvec{X}}_{i} \beta } \right)$$

where \({{\varvec{X}}}_{i}\) is a vector of covariates for patient \(i\), \({h}_{0}\left(t\right)\) the baseline hazard function and \({\alpha }_{j}\) is the random effect associated with the \(j\)-th cluster (clinic) that acts as a multiplier on the baseline hazard. A log-normal distribution was assumed for the frailty terms, equivalent to a normal distribution of the random effects \(\mathrm{exp}\left({\alpha }_{j}\right)\).

For multivariable modelling, we tried to utilize as many variables as possible [44, 45] by imputing the covariates with missing information using the multiple imputation by chained equations R package ‘mice’ [46]. A “missing at random” mechanism was assumed being responsible for missing variables, with all variables given in Table 1 as well as follow-up time and OS being added into the imputation model. Variables were imputed in the order of their number of missing cases. Logistic regression and predictive mean matching were used for imputing binary and continuous variables, respectively. A total of 100 imputation data sets were created. Each was used to fit a Cox frailty regression model, and the final Cox model was obtained by pooling the coefficients of these 100 Cox models together. For sensitivity analysis, a Cox model using only the complete cases (with no missing values) was built and compared to the Cox model resulting from the imputed datasets.

Table 1 Baseline characteristics

Statistical significance was defined as p-values < 0.01. This threshold was chosen based on the conversion between p-values and minimum Bayes factors [47]. Bayes factors (or likelihood ratios in case of simple hypotheses) measure the strength of evidence between two competing hypotheses [48]. In exploratory analyses, a p-value of 0.01 corresponds to a minimum Bayes factor of 1/6.5, providing moderate-strong evidence against the null hypothesis [47]. All analyses were performed within the statistical programming language R version 4.0.3 [49].

Results

Table 1 shows the baseline characteristics of the 484 patients fulfilling the inclusion criteria for this study. There was a significant difference between the NFRT and HFRT group with respect to the treating institution, because HFRT patients mainly stemmed from two hospitals. Other significant differences between the NFRT and HFRT groups existed for planning target volume (PTV) (larger in the HFRT group), steroid administration (more frequent in the HFRT group), and salvage treatment (less frequent in the HFRT group). The exact form of salvage treatment was known for 266 patients and most frequently consisted of surgery (23.3% of cases), repeat radiotherapy (16.5%), TMZ (16.2%), and the combination of surgery and TMZ (14.3%).

Kaplan–Meier survival curves stratified according to treatment protocol are shown in Fig. 1. Median follow-up for the whole cohort was 15.7 months (range 0.8–88.6 months). During individual follow-up, tumor progression occurred in 385 out of 410 patients (it was unknown for 74 patients), and 394 of the 484 patients had died. Median PFS was 10.0 months (95% confidence interval [CI] 9.2–11.0) in the NFRT group and 7.9 months (6.6–9.3 months) in the HFRT group, which was almost significantly different (p = 0.012). Median OS was 16.9 months (15.0–18.7 months) in the NFRT group and 14.9 months (13.2–17.3 months) in the HFRT group (p = 0.26). OS was 16.0 months in both groups combined (95% CI 15.0–17.6 months).

Fig. 1
figure 1

Progression-free and overall survival probability as a function of follow-up time computed with the Kaplan–Meier estimator and stratified according to treatment schedule (statistical significance at p < 0.01). HFRT: Hyperfractionated radiotherapy NFRT: Normofractionated radiotherapy

A difference in PFS and OS between centers was evident from univariable Cox regression analysis (Table 2). Given that these differences might reflect differences in some unmeasured hospital-specific variables, frailty models were used to fit the other univariable models reported in Table 2. Factors significantly associated with longer PFS were higher KPS, frontal versus multifocal tumor location, gross total resection, MGMT hypermethylation, IDH mutation and simultaneous plus sequential TMZ versus no TMZ administration. Factors significantly associated with longer OS were younger age, higher KPS, frontal versus multifocal tumor location, gross total resection, MGMT hypermethylation, IDH mutation, simultaneous plus sequential TMZ versus no TMZ administration, no steroid administration and salvage treatment.

Table 2 Results of the univariable Cox frailty model analyses

After creating 100 imputation datasets, we fitted a separate multivariable Cox frailty model to each of these datasets and pooled the regression coefficients and associated p-values. The average hazard ratios and p-values are given in Table 3. After controlling for many possible confounders, higher KPS, gross total versus not gross total resection, MGMT hypermethylation, IDH mutation, smaller PTV size and salvage therapy were significantly associated with longer OS (all p < 0.01). RT schedule (HFRT versus NFRT) had no significant effect on OS (p = 0.108). Also, in contrast to univariable analysis, multifocal disease was no longer associated with worse OS. The frailty term modelling heterogeneity between hospitals was on the threshold of statistical significance (p = 0.010), showing the importance of its inclusion into the model.

Table 3 Results of the multivariable Cox model analyses on imputed datasets

A multivariable Cox model fit to the original dataset with complete observations only resulted in similar hazard ratios (Table 4). However, the uncertainties were larger due to the smaller number of observed events (258 patients were not considered due to missing variables), and only MGMT hypermethylation, surgery extent, PTV size, and salvage therapy reached the threshold of statistical significance (p < 0.01).

Table 4 Results of the multivariable Cox model analyses on complete observations only

Discussion

The main aim of this study was to compare patients who had received HFRT with those having received NFRT with respect to PFS and OS. The strength of this analysis is that patients were pooled from four different hospitals, resulting in a large cohort of 484 patients for which several putatively prognostic factors were known. The analysis indicates that HFRT yielded comparable outcomes to NFRT.

This confirms previous findings that accelerated HFRT achieves treatment effects comparable to NFRT in a shorter time frame [39, 40]. The lower α/β ratio of normal brain tissue for late reactions, for which an α/β value of 2 Gy is widely accepted, favors a hyperfractionated acceleration instead of a hypofractionated one. The biologically effective dose (BED) inside the tumor of HFRT protocols may vary considerably because of uncertainties about the kick-off time when repopulation sets in, \({T}_{k}\), and the doubling time \({T}_{d}\) of repopulation. Radiobiological estimates for GB resulted in \({T}_{k}\) = 37 days and \({T}_{d}\) = 15.4 days, indicating a negligible influence of fractionation, but these estimates were obtained based on PFS rather than on the more reliable endpoint OS [50]. Using the formula provided by Lee et al. [51]

$${\text{BED}} = \left\{ \begin{gathered} nd\left( {1 + \frac{d}{{{\raise0.7ex\hbox{$\alpha $} \!\mathord{\left/ {\vphantom {\alpha \beta }}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$\beta $}}}}} \right),\quad T \le T_{k} \hfill \\ nd\left( {1 + \frac{d}{{{\raise0.7ex\hbox{$\alpha $} \!\mathord{\left/ {\vphantom {\alpha \beta }}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$\beta $}}}}} \right) - \frac{{\ln 2 \left( {T - T_{k} } \right)}}{{\alpha \cdot T_{d} }},\quad T > T_{k} \hfill \\ \end{gathered} \right.$$

and adopting \(\mathrm{\alpha }/\upbeta\) = 8 Gy and \(\mathrm{\alpha }\)=0.12 Gy−1 for GB [50], we calculated BEDs for the different schemes as follows:

  • 30 × 1.8 Gy twice/day (\(T\)=21 days): BED = 66.2 Gy;

  • 37 × 1.6 Gy twice/day (\(T\)=24.5 days): BED = 71.0 Gy;

  • 30 × 2 Gy daily (\(T\)=42 days): BED = 73.1 Gy.

Thus, the BEDs of the three different schedules are not too different and—given an additional clonal heterogeneity and clonal selection by radiotherapy—may explain why the particular fractionation scheme had no significant effect on PFS or OS.

To the best of our knowledge, there was no favorable patient selection for HFRT protocols. Quite opposite, in one center (center 3) the HFRT scheme was preferably used in patients with involvement of or a tumor position close to the brain stem; this was also the case in some patients from center 1. Furthermore, some of the prognostic factors indicated a less favorable patient selection into the HFRT group: A significantly higher percentage of patients did receive concomitant steroids (76.1% versus 55.3%, Table 1), which is associated with poorer OS [52]. PTV sizes were also larger in the HFRT group, and a significantly lower percentage of patients received salvage therapy after tumor progression. For these reasons, we performed multivariable regression analysis with frailty terms in order to reduce any possible selection bias that is a general problem of retrospective analyses.

We were able to confirm the effect of several prognostic factors in univariable and multivariable analysis. The significant predictors were gross total versus non-gross total resection, MGMT hypermethylation, IDH mutation status, and salvage treatment. Mutations in the IDH1 or IDH2 gene may predict a better prognosis by downregulating several hypoxia-inducible factor 1α (HIF-1α) target genes which are involved in glycolytic energy metabolism [53], among them lactate dehydrogenase (LDH) which catalyzes the conversion of pyruvate to lactate and thereby contributes to the acidification of the tumor microenvironment [54]. Accordingly, the LDH serum concentration was shown to be negatively correlated to OS in GB patients [33]. Both univariable and multivariable analyses also showed that OS was comparable between the HFRT and NFRT groups. However, there was a trend towards different PFS rates between the two groups, eventually pointing towards a systematic difference between centers in the evaluation of PFS within the scope of the RANO guidelines. The differentiation between pseudo-progression and real tumor regrowth is a difficult task in practice and our data reflect the real-world differences that exist between different applied criteria. In this respect, the difference in OS between institutions, which is an unambiguous endpoint, is harder to explain, but could be related to hospital-specific factors such as patient selection. For example, patients from center 3 had on average significantly larger PTVs than patients from the other institutions (all p < 0.0001). The negative selection was obvious in this patient cohort also by admission of HFRT in case of proximity to the brainstem. In addition, there might have been other unmeasured differences, so that the use of frailty models was a reasonable choice. This was confirmed by the importance of the frailty terms in multivariable modelling (Tables 3 and 4).

Another modality to reduce treatment time besides HFRT is hypofractionated accelerated radiation therapy. The use of hypofractionated protocols has been established in several randomized controlled trials [55,56,57,58]. This method is however primarily recommended for elderly patients and its role in the treatment of GB in younger individuals and those with good prognosis requires further research [59]. The other potentially interesting point of hypofractionation is the theoretical possibility of overcoming immanent or acquired radioresistance of tumor cells [44]. An alternative way of overcoming radioresistance could be a local dose escalation in biologically active tumor volumes, albeit the clinical data of dose escalation are needed to be proven in phase 3 trials [60]. Another possibility could be an intensification of chemotherapy regimens in selected patients which may also improve the treatment results with acceptable toxicity [61, 62].

Despite the large sample size, this study had several limitations. First, the presented data are retrospective in nature, which implies potential uncertainties in some variables, missing data, as well as the possibility of systematic confounding and biases. We took account of the problem of missing variables by using multiple imputation by chained equations, which is preferred over just using the data with complete cases because it makes better use of the full information within the data [45]. Sensitivity analysis with only the complete cases yielded quantitatively similar results as the imputed dataset analysis and therefore confirmed the latter, although uncertainties in regression coefficients were larger, as would be expected due to the smaller sample size. For example, IDH mutation status was a highly significant predictor of OS in multivariable regression on the imputed datasets but failed to reach the significance threshold in multivariable regression on the original dataset. Steroid administration was associated with worse OS in multivariable regression on both the imputed and original datasets but failed to reach the significance threshold. Nevertheless, the obtained hazard ratio > 1 is consistent with the detrimental effects of steroids found by Pitter et al. [52] which Klement and Champ [16] have attributed in part to the well-known effects of corticosteroids to raise blood glucose levels. In general, it is reassuring for the quality of our data that the hazard ratios of all variables are consistent with their expected effects on PFS and OS which are known from previous studies and tumor biology.

Another limitation is that the tolerability of the treatments was not directly evaluated because of the retrospective nature of this work. The need for steroids was assessed as an indicator of both tumor progression and treatment-related brain edema. Steroid administration was more frequent in the HFRT group. However, since prophylactic steroid administration was also performed, it cannot be considered an accurate surrogate parameter of toxicity. Previous data from Kaul et al. [39] and Lewitzki et al. [40] reported good tolerance of simultaneous TMZ and HFRT.

In conclusion, our multicenter analysis suggests that radiochemotherapy with HFRT and TMZ is a safe option for patients wishing to shorten their treatment time and does not affect OS, even in the context of larger tumors or poorer performance status. MGMT hypermethylation, smaller tumor size, and salvage therapy were confirmed to have the most significant positive impact on OS. To the best of our knowledge, both HFRT protocols were well tolerated without excessive acute or late toxicity, albeit no dedicated analysis was performed due to the retrospective nature. Analyses of neurocognition and quality of life in particular should also be performed in the future for a better assessment of tolerance. The rate of treatment interruptions was low in all protocol groups.