Introduction

Glioblastoma progresses in almost all patients. At progression a standard of care is unavailable and management varies from best supportive care or re-challenge of temozolomide to additional combination treatments or clinical trial participation [1,2,3]. The definition of progressive disease is ambiguous due to unreliable differentiation between treatment-related imaging phenomena and true disease progression. As a result progressive disease can often only be assumed after substantial growth or occurrence of new distant tumor locations. Detection of glioblastoma progression is important to make decisions on cessation of current treatment or initiation of additional therapy, to enroll patients in clinical trials at comparable stages of advanced disease, and to measure end of treatment response in time and in location. As progression free survival (PFS) has been demonstrated to correlate with overall survival, progression is a useful surrogate endpoint for response assessment of the first treatment in glioblastoma [4].

To standardize the time of progression, the response assessment in neuro-oncology (RANO) working group established radiological criteria for treatment response evaluation and for clinical trial entry, as an update to the MacDonald criteria [5,6,7,8]. These RANO criteria are based on standard FLAIR, T2-weighted and T1-weighted MR imaging before and after gadolinium-based contrast agents, which have widely-acknowledged shortcomings to identify pseudoprogression (i.e., treatment-related increase in contrast enhancement without disease progression), and pseudoresponse (i.e. treatment-related decrease of contrast enhancement without disease regression) [9, 10]. In addition, extensive nonenhancing true disease progression can remain unidentified.

In clinical practice, decisions to stop current treatment or start additional therapy for patients with glioblastoma progression are made in multidisciplinary tumor board meetings and can be based on factors other than radiological progression, such as patient condition and motivation, and expectations from additional treatment. This may result in treatment decisions that diverge in timing from progression criteria fulfillment. We considered this as clinical practice progression (CPP).

In this study we focus on the use of progression to evaluate initial local treatment, such as surgery or radiotherapy. For this purpose we postulate earliest radiological progression (ERP) that enables an accurate measurement of the location of first progression. Earliest radiological progression is determined by consensus in a multidisciplinary team that retrospectively reviews all clinical and radiological follow-up information. In this review the development of end stage of disease and its radiological appearance can be taken into account, which could improve the determination of the time of progression and consequently PFS [11].

Here, we quantify the differences in PFS and in tumor volume at the first time of progression using three definitions of progression: ERP, CPP, and RANO criteria in a consecutive patient cohort treated at a tertiary referral center for brain tumors. The aim of this study is to quantify the differences in time to progression and tumor volume based on existing definitions for progression and the novel definition of earliest radiological progression. These definitions are not mutually exclusive and may serve different purposes. Our motivating purpose to compare definitions is to estimate the location of progression within the brain as accurately as possible for treatment evaluation.

Methods

All patients were identified from an electronic department database, who had newly-diagnosed glioblastoma according to the WHO 2007 classification [12] and were treated in the VU University Medical Center between Jan 1, 2012 and Dec 31, 2013. Brain imaging was retrieved from the hospital PACS. Data closure for radiological follow-up, most recent hospital visit and death status was on March 1, 2016. No patient was lost in follow-up. Time intervals were censored at most recent hospital visit for patients alive at data closure. Informed consent was obtained from all individual participants included in the study.

Of the 97 identified patients, 73 had at least one MRI in follow-up after the postoperative imaging and were therefore included to evaluate progression. The other 24 patients had died early without follow-up MRI, because they had a condition too poor for imaging that would not have had consequences for oncological treatment. Ten patients were still alive at data closure. The median follow-up of patients still alive was 33 months. The patient and treatment characteristics were obtained from the electronic medical records and are listed in Table 1.

Table 1 Patient and treatment characteristics

Clinical treatment decisions for these patients were made in twice-weekly multidisciplinary tumor board meetings including neurologists, neurosurgeons, neuroradiologists, neuropathologists, radiation- and medical oncologists. The standard of first treatment was surgery in the form of resection or biopsy, followed by chemoradiation consisting of a total dose of 60 Gy conformal radiotherapy in daily fractions of 2 Gy with concomitant temozolomide of 75 mg/m2 daily from the first to the last day of radiotherapy, followed by six cycles of adjuvant temozolomide of 150–200 mg/m2 for 5 days every 28 days. As we included all newly-diagnosed patients, for some patients treatments were individualized.

Radiological follow-up with MRI was in accordance with national guidelines [13], typically including an immediately postoperative MRI considered as reference, a first follow-up MRI halfway during adjuvant chemotherapy and scheduled MRIs at approximately 3 month intervals after completion of chemotherapy and additional MRIs as necessary based on alterations in patient condition. A standardized MRI protocol included T2-weighted-, FLAIR-, T1-weighted before and after gadolinium-based contrast agents, diffusion-, and perfusion-weighted imaging as previously described in [14].

Three definitions of first progression were used. First, for ERP, a consensus panel (a neurosurgeon, two radiation oncologists, and a neuroradiologist) re-examined all treatment decisions, the patient’s condition over time and date of death together with all MRIs to trace back the first MRI that demonstrated the radiological progression. Radiological changes were correlated with possible response to additional treatment. In the review process, the scan clearly showing progressive disease was identified and consequently the location of progression was examined on previous scans. ERP was defined as the first scan showing contrast enhancement on the T1-weighted MRI after gadolinium-based contrast agent, at the location that evolved into fatal progressive disease. As the last progression before death was identified, this was considered true progression, thereby excluding pseudoprogression. To facilitate an efficient review a timeline was plotted for each patient with enhancing tumor volumes for every MRI and clinical events, such as oncological treatment, start of dexamethasone, and neurological deterioration (see Fig. 1). MRIs were coregistered for each patient to facilitate review of multiple modalities over time. Second, for CPP, the notes, conclusions and treatment decisions of the multidisciplinary tumor board meetings were reviewed to extract the care team’s interpretation of first progression during care over the course of disease. Third, for RANO progression, the RANO criteria for first progression were applied to follow-up MRIs: a new measurable contrast-enhancing lesion outside of the radiation field or an increase of more than 25% in the sum of the products of largest axial perpendicular diameters of the contrast-enhancing lesion(s) [6].

Fig. 1
figure 1

Consensus review progression example. The enhancing tumor volumes and clinical events are plotted in time from diagnosis. Below the graph the corresponding MR images are shown, from top to bottom: the T1-weighted with contrast agent, T1-weighted and FLAIR images. In this case the earliest radiological progression was at the seventh MRI (547 days), the clinical practice progression at the eighth MRI (637 days), and the RANO progression at the ninth MRI (739 days)

The number of days was determined between the date of histopathological diagnosis and the date of progression, the date of death—if progression was unobserved, or the most recent hospital visit—if progression was unobserved and the patient was alive at data closure, for all three definitions. For ERP, as progression date we used the date of the MRI, that most accurately represented the origin of first progression; for CPP, the date of the MRI that the tumor board interpreted as first progression; and for RANO, the date of the MRI that fulfilled the criteria.

To determine the tumor volumes a semi-automated volumetric tumor segmentation of the gadolinium-enhanced T1-weighted imaging MRI sequences was used (Brainlab Smartbrush Suite software, BrainLAB AG, Münich, Germany). We considered gadolinium-enhancing tumor and -nonenhancing enclosed necrosis or cyst on T1-weighted images as tumor. Segmentations were performed on the gadolinium-enhanced T1-weighted imaging (generally 1 × 1 × 1 mm voxel size), while taking other modalities into account. The volumetric reconstructions of the segmentations were edited in three orthogonal planes and verified by the expert panel.

For statistical comparison of tumor volumes at progression, we have used the nonparametric Quade test for repeated measures on three groups and the Conover’s Quade post hoc test for group comparisons [15]. For comparison of PFS, we have used Kaplan–Meier survival curve plots and a proportional hazards random effects model to compare the PFS according to the three progression definitions with patients as model frailties with normal distribution for repeated measures [16].

Results

The 73 patients had on average 3.9 MRIs [interquartile range (IQR) between 2 and 5] in follow-up. The first follow-up MRI after postoperative imaging was obtained at a median of 5.4 months after the date of diagnosis (IQR between 3.2 and 8.8 months).

Not all patients fulfilled all three definitions of progression. ERP was established in 63 (86%) of 73 patients, CPP in 64 (88%), RANO progression in 42 (58%). Of the 63 patients who had died, 57 (90%) did with prior ERP, 57 (90%) with prior CPP, and 37 (59%) with prior RANO progression. Of the 10 patients still alive, six (60%) have had ERP, 7 (70%) CPP, and 5 (50%) RANO progression.

The time differences between diagnosis, progressions, and death are plotted in Fig. 2. Some noticeable differences were observed. In one patient CPP was 16 months before the ERP, because the consensus panel considered the CPP false-positive as the presumed progression remained stable for 16 months without treatment. In two other patients the RANO progression was 8 and 11 months before EPR, because the consensus panel considered the RANO progression false-positive based on spontaneous regression of the presumed progression. The 26 (41%) patients who died without prior RANO progression can be considered false-negative RANO progressions. The 6 (10%) patients who died without prior CPP can be considered false-negative clinical decisions.

Fig. 2
figure 2

Histograms of time differences in months between diagnosis, progression according to earliest radiological detection, clinical practice, and RANO criteria, and death. The time difference is from the event mentioned on the left to the event at the bottom

The PFS curves according to the three definitions and the overall survival are plotted in Fig. 3. As a reference, the median overall survival was 15.3 months and the percentage of patients alive at 12 months was 63%. The median PFS was 8.8 months until ERP, 9.5 months until CPP, and 11.8 months until RANO progression. The percentage of patients with PFS at 12 months was 29% for ERP, 40% for CPP, and 47% for RANO progression. PFS until ERP was shorter than until CPP [Hazard ratio (HR) for progression 0.57, 95% CI 0.38–0.84, p = 0.004], and shorter than until RANO progression (HR 0.29, 95% CI 0.19–0.43, p < 0.001). PFS until RANO progression was longer than until CPP (HR 1.75, 95% CI 1.19–2.58, p = 0.004). In a subgroup analysis with 54 patients who had a gross total resection and chemoradiation, similar differences in PFS were observed (Supplementary Fig. 1).

Fig. 3
figure 3

Progression free survival Kaplan–Meier curves for the definitions according to earliest radiological progression (orange), clinical practice progression (green), and RANO criteria (blue). As a reference, the overall survival curve is plotted in grey

The tumor volumes at progression were different among the three definitions (Quade F = 16.9, df = 2, p < 0.0001) as shown in Fig. 4. The smallest tumor volumes were observed at ERP with a median of 8.8 mL (IQR 1.6–27 mL), followed by volumes at CPP with a median of 17 mL (IQR 5.5–42 mL) and at RANO progression with a median of 22 mL (IQR 10–36 mL). The median difference in tumor volume between ERP and CPP was 0 mL (IQR 0–5 mL) and between ERP and RANO was 10 mL (IQR 3.5–27 mL).

Fig. 4
figure 4

Box plot of tumor volumes at progression according to progression definitions of clinical practice, RANO criteria, and consensus review

Discussion

In this study we postulate earliest radiological progression as comprehensive definition of glioblastoma progression by consensus of a multidisciplinary panel, that is only available after the course of disease using all available imaging, treatment decisions, neurological deterioration and ultimately death from disease. We demonstrate that the PFS using earliest radiological progression is shorter and detects smaller tumor volume than the tumor board meeting’s interpretation of progression, and even more so than the RANO criteria for progressive disease.

The unambiguous assessment of glioblastoma progression on MRI is notoriously difficult for several reasons. First, standard MRI sequences do not adequately reflect the extent of glioblastoma infiltration with T2-weighted imaging being too sensitive for tumor detection and gadolinium-enhanced T1-weighted imaging not being specific enough [17]. Second, treatment can induce MRI abnormalities, that mimick glioblastoma progression, such as resulting from surgery [18], radiotherapy [19], antiangiogenetic agents [20], and immunotherapy [21]. Third, the correlation between patient condition and MRI abnormalities is not strong [22, 23]. Fourth, MRI monitoring is necessarily discontinuous, and the level of progression detection is as fine-grained as the scheduled time intervals between MRIs. In clinical practice this often results in postponed treatment decisions until additional MRI followup is obtained at shortened intervals to accumulate arguments for further radiological progression.

To improve the distinction of glioblastoma progression and treatment-related MRI abnormalities, advanced imaging techniques have been introduced [24,25,26,27], such as MR perfusion-weighted imaging [28, 29], MR diffusion-weighted imaging [30, 31], MR spectroscopy [32, 33], and Positron Emission Tomography [34, 35]. A multimodal combination of imaging techniques has been reported to be more accurate than single imaging modalities [36,37,38]. Another strategy is to determine progressive disease from routinely collected healthcare data on treatments and readmission without imaging criteria [3]. Although promising, these strategies have so far not been incorporated in standard care.

Clinical decisions were made earlier in our data than the RANO criteria were fulfilled. Several reports discussing the RANO criteria have mentioned that these criteria are not customarily used as standard in clinical practice [39,40,41]. In a recent survey among radiologists in Europe only 27% obtained measurements according to RANO criteria [42]. Kazda et al. identified a difference in time to progression between clinical decisions and RANO criteria in anaplastic astrocytoma [43]. No publications were identified that have reported on this comparison in glioblastoma.

Strengths of this study include a long term and complete followup of this patient cohort. Furthermore, notes and conclusions from tumor board meetings were well-documented to evaluate the progression definition of clinical practice. Finally a custom MRI viewing setup facilitated efficient imaging review for the multidisciplinary consensus review. Limitations of this study are that this was a single center study with clinical practice interpretation and consensus review by one multidisciplinary team. The interpretations in clinical practice and follow-up frequency may vary between centers. As a result the validity of extrapolation of our results to other teams and centers remains uncertain and should be investigated in future research.

The implications from our findings can be several. The earliest radiological progression should be considered to evaluate the relation between the location of progression and initial local therapy, such as radiotherapy or surgery. It can also be of added value for clinical trial inclusion by allowing correction of lead time bias and confounding due to tumor growth, inherent to the RANO criteria for trial enrollment and response evaluation. Furthermore, it can be useful for evaluation of the quality of clinical decision-making by multidisciplinary teams. The earliest radiological progression can only be determined at late advanced stages of disease and is therefore not a suitable alternative for routine care or for clinical trial enrollment. The minimum requirements for expertise of a multidisciplinary panel, the reproducibility and the optimal scheduled time interval between follow-up MRIs for determination of earliest radiological progression are undetermined.