Background

There is a clinical and economic need for improved outcomes for patients diagnosed with glioblastoma. Globally, over 300,000 primary brain and central nervous system cancers were diagnosed in 2020, with glioblastoma accounting for 48.6% of primary brain and central nervous system cancers in the United States (US) [1, 2]. The outcomes from care are poor; a patient’s relative survival post-diagnoses is under 7% at 5-years with current best practice [3]. In Australia, primary brain cancers account for more disability adjusted life-years lost per patient than any other adult cancer [4]. Further, brain cancer diagnoses have the largest financial cost of all cancers [5].

The World Health Organisation has classified four categories of diffuse gliomas: adult-type diffuse gliomas, pediatric-type diffuse high-grade gliomas, pediatric-type diffuse low-grade gliomas and circumscribed astrocytic gliomas with glioblastoma grouped with isocitrate dehydrogenase (IDH)-wildtype as adult-type diffuse gliomas [6]. The current standard treatment for glioblastoma consists of maximum safe surgical excision, followed by concurrent chemo-radiotherapy (typically 60 Gy/30 fractions or 40 Gy/15 fractions in elderly patients) with concurrent temozolomide (TMZ), followed by adjuvant TMZ [7]. Standard treatment has remained relatively unchanged in the last 17 years [8, 9]. Patient outcomes with standard treatment include a median overall survival of 14.6 months, a median progression-free survival of 6.9 months, a 12 months overall survival of 61.1% and a 12 months progression-free survival of 26.9% [9, 10]. Interestingly, the control arms of more recent randomised control trials which follow the Stupp et al. 2005 [11] guidelines report a median survival of approximately 20 months, most likely as a result of treatment refinement [11, 12].

Radiation therapy (RT) treatment quality has been shown to influence patient outcomes with target delineation identified as one of the largest variables in the treatment process [13, 14]. Currently, RT management of patients with glioblastoma is fragmented by the existence of variable target delineation guidelines which are based on magnetic resonance imaging (MRI) and computed tomography (CT) [15, 16]. This fragmentation in practice results in a significant difference in the outlined clinical target volume (CTV), for instance, in a study by Kumar et al. the CTV varied from 246 to 436 cm\(^3\) depending on the guidelines followed [17]. The guidelines documenting the acceptability of positron emission tomography (PET) scanning to support the RT planning of patients with gliomas have recently changed with PET scanning now more likely [18]. There is also increasing evidence to investigate dose-escalated RT in patients with glioblastoma, particularly in subgroups that do not receive TMZ or those who are known to have unmethylated DNA repair enzyme O-6-methylguanine-DNA methyltransferase (MGMT) [19]. Therefore, RT quality and patient outcomes are highly dependent on target voluming guidelines and the dose fractionation regimes used.

The main concern with dose escalation or increases in target volume size is an increased risk of side effects such as radiation necrosis [20]. This has prompted research to investigate biologically-derived target volumes and RT boosts to improve the specificity and sensitivity of the treated volumes and facilitate safe dose escalation [19, 21, 22]. Currently, 75% of patients receiving standard care for glioblastoma recur local to the high dose RT volume [23]. This highlights the need to improve both the local control and the sensitivity of target localisation, as nearly 100% of glioblastoma patients progress with current practice. Key to this goal, is capturing detailed recurrence information with geometric reference to the standard International Commission on Radiotherapy Units (ICRU) volumes and pre-treatment imaging, as this can be used to quantify the success of target contouring.

Functional imaging modalities have the advantage of being able to identify biochemical changes that often predate, or are distinct from, anatomical changes [24, 25]. A diverse range of functional imaging biological markers, including MRI, magnetic resonance spectroscopic imaging (MRSI) and nuclear medicine (NM), hold promise in this new era of functional image-guided radiotherapy (FIGR) [22, 25]. Additionally, the novel disciplines of radiomics and dosiomics, will have an increasingly synergistic role alongside functional imaging in the management of RT patients [26]. However, there is an evidence gap to support the development of guidelines to realise the benefits of FIGR. Therefore, the aim of this systematic review is to synthesise strategies and outcomes of functional imaging for RT planning in patients with glioblastoma. Outcomes related to RT target volumes, dose distributions, toxicity, recurrences and survival will be synthesised to support clinicians and research.

Method

This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement [27]. The review was prospectively registered on PROSPERO (CRD42021221021) [28]. The search strategy was conducted in consultation with a research librarian.

Search strategy

Literature searches were conducted in January 2021 in PubMed, CINAHL, Scopus, Cochrane Library, EMBASE, Web of Science and AMED. Three concepts were used to guide searches: functional imaging, glioblastoma multiforme and radiotherapy planning. Indexing thesaurus, keywords, MeSH terms, CINAHL headings, Indexed terms, Emtree terms, and synonyms were used, as relevant. Searches were restricted to texts available in English and published from January 2011 to January 2021, due to the rapid development in this field. A full overview of search terms, dates and boolean operations used for each database is available in Additional file 1.

Eligibility criteria

Eligibility criteria were defined according to participants, intervention, comparator, outcome and study type (PICOS framework). Participants and comparators were patients with primary glioblastoma undergoing external beam radiotherapy with a curative intent. Participants were 18 years of age or over. Mixed cohorts (i.e., glioblastoma and other high grade glioma) had to report separate outcomes for patients with glioblastoma to be eligible. Journal articles and published conference abstracts were included. Review articles and unpublished grey literature were excluded. Purely explorative articles (i.e., radiomics studies that were not trialled in patients with glioblastoma) were excluded.

Study selection

Duplicates were removed in Covidence [29]. Title and abstract screening against eligibility criteria was completed by two reviewers (from JR, MN, IG or LM). Conflicts were resolved by an independent reviewer (NH or JR). Eligibility of full text articles was determined by two reviewers (from JR, MN or IG) reading each paper in full. Conflicts were resolved by consultation. Data extraction was completed by JR. Figure 1 provides an overview of the screening, exclusion rationale and data extraction.

Fig. 1
figure 1

Flow diagram (adapted from the PRISMA guidelines) demonstrates the screening and evaluation process

Data extraction and management

Citations were grouped according to study cohort. Study characteristics were extracted using a Covidence template and the Template for Intervention Description and Replication (TIDieR) checklist [30]. Data were extracted according to study ID, title, author, information source, location, funding, conflicts, study aim, study design, participant description, sample size, attrition, confounding variables, intervention and TIDieR components. Study quality including biases for journal articles was assessed using the Newcastle–Ottawa Scale (NOS) assessment tool (Table 1) [31].

Table 1 Study characteristics

Studies were grouped according to their aims, including improvement of standard target delineation, facilitation of dose escalation, improvement of organ at risk (OAR) sparing, mapping recurrence and treated volumes, mapping functional and planning volumes, mapping functional volumes and survival, and carrying out voxel/radiomics analyses (Fig. 1 and Table 1). Study outcomes and confounders are captured in Tables 2, 3 and 4. A study quality score of less than nine in Table 1 indicates an issue with data reliability in an individual study, such as reporting in a short time frame post-intervention or not reporting select patient data that may have biased the results.

Study synthesis

The tables present a snapshot of the included studies (Table 1) and examine clinically relevant RT planning and associated patient outcomes (Tables 2, 3 and 4). An analysis of outcomes relating to median overall survival, median progression-free survival, 12-month overall survival percentage, and 12-month progression-free survival was completed (Table 5). In Table 5 studies were weighted based on their population number and the summed averages compared to a historical control. Data were otherwise reported narratively.

Results

A total of 32 studies (53 citations) met inclusion criteria (Table 1 and Fig. 1) . TIDieR components were inconsistently reported [25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76]. Information about who carried out the intervention and their expertise was lacking. Few studies used a prospective control group.

Included study overview

Included studies consisted of journal articles (n = 25) and conference presentations (n = 7). There were 20 retrospective cohort studies, eight prospective cohort studies, seven Phase I or Phase II trials and one Phase III randomised controlled trial.

Studies were grouped into seven categories comparing standard RT planning volumes with functional-derived volumes (n = 20), treated RT volumes with recurrences (n = 15), functional imaging into standard target delineations (n = 9), functional imaging volumes and survival (n = 8), functional imaging to guide dose escalation (n = 8), voxel or radiomics analyses (n = 4) and OAR sparing based on functional information (n = 3). There were 13 different technological interventions directly related to functional imaging, categorised according to NM, MRI, MRSI and FIGR boost. Dose escalation based on FIGR was implemented in eight studies using NM agents (n = 5), MRSI (n = 2) and MRI (n = 1).

Study outcomes

Twenty studies included outcomes on RT target volumes, 12 reported toxicity with a FIGR intervention, 10 reported survival with a FIGR intervention and 15 reported recurrence. Two of the 32 studies were not included in Tables 2, 3, 4 and 5 for analysis. Matsuo et al. [50, 51] conducted a volume comparison study and reported on the sensitivity and specificity of the CTV with different gross tumour volume (GTV) to CTV expansion margins. Lopez and colleagues [81] established a framework of co-dependencies between MRI, MRSI and radiotherapy planning volumes using radiomics.

Target volume size

Target volume size for RT planning volumes was recorded in Table 2. Target volume reporting between studies was varied, with 11 studies reporting median volumes and four studies reporting mean volumes. The GTV that incorporated functional and anatomical imaging was larger than the anatomical GTV in all studies. The intra-study variation in standard CTV size was greater than the inter-study variation in CTV size that incorporated functional and anatomical imaging. Target volume creation with functional imaging was heterogeneous. The most common approach was to incorporate the functional imaging data directly into the CTV or the planning target volume (PTV) without a specific uncertainty margin and by using standard GTV to CTV margin recipes. The CTV and PTV change with combined imaging was methodology dependent and varied between a 27% increase to a 50% decrease. The dose escalated boost volume was less than 100 cm\(^3\).

Table 2 Target volume outcomes

Dose distribution, toxicity and survival

Toxicity and survival metrics associated with the RT treatment process are presented (Table 3). Dose escalation was used in eight studies. The dose-escalated fractionation schedules had an equivalent 2 Gy doses (EQD\(_2\)) ranging between 74.4 and 104.8 Gy, with a given alpha/beta ratio of 10. Only one study used an EQD\(_2\) > 79.4 Gy [39]. Toxicity in the dose-escalated studies with an EQD\(_2\) of \(\leqslant\) 79.4 Gy were well tolerated, when delivered as a simultaneous integrated boost, with 60 Gy in 30 fractions prescribed to the standard PTV. A limited boost size \(\leqslant\) 65 cm\(^3\) was recommended in one study [85].

Table 3 Survival and toxicity outcomes

Recurrence patterns

Fifteen studies reported on recurrence outcomes (Table 4). These were broken down into three categories: studies that involved a retrospective comparison with a standard approach, studies that used functional imaging to derive the target volumes for treatment and studies that outlined extra target volumes with associated dose escalation. Table 4 indicates the relative location of a recurrence as a proportion of the total number of recurrences. Consistency was limited in the reporting of recurrence patterns and their location in relation to imaging and ICRU volumes. There were a higher proportion of recurrences central to the high therapeutic dose volume when functional and anatomical imaging were combined to outline the target.

Table 4 Recurrence outcomes

Analysis of the effect of functional imaging on survival

Survival analysis (n = 10 studies, N = 686 participants) indicated improved survival outcomes with FIGR compared to the 2005 Stupp et al. trial (Table 5) [9]. Dose escalation, guided by functional imaging (n = 6 studies, N = 235 participants) further increased median overall survival and median progression-free survival, of these studies, only the Brinkmann study used a local current control group [59,60,61].

Table 5 Survival analyses when functional imaging incorporated in radiation therapy planning

Discussion

To the authors knowledge this is the first systematic review of functional image-guided RT interventions in glioblastoma patients. Recent findings to support FIGR in glioblastoma patients have been shared and indicated FIGR can improve patient survival outcomes in certain patient cohorts [87, 88].

Standard target delineation

The classification of the target volumes for patients with glioblastoma tumours is not static, that is, it varies on the time-point, resources and local expertise available. Most studies using FIGR edited the CTV or PTV directly with historical margin recipes (Table 2). This is problematic as interventions that produce new functional imaging volumes have their own uncertainties that are inherent to the technologies and processes used. Study teams were typically cautious about incorporating functional imaging directly in the GTV, perhaps due to concerns about the final target volume size. However, an increased target size can be avoided with technology-specific GTV-CTV margin recipes (Table 3).

Dose escalation

Functional image-guided radiotherapy provides a mechanism to give a more personalised and possibly a more effective dose distribution. For example, where dose escalation was trialled based on a functionally-derived target volume there was a corresponding survival increase against the historical control (Table 5). This should be reviewed with the understanding that there has been substantial technological and practice refinement in glioblastoma treatment since 2005 and these changes may be due to other confounding factors like the categorisation of patients within individual studies, surgical practices, radiation therapy planning and delivery practices. There may be scope for further survival improvements with a more tailored approach to dose escalation, as boosting was done with a small number of discrete dose intervals.

Presently, research is concerned with ensuring the safety profile of dose-escalated RT for glioblastoma patients. Simultaneous-integrated boost dose prescriptions of 72–76 Gy delivered to volumes less than 100 cm\(^3\), with the standard PTV receiving 60 Gy, appeared relatively well tolerated (Table 3). The omission of a CTV-PTV boost volume margin in most studies may have limited the effectiveness of dose escalation. Further research is needed to compare the intrinsic differences in the boost volumes indicated by the different imaging agents and to derive ways to account for imaging agent variation via uncertainty margin recipes.

Organs at risk sparing

The specificity of target localisation in glioblastoma patients is directly linked to OAR sparing, particularly with inverse planning and intensity modulated radiation therapy (IMRT) delivery. Dose escalation with simultaneous-integrated boost techniques was common in the studies. Switching to IMRT delivery created OAR sparing capacity [41, 61]. However, current segmenting guidelines are based on anatomical imaging and thus inherently have to use relatively large isotropic expansions [89, 90]. Berberat and colleagues [69] demonstrated the feasibility of using functional imaging to map the white matter tracks in the brain and incorporated this information when deciding on target volumes. This mapping resulted in a 15% reduction in the PTV. This smaller PTV will most likely result in increased OAR sparing and increased potential for dose escalation via isotoxic planning. Both Wang [70] and Altabella [72] also used diffusion tensor imaging magnetic resonance imaging (DTI MRI) to map white matter tracks, but instead of using the information for target localisation, information was used as an IMRT OAR optimisation structure. A limitation with NM approaches to FIGR, is the lack of information regarding OAR functionality that can be incorporated into the RT plan.

Volume comparisons and radiomics

Volume comparison studies were the most prevalent study type (n = 21). Volumes were compared using absolute comparisons (mean/median), dice scores, Hausdorff distance metrics and radiomic voxel comparisons. The variation in data (Table 2), highlighted the need for volume comparison study guidelines. For example, two of the five studies that altered treated target volumes, based on FIGR, did not report the treated CTV size [43, 46].

A current gap in the literature is the unclear relationship between volumes outlined with different functional imaging agents for FIGR. None of the studies compared target volumes based on different functional scan information. The similarity between targets based on fluorine-18-fluoroethyltyrosine positron emission tomography (\(^{18}\)F-FET PET) and carbon-11-methionine positron emission tomography (\(^{11}\)C-MET PET) is well established but information about the crossover to other types of functional scan volumes is not present in the literature.

The value of recurrence analysis in volume comparison studies cannot be underestimated as they indicated where tumour volume was either missed or did not fully respond to treatment. However, the varied terminology used in the studies to describe recurrence location in relation to the outlined volumes makes interpretation and utilisation challenging. Despite this, there was a trend to suggest that a combined approach to target voluming would improve target sensitivity (Table 4).

Volume comparison studies that report survival are key to assessing the benefits of FIGR and critical to any practice reform. The methodologies used in the studies that implemented FIGR were heterogeneous, however, survival outcomes were reported in a uniform way. The improvements in survival with FIGR (n = 10) were modest, yet consistent (Tables 3 and 5). Further improvements may be possible with optimised dose escalation and outlining.

The radiomics studies presented a pathway to incorporate functional imaging data from diverse imaging techniques with a unified approach and could address limitations associated with each imaging technology.

Limitations and recommendations

Our study has several limitations. Variation in patient cohorts between studies should be acknowledged. Factors that influence outcomes, such as patient demographics, extent of surgery, MGMT status, chemotherapy protocol and RT treatment delivery were not clearly reported across all studies. It is not possible to verify the diagnosis and classification processes that led to patients being diagnosed with glioblastoma and thus meeting inclusion criteria for individual studies. The inclusion of published conference articles resulted in some studies having limited background detail to support data extraction.

Carrying out this review highlighted key recommendations for future FIGR studies. Implementation protocols should be published. Guideline development and implementation is needed for RT volume comparison and recurrence studies. Further, there is a need for prospective volume comparison studies between different FIGR intervention agents with radiomic analyses.

Conclusion

Functional image-guided radiotherapy is not currently standard practice for glioblastoma patients due to the lack of conclusive Phase III evidence. However, there are many variations possible when implementing FIGR and this makes assessing competing methodologies difficult. This review highlights the different approaches to FIGR for glioblastoma patients and relevant successes.

A three-pronged approach to FIGR for the RT treatment of patients with glioblastoma is recommended with optimised target voluming, dose prescription and OAR sparing. There is a need for a more structured approach to the testing and implementation of competing methodologies, with practical recommendations to account for the variations in available technology.