Introduction

Glioma grade 4 (GG4) tumors, which include astrocytoma IDH-mutant grade 4 and the astrocytoma IDH wt (that currently defines the glioblastoma class) are the most common and aggressive primary tumors of the central nervous system [1,2,3]. Compelling evidence, based on objective tumor volume analysis, supports the role of the extent of resection (EOR) in high grade glioma patients as the first step of patients management. Surgical treatment, however, can rarely be considered as radical, due to infiltrating nature, multifocal presentation, and ill-defined tumor margins [3,4,5]. Although the Stupp protocol was introduced as postoperative standard treatment more than 15 years ago, alternative treatments have not been recently developed, and the poor 5-year survival rate has not changed significantly in these last decades [6,7,8,9]. The infiltrative growing, the rapid proliferative rate of malignant cells and the appearance of treatment-resistant cell clones shortly after initial therapy induces recurrence within 2 cm of resection margins, regardless the initial EOR. In addition GG4 are characterized by high spatial and temporal molecular heterogeneity, which implies a poor prognosis, especially in glioblastoma cases with a median survival of less than 15 months [1, 2, 6,7,8, 10,11,12].

The intraoperative treatment with Carmustine Wafers (CW) implantation [marketed as Gliadel, biodegradable copolymers discs impregnated with the alkylating agent Bis-ChloroethylNitrosoUrea (BCNU)], for newly high grade glioma was introduced in 2003 as a therapeutic bridge between surgery and Stupp protocol onset. After an initial enthusiasm, its employment has been gradually reduced, mainly for contrasting results in terms of efficacy and safety, but not completely abandoned [13,14,15,16,17,18,19].

The differences in overall survival (OS) and response to treatment are largely due to the wide heterogeneity of GG4, with a variable distribution of biological features associated with aggressiveness between tumors as well as within a single tumor. Several prognostic factors for GG4 have been proposed including MGMT promoter methylation status, IDH1, IDH2 mutation, EOR and residual volume [10, 20,21,22,23]. In addition, comprehensive multiplatform genome-wide analyses have demonstrated that GG4 heterogeneity is dependent from specific molecular/genetic features [12, 24,25,26]. Finally, a high tumor mutation burden (TMB) in glioma has been significantly associated with short OS [27]. However, the prognostic value of TBM in GG4 has not been fully elucidated [27,28,29].

In the last years, machine learning (ML) techniques have been applied in the oncology field with the aim of improving diagnosis, prediction and prognosis of cancers, being characterized by the capability of taking in account interaction effects and nonlinearities. In particular, several ML techniques have been adapted in order to be applied on censored data for survival analyses. In this context, survival tree methods are non-parametric, flexible methods that have been developed with the aim of dealing with high dimensional covariate data [30,31,32].

In the present study, with the aim of better defining prognostic markers for OS, we performed a comprehensive molecular characterization of surgical specimens of a monoistitutional cohort of 102 patients who underwent surgical resection of newly GG4 and adjuvant postoperative Stupp protocol. By a next-generation sequencing (NGS) approach using a broad exome panel of 523 genes, analysis of copy number variations and of types and distribution of nonsynonymous mutations was performed. ML models were developed by using eXtreme Gradient Boosting for survival (XGBoost-Surv) to investigate the contribution in predicting OS of the different available data (e.g. clinical data including follow-up, radiological data, or molecular data derived by panel-based sequencing such as presence of somatic mutations and amplification).

Methods

Patient cohort

The study included 102 patients who underwent a surgical resection of a newly diagnosed GG4 at the Neurosurgery Department of Udine Hospital between 2014 and 2019 [1,2,3]. Median age of diagnosis was 60 years. Written informed consent was obtained for surgery. Patients provided informed consent in accordance with the local institutional review board requirements and the Declaration of Helsinki. After surgery, all patients were treated with combinations of concomitant adjuvant radiotherapy and chemotherapy, followed by adjuvant chemotherapy, as recommended by Stupp [7, 8], and were included in a study approved by the local Ethics Committee (protocol N. 0036566 /P/ GEN/EGAS, ID study 2538). Treatment with CW was applied in 39 out of 102 GG4 cases. All the 102 cases were characterized for the main clinical, radiological and molecular parameters with prognostic significance that were reported in Additional file 1: Table S1 [20,21,22,23].

DNA extraction

DNA material was extracted from frozen GG4 sample sections using the Allprep DNA / RNA / miRNA Universal Kit (Qiagen). Alternatively, DNA was extracted from formalin-fixed, paraffin-embedded (FFPE) GG4 samples using the AllPrep DNA / RNA FFPE Kit (Qiagen).

Sequencing analysis and “Tumor Mutational Burden” (TMB) determination

The 102 GG4 cases included in the series were all sequenced using the TruSight Oncology 500 DNA kit (TSO500). The library was prepared manually according to the manufacturer’s protocol. NGS was performed using the NextSeq 550 instrument (Illumina) with eight libraries per sequencing run. Further information regarding sequencing analysis and determination of TMB is reported in the Additional file 1: Methods section S1.

The calculation of the “Tumor Mutational Burden” (TMB) using the data obtained from sequencing employing the TSO500 panel (Illumina) was determined by using the local TruSight Oncology 500 version 2.3 (Illumina) app. The manufacturer’s quality control criteria were used to determine whether a TMB determination result with TSO500 was valid, including NGS library concentration ≥ 1 ng / µL, median insert size ≥ 70 bp, median coverage exon ≥ 50, count and percentage of exons with coverage of at least 50 counts ≥ 90%. TMB metrics were calculated following small variant calling. TMB was determined as the ratio of the number of eligible variants (mutations) to eligible DNA regions [expressed in megabases (Mb)]. Eligible variants (representing the numerator) include only coding variants with a frequency ≥ 5% and coverage ≥ 50 reads. Single Nucleotide Variants (SNV) and insertion/deletion (indel) were included, but multi-nucleotide variants (MNVs) and variants with a cosmic count ≥ 50 were excluded. Variants in blacklisted regions with poor mapability were also excluded. For the denominator, all eligible coding regions (with coverage ≥ 50x) were included, except for the blacklist regions. Small variants were exported from the TSO500 pipeline. All passed filter data, including TMB and “small variants” (SNV and indel) were provided in the CombinedVariantOutput.tsv file for each sample. All the small variants calls were merged into one file. An additional quality control for the call of “small variants”, TMB, copy number variations (CNV) was used to select samples with the best quality.

Evaluation of the TMB in the TCGA cohort

The tumor mutational burden (TMB) was calculated from the MC3 Public MAF file as previously described [33]. Only primary tumors from the high grade glioma TCGA cohort were included in the analysis [25].

Machine learning approach for survival prediction

The approach used to predict patient OS was based on the application of ML methods [31, 32, 34]. The datasets were composed by integrating different information. The main resource variables, divided into clinical (radiological and clinical variables), and molecular including gene mutation status, and copy number results (amplifications and deletions), are reported in Additional file 1: Table S2.

EXtreme Gradient Boosting for survival (XGBoost-Surv) modeling t was performed in Python (V3.8) using the xgboost and sklearn_surv library partially evaluated using the scikit-learn library [35]. In each of the datasets, string-based categorical variables were converted to numerical values, and each continuous variable was standardized. All XGBoost models were trained using the survival Cox objective function. Additionally, for each dataset, hyperparameter tuning was performed using the HyperOpt package [36, 37]. The mean Harrell’s concordance index (c-index) was computed using the five-fold cross-validation (CV) approach and used as metric [31, 38, 39]. After identifying the hyperparameters, the model was subsequently evaluated on the same total dataset using five-fold CV with the randomly selected folds being distinct from those used in the tuning of the hyperparameters to accurately assess the generalizability of the model [31]. Further details have been included as Additional Information.

Because of the high number of variables in the datasets used compared with the number of cases, the dimensionality of the input in the datasets was reduced by feature selection. In this context, selection was applied firstly considering all mutations included in the output and then used the merged TMB trace to filter out germline variants. Other variants were excluded by filtering the mutation type. Specifically, only in the case in which a mutation found by the sequencing was considered to perturb the gene (i.e., missense variant, stop loss, start loss), the mutation and therefore the gene were selected. Thus, only genes with at least one non-silent mutation were recorded into a presence or absence mutation (binary-valued patients)-by-genes matrix. Multiple hit mutations and singular mutations were both indicated as presence of mutation (representing in both cases a mutated status for the gene). Of note, from this list of variants that was generated, clinical data were integrated into the previous produced matrix.

In detail, to perform ML modeling, 4 different datasets were created. Two datasets (dataset 1 and dataset 2) were created considering the whole series of the 102 GG4 cases (Additional file 1: Tables S3 and S4). Two other datasets (dataset 3 and 4) were created in the context of a subgroup of 71 out of 102 cases that were characterized by values above the 75 percentile for the following parameters: median insert size, median coverage exon, and PCT_exon 50x (Additional file 1: Tables S3 and Table S4). Specifically, in dataset 1 all the available variables arisen from the 102 cases were considered. In dataset 2 and dataset 3, feature selection was applied by selecting somatic gene mutations present in at least 4 GG4 cases. For both dataset 2 and dataset 3, the selection of most important features was obtained by using the EXtreme Gradient Boosting for survival (XGBoost-Surv) modeling t performed in Python (V3.8), which provides a way to compute feature importance by measuring how concordance index (c-index) decreases when a feature is not available. In the survival framework, the remotion of the relationship of a certain feature with the survival time is executed by random shuffling of its values: the weight of each feature is quantified by the drop on average of the c-index [40]. Finally, dataset 4 was obtained by applying expert selection, by considering only genes included in the PI3K-AKT-mTOR signaling pathway (WP3844) [41].

In order to understand how the models yielded their predictions, Shapley Additive exPlanations (SHAP) values were used to obtain a visualization of the overall feature importance for the models [30,31,32]. Then, SHAP dependence plots were generated for each model in order to compare how the features contributed to the corresponding model’s output.

Results

Impact of established prognosticators in the whole GG4 cohort

The main clinical and molecular markers with established prognostic significance in the 102 GG4 cases entering the study cohort are reported in Additional file 1: Table S1. Median age of diagnosis was 60 years. Eight/102 (7.8%) GG4 cases were IDH mutated and 94/102 cases (92.2%) were IDH wild type. Sixty-two/102 GG4 cases (60.8%) showed a methylated MGMT promoter whereas 40/102 GG4 cases showed an unmethylated MGMT promoter. Median OS was 15.5 months (range 2–48 months). Treatment with CW was applied during surgical resection in 39/102 GG4 cases (38.2%).

By univariate Cox regression analysis, preoperative volume (hazard ratio (HR) = 1.10, p = 0.033), EOR (HR = 0.92, p < 0.001), a methylated status of the MGMT promoter (HR = 0.44, p < 0.001) and CW treatment (HR = 0.53, p = 0.003) showed a significant impact on OS. In a multivariate Cox regression analysis model in which all parameters with significant impact in the univariate Cox regression analysis were included, EOR (HR = 0.85, p = 0.002), MGMT promoter methylation status (HR = 0.47, p < 0.001) and CW treatment (HR = 0.62, p = 0.032) retained an impact on OS (Table 1) [10, 13,14,15,16,17,18,19,20].

Table 1 Univariate and multivariate Cox regression analysis of OS in the whole cohort (102 GG4 cases)

Impact of molecular and clinical variables on OS by a ML approach

The most frequently found gene mutations for the 102 cases are shown in Fig. 1. In particular, the 3 most frequently altered genes were EGFR, TP53 and BRAF [25]. The gene with the higher rate of amplification was EGFR gene (Additional file 1: Figure S1) [25].

Fig. 1
figure 1

Distribution and classification of the gene mutations found in the 102 sequenced GG4 cases. The distribution of the type of mutations (e.g. missense mutations, nonsense mutations and multihit) and mutation frequencies for the 20 genes most frequently found mutated are shown. The number of mutated cases for each gene is also shown

To account for genomic features in predicting OS of GBM cases, we used a ML approach to build a multiparametric clinical-molecular prediction model that includes clinical information, somatic mutations, and amplifications [31, 32, 34, 42]. In this ML modeling, functional mutations and amplifications were considered only on the basis of their presence or absence, regardless of their specific location, according to the generated simple matrix, as reported in methods’ section.

To perform ML modeling, several feature selection methods were employed. By using this approach, we obtained 2 datasets (dataset 1 and dataset 2) for the entire cohort of 102 GG4 cases and 2 datasets (dataset 3 and dataset 4) for the subset of 71 patients in which TMB calculation met the appropriate metrics according to the Illumina pipeline (Additional file 1: Tables S3 and S4). A first preliminary comprehensive model developed using 417 gene mutations, 45 gene amplifications and 12 clinical/radiological variables (dataset 1) showed a concordance - index (c-index) of 0.537 in the test set. To better define the contribute of the most relevant somatic mutations in predicting OS, in the context of a model that also considered clinical variables, the number of gene mutations was reduced by selecting those present in at least 4 GG4 cases. By this approach, the model developed using 107 variables constituted by 95 gene mutations and 12 clinical/radiological variables (dataset 2) showed the best c-index (c-index of 0.682 in the test set, Additional file 1: Table S3). Figure 2 shows the SHAP analysis reporting the first 20 variables that explain the model developed using dataset 2. The directions in the SHAP plots of these variables showed a significant contribution in predicting OS for the well-established prognosticators: residual volume, preoperative volume, EOR and methylation status of MGMT, also in keeping with results of Cox regression analysis. With this model, IDH1 mutation status had also a prognostic significance [10, 20, 23, 43]. The direction in the SHAP plots was also consistent with the multivariate Cox regression analysis for the treatment with CW, that was considered as a protective variable [13,14,15,16,17,18,19]. SHAP analysis in Fig. 2 also reported the contribution of gene mutations in predicting OS. Among gene mutations, the higher contribution was reported for BRAF. Several other genes such as POLE, PTEN, NOTCH3 and TP53 emerged as important for OS prediction [24,25,26, 44,45,46,47,48,49].

Fig. 2
figure 2

Feature importance ranked by “mean absolute magnitude” of SHAP values using dataset 2. The model was developed for the 102 GG4 series using 107 variables constituted by 95 genes and 12 clinical/radiological variables (dataset 2). Upper panel: mean absolute values corresponding to the magnitude of feature importance. Lower panel: summary plots for SHAP values; for each considered feature, a single patient is represented by one point. Along the x axis the position of a point corresponds to the logarithm of the mortality risk associated with that feature for a specific patients. This value corresponds to the impact that the feature had on the model output for that specific patient. Data clusters with SHAP values around zero indicate low impact on the model. Along the y axis, the different features are disposed according to their importance corresponding to the mean of their absolute SHAP values. Features with the higher importance are disposed on the upper part of the summary plots. SHAP, Shapley Additive exPlanation

To evaluate the effect not only of genes but also of the new emerging biomarker TMB as OS prognostic indicator, a further dataset (dataset 3) was considered (Additional file 1: Tables S3 and S4). In detail, by using dataset 3, composed by 108 features constituted by 95 gene mutations, 12 clinical/radiological variables, and TMB (dataset 3), a model with a c-index of 0.625 in the test set was obtained (Additional file 1: Table S3). Of note, SHAP analysis highlighted the importance for the predicting of OS for TMB, with higher TMB values associated with shorter OS (Fig. 3).

Fig. 3
figure 3

Feature importance ranked by “mean absolute magnitude” of SHAP values using dataset 3. The model was developed for the 71 GG4 cases with available TMB values using 108 variables constituted by 95 genes, 12 clinical/radiological variables and TMB (dataset 3). Upper panel: mean absolute values corresponding to the magnitude of feature importance. Lower panel: summary plots for SHAP values; for each considered feature, a single patient is represented by one point. Along the x axis the position of a point corresponds to the logarithm of the mortality risk associated with that feature for a specific patients. This value corresponds to the impact that the feature had on the model output for that specific patient. Data clusters with SHAP values around zero indicate low impact on the model. Along the y axis, the different features are disposed according to their importance corresponding to the mean of their absolute SHAP values. Features with the higher importance are disposed on the upper part of the summary plots. SHAP, Shapley Additive exPlanation

The results of the 3 ML models (dataset1, dataset 2, dataset 3, Additional file 1: Tables S3, S4) were summarized by a circular barplot to comprehensively consider variable importance of clinical and radiological variables, TMB and signaling pathways putatively dysregulated by gene mutations. Figure 4 shows a main contribution for clinical and radiological variables together with a contribution for genes belonging to several categories including “activated NTRK2 signals through PI3K”. These results confirmed the importance of clinical and radiological variables in predicting OS of GG4 cases as well as a role for TMB. Moreover, these results confirmed the role of gene mutations involved in the PI3K pathway alteration in affecting the clinical outcome of GG4 cases [25, 50].

Fig. 4
figure 4

Contribution of clinical, radiological, and molecular data in predicting survival of GG4 cases by ML. Circulating barplot, obtained by overall considering the results of the 3 different ML models (obtained with dataset 1, dataset 2 and dataset 3), recapitulating the overall contribution in explaining GG4 clinical outcome of clinical/radiological parameters, TMB values and dysregulated signaling pathways by detected gene mutations

Recently, several signaling pathways have been highlighted as a potential focus for therapeutic intervention strategies [51]. Among these, a key role was attributed to the PI3K-AKT-mTOR signaling pathway [50]. Thus, to further investigate the role of the PI3K-AKT-mTOR pathway, we restricted the ML approach to the genes belonging to this pathway (WP3844)[41], in a dataset also comprising clinical and radiological data, and TMB data (dataset 4; Additional file 1: Tables S3 and S4). In this model, 27 features were included, constituted by 16 genes, 10 clinical and surgical variables and TMB (dataset 4). With this approach, a c-index of 0.670 in the test set was obtained (Additional file 1: Table S3). SHAP analysis confirmed the importance of clinical and radiological variables and further suggested a role for TMB in predicting OS (Additional file 1: Figure 2A, B). Moreover, SHAP analysis defined a contribution of genes belonging to PI3K-AKT-mTOR signaling pathway in particular BRAF and PI3KCA (Additional file 1: Figure S2A) [25, 44, 50].

Impact of TMB as OS prognosticator

Based on the results of the ML modeling, the role of TMB as OS prognostic indicator was further explored to define a TMB cutoff to separate GG4 cases into 2 subgroups with different OS. In the context of the 71 GG4 cases in which the TMB calculation met the appropriate metrics, maximally selected log-rank statistics identified the cutoff value of 1.7 mutations/MB as the most appropriate cutoff value (not shown). By dividing this 71 GG4 cases into 2 categories, 56 cases were included in the TMB category ≥ 1.7 mutations/MB and 15 cases were included in the TMB category < 1.7 mutations/MB. By using this cutoff, GG4 cases with a TMB ≥ 1.7 mutations/MB (median OS = 16 months) had shorter OS compared with GG4 cases with a TMB < 1.7 mutations/MB (median OS = 21 months, p = 0.047, Fig. 5A). Of note, in our GG4 series, high TMB values were frequently present in GG4 cases characterized by other biomarker of worse prognosis (Fig. 5B). Finally, using the cutoff value of 1.7 mutations/MB for TMB, a difference (although not significant) was also observed in the context of high grade glioma cases from the TCGA series in which, however, data were obtained by a whole exome sequencing approach at difference from the panel-based sequencing approach we used (Additional file 1: Figure S3).

Fig. 5
figure 5

Impact of TMB in the clinical outcome of GG4 cases. A Impact of TMB on survival in the series of 71 GG4 cases for which TMB calculation met the appropriate metrics according to the Illumina pipeline. Kaplan-Meier curves comparing the OS intervals of GG4 cases with high TMB (TMB ≥ 1.7 mutations/MB, 56 cases, red line) and GG4 cases with low TMB (TMB < 1.7 mutations/MB, 15 cases, blu line). The p-value reported refers to the log-rank test. B Line graph recapitulating the association between clinical/radiological parameters and TMB values for each GG4 case

Discussion

Newly diagnosed GG4 represents a heterogeneous group of brain tumors, characterized by minimally effective genotype-targeted therapies, in which surgery followed by Stupp protocol is still the current first-line treatment despite decades of research [1, 2, 6,7,8]. In the present study, a comprehensive molecular characterization using a NGS approach with a wide panel of 523 genes was performed in the context of a mono-institutional cohort of newly diagnosed GG4 patients who underwent neurosurgery followed by Stupp protocol and CW treatment. Then, ML modeling was applied with the aim of defining the prognostic value of clinical, radiological and molecular parameters in predicting OS.

Several challenges need to be overcome in order to define prognostic markers for OS in cancer diseases, including GG4. They mainly include the different variables to be considered and the way these variables interact with each other. This has became crucial with the introduction of NGS approaches capable to generate high amount of genomic data of which interpretation could become complex [30,31,32]. To overcome this complexity, ML approaches have been proposed in cancer research. In our study a ML approach was applied using XGBoost to develop multi-parametric predictor models for survival analysis [31, 32, 34]. Moreover, to describe the contribution of each variable to prediction, SHAP values were employed [30,31,32].

We chose Cox regression analysis as the starting point for our study to confirm the prognostic significance of established prognosticators for survival in the context of our GG4 case cohort, as this analysis is the most commonly used approach for survival analysis in a variety of fields, including oncology, where it is mainly used to identify the prognostic factors that have an impact on patient occurrence or survival [52]. However, Cox regression analysis is generally considered an inappropriate model for high-dimensional associations and relies on some other restrictive assumptions such as proportionality of hazard functions for any two patients and uncorrelated characteristics. Furthermore, Cox regression analysis is not able to model non-linearities and interaction effects [30,31,32]. In this context, we introduced ML modeling to incorporate the large amount of genomic data obtained by NGS using the panel of 523 genes in the analysis [30,31,32].

In particular, the ML approach allowed us to take in consideration the specific contribution on the prediction of the high amount of data obtained by the panel-based sequencing regarding the presence of somatic genomic alterations (e.g. somatic mutations and amplifications). Thus, in the present study, different ML models were developed both in the context of the entire cohort of 102 GG4 cases and in the subgroup of 71 GG4 cases in which TMB values were considered useful. Specifically, in addition to the preliminary comprehensive dataset 1 model, the model using dataset 2 focused mainly on determining the contribution of the major somatic mutations as OS prognosticators, when clinical variables were also considered. The dataset 3 model was mainly focused on defining the contribution of TMB as a OS prognosticator. Finally, the dataset 4 model focused specifically on the role of the PI3K-AKT-mTOR pathway given the potential implications for targeted therapeutic strategies.

The intra-operative treatment with CW was developed according to the rational of locally interfering with the potential tumor re-growth in the proximity of the original tumor site [13,14,15,16,17,18,19]. Although this treatment option seems to have lost clinical importance in the recent few years, recent long-term follow up investigations have demonstrated a survival benefit in selected cases, shedding thus the light on the effectiveness of this therapeutic option [16]. Despite controversies in current literature [13,14,15,16,17,18,19], our data from ML modeling showed that CW treatment was associated with longer OS. This result was also in keeping with multivariate Cox regression analysis. Even if such data prevent to drive final conclusions due to the small sample size, they suggest to further re-consider the application of CW in sequential combination with the Stupp protocol in selected cases to avoid the known side effects (young patients with small lesion and without ventricle opening during surgery) [16].

The ML approach also showed that, for all the developed models, the radiological variables EOR, residual volume and preoperative volume, and the MGMT methylated status were located in the top positions of the SHAP diagrams, thus highlighting their role as OS prognosticators in GG4 cases undergoing surgery followed by Stupp protocol, in keeping with previous investigations [10, 20,21,22,23].

Considering somatic gene mutations, results of the ML approach showed that a mutated status for different genes was defined as having a role in OS prediction. The outcome of the developed predictor models with different identified genes was dependent by the choices of the employed feature selection. However, in this context, the great majority of altered genes was previously identified as associated with high grade glioma and contributing in the dysregulation of key pathways involving tumor suppressors or oncogenes [25, 44,45,46,47,48,49,50,51, 53]. In particular, a role for gene mutations of BRAF was highlighted by the ML modeling. Specifically, the presence of BRAF mutations was defined as associated with longer OS, as previously reported [44]. ML modeling also suggested a prognostic relevance for genes involved in the PI3K-AKT-mTOR signaling pathway [50, 53]. This give emphasis for further studies to better clarify the functional impact of the presence of these gene mutations in the progression and treatment resistance of high grade glioma. Moreover, these results could suggest useful targets for personalized therapy for the treatment of patients affected by GG4 [47, 50, 51, 53]. In this context, a panel-driven approach such as that evaluating 523 genes that has been adopted in the present study may allow an accurate molecular diagnosis of tumor specimens of GG4 patients [53, 54]. This panel-driven approach also represents an useful tool for identifying and elucidating complicated pathways such as the PI3K pathway, therefore allowing the repositioning of drugs or the development of novel drugs for personalized therapy. It also allows the possibility of combining therapies targeting different pathways in GG4 [50, 51, 53].

Finally, the application of XGBoost in the subgroup of 71 cases, in which TMB calculation met the appropriate criteria according to the Illumina TMB metrics, suggested an association between a higher TMB and shorter OS [55]. In our study, 1.7 mutations/MB was proposed as the optimal cutoff value for TMB to separate GG4 cases into 2 subgroups with different OS. Different cutoffs have been previously proposed by other studies [27]. The lack of a common cutoff between the different studies might depend on the different intrinsic characteristics of the analyzed series (e.g., volumetric or subjective evaluation of EOR, WHO classification applied, number of needle biopsies included) as well as on the sequencing method (whole exome sequencing, such as in the TCGA cohort, vs. panel-based sequencing). Direct comparisons between TMB cutoffs obtained with different panels can be very problematic, as a TMB cutoff certainly depends on the genomic features and bioinformatics platform used in a given panel [27, 28, 54]. Nevertheless, the correlation between higher TMB and shorter OS found in the present study for GG4 cases is consistent with a previously published analysis of TCGA and CGGA databases [27]. This evidence is in keeping with the concept that a high TMB could be considered as an indicator of a high rate of somatic mutations putatively associated with cell proliferative advantages as well as chemoresistance molecular mechanisms [12, 24,25,26]. The TMB has been proposed as a predictive marker for response to immunecheckpoint inhibitors [56]. However, an association of high TMB with high responsiveness to immunecheckpoint inhibitor treatment in cohorts of aggressive glioma patients undergoing such immunotherapy remains to be fully elucidated [28, 29, 55,56,57].

The present study suffers for several limitations mainly due to the retrospective nature of this investigation and the simple size, expecially in the subgroup of 71 GG4 cases with available TMB values., In this context, the retrospective acquisition of data could be associated to the lack of standardized follow-up. However, the fact that the present study analyzed a mono-institutional cohort determined that all the GG4 cases included in this study were homogeneously treated, thus attenuating the possible lack of standardization. On the other hand, the analysis of a mono-institutional cohort could challenge the generalizability of results when an external patient population is considered. The ML approach we employed was chosen in an attempt to overcome these limitations. However, in the present study, our modeling principle was a trade-off between a minimal number of features and the ability to make good predictions, thus avoiding overfitting. Our aim was to strike a balance between interpretability of the model and improved accuracy. Although interpretable models are preferred in the clinical setting, it is possible that a black box model would have resulted in better performance [30,31,32].

Further integrative analysis, including a MRI radiomic approach may contribute to refine a preoperative prognosis, to plan personalized surgical treatment and to offer patients preoperative counseling. In the future, a prospective multicenter study with a larger sample size is needed to optimize prediction models for clinical practice, and to overcome the intrinsic limitations of retrospective studies.

Conclusions

In conclusion, the results of the present study showed that the proposed ML approach using XGBoost was capable to define a key role for radiological variables in predicting OS of GG4 cases undergoing surgical resection and Stupp protocol. A protective role for CW treatment was also suggested. Moreover, this approach was capable to recapitulate the contribution as OS prognosticators of somatic mutations of several genes, including BRAF and other genes involved in the PI3K-AKT-mTOR signaling pathway. In addition, with the same approach, we showed that, the TMB, as defined by the wide target gene panel of 523 genes, appeared to be associated with shorter OS. Therefore, TMB could be useful to refine the prognosis of GG4 patients undergoing to surgical resection and Stupp protocol.