Introduction

Gliomas are the most common primary neoplasms in brain, which represent approximately 30% of all central nervous system (CNS tumors and 80% of all malignant brain tumors [1]. Malignant glioma often develop in the higher ages, the high incidence being mostly within the 5th and 6th decades of life. Male and female incidence ratio of glioma is reported 1.47:1 [2]. Despite many advances have obtained in recent years, the average survival time still ranges from 17 weeks to 3 years [3]. The genotype status of glioma have been identified helpful recent years. The prognosis and survival rate of glioma is different with different genotype status. Accurate predicting the genotype status of glioma pre-operation can effectively predict prognosis and guide treatment strategy.

There have obtained many identifications about some phenotypes of glioma which are useful to prognosis, such as methylation of the MGMT, mutations in IDH1/2 and so on. More recently, co-deletion of 1q/19q and mutation in TERT is also important in prognosis and treatment option of glioma. However, there are little literature about radiomics combined with clinical characteristics to predict status of multiple phenotypes.

Isocitrate dehydrogenase (IDH catalyzes oxidative decarboxylation of isocitrate and plays an irreplaceable role in the Krebs cycle and cell homeostasis. IDH gene mutations occur in a variety of malignancies, including glioma, acute myeloid leukemia, cholangiocarcinoma, chondrosarcoma and thyroid cancer [4, 5]. Among gliomas, IDH mutations were found in 80% of WHO grade II/III gliomas [6]. A follow-up investigation showed that IDH mutation patients had a better prognosis. The median survival of GBM was 15 months in IDH wild type versus 31 months in IDH mutants. While anaplastic astrocytomas was 20 months in IDH wild type versus 65 months in IDH mutants [6]. Therefore, it is essential to predict IDH mutation status before treatment selection and patient stratification [7].

Human O6-methylguanine DNA methyltransferase (MGMT), also known as O6-alkylguanine DNA alkyltransferase (AGT), is a simple DNA repair protein involved in protecting the genome of normal cells from the mutagenicity of alkylating agents. The anti-mutation function of MGMT can block the cytotoxic effect of anticancer alkylation agents and make tumors resistant to chemotherapy drugs. When the MGMT promoter is methylated, the epigenetic expression of MGMT is inactivated and its resistance to alkylation agents lost, which manifested sensitive to chemotherapy. It will achieve better chemotherapy response, greater overall survival rate and longer progression time. Therefore, the detection of methylation status of MGMT promoter has become an important clinical procedure for the prognosis of glioma patients [8].

1p/19q co-deletion is that both the short arm of chromosome 1 (1p and the long arm of chromosome 19 (19q are deleted [9]. Lower-grade gliomas with both an IDH mutation (i.e., a mutation in either IDH1 or IDH2 and deletion of chromosome arms 1p and 19q (1p/19q codeletion), which occurs most often in oligodendrogliomas, have better responses to radiochemotherapy and are associated with longer survival than diffuse gliomas without these alterations [10]. A research showed that patients without chromosome 1p/19q co-deletions was associated with poor overall and progression-free survival [11].

Telomerase reverse transcriptase (TERT is important for the biology of diffuse gliomas. When DNA is replicated during mitosis, telomeres shorten, leading to apoptosis. Telomerase is an RNA-dependent DNA polymerase, which can prolong telomere DNA and maintain telomere homeostasis, leading to cell immortalization and the occurrence of malignant tumors. TRET is the rate-limiting catalytic subunit of telomerase. TERT promoter mutation can cause TERT transcription upregulation and activate telomerase [12]. TERT promoter mutations have been found in central nervous system tumors, including primary glioma (up to 80%), medulloblastoma (20%), and meningioma (6.4–11%) [13]. The frequency of TERT promoter mutations in gliomas increases with the increasing WHO grade. TERT promoter mutations were independently associated with poorer overall survival in all glioma subtypes [14].

Radiomics predicting prognosis of glioma pre-operation noninvasively is invaluable as biopsy is invasively and test of phenotypes is difficult available. Thus, radiomics can bringing huge benefits to patients who just not complete the test of phenotypes and to guide clinical pathway of treatment. So far, it has been confirmed that many molecular biomarkers are related to the prognosis of glioma, including IDH, MGMT, TERT, 1p/19q, EGFR, TP53, BRAF etc. In this study, we aimed to explore the valuable of radiomics in conventional (T2W/T1W/T1W + C and diffusion (DWI/ADC MR combined with clinical features (clinical and MRI imaging characteristics to predict four kinds of glioma molecular biomarkers (IDH, MGMT, TERT, 1p/19q).

Material and methods

Patient data

Patients who were initial diagnosed with glioma between January 2018 and September 2021 were collected. A total of 235 patients with initial diagnosed as glioma were reviewed (Fig. 1). Inclusion criteria were (1 Glioma confirmed by surgical pathology and containing all genetic diagnostic information (IDH1, MGMT, TREF, and 1q/19q and (2 MRI scan performed within 30 days before surgery and including four sequences (T2W, T1W, T1W + C and DWI). Exclusion criteria were no surgery or surgery in another hospital (n = 135), poor MR image quality (motion artifact, n = 8), prior surgery (n = 5 or radiotherapy (n = 6). Finally, 81 patients met the criteria, including infiltrative and circumscribed gliomas (Table1). Our study complied with the Declaration of Helsinki and informed consent was waived due to the retrospective nature of the study by ethical committee of Affiliated Hospital of Inner Mongolia Medical University. All methods were carried out in accordance with relevant guidelines and regulations.

Fig. 1
figure 1

Flowchart of study population

Table 1 Demographic data for each glioma biomarker

Tissue samples were obtained from patients undergoing resection. Then sample section was used to perform diagnosis of neuropathology and molecular evaluation. Multiplex PCR combined with next generation sequencing were used to detect IDH1/2, TERT and 1q/19q co-deletion status. Pyrosequencing of bisulfite-treated genomic DNA (CpG sites 74–78, QIAGEN was used to detect MGMT promoter methylation status.

MR imaging was obtained using 2 MRI scanners (Skyra 3 T from Siemens Healthineers, Germany and Discovery 3 T from GE, Healthcare within our radiology department. The sequences and parameters of MR scan including: T2W (TR/TE, 5000–8000/100–200 ms, voxel size: 0.5 × 0.5 × 1 mm3), T1W (TR/TE, 1000–1800/10-25 ms, voxel size: 0.5 × 0.5 × 1 mm3), DWI (TR/TE: 3000–3800/66–81 ms, with b values of 0 and 1000 s/mm2, voxel size: 0.9 × 0.9 × 5.0 mm3 and post-contrast T1W imaging (TR/TE, 1000–1800/10–25 ms, voxel size: 0.5 × 0.5 × 1 mm3). A total volume of 0.1 mmol/kg of gadobutrol (Gadovist, Bayer Healthcare was injected intravenously for post-contrast T1W imaging.

Clinical and MRI imaging data

Clinical features: patient sex, age, WHO grade (I, II, III, IV), genotype status of molecular biomarkers (IDH wild or mutant, MGMT nonmethylated or methylated, TERT wild or mutant, 1p/19q wild or codeletion (Table1).

MRI imaging features: tumour size (< 6 or > 6 cm), tumour centre location (left side or right side), cerebral lobe involving (frontal lobe, occipital lobe, parietal lobe, temporal lobe, insular lobe), cortex matter involving (involving or not), deep white matter involving (involving or not), pial matter involving (involving or not), ependymal membrane involving (involving or not), tumour cross midline (cross or not), oedema cross midline (cross or not), tumour border (clear or not), haemorrhage (yes or no), cystic_necrosis (no, < 25%, 25–50%, > 50%), oedema degree (< 1.6 cm or > 1.6 cm), enhancemengt_style (no, ring-enhancement, nodular-enhancement, irregular-enhancement), enhancement_degree (no, slight, obvious), signal characteristics (homogenous, heterogenous) [3].

Models establishment

Clinical model Univariate analysis was used for clinical and imaging features, and the features with p < 0.05 were retained. Then multivariate stepwise regression was used to retain the smallest AIC (Akaike Information Criterion feature set. Finally, multivariate logistic regression was used to establish a multivariate clinical model.

Radiomics model For each patient, tumor segmentation was delineated manually slice by slice on T2WI by a 6 years of experience radiologist and under supervision of a 10 years of experience neuroradiologist. They were blinded to the final diagnosis and molecular biomarkers status. T1WI, T2WI, DWI, ADC and T1WI + C were co-registered after interscan motion corrections. The volume of interest (VOI was generated encompassing the entire region of T2WI hyperintensity by segmenting the region of interest (ROI slice by slice on axial scans. After additional reviewing session, any discrepancies in ROI were resolved by consensus. The VOI overlaid onto co-registered datasets, which obtained five VOI from each sequence. All segmentations and registrations were carried out by ITK-SNAP software (version 3.8.0, https://www.itksnap.org).

Image processing and radiomics features extraction was performed on PHIgo-AK software (GE Healthcare, China which integrate Pyradiomics toolkit [15]. Automatic preprocessing was standardized for each case involving resampling (set vovel for 1 × 1 × 1mm to reduce the influence between different layer thicknesses and adopt linear interpolation), intensity normalization (z-score)and discretization (set binwidth to 5). The above VOI corresponding to each sequence is input into the corresponding image in the software of Pyradiomics toolkit to extract the radiomics features. The workflow is presented in Additional file 1: Fig. S2. At last, each sequence can extracted a total of 107 radiomic features. These included 18 first-order metrics, such as the mean, standard deviation, skewness, and kurtosis, 14 shape features and second-order metrics including 24 grey level run length matrix (GLCM), 16 grey level run length matrix (GLRLM), 16 grey level size zone matrix (GLSZM), 5 neighboring gray tone difference matrix (NGTDM), and 14 grey level dependence matrix (GLDM). The radiomic features selection procedure were performed in the cross-validation stage as follow: (1 Using univariate rank sum test, the characteristics of p < 0.05 were retained (2 Correlation analysis was used to remove features with a correlation greater than 0.8; (3 Least Absolute Shrinkage and Selection Operator (LASSO [16] regularization was employed to remove collinear features; (4 Multivariate stepwise regression was used to retain the feature set of minimal AIC. Each sequence is screened in this way.The independent radiomic model of each sequence radiomic features were established by multivariate logistic regression, including T1WI model, T2WI model, T1 + C model, DWI model and ADC model. Meaningful features from five sequences were combined to establish a combined radiomics model called multi sequence model (It is named ‘All’ model in the Figs. 2, 5).

Fig. 2
figure 2

Receiver-operating characteristic (ROC curves for prediction of each biomarker status. combined model which unites clinical and radiomics features shows significant improvement in predicting each biomarker status, especially in group of IDH (0.928 and MGMT (0.878). T2W, T2WI model. T1W, T1WI model. DWI, DWI model. ADC, ADC model. T1C, T1 + C model. All, multi sequence model. Clinical, clinical model. COMB, combined model

Combined model The combine model was established by combining the radscore output from multi sequence model with the clinical characteristics by multivariate logistic regression.

All the above methods and models are completed separately for each group of molecular biomarkers.

Statistical analysis

Statistical analysis was performed using R (version:4.1.0, https://www.rproject.org). A two-sided p value of less than 0.05 was considered statistically significant. Differences in the clinical and imaging characteristics between group of each genic phenotype were evaluated using the independent samples t test, the Mann–Whitney U test, and the chi-squared or Fisher’s exact tests, as appropriate. All models which include T2WI, T1WI, T1 + C, DWI, ADC, All, clinical, combined were established by logistic regression. Three-fold cross-validation repeated 20 times strategy was used to select best features subset. The cross-validation voting strategy [17], features selected in more than onefold were used as the final feature set. Receiver-operating characteristic (ROC curves and area under the curve (AUC was obtained. A comparative analysis between ROCs of all models in each group was performed using nonparametric methods described by DeLong et al. [18]. Decision curve analysis (DCA was carried out to assess the clinical usefulness of each model by evaluating the net benefit at various threshold probabilities.

Results

Clinical model assessment

Comparison of clinical and MRI imaging features of phenotype status for each molecular biomarker are summarized in Table 2 (Only the features with significant differences are listed, all features are detailed in Additional file 1: Table S1). After univariate analysis and multivariate stepwise regression, the differences in the following characteristics were statistically significant. Nine characteristics are predicting factors of IDH status including Age, Grade, Frontal lobe, Parietal lobe, Involving cortex matter, Oedema degree, Enhancement style, Enhancement degree and Signal characteristics. Five characteristics are predicting factors of MGMT status including Frontal lobe, Involving cortex matter, Border, Oedema degree and Enhancement degree. Enhancement style is predicting factor of 1p/19q status. Involving pial matter is predicting factor of TERT status. Above characteristics set were further selected to perform multivariate logistic regression analysis. The results are shown in Table 3. IDH mutant type is easier involving frontal lobe than wild type. IDH wild type is more common showed ring or irregular enhancement than mutant type. MGMT Methylated type is more likely involve frontal lobe and have a non-clear border than Nonmethylated type. Oedema degree of MGMT Methylated type is less than Nonmethylated type. TERT mutant type is more likely involving pial matter than wild type. Ring-enhancement is more likely presenting in 1p/19q wild type than co-deletion type. Then, a clinical model of each biomarker was established based on the independent variables mentioned above. IDH: the AUC was 0.88 (95% confidence interval, 0.81–0.95); MGMT: the AUC was 0.78 (95% confidence interval, 0.68–0.88); TERT: the AUC was 0.66 (95% confidence interval, 0.55–0.76; 1p/19q: the AUC was 0.66 (95% confidence interval, 0.58–0.73 (Fig. 2).

Table 2 Comparison of clinical and MRI imaging features of phenotype status for each molecular biomarker
Table 3 Multivariate logistic regression analysis of clinical characteristics for each Biomarker

Radiomics model assessment

Following LASSO regularization and logistic regression analysis, multi sequence model was established. As a result, a total of 4 in IDH, 5 in MGMT, 4 in TERT, 2 in 1p/19q radiomic features (Additional file 1: Table S2; Fig. S3 remained as contributors in our predictive model with resultant AUC of 0.87 in IDH, 0.83 in MGMT, 0.72 in TERT, 0.68 in 1p/19q (Fig. 2). In different genotypes, the AUC of predictive performance of the model established according to each sequence was (IDH: 0.75 in T2W, 0.83 in T1W, 0.66 in DWI, 0.63 ADC, 0.83 in T1 + C; MGMT: 0.70 in T2W, 0.77 in T1W, 0.73 in DWI, 0.64 ADC, 0.72 in T1 + C; TERT: 0.68 in T2W, 0.64 in T1W, 0.64 in DWI, 0.65 ADC, 0.66 in T1 + C; 1p/19q: 0.63 in T2W, 0.64 in T1W, 0.66 in DWI, 0.67 ADC, 0.64 in T1 + C (Fig. 2).

Model comparisons

Nonparametric tests of DeLong were carried out between AUC of each model in each genotype. In IDH genotype, the AUC of multi sequence model is higher than that of T2WI model, DWI model and ADC model (All vs. T2W, DWI, ADC: 0.865 vs. 0.752, 0.656, 0.631, p < 0.05). The AUC of combined model is higher than that of T2WI model, T1WI model, T1 + C model and “All” model (COMB vs. T2W, T1W, T1C, All: 0.928 vs. 0.752, 0.827, 0.829, 0.865, p < 0.05). There was no significant difference in AUC between COMB model and clinical model (COMB vs. clinical: 0.928 vs. 0.880, p > 0.05). In MGMT genotype, the AUC of multi sequence model is higher than that of each single sequence model (All vs. T2W, T1W, DWI, ADC, T1 + C: 0.833 vs. 0.697, 0.765, 0.725, 0.643, 0.715, p < 0.05). The AUC of combined model is higher than that of T2WI model, T1WI model, DWI model, ADC model, T1 + C model and clinical model (COMB vs. T2W, T1W, DWI, ADC, T1C, clinical: 0.878 vs. 0.697, 0.765, 0.725, 0.643, 0.715, 0.781, p < 0.05). There was no significant difference in AUC between COMB model and “All” model (COMB vs. All: 0.878 vs. 0.833, p > 0.05). In TERT genotype, the AUC of multi sequence model is higher than that of DWI model and ADC model (All vs. DWI, ADC: 0.715 vs. 0.635, 0.654, p < 0.05). The AUC of combined model is higher than that of DWI model, ADC model and clinical model (COMB vs. DWI, ADC, clinical: 0.757 vs. 0.635, 0.654, 0.657, p < 0.05). In 1p/19q genotype, there is no significant difference in AUC value of each model. Detailed performances and ROC curves of all models are summarized in Fig. 2. A comparison of radiomic features which have significant predicting value in each biomarker is shown as a heat map in Fig. 3, respectively. Bar and box diagram according to radscore in each biomarker is shown in Fig. 4. Classification capability in IDH status group has a highest performance among all groups. DCA showed that the combined model has better predictive performance than the multi sequence radiomics model in each biomarker, and the best performance is in IDH group (Fig. 5).

Fig. 3
figure 3

Heatmap comparison of the radiomic features which have significant statistic difference in each phenotypes group

Fig. 4
figure 4

The bar (a) and box (b) chart of Radscore. The two charts showed a better performance in predicting each biomarker status, especially in group of IDH and MGMT

Fig. 5
figure 5

The decision curve analysis for multi sequence model (All, red curve and combined model (COMB, green curve). The Y-axis represents the net benefit

Discussion

Our results showed that constructed multiparametric model from MRI radiomics features can identify phenotype status of IDH, MGMT and TERT in preoperative MRI scans of patients with glioma. We specifically demonstrated that addition of clinical features can significantly improve predictive performance for IDH, MGMT, TERT.

There have had many literatures revealed the value of biomarkers status of glioma for diagnosis, therapy and prognosis. In this paper, four kinds of glioma biomarker were taken into consider to further investigate the value of predicting by radiomics add clinical model. The result showed radiomics model which include multiple parameters models obtained a higher predict performance (IDH AUC = 0.87, MGMT AUC = 0.83, TERT AUC = 0.72, 1p/19q AUC = 0.68 than each single parameter model in all kinds of phenotypes. These results are consistent with previous studies. A research showed an AUC of 0.884 and 0.669 for predicting IDH and TERT status, respectively [19]. Zhou et al. [20] demonstrated IDH and 1p/19q status prediction with AUC of 0.921 and 0.685 using a model from T1 + C and T2 FLAIR integrated with age. Kihira et al. [21] demonstrated IDH1 and MGMT status prediction with AUC of 1 and 0.79 using a combination model of texture features from conventional and diffusion.

IDH mutations are associated with increased survival in glioma patients [22] and increased sensitivity to temozolomide therapy [23] and radiation therapy [24]. LGG with IDH mutation along with 1p/19q codeletion has ever better clinical outcomes [10, 25]. MGMT promoter hypermethylation is the only known biomarker for TMZ response [26]. Mutation of TERT promoter as a genetic event is frequently detected in 60–75% of glioblastomas (GBM), and associated with a poor prognosis [27]. 1p/19q co-deletion is associated with better response to radiotherapy and alkylating agent chemotherapy, and longer progression-free and overall survival [28, 29].Therefore, preoperative identification of IDH, MGMT, TERT and 1p/19q status can play an important role with prognostic and treatment implications. Our results showing that IDH mutant type is easier involving frontal lobe than wild type. Feraco et al. demonstrated that IDH mutational status is more likely related to a frontal location [30]. IDH wild type is more common showed ring or irregular enhancement than mutant type, indicating a tendency for invasive behavior, which concurs with a study by Zhang et al. [31]. MGMT methylated type is more likely involve frontal lobe and have a non-clear border than Nonmethylated type, which are inconsistent with other studies. Previous studies suggested that MGMT promoter methylation is associated with GBM in parietal and occipital lobes [32]. These conflicting results may be need Multicenter studies further confirmed. Oedema degree of MGMT Methylated type is less than Nonmethylated type, indicating MGMT Methylated type is less aggressive, which is consistent with a research by Suh et al. [33]. In addition, our study also found that TERT mutant type is more likely involving pial matter than wild type and ring-enhancement is more likely presenting in 1p/19q wild type than co-deletion type. These results should be further confirmed by subsequent studies. Different genetic phenotypes status has different clinical imaging features. In-depth understanding of imaging features can better predict genetic phenotypes status of glioma. Establishing an accurate glioma biomarker prediction model integrate radiogenomics and clinical features will enable noninvasive prediction of prognosis and contributes to treatment planning.

Our study has several limitations. First, this is a retrospective study with a possible unknown bias. Second, as with other radiological studies, there is a risk of over-fitting given the large number of variables involved. Third, tumor segmentation is performed using the overall volume of the tumor with high signal on T2WI to increase inclusion. There was no differentiation of the cystic, edema and parenchyma areas. Further studies are needed to assess whether inclusion or exclusion of these components makes a significant difference in biomarker prediction. Finally, although the preprocessing uses techniques such as signal normalization and resampling to mitigate the impact of image changes caused by MR scanners from different vendors, external testing of the developed model through multi-institution collaboration is still required to improve the versatility and clinical utility of our model.

In conclusion, the described multi sequence model from all sequence radiomic features can better predict glioma biomarker status preoperatively. In particular, addition of clinical features provided significant added diagnostic value in determination of IDH, MGMT, TERT status.