Abstract
Background
Accurate and non-invasive estimation of MGMT promoter methylation status in glioblastoma (GBM) patients is of paramount clinical importance, as it is a predictive biomarker associated with improved overall survival (OS). In response to the clinical need, recent studies have focused on the development of non-invasive artificial intelligence (AI)-based methods for MGMT estimation. In this systematic review, we not only delve into the technical aspects of these AI-driven MGMT estimation methods but also emphasize their profound clinical implications. Specifically, we explore the potential impact of accurate non-invasive MGMT estimation on GBM patient care and treatment decisions.
Methods
Employing a PRISMA search strategy, we identified 33 relevant studies from reputable databases, including PubMed, ScienceDirect, Google Scholar, and IEEE Explore. These studies were comprehensively assessed using 21 diverse attributes, encompassing factors such as types of imaging modalities, machine learning (ML) methods, and cohort sizes, with clear rationales for attribute scoring. Subsequently, we ranked these studies and established a cutoff value to categorize them into low-bias and high-bias groups.
Results
By analyzing the 'cumulative plot of mean score' and the 'frequency plot curve' of the studies, we determined a cutoff value of 6.00. A higher mean score indicated a lower risk of bias, with studies scoring above the cutoff mark categorized as low-bias (73%), while 27% fell into the high-bias category.
Conclusion
Our findings underscore the immense potential of AI-based machine learning (ML) and deep learning (DL) methods in non-invasively determining MGMT promoter methylation status. Importantly, the clinical significance of these AI-driven advancements lies in their capacity to transform GBM patient care by providing accurate and timely information for treatment decisions. However, the translation of these technical advancements into clinical practice presents challenges, including the need for large multi-institutional cohorts and the integration of diverse data types. Addressing these challenges will be critical in realizing the full potential of AI in improving the reliability and accessibility of MGMT estimation while lowering the risk of bias in clinical decision-making.
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Glioblastoma multiforme (GBM), is a malignant and aggressive brain tumor that spreads aggressively (Jena et al. 2022) and typically originates in the adult cerebrum, the brain's largest region (Peri 2022). The median survival time in GBM patients is approximately nine months; however, those with standard-of-care surgery and adjuvant chemoradiation may extend to 15–16 months (Brain tumor segmentation and overall survival period prediction in glioblastoma multiforme using radiomic features-Das 2022; Tamimi and Juweid 2017). The economic burden has been prominently evidenced among the affected individuals receiving systemic medication (Raizer et al. 2015; Kumthekar et al. 2014). Neurological examinations and neuroimaging techniques are the primary diagnostic tools for GBM identification but could be employed only after the disease has significantly progressed. It is often treated with surgery to remove the tumour mass, followed by radiotherapy and chemotherapy. Regardless of the surgery's extent, GBM resection is frequently insufficient, resulting in relapse or even recurrence (Silantyev et al. 2019). Various imaging modalities are employed, such as magnetic resonance imaging (MRI), computer tomography (CT), digital subtraction angiography (DSA), and to a certain extent, even X-Rays for non-invasive determination of GBM. Although there are multiple neuro-imaging paradigms, MRI is preferred to other modalities for various reasons, including its ability to image discrete anatomical regions in arbitrary planes with excellent tissue contrast and the lack of evident negative health impacts on patients (An empirical study of different machine learning techniques for brain tumor classification and subsequent segmentation using hybrid texture feature|SpringerLink 2022).
O6-methylguanine-DNA methyltransferase (MGMT), a DNA-repairing enzyme, is located on the 10q26.3 chromosome (Methylguanine-DNA methyltransferase (MGMT)|Radiology Reference Article|Radiopaedia.org 2022). High MGMT activity reduces the effectiveness of alkylating drugs and is a poor prognostic indicator. However, when the MGMT promoter is methylated, alkylating agents are more potent (Methylguanine-DNA methyltransferase (MGMT)|Radiology Reference Article|Radiopaedia.org 2022). It has been examined as a potential biomarker of susceptibility to alkylating chemotherapy, particularly temozolomide (TMZ), because of its relatively high frequency in GBM, which may vary depending on the method employed for its assessment (Stupp et al. May 2009). In a study by Hegi et al. (MGMT Gene Silencing and Benefit from Temozolomide in Glioblastoma|NEJM”. 2022), the authors stated that “Among patients whose tumor contained a methylated MGMT promoter, a survival benefit was observed in patients treated with temozolomide and radiotherapy; “their median survival was 21.7 months as compared with 15.3 months among those who were assigned to only radiotherapy. In the absence of methylation of the MGMT promoter, there was a smaller and statistically insignificant difference in survival between the treatment groups”.
In current clinical practice in the era of precision medicine, invasive methods such as biopsy and surgery are the only reliable ways for MGMT methylation status determination. However, such invasive methods carry threats and difficulties, for example, neurologic injury, complications, cost, etc. Hence, researchers have been working on AI-based methods using medical imaging modalities for MGMT status determination in the last couple of years. The idea of AI is to strive to use computers to imitate human intelligence. This field has much potential in radiological-based medical applications (Jena et al. 2021). Deep Learning (DL) is a paradigm of AI wherein the programmer tries to create a mathematical model mimicking the human mind called Neural Networks (NN). These DL models, coupled with the traditional Machine Learning (ML) models, possess the potential to detect the methylation status of the MGMT gene, among other biomarkers, without needing a biopsy directly from the neuro-image sequences. In a study by Xi et al. (2018) Support Vector Machines (SVMs) have been used to analyze radiomics features for utilizing the full potential of medical imaging as biomarkers of MGMT promoter methylation. The results revealed an accuracy of 86.59% using the T1, T2, and enhanced T1CE image features of the MRI scans of the 98 GBM patients. This study stated that there is a space for the ML models to churn the neuro-imaging data into a viable option for biopsy. DL models, too, are showing promising results. Chen et al. (2020) devised a DL pipeline to automate the prediction of MGMT status. They have considered 106 GBM patients with contrast-enhanced T1W images and fluid-attenuated inversion recovery (FLAIR) images. Using a pipeline model consisting of both tumor segmentation and classification, they have concluded that their pipeline best works on FLAIR images with an 82.7 ± 5.6% accuracy. Their suggested pipeline reduced inter-observer variation in glioma segmentation, sped the tumour annotation process, and accurately predicted the MGMT methylation status. It would make finding molecular biomarkers from common medical imaging even easier. There is constant innovation in models designed to make the results indistinguishable from the biopsy ones.
In this study, we conducted a systematic review and analysis of 33 research articles on estimating MGMT promoter methylation using various AI methodologies and its ML and DL components. Thereafter, we have presented the recent developments in the contribution of AI as per the neurological perspective of MGMT methylation. Eventually, the bias analysis was performed on the selected studies, and principal findings and challenges were discussed.
Search strategy and statistics
Preferred reporting items for systematic reviews and meta-analyses (PRISMA) model
We conducted an extensive literature search on PubMed, ScienceDirect, Google Scholar, and IEEE Explore using the PRISMA strategy (Fig. 1). The keywords included were: (Glioblastoma AND MGMT AND Machine Learning AND Artificial Intelligence), (Gliomas AND MGMT AND Machine Learning AND Deep Learning), and (MGMT AND (Machine Learning OR Deep Learning OR Artificial Intelligence OR Radiomics OR Radiogenomics) AND (gliomas OR glioblastomas)).
Using Clarivate Analytics' EndNote software's "Find Duplicates" option, a maximum of 258 papers were found and replicas were eliminated, leaving 184 entries. Studies unrelated to AI, irrelevant papers, and articles with inadequate data were the three exclusion criteria. The final 33 references for this study were chosen after applying the exclusion criteria to 105, 21, and 25 studies (designated as E1, E2, and E3 in Fig. 1), which were then located and discarded.
Statistical distribution and analysis based on various parameters
Dataset size (DS)
We observed that the data set size ranged from 34 to 418 patients across the 33 publications. DS is the total number of patients used to capture images in MRI, CT, PET, or a combination of these three modalities. The distribution of the dataset in various studies is displayed in Fig. 2.
Studies with AI application
Figure 3a shows the prevalence of machine learning (ML) and deep learning (DL) methodologies. Most of the studies employed the ML methods rather than DL. This may be attributed to DL models requiring a good-quality dataset which is easily accessible.
AI methodologies
Though various modalities have been used to accurately detect MGMT status, the most prominent ones include Random Forests and Vector Machines. It has also been revealed that compared to individuals, the combination of modalities may be effective (Stupp et al. 2009). Moreover, DL models have higher accuracies than simple ML models. To detect MGMT status, various AI methodologies have been represented in Fig. 3b.
MRI—an efficient imaging modality to analyze GBM
MRI is a frequently employed technique that creates three-dimensional, intricate anatomical images that help diagnose disease and monitor therapy. It detects changes in the rotational axis of protons in the water present in the living tissues (Magnetic Resonance Imaging (MRI) 2022). MRI plays a significant role in AI-based analysis. It can scan discrete anatomical locations in vivo with excellent tissue contrast with images that can be taken in any plane. Though CT and PET modalities are also very informative but are used to a lesser extent, the distribution of which has been represented in Fig. 3c.
GBM and MGMT
Glioma’s pathology: the WHO grading system
Glioma is a type of primary tumor that originates in the brain and spinal cord and initiates in the gluey supportive cells (glial cells) surrounding nerve cells (Stupp et al. 2009). Due to its complex nature, gliomas are often referred to as intra-axial brain tumors. The prognosis and course of therapy are influenced by the type of glioma, and the treatment options include surgical, radiation therapy, chemotherapy, and targeted therapy.
Astrocytomas, ependymomas, and oligodendrogliomas are the three primary forms of gliomas, which are categorized based on phenotypic cell features (Magnetic Resonance Imaging (MRI) 2022). These cell gliomas are further divided into low-grade, atypical, and high-grade tumors based on cell morphology, mitotic activities, and molecular marker. The World Health Organization (WHO) grading system recommends molecular markers with proven prognostic and therapeutic implications. For example, GBM is a type of glioma that has progressed to the fourth-grade (Lopes Oct. 2017; Mesfin and Al-Dhahir 2022). Figure 4 shows an MR scan of the GBM-affected brain, and Table 1 shows the histologic type and grade of glioma.
MGMT in GBM
MGMT is a DNA "suicide" repair enzyme. Transfer of methyl group from guanine's O6 site to its cysteine residues restores damaged guanine nucleotides without causing gene mutation, cell death, or tumorigenesis from alkylating agents (Gerstner et al. 2009). MGMT gene is located on chromosome 10q26.3 (Fig. 5), with a total length of 300,437 bp (Yu et al. 2020). Methylation of the MGMT gene promoter significantly predicts prognosis for newly diagnosed GBM. MGMT has recently been linked to the therapeutic success of alkylating agent chemotherapy, specifically temozolomide (TMZ) treatment (Sharma et al. 2009). It is commonly believed that MGMT promoter methylation in patient tumors results in reduced MGMT protein production, and elimination of the DNA repair activity required for TMZ resistance as MGMT transcription may be repressed by promoter methylation in tumor cells (Brandes et al. 2017), according to Liu et al. (2006) and Pistollato et al. (2010), GBM stem cells, which the stem cell marker CD133 can recognize, express a high amount of MGMT and have significant tumor resistance to TMZ. According to CD133(-) glioblastoma-derived cancer stem cells show differential growth characteristics and molecular profiles-PubMed (2022), Beier et al. (2008), several stem cell types exhibit different MGMT protein expressions despite having equivalent MGMT promoter methylation status. It was further discovered that TMZ selectively destroys glioblastoma cancer stem cells in MGMT-negative cell lines, indicating this protein's potential in cancer treatment. The MGMT levels have been successfully manipulated to not only improve alkylating agent therapy but also to safeguard hematopoietic cells from the myelosuppressive effects of high-dose chemotherapy (Sharma et al. 2009).
Role of AI in current clinical practice
Recent development in predicting MGMT status.
Till now, numerous studies have demonstrated that MGMT promoter methylation is a significant predictive biomarker for TMZ resistance and poor progression-free survival in GBM patients (Yin et al. 2014; Gerstner et al. 2009; Butler et al. 2020; OncologyPRO 2019; Saxena et al. 2023a; Saxena et al. 2023b; Sareen et al. 2022). Methylation-specific polymerase chain reactions using surgical specimens are considered the gold standard for evaluating the MGMT methylation status; however, they require a large volume of tissue samples and strict sample cryopreservation procedures (Stupp et al. 2009). Other techniques, such as activity assays, immunohistochemistry, and methylation chip analysis, have technical limitations (Drabycz et al. 2010). These invasive procedures are also less helpful in hospitals due to the potential of insufficient biopsy samples, expensive detection costs, and the great complexity of the intralesional heterogeneity (18F-FDG-PET-based Radiomics signature predicts MGMT promoter methylation status in primary diffuse glioma|Cancer Imaging|Full Text 2022).
As discussed in recent decades, experts have switched to finding correlations between clinical symptoms and genetic traits utilizing non-invasive methods like radiomics (A Deep Learning-Based Radiomics Model for Prediction of Survival in Glioblastoma Multiforme-PubMed 2022) to quantitatively extract and evaluate various noninvasive image data, including intensity distributions, spatial relationships, and patterns of textural heterogeneity (McGarry et al. 2016). And it is noticed the developments of radiomics models in radiology for predicting survival rates, distant metastasis, and molecular characterization (Kickingereder et al. 2016). In addition, numerous computer models were created to preoperatively predict the MGMT methylation status based on magnetic resonance imaging since it is thought that the MGMT methylation status is a significant predictive indicator for guiding GBM treatment decisions (MRI) (Xi et al. 2018; Li et al. 2018; Wei et al. 2019). Le et al. (2020), recently proposed a radiomics-based eXtreme Gradient Boosting (XGBoost) model that demonstrated reasonably good performance for predicting the MGMT promoter methylation status, with an accuracy of 88.7% and an area under the receiver operating characteristics curve (AUC) of 0.896. Do et al. (Improving MGMT methylation status prediction of glioblastoma through optimizing radiomics features using genetic algorithm-based machine learning approach|Scientific Reports 2022), suggested a hybrid ML-based radiomics feature selection model to find the best radiomics feature sets and predict the MGMT promoter methylation status in response to the earlier work of Le et al. (2020). Most of the radiomics feature sets for categorizing MGMT methylation statuses provided by other studies were based only on one feature selection technique. This study is the first to use the genetic algorithm-based hybrid feature selection approach for classifying MGMT methylation statuses in GBM.
To identify a radiomics feature subset that could accurately predict the MGMT methylation statuses, their study set out to explore the viability of adopting a two-stage feature selection approach composed of feature selection carried out using the XGBoost algorithm followed by a GA wrapper model (GA wrapper is a feature selection mechanism where each feature is considered as a gene and a selected set of features as a chromosome). They noticed that the implementation of the GA resulted in a radiomics feature set, which displayed greater accuracy levels for MGMT methylation status prediction than most of those reported in prior research. Additionally, their findings demonstrated that a smaller degree of prediction accuracy might be caused by either the inclusion of too few features (F-score feature set) or too many features. As a result, the GA provides a viable method for producing highly effective predictors without knowing the ideal number of features to be included in advance. This model with the highest performance (GA-RF) was tested on an independent dataset, which demonstrated that the model might be generalized to similar diseases. This cutting-edge model's ability to predict MGMT methylation status might benefit clinical decision-making by allowing for treatment strategies for patients with GBM even before surgery.
Challenges and opportunities
Despite the enormous promise of AI in tumor diagnosis, prognosis, and prediction, translations into clinical settings are delayed because of several related difficulties (Ak et al. 2022). These substantial obstacles must be addressed to incorporate AI methods into healthcare settings. A key problem in predicting MGMT status using AI is the interpretation of the algorithms, which are exceedingly complicated. Interpreting their inner workings is not straightforward; it is called a 'black box' nature (Cuocolo et al. 2020). This makes it harder for such technologies to be used in healthcare. An algorithm that is simple to understand enables evaluation of its results and offers suggestions for improvement. Although important, these algorithms rely heavily on available data interpretation standards, which can also introduce bias (Elmore et al. 2016). The findings of these algorithms have consistently outperformed human readings regarding reproducibility and consistency; however, this leads to additional patient exams and overdiagnosis (Cuocolo et al. 2020).
The most challenging task in the next step is storing, managing, extracting, analyzing, integrating, visualizing, and communicating the information produced from the vast amount of accessible data (Pinta et al. 2021). Integrating such diverse and multivariate data in an economical, standardized, and safe way is crucial. Critical ongoing problems also include the nature and variability of the data. Despite the ease with which large amounts of imaging data are accessible, institutional heterogeneity (either intra- or inter) exists due to variations in scan protocols, technology, and post-processing procedures, which restricts the generalizability of findings (Pinta et al. 2021). In addition, there are variations in contrast enhancement procedures, arguments, and image acquisition settings. According to research, radiomic feature estimations still varied even when the identical scanning methodology was used for image acquisition. As a result, findings are less easily repeatable, hindering useful AI models' creation (Ger et al. 2018).
Another key problem connected with AI research is the restricted number of laboratories performing such research due to the costs and difficulties involved (Trivizakis et al. 2020). Furthermore, one critical issue of implementing AI is the need for appropriate nested cross-validation to minimize overfitting, which is typical in AI (Saxena et al. 2022). Finally, data on MGMT promoter status were only available for a selected patient subgroup of an overall trial population which can induce selection bias in the analysis (Yin et al. 2014).
AI in MGMT status prediction: a neuroimaging perspective
This section presents neuro-imaging perspectives on the recent advancements in the methods of MGMT methylation estimation under the artificial intelligence paradigm. Radiological scans have proved an effective non-invasive technique for early-stage MGMT prediction in patients suffering from GBM with screening and diagnosis, support for treatment regimens, prognosis evaluation, and follow-up for advanced-stage of glioblastoma (Jena et al. 2022; Saxena et al. 2022). Recent years have seen the evolution of radiological features from semantic to radiomic hand-crafted and deep features. Semantic features are the qualitative characteristics that a skilled radiologist will typically derive from the clinical imaging directly to describe the lesion (An empirical study of different machine learning techniques for brain tumor classification and subsequent segmentation using hybrid texture feature|SpringerLink 2022; Rizzo et al. 2018). To examine potential connections with biology and clinical outcomes, radiomic features, on the other hand, comprise extracting and evaluating quantitative information from medical images using mathematical algorithms, machine learning, and deep learning techniques. The anatomical and functional knowledge of MGMT genomics can be separately reflected by radiomic characteristics retrieved from structural and functional imaging (Srivastava et al. 2019).
Radiomics can be coupled with artificial intelligence (AI) due to its superior capacity to handle vast amounts of data when compared to conventional statistical methods. Together, these fields' main goal is to unearth and meaningfully evaluate as much buried quantitative data as possible for use in decision support. Due to their impressive accomplishment in a variety of radiological tasks, both radiomics and AI have recently attracted the attention (Radiomics with artificial intelligence: a practical guide for beginners-PMC 2022). The traditional radiomics workflow uses an AI framework. It includes a number of processes, such as image acquisition, pre-processing, region of interest segmentation, feature extraction, feature selection, model selection, evaluation, and validation with clinical implementation. Deep radiomics research is a part of deep learning (DL) technology, a branch of computer learning (ML). Image pre-processing is essential when dealing with clinical images with essential genomics information, such as MGMT, both for traditional and deep radiomics. Following pre-processing, the region of interest (ROI) has been identified, and radiomics features that contain genomics information have been retrieved from it. The radiomics feature may be hand-crafted or deep. The final step of the radiomics process is the model selection and data analysis from the radiomics feature for better clinical decision and treatment planning, as shown in Fig. 6.
Risk-of-bias (RoB) analysis
As mentioned before, by PRISMA strategy, we considered 33 studies for MGMT methylation using AI and its components, such as ML and DL. Moreover, we performed RoB analysis to check the bias in these studies to show that AI is viable for MGMT methylation status determination and analysis in GBMs. Each study has been analyzed on 21 AI-based attributes such as image modalities, the objective of the study, the dataset size (in the number of patients), patients demography, feature extraction and selection, data preprocessing and augmentation, performance evaluation parameters like accuracy, sensitivity, specificity, precision, AUC of the ROC, the F-score, performance analysis metrics like the confusion matrix and ROC, statistical analysis, regularization, hardware and software resources used. These attributes using AI features are initially qualitative and then quantified by assigning a number between 0 and 1 based on the consensus of the AI scientist’s experience. The value of AI-based attributes has been set based on the attribute's strength, which ranges from 0 to 1. Then each study’s aggregate score is the sum of all attribute values for that selected study. The mean of each study was then calculated by dividing by the number of AI attributes considered (i.e., 21 in our case). Using this principle, all 33 studies are ranked based on their mean scores, ranging from 0.820 to 0.410. We multiplied all the mean values with 10 to normalize the scores between 1 and 10, then plotted them in decreasing order (Fig. 7). The raw cut-off of 6.00 was determined based on the intersection of the “cumulative plot of the mean score” and “the frequency plot curve of the studies”. This raw cut-off mark estimates the whole number of studies into low-bias and high-bias categories. The higher the mean value, the lower the risk of bias; hence, studies above the cut-off mark belong to the low-bias category, while 27% belong to the high-bias category. The highly biased studies have not considered all AI attributes while evaluating the radiogenomics system or may have low proportioned values for the attributes considered.
Discussion
Principal findings
As per the best of our rigorous search and findings, this is the first study of its kind to demonstrate AI approaches in predicting the MGMT status in GBM and the most recent developments in its prediction. The PRISMA methodology, a well-established benchmark in the healthcare business, was used to identify 33 studies. The investigation revealed several statistical distributions based on many criteria, including (a) Dataset Size; (b) image modalities; (c) AI employed, and (d) AI modality. In the RoB analysis, we considered criteria such as; (a) image modalities, (b) the objective of the study, (c) the dataset size (in the number of patients), (d) the demography of the patients, (e) feature extraction and (f) selection, (g) data preprocessing, (h) data augmentation, (i) the number of performance evaluation parameters, (j) accuracy, (k) sensitivity, (l)specificity, (m) precision, (n) AUC of the ROC, (o) the F score, (p) performance analysis metrics like the confusion matrix and ROC, (q) statistical analysis, (r) regularization, (s) number of regularization methods, (t) hardware and (u) software resources.
The novelty of our study includes determining modality with higher efficacy of several AI models in predicting the MGMT status of GBM patients. Most research employed ML algorithms to predict the MGMT status of GBM patients. A DL model needs a huge number of training instances, which makes large, high-quality medical imaging pictures challenging or impossible to produce. It is many advantages over conventional approaches with hand-crafted features, including being resistant to distortions like changes in form and having a lower computational cost. In addition, DL models have the advantage of automatically extracting features from the images. We have observed that the majority of the images used for diagnosis purposes are MRI images because the advanced MRI techniques such as diffusion tensor imaging, perfusion MR techniques such as arterial spin labeling, dynamic susceptibility contrast, MR spectroscopy technique, and dynamic contrast-enhanced imaging can aid with the morphology and function of tumors. Though CT and PET modalities are also very informative but are used up to a lesser extent. Various modalities are used to predict the MGMT status, but we have observed that modalities like CNN, Boosting, Random Forests, and Vector Machines are mostly used. The observation has been made that a combination of modalities works better than a single modality. The models using DL techniques have resulted in higher accuracies than the models using ML methods. We have found that only 73% of the studies we considered have a low bias (mean > 6.0). Despite many exemplary studies with great performance evaluation metrics, we have found that they have not focused on large datasets.
Benchmarking table
The following benchmarking Table 2 compares some of the contributing research for MGMT promoter methylation estimation considered for the evaluation (Zhu et al. 2022; Zlochower et al. 2020; Alhasan 2021; Wu et al. 2021; Kempen et al. 2021), where 8 attributes were considered.
Recommendations and challenges
High-quality ground truth data, generalizable and interpretable methodologies, and the integration of user-centric workflows are major obstacles to the promises of AI in radiology. Concerns over the "black box" character of these algorithms have waned in light of the ongoing advancement of methods, such as saliency mapping or principal component analysis, that may "unbox" the networks by examining internal algorithm feature vectors. So, it is recommended that a better mechanistic understanding of feature patterns and underlying biology will be helpful both for clinical acceptance and for improving the biological and treatment relevance of the patterns revealed by these methods.
The need for robust and thoroughly annotated data sets is AI research's biggest challenge. However, studies with relatively small sample numbers are likelier to have measurement errors. TCIA and BraTS have significantly produced consolidated, well-labelled data for glioma image processing. In contrast, non-glioma-based research has been constrained by the absence of a publicly available data set. However, most data are still isolated within various organizations and hospital systems. To increase the generalizability of an algorithm's performance across multiple imaging sites, acquisition parameters, and patient groups, more extensive and more diverse data sets are recommended (AlBadawy et al. 2018). Other approaches to enhancing data sets include statistical techniques to harmonize the data sets and to introduce more consistent data collecting via the adoption of standardized neuro-oncology imaging protocols across institutions (Ellingson et al. 2017).
Although processing, expenses, and various institutions' ethical approval processes make managing multi-institutional data, it is advised to manage it meticulously so that the radiogenomics study will turn out to be the best and most clinically trustworthy. For instance, if institutions cannot disclose their data owing to ethical concerns, they may release the AI models they have generated and test them on their cohort so that researchers can efficiently integrate the models and conduct additional analyses. Consequently, researchers could conduct their research with more reliable and applicable results (Saxena et al. 2022). Finally, when dealing with high-dimensional, small-sized datasets, the issue of ML model overfitting may be avoided using cross-validation to ensure that the test component does not interfere with the training process (Improving MGMT methylation status prediction of glioblastoma through optimizing radiomics features using genetic algorithm-based machine learning approach|Scientific Reports 2022).
Conclusion
Here, multiple AI-based studies for MGMT promoter methylation estimation with numerous ML & DL architectures, datasets, accuracy, and other significant attributes have been presented. It is concluded that ML-based methods could be employed as filters, predictors, and classification methods to increase most cases' overall performance of the robust model. And DL-based methods demonstrated well performed for in-depth analysis of MGMT methylation estimation. An RoB analysis, considering 21 AI attributes, showed that 27% of studies belong to the high-bias category, and the remaining belong to the low-bias category. The highly biased studies have not considered all AI attributes while evaluating the radiogenomics system or may have lower proportioned values for the attributes considered. Though, there are specific challenges while implementing such AI-based methods for MGMT promoter methylation estimation. However, some promising results demonstrate that if the obstacles are carefully handled, these methods could play a vital role in the field of neuro-oncology in current clinical practice in the era of precision medicine.
Abbreviations
- AI:
-
Artificial intelligence
- ANN:
-
Artificial neural network
- AUC:
-
Area under curve
- BraTS:
-
Brain tumor segmentation
- CNN:
-
Convolutional neural network
- CT:
-
Computer tomography
- DL:
-
Deep learning
- DS:
-
Dataset size
- DSA:
-
Digital subtraction angiography
- E:
-
Exclusion criteria
- FCN:
-
Fully convolutional network
- FLAIR:
-
Fluid-attenuated inversion recovery
- GA:
-
Genetic algorithm
- GBM:
-
Glioblastoma multiforme
- I:
-
Inclusion criteria
- KNN:
-
K-Nearest Neighbours
- MGMT:
-
O6-methylguanine-DNA methyltransferase
- ML:
-
Machine learning
- MRI:
-
Magnetic resonance imaging
- NN:
-
Neural networks
- OOP:
-
Out-of-pocket
- OS:
-
Overall Survival
- PET:
-
Positron emission tomography
- PRISMA:
-
Preferred reporting items for systematic reviews and meta-analyses
- RF:
-
Random forest
- RoB:
-
Risk-of-bias
- ROI:
-
Region of interest
- SVM:
-
Support vector machine
- T1WI:
-
T1-weighted image
- T2WI:
-
T2-weighted image
- TCIA:
-
The cancer imaging archive
- TMZ:
-
Temozolomide
- WHO:
-
World Health Organisation
- XGBoost:
-
EXtreme Gradient Boosting
References
Kong Z et al. (2019) 18F-FDG-PETbased Radiomics signature predicts MGMT promoter methylation status in primary diffuse glioma. Cancer Imaging 19(1): 1–10
“A Deep learning-based radiomics model for prediction of survival in glioblastoma multiforme-pubmed. https://pubmed.ncbi.nlm.nih.gov/28871110/. Accessed 21 Dec 2022
Ak M, Toll SA, Hein KZ, Colen RR, Khatua S (2022) Evolving role and translation of radiomics and radiogenomics in adult and pediatric neuro-oncology. AJNR Am J Neuroradiol 43(6):792–801. https://doi.org/10.3174/ajnr.A7297
AlBadawy EA, Saha A, Mazurowski MA (2018) Deep learning for segmentation of brain tumors: Impact of cross-institutional training and testing. Med Phys 45(3):1150–1158. https://doi.org/10.1002/mp.12752
Alhasan AS (2021) Clinical applications of artificial intelligence, machine learning, and deep learning in the imaging of gliomas: a systematic review. Cureus 13(11):e19580. https://doi.org/10.7759/cureus.19580
An empirical study of different machine learning techniques for brain tumor classification and subsequent segmentation using hybrid texture feature|SpringerLink. https://doi.org/10.1007/s00138-021-01262-x. Accessed 21 Dec 2022
Baine M et al (2021) The potential use of radiomics with pre-radiation therapy MR imaging in predicting risk of pseudoprogression in glioblastoma patients. J Imaging 7(2):17. https://doi.org/10.3390/jimaging7020017
Beier D et al (2008) Temozolomide preferentially depletes cancer stem cells in glioblastoma. Cancer Res 68(14):5706–5715. https://doi.org/10.1158/0008-5472.CAN-07-6878
Brain tumor segmentation and overall survival period prediction in glioblastoma multiforme using radiomic features-Das-2022-Concurrency and Computation: Practice and Experience-Wiley Online Library. https://doi.org/10.1002/cpe.6501. Accessed 21 Dec 2022
Brandes AA et al (2017) Role of MGMT methylation status at time of diagnosis and recurrence for patients with glioblastoma: clinical implications. Oncologist 22(4):432–437. https://doi.org/10.1634/theoncologist.2016-0254
Butler M et al (2020) MGMT status as a clinical biomarker in glioblastoma. Trends Cancer 6(5):380–391. https://doi.org/10.1016/j.trecan.2020.02.010
“CD133(+) and CD133(-) glioblastoma-derived cancer stem cells show differential growth characteristics and molecular profiles-PubMed. https://pubmed.ncbi.nlm.nih.gov/17483311/. Accessed 21 Dec 2022
Chang P et al (2018) Deep-learning convolutional neural networks accurately classify genetic mutations in gliomas. AJNR Am J Neuroradiol 39(7):1201–1207. https://doi.org/10.3174/ajnr.A5667
Chen L et al (2017) Multi-label inductive matrix completion for joint MGMT and IDH1 status prediction for glioma patients. In: Med. Image Comput. Comput.-Assist. Interv. MICCAI Int. Conf. Med. Image Comput. Comput.-Assist. Interv., vol. 10434, pp. 450–458, 2017, https://doi.org/10.1007/978-3-319-66185-8_51.
Chen L et al (2018) Multi-label nonlinear matrix completion with transductive multi-task feature selection for joint MGMT and IDH1 status prediction of patient with high-grade gliomas. IEEE Trans Med Imaging 37(8):1775–1787. https://doi.org/10.1109/TMI.2018.2807590
Chen X et al (2020) Automatic prediction of MGMT status in glioblastoma via deep learning-based mr image analysis. BioMed Res Int 2020:e9258649. https://doi.org/10.1155/2020/9258649
Comprehensive Genomic Subtyping of Glioma Using Semi-Supervised Multi-Task Deep Learning on Multimodal MRI|IEEE Journals & Magazine|IEEE Xplore. https://ieeexplore.ieee.org/document/9654221. Accessed 21 Dec 2022
Crisi G, Filice S (2020) Predicting MGMT promoter methylation of glioblastoma from dynamic susceptibility contrast perfusion: a radiomic approach. J Neuroimaging off J Am Soc Neuroimaging 30(4):458–462. https://doi.org/10.1111/jon.12724
Cuocolo R, Caruso M, Perillo T, Ugga L, Petretta M (2020) Machine learning in oncology: a clinical appraisal. Cancer Lett 481:55–62. https://doi.org/10.1016/j.canlet.2020.03.032
de la Pinta C, Castillo ME, Collado M, Galindo-Pumariño C, Peña C (2021) Radiogenomics: hunting down liver metastasis in colorectal cancer patients. Cancers 13(21):5547. https://doi.org/10.3390/cancers13215547
Drabycz S et al (2010) An analysis of image texture, tumor location, and MGMT promoter methylation in glioblastoma using magnetic resonance imaging. Neuroimage 49(2):1398–1405. https://doi.org/10.1016/j.neuroimage.2009.09.049
Ellingson BM, Wen PY, Cloughesy TF (2017) Modified criteria for radiographic response assessment in glioblastoma clinical trials. Neurother J Am Soc Exp Neurother 14(2):307–320. https://doi.org/10.1007/s13311-016-0507-6
Elmore JG et al (2016) Variability in pathologists’ interpretations of individual breast biopsy slides: a population perspective. Ann Intern Med 164(10):649–655. https://doi.org/10.7326/M15-0964
Ger RB et al (2018) Comprehensive investigation on controlling for CT imaging variabilities in radiomics studies. Sci Rep 8(1):1. https://doi.org/10.1038/s41598-018-31509-z
Gerstner ER, Yip S, Wang DL, Louis DN, Iafrate AJ, Batchelor TT (2009) MGMT methylation is a prognostic biomarker in elderly patients with newly diagnosed glioblastoma. Neurology 73(18):1509–1510. https://doi.org/10.1212/wnl.0b013e3181bf9907
Hajianfar G et al (2019) Noninvasive O6 methylguanine-DNA methyltransferase status prediction in glioblastoma multiforme cancer using magnetic resonance imaging radiomics features: univariate and multivariate radiogenomics analysis. World Neurosurg 132:e140–e161. https://doi.org/10.1016/j.wneu.2019.08.232
Haubold J et al (2020) Non-invasive tumor decoding and phenotyping of cerebral gliomas utilizing multiparametric 18F-FET PET-MRI and MR Fingerprinting. Eur J Nucl Med Mol Imaging 47(6):1435–1445. https://doi.org/10.1007/s00259-019-04602-2
Haubold J et al (2021) Fully automated MR based virtual biopsy of cerebral gliomas. Cancers 13(24):6186. https://doi.org/10.3390/cancers13246186
Hegi ME, Diserens AC, Gorlia T, Hamou MF, de Tribolet N, Weller M, Kros JM, Hainfellner JA, Mason W, Mariani L, Bromberg JEC, Hau P, Mirimanoff RO, Cairncross JG, Janzer RC, Stupp R (2005) MGMT Gene Silencing and Benefit from Temozolomide in Glioblastoma. N Engl J Med 352:997–1003. https://doi.org/10.1056/NEJMoa043331
Improving MGMT methylation status prediction of glioblastoma through optimizing radiomics features using genetic algorithm-based machine learning approach|Scientific Reports. https://www.nature.com/articles/s41598-022-17707-w. Accessed 21 Dec 2022
Jena B, Saxena S, Nayak GK, Saba L, Sharma N, Suri JS (2021) Artificial intelligence-based hybrid deep learning models for image classification: the first narrative review. Comput Biol Med 137:104803. https://doi.org/10.1016/j.compbiomed.2021.104803
Jena B et al (2022) Brain Tumor Characterization Using Radiogenomics in Artificial Intelligence Framework. Cancers 14(16):4052. https://doi.org/10.3390/cancers14164052
Jiang C et al (2019) Fusion radiomics features from conventional MRI predict MGMT promoter methylation status in lower grade gliomas. Eur J Radiol. https://doi.org/10.1016/j.ejrad.2019.108714
Kanas VG, Zacharaki EI, Thomas GA, Zinn PO, Megalooikonomou V, Colen RR (2017) Learning MRI-based classification models for MGMT methylation status prediction in glioblastoma. Comput Methods Programs Biomed 140:249–257. https://doi.org/10.1016/j.cmpb.2016.12.018
Kickingereder P et al (2016) Radiogenomics of glioblastoma: machine learning-based classification of molecular characteristics by using multiparametric and multiregional MR imaging features. Radiology 281(3):907–918. https://doi.org/10.1148/radiol.2016161382
Kocher M, Ruge MI, Galldiks N, Lohmann P (2020) Applications of radiomics and machine learning for radiotherapy of malignant brain tumors. Strahlenther Onkol Organ Dtsch Rontgengesellschaft Al 196(10):856–867. https://doi.org/10.1007/s00066-020-01626-8
Korfiatis P et al (2016) MRI texture features as biomarkers to predict MGMT methylation status in glioblastomas. Med Phys 43(6):2835–2844. https://doi.org/10.1118/1.4948668
Kumthekar P et al (2014) Financial burden experienced by patients undergoing treatment for malignant gliomas. Neuro-Oncol Pract 1(2):71–76. https://doi.org/10.1093/nop/npu002
Le NQK, Do DT, Chiu F-Y, Yapp EKY, Yeh H-Y, Chen C-Y (2020) XGBoost improves classification of MGMT promoter methylation status in IDH1 wildtype glioblastoma. J Pers Med 10(3):128. https://doi.org/10.3390/jpm10030128
Li Z-C et al (2018) Multiregional radiomics features from multiparametric MRI for prediction of MGMT methylation status in glioblastoma multiforme: a multicentre study. Eur Radiol 28(9):3640–3650. https://doi.org/10.1007/s00330-017-5302-1
Liu G et al (2006) Analysis of gene expression and chemoresistance of CD133+ cancer stem cells in glioblastoma. Mol Cancer 5:67. https://doi.org/10.1186/1476-4598-5-67
Lopes MBS (2017) The 2017 World Health Organization classification of tumors of the pituitary gland: a summary. Acta Neuropathol (berl) 134(4):521–535. https://doi.org/10.1007/s00401-017-1769-8
Lu Y et al (2020) Machine learning-based radiomic, clinical and semantic feature analysis for predicting overall survival and MGMT promoter methylation status in patients with glioblastoma. Magn Reson Imaging 74:161–170. https://doi.org/10.1016/j.mri.2020.09.017
Magnetic Resonance Imaging (MRI) (2022) National Institute of Biomedical Imaging and Bioengineering. https://www.nibib.nih.gov/science-education/science-topics/magnetic-resonance-imaging-mri. Accessed 25 Dec 2022
McGarry SD et al (2016) Magnetic resonance imaging-based radiomic profiles predict patient prognosis in newly diagnosed glioblastoma before therapy. Tomography 2(3):223–228. https://doi.org/10.18383/j.tom.2016.00250
Mesfin FB, Al-Dhahir MA (2022) Gliomas,” in StatPearls, Treasure Island (FL): StatPearls Publishing, 2022. [Online]. http://www.ncbi.nlm.nih.gov/books/NBK441874/. Accessed 21 Dec 2022
Methylguanine-DNA methyltransferase (MGMT)|Radiology Reference Article|Radiopaedia.org. https://radiopaedia.org/articles/methylguanine-dna-methyltransferase-mgmt. Accessed 21 Dec 2022
MRI-Based Deep-Learning Method for Determining Glioma MGMT Promoter Methylation Status-PubMed. https://pubmed.ncbi.nlm.nih.gov/33664111/. Accessed 21 Dec 2022
“New Strategies Take on the Worst Cancer--Glioblastoma-Scientific American.” https://www.scientificamerican.com/article/new-strategies-take-on-the-worst-cancer-glioblastoma/. Accessed 21 Dec 2022
OncologyPRO (2019) “Factsheets on Biomarkers|OncologyPRO,” Factsheets on Biomarkers|OncologyPRO. https://oncologypro.esmo.org/education-library/factsheets-on-biomarkers/mgmt-promoter-methylation-in-glioma. Accessed 21 Dec 2022
Patel M et al (2021) Machine learning-based radiomic evaluation of treatment response prediction in glioblastoma. Clin Radiol 76(8):628.e17-628.e27. https://doi.org/10.1016/j.crad.2021.03.019
Peri C (2022) What is glioblastoma? WebMD. https://www.webmd.com/cancer/brain-cancer/what-is-glioblastoma. Accessed 21 Dec 2022
Pistollato F et al (2010) Intratumoral hypoxic gradient drives stem cells distribution and MGMT expression in glioblastoma. Stem Cells Dayt Ohio 28(5):851–862. https://doi.org/10.1002/stem.415
Predicting MGMT Promoter Methylation in Diffuse Gliomas Using Deep Learning with Radiomics-PMC. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9224690/. Accessed 21 Dec 2022
Qian J et al (2020) Prediction of MGMT status for glioblastoma patients using radiomics feature extraction from 18F-DOPA-PET imaging. Int J Radiat Oncol Biol Phys 108(5):1339–1346. https://doi.org/10.1016/j.ijrobp.2020.06.073
Radiological model based on the standard magnetic resonance sequences for detecting methylguanine methyltransferase methylation in glioma using texture analysis-PubMed. https://pubmed.ncbi.nlm.nih.gov/33932065/. Accessed 21 Dec 2022
Radiomic Analysis to Predict Histopathologically Confirmed Pseudoprogression in Glioblastoma Patients-ScienceDirect. https://www.sciencedirect.com/science/article/pii/S2452109422000239. Accessed 21 Dec 2022
Radiomics with artificial intelligence: a practical guide for beginners-PMC. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6837295/. Accessed 25 Dec 2022
Radiomics-based machine learning model for efficiently classifying transcriptome subtypes in glioblastoma patients from MRI-ScienceDirect. https://www.sciencedirect.com/science/article/pii/S0010482521001141. Accessed 21 Dec 2022
Raizer JJ et al (2015) Economics of malignant gliomas: a critical Review. J Oncol Pract 11(1):e59-65. https://doi.org/10.1200/JOP.2012.000560
Rasmussen BK et al (2017) Epidemiology of glioma: clinical characteristics, symptoms, and predictors of glioma patients grade I-IV in the the Danish Neuro-Oncology Registry. J Neurooncol 135(3):571–579. https://doi.org/10.1007/s11060-017-2607-5
Rizzo S et al (2018) Radiomics: the facts and the challenges of image analysis. Eur Radiol Exp 2(1):36. https://doi.org/10.1186/s41747-018-0068-z
Sareen H, Ma Y, Becker TM, Roberts TL, de Souza P, Powter B (2022) Molecular biomarkers in glioblastoma: a systematic review and meta-analysis. Int J Mol Sci 23(16):8835. https://doi.org/10.3390/ijms23168835
Saxena S et al (2022) Role of artificial intelligence in radiogenomics for cancers in the era of precision medicine. Cancers 14(12):12. https://doi.org/10.3390/cancers14122860
Saxena S et al (2023a) Prediction of O-6-methylguanine-DNA methyltransferase and overall survival of the patients suffering from glioblastoma using MRI-based hybrid radiomics signatures in machine and deep learning framework. Neural Comput Appl. https://doi.org/10.1007/s00521-023-08405-3
Saxena S et al (2023b) Fused deep learning paradigm for the prediction of o6-methylguanine-DNA meth yltransferase genotype in glioblastoma patients: a neuro-oncological investigation. Comput Biol Med 153:106492. https://doi.org/10.1016/j.compbiomed.2022.106492
Sharma S, Salehi F, Scheithauer BW, Rotondo F, Syro LV, Kovacs K (2009) Role of MGMT in tumor development, progression, diagnosis, treatment and prognosis. Anticancer Res 29(10):3759–3768
Silantyev AS et al (2019) Current and future trends on diagnosis and prognosis of glioblastoma: from molecular biology to proteomics. Cells 8(8):863. https://doi.org/10.3390/cells8080863
Sohn B et al (2021) Radiomics-based prediction of multiple gene alteration incorporating mutual genetic information in glioblastoma and grade 4 astrocytoma, IDH-mutant. J Neurooncol 155(3):267–276. https://doi.org/10.1007/s11060-021-03870-z
Srivastava SK, Singh SK, Suri JS (2019) Effect of incremental feature enrichment on healthcare text classification system: a machine learning paradigm. Comput Methods Programs Biomed 172:35–51. https://doi.org/10.1016/j.cmpb.2019.01.011
Stupp R et al (2009) Effects of radiotherapy with concomitant and adjuvant temozolomide versus radiotherapy alone on survival in glioblastoma in a randomised phase III study: 5-year analysis of the EORTC-NCIC trial. Lancet Oncol 10(5):459–466. https://doi.org/10.1016/S1470-2045(09)70025-7
Tamimi AF, Juweid M (2017) Epidemiology and outcome of glioblastoma. In: Glioblastoma, S. De Vleeschouwer, Ed. Brisbane (AU): Codon Publications, 2017. [Online]. http://www.ncbi.nlm.nih.gov/books/NBK470003/. Accessed 21 Dec 2022
Tixier F et al (2019) Preoperative MRI-radiomics features improve prediction of survival in glioblastoma patients over MGMT methylation status alone. Oncotarget 10(6):660–672. https://doi.org/10.18632/oncotarget.26578
Trivizakis E et al (2020) Artificial intelligence radiogenomics for advancing precision and effectiveness in oncologic care (Review). Int J Oncol 57(1):43–53. https://doi.org/10.3892/ijo.2020.5063
van Kempen EJ et al (2021) Accuracy of machine learning algorithms for the classification of molecular features of gliomas on MRI: a systematic literature review and meta-analysis. Cancers 13(11):11. https://doi.org/10.3390/cancers13112606
Verduin M et al (2021) Prognostic and predictive value of integrated qualitative and quantitative magnetic resonance imaging analysis in glioblastoma. Cancers 13(4):722. https://doi.org/10.3390/cancers13040722
Wei J et al (2019) A multi-sequence and habitat-based MRI radiomics signature for preoperative prediction of MGMT promoter methylation in astrocytomas with prognostic implication. Eur Radiol 29(2):877–888. https://doi.org/10.1007/s00330-018-5575-z
Wu Y, Guo Y, Ma J, Sa Y, Li Q, Zhang N (2021) Research progress of gliomas in machine learning. Cells 10(11):3169. https://doi.org/10.3390/cells10113169
Xi Y-B et al (2018) Radiomics signature: a potential biomarker for the prediction of MGMT promoter methylation in glioblastoma. J Magn Reson Imaging JMRI 47(5):1380–1387. https://doi.org/10.1002/jmri.25860
Yin A et al (2014) The predictive but not prognostic value of MGMT promoter methylation status in elderly glioblastoma patients: a meta-analysis. PLoS ONE 9(1):e85102. https://doi.org/10.1371/journal.pone.0085102
Yu W, Zhang L, Wei Q, Shao A (2020) O6-Methylguanine-DNA Methyltransferase (MGMT): Challenges and New Opportunities in Glioma Chemotherapy. Front Oncol 9:1547. https://doi.org/10.3389/fonc.2019.01547
Zhang S et al (2021) Automated machine learning to predict the co-occurrence of isocitrate dehydrogenase mutations and O6 -methylguanine-DNA methyltransferase promoter methylation in patients with gliomas. J Magn Reson Imaging JMRI 54(1):197–205. https://doi.org/10.1002/jmri.27498
Zhu M et al (2022) Artificial intelligence in the radiomic analysis of glioblastomas: a review, taxonomy, and perspective. Front Oncol. https://doi.org/10.3389/fonc.2022.924245
Zlochower A, Chow DS, Chang P, Khatri D, Boockvar JA, Filippi CG (2020) Deep learning AI applications in the imaging of glioma. Top Magn Reson Imaging TMRI 29(2):115–110. https://doi.org/10.1097/RMR.0000000000000237
Funding
The authors declare that no funds, grants, or other support were received for this study.
Author information
Authors and Affiliations
Contributions
All of the listed authors have participated actively in the entire study project, including design, conception, analysis, and manuscript preparation. Preparation, research papers collection and analysis were performed by MVSS, BJ, GM and SS. MVSS, BJ, GM drafted the original manuscript, while NKD, WCL and SS reviewed and edited the draft. All authors participated in and approved the final submission.
Corresponding authors
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Ethics approval
No ethical approval is required due to the retrospective nature of the study.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A
Based on the consensus of the experienced AI engineering team and deep literature review, we developed the scheme for the weight matrix, in which 21 AI-based attributes were for all 33 studies, thus a total of 693 attributes involved. These AI attributes are initially qualitative and then quantified by assigning a number between 0 and 1, as shown in Table 3, which is known as the weight matrix. The attributes are assigned to values ranging from A1 to A21 Table 3).
Assigning weights to the attributes
A1 (Image modality): MRI = 0.5, PET = 0.6, CT = 0.4, MRI + PET = 1, MRI + CT = 0.7, CT + PET = 0.8; A2(Study objective): classification or segmentation = 0.5, both classification and segmentation = 1; A3 (Dataset Size (# of patients)): < 100 = 0.6, 101–200 = 0.7, 201–300 = 0.8, > 300 = 1; A4(Demographic info): no = 0.5, yes = 1; A5(Feature Extraction):hand-crafted = 0.7 none = 0.5 automatic = 1; A6(Feature Selection): no = 0.2 and yes = 1; A7(Pre-Processing): No pre-processing used = 0.5 and pre-processing used = 1; A8(Data Augmentation): No data augmentation = 0.5, with data augmentation (yes) = 1; A9(number of PE Parameters): 5 or more parameters = 1, 4 parameters = 0.9, 3 parameters = 0.8, 2 parameters = 0.7, 1 parameters = 0.6; A10(Accuracy):converted to percentage and scored between 0 and 1 (eg, 50% = 0.5 and 100% = 1); A11(Sensitivity): converted to percentage and scored between 0 and 1 (eg, 50% = 0.5 and 100% = 1); A12(Specificity): converted to percentage and scored between 0 and 1 (eg, 50% = 0.5 and 100% = 1); A13(Precision):converted to percentage and scored between 0 and 1 (eg, 50% = 0.5 and 100% = 1); A14(F1 score): converted to percentage and scored between 0 and 1 (eg, 50% = 0.5 and 100% = 1; A15(AUC): Keep it as it is, if 0.99, the 0.99; A16(Performance Analysis Metrics):no ROC and Confusion Matrix and Boxplot = 0.2, Only confusion matrix = 0.6,Only Boxplot = 0.7, Only ROC = 0.8, Both ROC and Boxplot = 0.9, Both ROC and Confusion Matrix = 1; A17(Statistical Analysis):No statistical analysis = 0.2, Yes = 1; A18(Regularisation): Absence in study = 0.2, Presence = 1; A19(# of Regularisation Methods): no method = 0.2, 1-method = 0.8, 2-methods = 0.9, and 3-methods = 1; A20(Hardware Resources): Information available = 1, No information = 0.5; A21(Software Resources):Informationavailable = 1,Noinformation = 0.5.
-
1.
Image Modalities: Understanding the pivotal role played by various image modalities in offering distinct insights into tumors and the inherent challenges they pose in constructing effective ML/DL models based on these insights, we recognize the paramount importance of image modality in our analyses.
-
2.
Study Objectives: In our evaluation, we have duly considered the primary objectives of the studies. We have accorded higher scores to studies that encompass both classification and segmentation objectives, reflecting the prevalence of these objectives among the considered works.
-
3.
Dataset Size (Number of Patients): Recognizing the pivotal role of dataset size in determining the efficacy of statistical models, we have given preference to studies boasting larger datasets and substantial results.
-
4.
Patient Demographics: Comprehending the significance of patient demographics in model training, we have included demography as a metric to encourage a holistic approach, encompassing factors such as patient age and gender.
-
5.
Feature Extraction: To facilitate replicability, we have considered studies that explicitly detail their feature extraction procedures as they contribute to easier replication efforts.
-
6.
Feature Selection: Similarly, we have prioritized studies that elucidate their feature selection methods, promoting replicability and improved results.
-
7.
Data Preprocessing: Acknowledging the critical role of data preprocessing in the machine learning pipeline, we have rewarded studies that transparently describe their data preprocessing methods.
-
8.
Data Augmentation: Recognizing the benefits of data augmentation in bridging data gaps, we have evaluated studies based on their utilization of data augmentation techniques.
-
9.
Performance Evaluation Parameters: Given the multifaceted nature of model performance, we have favored studies employing a comprehensive set of evaluation metrics.
-
10.
Accuracy: We have acknowledged the importance of accuracy as a key evaluation metric.
-
11.
Sensitivity: Sensitivity, being a vital evaluation metric, has been given due consideration.
-
12.
Specificity: Specificity, as an essential evaluation metric, has been duly recognized.
-
13.
Precision: Precision, a crucial evaluation metric, has been accounted for.
-
14.
AUC of the ROC: The area under the ROC curve, a pivotal evaluation metric for classification models, has been considered.
-
15.
F1 Score: We have recognized the significance of the F1 score as a key evaluation metric.
-
16.
Performance Analysis Metrics: To gain a comprehensive visual understanding of model results, we have awarded higher scores to studies employing multiple analytic metrics such as confusion matrices, ROC curves, and box plots.
-
17.
Statistical Analysis: Studies that incorporate statistical analysis bolster the credibility of their models, and hence, we have acknowledged their contributions.
-
18.
Regularization: We have emphasized the importance of regularization techniques in preventing overfitting, making it a relevant criterion in our assessment.
-
19.
Number of Regularization Methods: Studies implementing multiple regularization methods have received higher scores for their versatility.
-
20.
Hardware Resources: To enhance replicability, we have considered the disclosure of hardware resources used in the study as a criterion.
-
21.
Software Resources: Similarly, the provision of information about software resources used in the study aids in replicability and has been included as a relevant metric.
Appendix B
See Table 3.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Samartha, M.V.S., Dubey, N.K., Jena, B. et al. AI-driven estimation of O6 methylguanine-DNA-methyltransferase (MGMT) promoter methylation in glioblastoma patients: a systematic review with bias analysis. J Cancer Res Clin Oncol 150, 57 (2024). https://doi.org/10.1007/s00432-023-05566-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00432-023-05566-5