A novel framework for brain tumor detection based on convolutional variational generative models

Salama, Wessam M.; Shokry, Ahmed

doi:10.1007/s11042-022-12362-9

A novel framework for brain tumor detection based on convolutional variational generative models

Open access
Published: 03 March 2022

Volume 81, pages 16441–16454, (2022)
Cite this article

Download PDF

You have full access to this open access article

Multimedia Tools and Applications Aims and scope Submit manuscript

A novel framework for brain tumor detection based on convolutional variational generative models

Download PDF

3144 Accesses
19 Citations
2 Altmetric
Explore all metrics

Abstract

Brain tumor detection can make the difference between life and death. Recently, deep learning-based brain tumor detection techniques have gained attention due to their higher performance. However, obtaining the expected performance of such deep learning-based systems requires large amounts of classified images to train the deep models. Obtaining such data is usually boring, time-consuming, and can easily be exposed to human mistakes which hinder the utilization of such deep learning approaches. This paper introduces a novel framework for brain tumor detection and classification. The basic idea is to generate a large synthetic MRI images dataset that reflects the typical pattern of the brain MRI images from a small class-unbalanced collected dataset. The resulted dataset is then used for training a deep model for detection and classification. Specifically, we employ two types of deep models. The first model is a generative model to capture the distribution of the important features in a set of small class-unbalanced brain MRI images. Then by using this distribution, the generative model can synthesize any number of brain MRI images for each class. Hence, the system can automatically convert a small unbalanced dataset to a larger balanced one. The second model is the classifier that is trained using the large balanced dataset to detect brain tumors in MRI images. The proposed framework acquires an overall detection accuracy of 96.88% which highlights the promise of the proposed framework as an accurate low-overhead brain tumor detection system.

Optimized CNN Model with Deep Convolutional GAN for Brain Tumor Detection

Brain Tumor Recognition from MRI Using Deep Learning with Data Balancing Methods and Its Explainability with AI

Improving Performance of a Brain Tumor Detection on MRI Images Using DCGAN-Based Data Augmentation and Vision Transformer (ViT) Approach

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Over the years, cancer by its unstable nature remains a curse to humankind [26]. Computer-aided diagnosis (CAD) applications are used to assist neurologists. Brain tumor detection, classification and grading are presented in [9, 19, 22, 24]. These applications rely on magnetic resonance imaging (MRI) images of the brain, which are better than computed tomography (CT) images because they can provide greater contrast to the soft tissues in the brain compared to CT images. In CAD systems, machine-learning techniques are widely used to detect and classify brain tumors. The basic step for these systems is the feature extraction step where the system learns the important features in MRI images. To this end, several methods of extraction of monuments have been proposed [29]. The extracted features are then fed into a structured form model to detect and classify brain tumors. However, to make this problem computationally tractable these systems usually assume that the important features in MRI images are independent, which limits their ability to capture the relationship associated with the nature between the features, which in turn reduces their accuracy [30].

To address these limitations, several brain-based MRI detection systems have adopted deep learning. The convolutional neural network models (CNN) and the transfer learning models are commonly used as solutions for detection and classification problems. These solutions usually contain two phases. The first is an offline phase where the deep model is trained using a set of manually classified MRI images (training data). The second is an online phase that takes a brain MRI image and determines whether it contains a tumor or not.

The limitations of existing solutions are summarized as follows. The performance of modern methods is not sufficient given the medical importance of detection and classification problems. Previous solutions rely on manually defined tumor areas, before classification. This eliminates the disclosure problem and makes the classification problem easier. However, it prevents these systems from being fully automated. On the other hand, automated solutions developed using CNN and its variants have not been able to significantly improve performance. This is because CNN and deep learning models, in general, are data-hungry [27]; i.e. In order to achieve the expected good performance, it requires large amounts of training data (classified images). This comes with additional cost due to the need to perform the assembly and the classification of the brain MRI images. Although pre-trained models (i.e. transfer learning solutions) can be used to solve the need for big training data, the performance of such solutions depend on the pre-training model. If your task (i.e. tumors detection or classification) and the task that the pre-trained model is trained for are too dissimilar, the accuracy will be poor using transfer learning [28]. Finally, existing solutions using either CNN or transfer learning are trained using class-unbalanced datasets (with respect to the number of training samples for each class), lowering their detection and classification accuracy [13, 16].

This paper provides a novel general framework for discovering and classifying brain tumors. The proposed framework can be seamlessly integrated with any of the existing MRI-based detection and classification systems by processing their small unbalanced training dataset to produce a larger balanced dataset which is suitable for training deep learning models. In particular, we employ two types of deep models. The first model is an innovative deep model to capture the distribution of the important features in a group of a small class-unbalanced images dataset. Then by using the distribution of the important features, the system can generate any number of brain MRI images for each class. Thus, the system can automatically convert a small unbalanced dataset into a larger balanced one. The second model is the classifier that is trained using the large class-balanced dataset to detect brain tumors in MRI images. Note that while we use two models for generating new samples and for detecting tumors, our work is different from the previous work where two models are usually employed for feature extraction task and for classification [12].

The generator model can provide a dense class-balanced dataset that is required for training the classifier model. This makes it has a large impact on many real-world applications that require dense class-balanced training data. Examples include brain tumor detection, classification, and grading. However, as the classifier model is trained using the newly generated data from the generator model, the accuracy of the classification (i.e. tumor detection) is affected by the accuracy of the generator model. This limitation can be solved if we carefully train the generator model until it reaches certain good accuracy.

The proposed framework acquires a comprehensive detection accuracy of 96.88%, outperforming the most recent detection methods, which highlights the promise of the proposed framework as an accurate system for brain tumor detection.

The rest of the paper is organized as follows: Section 2 presents our related work. The proposed framework and the details of the two models are described in Section 3. The dataset description and the evaluation of the proposed framework are explained in Section 4. Finally, Section 5 concludes the paper.

2 Related work

2.1 Deep learning solutions

Deep learning solutions can learn to generate a high-level feature directly from raw MRI images. Convolutional Neural Networks (CNN) is a commonly used deep model in these systems [21]. It can automatically learn the representation of complex features directly from the data itself. CNN-based brain tumor detection systems have usually two-stage. An offline phase where a deep CNN model is trained using a set of classified MRI images (training data). An online phase that takes a brain MRI image and determines whether it contains tumors or not. CNN-based systems have been successfully applied to the problem of detection and classification of brain tumors. In addition, with the support of parallel GPUs, these technologies have gained tremendous success. In [12], authors further used two different models for developing an accurate brain tumor classification. The first is a hybrid model for brain tumor feature extraction. The second is a regularized extreme learning machine (RELM) for developing an accurate brain tumor classification.

On the other hand, CNN and its variants have not been able to significantly improve performance. This is because CNN and deep learning models, in general, are data-hungry [27]; i.e. In order to achieve the expected good performance, they require large amounts of training data (classified images).

2.2 Transfer learning solutions

Recently, a special class of deep learning, known as transfer learning, has demonstrated its potential for detecting brain tumors based on MRI [15, 33]. Transfer learning allows the use of a previously trained CNN template, which has already been developed for another related application. Several techniques are underway to extract deep features from MRI images of the brain using pre-trained networks. These techniques demonstrate the transfer learning ability to work with smaller datasets [2, 20]. Examples include [33] where AlexNet and GoogLeNet are employed on grading of glioma from MRI images. In [15] a pre-trained VGG-16 network is used for the diagnosis of Alzheimer’s disease from MRI images.

On the other hand, the performance of transfer learning solutions depend on the pre-training model. If your task (i.e. tumors detection or classification) and the task that the pre-trained model is trained for are too dissimilar, the accuracy will be poor using transfer learning [28].

2.3 Deep learning-based segmentation

Recently, there has been much work in image segmentation [3,4,5,6, 11, 14, 23, 34, 35]. In [34, 35] authors propose a novel cross-modality deep-learning based framework to segment brain tumors from the multi-modality MRI data. In [14], authors employ nnU-net for brain tumor segmentation. In [23], authors proposed an automated segmentation of brain tumors from 3D MRI images where they used autoencoders to solve the problem of the small training dataset. Authors of [3] also provide an efficient secure algorithm for 3D image segmentation. They proposed an algorithm for segmenting medical volumes based on multiresolution analysis where different 3D volume reconstructed versions have been considered to come up with a robust and accurate segmentation results [5].

However, these techniques handle 3D images to perform image segmentation which makes them relatively computationally expensive compared to the use of 2D images which is a better fit for brain tumors detection problem [1, 4, 6].

3 The proposed framework

Figure 1 shows the proposed framework which can be integrated with any of the current deep learning-based brain tumor detection systems^{Footnote 1}. The framework takes the brain MRI images data collected by the traditional MRI-based systems as input, typically class-unbalanced and small in size. The input brain MRI images are then passed through the preprocessor module which resizes and normalizes the input brain MRI images. The framework contains two deep models: the Generator model G(x,𝜃_g) and the Classifier model C(x,𝜃_c) where x is the input to each model, 𝜃_g is the generator network parameters and 𝜃_c is the classifier network parameters. The Generator learns the distribution of the important features in the preprocessed images. Then, given the distribution of the important features, the Generator can convert the small unbalanced preprocessed images dataset to a larger balanced one. Finally, the large balanced dataset is used to train another deep model, the classifier, which is used for detection and/or classification. In the balance of this section, we start by describing the input data format followed by the details of the Preprocessor and the Generator modules. The latter is the core contribution of this paper. Finally, we describe the classifier that is used to detect tumors in MRI images.

3.1 Input data

The input to the system is a set of brain MRI images. This set is small in size and imbalanced with respect to the number of training images per class. Almost all MRI scanners output images in the standardized medical format. These images are stored as two-dimensional (2D) grayscale images. Each entry in the grayscale image stores a value from 0 to 255. This range presents the trade-off between the efficiency of storing information about the image (256 values fit perfectly in 1 byte) and the sensitivity of the human eye (humans distinguish a limited number of shades of the same color). The grayscale images are then passed through the preprocessor module.

3.2 Preprocessor

The goal of this module is to resize and normalize the input MRI images. Firstly, all the input images should be in a fixed size, the grayscale input images are resized to be images of size 256 x 256 pixels. This allows these images to be fed into deep models with fixed input size. Secondly, normalizing the input data generally speeds up learning and leads to faster convergence. To this end, the grayscale input images are normalized to be in the range of [0,1]. The normalization is in intensity values. A minimum-maximum normalization technique is followed to scale the intensity values between 0 and 1 [25]. The preprocessed images are then passed through the generator network.

3.3 Generator model

The goal of the generative model is to synthesize new brain MRI image samples for each class. The generative model is trained using the preprocessed images to generate new image samples that reflect the typical patterns in the preprocessed images. Our generative model is a modified version of the variational autoencoder [18]. In general, autoencoders are a specific type of feedforward neural networks where the input is the same as the output [7]. They are used to learn key representational information (features) for a dataset within a low-dimensional latent space in an unsupervised mannar where they compress the input image into a lower-dimensional latent-space representation z (embeddings) and then reconstruct the output image from this representation. Hence, the latent space learns to capture the most essential information required for reconstruction. However, the latent space embeddings may be sparsely distributed that makes the key information to be spread across several clusters in the latent space. Meanwhile, the empty space between clusters does not capture any useful information which makes sampling from it creates meaningless results.

To solve this problem, we uses the variational autoencoder (VAE) where a new constraint is added that the latent space embeddings need to follow certain predefined distribution p(z) [18]. This distribution is usually selected as normal distribution [10]. Now, by forcing the latent space embeddings to follow the normal distribution, the network is forced to fully utilize the latent space so that information is distributed in a way that allows us to sample from any point in the latent space to generate new images that reflect the typical patterns in the original small brain MRI images dataset. Therefore, we depend on VAE to generate new brain images.

Because brain tumors do not always appear with the same number, same shape and in the exact same position in the brain MRI images, convolutional and deconvolutional layers are applied in the implementation of the encoder and the decoder instead of the regular feedforward layers. These layers can utilize sliding filter maps that can recognize the tumors’ local patterns independently of their number, shapes, and positions in the brain MRI images. Hence, the generator model learns to generate new MRI images with different numbers of tumors, with different shapes, and in different positions in the images. Figure 2 shows our convolutional variational autoencoder (CVAE) architecture. The CVAE network has two main components: the encoder (conv) and the decoder (deconv). The encoder network consists of several convolutional layers followed by a fully connected layer and the decoder network consists of a fully connected layer followed by convolutional layers. The encoder compresses the brain MRI image input x ∈ X to get the hidden (latent) representation z and network parameters ϕ as an output. The latent space z is typically referred to as a bottleneck because the encoder must learn an efficient compression of the brain image data into a lower-dimensional space. We refer to the encoder as Q_ϕ(z|x). We can sample from this distribution to get noisy values of the representations z. The decoder takes the latent representation z as input and produces the parameters to the probability distribution of the data and has weights and biases 𝜃. The decoder is denoted by P_𝜃(x|z). The loss function is the negative log-likelihood function defined as follows,

$$ L(\phi, \theta) = {\mathcal{L}}_{1} + {\mathcal{L}}_{2} $$

(1)

$$ {\mathcal{L}}_{1} = - E_{z\sim Q_{\phi}(z\vert x)}[\log P_{\theta}(x\vert z)] $$

(2)

$$ {\mathcal{L}}_{2} = KL(Q_{\phi}(z\vert x) \vert\vert p(z)) $$

(3)

The loss consists of two terms. The first term is the reconstruction loss. It is the expected negative log-likelihood of the data. This term forces the decoder to learn to reconstruct the data which prevents the VAE from generating meaningless image samples. The second term is the regularizer. It is the Kullback-Leibler [17] divergence between the encoder’s distribution Q_ϕ(z|x_i) and the predefined distribution p(z). Assuming that p(z) = N(0,1), the regularizer forces the latent representations z to follow the standard normal distribution. We employ the stochastic gradient descent to optimize the loss with respect to CVAE network parameters ϕ and 𝜃.

We employ convolutional variational autoencoder (CVAE) model for each class in the dataset to learn the joint distribution P(X_i) of input features over the small training images X_i for class i^{Footnote 2}. After finishing the training phase, the network can generate new brain MRI images by sampling the latent variables $z \sim N(0,1)$, then decode z to get new brain image samples $\hat {x}$ from class i. So, we can convert a small unbalanced dataset to a larger balanced one.

3.4 Classifier model

This section describes our classifier model and how we increase the model robustness.

3.4.1 Basic model

Our classifier structure is represented in Fig. 3. The classifier model is assumed to be a binary classifier that detects whether a brain MRI image has a tumor or not. However, this model can be replaced with any other classifier. The classifier model is a convolutional neural network (CNN). The input to the CNN is a brain MRI image. The output is the probability distribution for the different classes. The classifier consists of three components. The first one is the convolutional layers which extract the most important features from the brain MRI image. The second is the pooling layers that downsample each feature to reduce its dimensionality and focus on the most important elements. Finally, the fully connected layers are used after flattening the features (that identified in the previous layers into a vector) to predict the probability that the brain MRI image has tumors or not.

3.4.2 Increasing model robustness

To further increase the model robustness, the proposed system employs the drop-out regularization technique during training [31]. The idea is to randomly drop neurons from the network during training. The temporarily removed neurons no longer contribute to the activation of downstream neurons in the forward pass. Similarly, the weight update process is not applied to them in the backward pass. This prevents the network from overfitting the training data.

3.5 Discussion

The proposed framework consists of two different models for two different functions. The generator model target is to generate more data samples for each class in a small class-unbalanced data. On the other hand, the classifier model target is used only for classification. The generator model needs to be trained first to generate the synthetic samples. The original samples and the synthetic ones are then used for training the classifier. Hence, the two networks cannot be trained in an end-to-end manner.

The generator network can be employed independently from the classifier network. It can be used to generate more data samples for any image-based machine learning problem that needs a dense balanced dataset for training. The classifier, on the other hand, is used for any image-based classification. Hence, our framework is general for different detection and classification problems as we can replace the classifier of the brain tumor detection with another one that handles another problem (e.g. grading the tumors).

4 Experiments and results

This section starts with dataset description, followed by the explanation of the performance metrics. Finally, we compare the performance of the proposed framework with other systems. We implement the generator and the classification networks using Google Colab.

4.1 Dataset description

We depend in our experiments on a small dataset that contains a few number of samples, typically 253 samples. The dataset is openly available in [8] and is commonly used for evaluating classification in [32]. Each sample in the dataset has a binary label that indicates whether the brain MRI image sample has a tumor or not. We used 70% of the samples for training and the remaining 30% for testing.

4.2 Framework settings

The performance of the brain tumor classification framework depends on a combination of generator model parameters and classifier model parameters. There are three distinct framework settings.

4.2.1 Deep CNN model as a stand-alone system (CNN)

The deep convolutional neural network is used as a classification model. The network is trained using the small training dataset (after pre-processing). The network architecture is described in Section 3.4. The hyperparameters of the network are heuristically adjusted. RMSprop optimizer is chosen as an optimizer. A high value for the learning rate makes the optimizer overshoots the minimum value for the loss, while a small value for the learning rate increases the training time. We choose the learning rate to be 0.0001. The loss function is categorical cross-entropy which measures the closeness of the predicted and actual distributions. The dropout rate is 0.5. Table 1 summarizes the classifier parameters.

Table 1 Generator and classifier networks parameters

Full size table

4.2.2 Transfer learning

A pre-trained modified VGG network followed by a dense layer with 256 neurons and the output layer with two neurons is trained using the small training dataset (after pre-processing). RMSprop is used as an optimizer with a learning rate of 0.0004. The loss function is categorical cross-entropy. The dropout rate is chosen to be 0.5.

4.2.3 Proposed framework

Our proposed framework with two networks: the generator network and the classifier. The preprocessed images are used to train the generator network described in Section 3.3. Figure 4 shows a sample of generated images using the generative network. In the start of training the generative network, the network was not able to generate clear brain MRI images. With increasing the training time (i.e.number of training iterations), the network generates clearer images that eventually reflect the typical pattern of the brain MRI images. We used the generator network to convert the original small training dataset to a larger one with 1000 samples (500 for each class). The newly generated large dataset is then used for training the classifier network for detection. The classifier network is described in Section 3.4. We tried different architectures for the generator and the classifier networks. Table 1 contains the generator and the classifier best parameters in terms of the detection accuracy.

4.3 Performance metrics

There are several evaluation tools to assess a classifier amongst them, are the accuracy, the precision, the recall, and the F1 score. Accuracy is the measure of a correct prediction made by the classifier. It gives the ability of the performance of the whole classifier. The accuracy is defined as,

$$ Accuracy = \frac{TP + TN}{TN + FP + FN + TP} $$

(4)

Precision, on the other hand, is the ratio of correctly predicted positive observations to the total predicted positive observations. High precision relates to the low False Positive Recall (FPR):

$$ Precision = \frac{TP}{TP + FP} $$

(5)

Recall in this context is also referred to as the true positive rate

$$ Recall = \frac {TP}{TP + FN} $$

(6)

The F1-score is the weighted average of precision and recall. It is used as a statistical measure to rate the performance of the classifier. Therefore, this score takes both false positives and false negatives into account (Fig. 5):

$$ F_{1}=\frac{2*(Recall * Precision)}{(Recall + Precision)} $$

(7)

4.4 Model accuracy

Figure 6 shows the loss and detection accuracy for training and testing for the classifier network with an increasing number of training epochs. Evidence from Fig. 6 that the detection accuracy for training and testing increases together. It also shows that the loss of training and testing decreases together. This indicates that the classifier network does not over-train (over-fit) the training data. This is mainly because the newly generated variational samples help in generalizing the classifier.

4.5 Effect of number of generated samples

Figure 7 shows the effect of increasing the number of additional generated training samples on the system accuracy. The figure shows that, as expected, increasing number of training samples increases the system accuracy due to the increase of the quality of the learned model. The good news is that the system performance saturates after about 2000 samples.

4.6 Comparison with other systems

Now, we compare our proposed framework with the CNN classifier model and the transfer learning model both in accuracy and training time.

4.6.1 Accuracy comparison

We compare the proposed framework with other approaches such as training the classifier network directly using the original training data (CNN) and training the pre-trained VGG network directly using the original training data (Transfer learning). Figure 5 shows the normalized confusion matrix for the different approaches.

Table 2 summarizes the performance metrics such as Accuracy, Precision, Recall, and F1-score for the different approaches. The results show that the different approaches can classify the positive samples (i.e. the samples that have a tumor) as good as the negative samples. The transfer learning technique is better than the CNN classifier as it depends on a large number of pre-trained weights. Finally, the table shows that our proposed framework can detect tumors better than other approaches. This is because of the ability of the generative model to augment the CNN with newly generated samples which highlights the promise of the proposed framework as an accurate low-overhead brain tumor detection system.

Table 2 Summary of performance metrics for the different classifiers

Full size table

4.6.2 Time comparison

Figure 8 shows the training time for the three models. The CNN classifier model training is the fastest as it contains few layers. Unlike the CNN classifier, the transfer learning classifier contains a relatively large number of layers to train. Finally, as our proposed network trains the generative model before training the CNN classifier, our proposed framework is the slowest. Note that, the training is achieved in an offline stage. Hence, it doesn’t affect the models’ online running time where the three models are fast.

5 Conclusion

A novel generalized framework for brain tumors detection and classification is introduced in this paper. The proposed framework used two different deep models for two different tasks. The first is a convolutional variational generative model to convert a small class-unbalanced dataset to a large balanced one. The second is the classifier which is a convolutional model used to detect tumors in brain MRI images. The proposed framework acquired an equal best performance of accuracy, precision, recall, and F1-score of 96.88%, outperforming other recent systems in the literature. This highlights the promise of our framework as an accurate brain tumor detection system.

Notes

Without loss of generality, we focus in this paper on the brain tumor detection problem. However, our proposed framework can be applied to other problems such as MRI-based classification and grading.
Note that in our detection problem we have two classes: images have tumors or not. Hence, we used two identical CVAE models. One is trained with “Yes” images and another one is trained with “No” samples.

References

Abiwinanda N, Hanif M, Hesaputra ST, Handayani A, Mengko TR (2019) Brain tumor classification using convolutional neural network. In: World congress on medical physics and biomedical engineering 2018. Springer, pp 183–189
Ahmed KB, Hall LO, Goldgof DB, Liu R, Gatenby RA (2017) Fine-tuning convolutional deep features for mri based brain tumor classification. In: Medical imaging 2017: computer-aided diagnosis, vol 10134. international society for optics and photonics, p 101342e
Al-Zu’bi S, Hawashin B, Mughaid A, Baker T (2021) Efficient 3d medical image segmentation algorithm over a secured multimedia network. Multimed Tools Appl 80(11):16887–16905
Article Google Scholar
AlZu’bi S, Al-Qatawneh S, Alsmirat M (2018) Transferable hmm trained matrices for accelerating statistical segmentation time. In: 2018 Fifth international conference on social networks analysis, management and security (SNAMS). IEEE, pp 172–176
Alzu’bi S, Jararweh Y, Al-Zoubi H, Elbes M, Kanan T, Gupta B (2019) Multi-orientation geometric medical volumes segmentation using 3d multiresolution analysis. Multimed Tools Appl 78(17):24223–24248
Article Google Scholar
AlZu’bi S, Shehab M, Al-Ayyoub M, Jararweh Y, Gupta B (2020) Parallel implementation for 3d medical volume fuzzy segmentation. Pattern Recogn Lett 130:312–318
Article Google Scholar
Baldi P (2012) Autoencoders, unsupervised learning, and deep architectures. In: Proceedings of ICML workshop on unsupervised and transfer learning, pp 37–49
Chakrabarty N (2019) Dataset for brain tumor mri images
Deepak S, Ameer P (2019) Brain tumor classification using deep cnn features via transfer learning. Comput Biol Med 111:103345
Article Google Scholar
Doersch C (2016) Tutorial on variational autoencoders. arXiv:1606.05908
Elbes M, Alzubi S, Kanan T, Al-Fuqaha A, Hawashin B (2019) A survey on particle swarm optimization with emphasis on engineering and network applications. Evol Intel 12(2):113–129
Article Google Scholar
Gumaei A, Hassan MM, Hassan MR, Alelaiwi A, Fortino G (2019) A hybrid feature extraction method with regularized extreme learning machine for brain tumor classification. IEEE Access 7:36266–36273
Article Google Scholar
Guo X, Yin Y, Dong C, Yang G, Zhou G (2008) On the class imbalance problem. In: 2008 Fourth international conference on natural computation, vol 4. IEEE, pp 192–201
Isensee F, Maier-hein KH (2021) Nnu-net for brain tumor segmentation. In: Brainlesion: glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries: 6th International Workshop, BrainLes 2020, Held in Conjunction with MICCAI 2020, Lima, Peru, October 4, 2020, Revised Selected Papers, Part II, vol 12658Springer Nature, p 118
Jain R, Jain N, Aggarwal A, Hemanth DJ (2019) Convolutional neural network based alzheimer’s disease classification from magnetic resonance brain images. Cogn Syst Res 57:147–159
Article Google Scholar
Japkowicz N, Stephen S (2002) The class imbalance problem: a systematic study. Intelligent Data Analysis 6(5):429–449
Article Google Scholar
Joyce JM (2011) Kullback-leibler divergence. International encyclopedia of statistical science, 720–722
Kingma DP, Welling M (2013) Auto-encoding variational bayes. arXiv:1312.6114
Kumar S, Dabas C, Godara S (2017) Classification of brain mri tumor images: a hybrid approach. Procedia Comput Sci 122:510–517
Article Google Scholar
Liu R, Hall LO, Goldgof DB, Zhou M, Gatenby RA, Ahmed KB (2016) Exploring deep features from brain tumor magnetic resonance images via transfer learning. In: 2016 International joint conference on neural networks (IJCNN). IEEE, pp 235–242
Lo SCB, Chan HP, Lin JS, Li H, Freedman MT, Mun SK (1995) Artificial convolution neural network for medical image pattern recognition. Neural Netw 8(7-8):1201–1214
Article Google Scholar
Mohan G, Subashini MM (2018) Mri based medical image analysis: Survey on brain tumor grade classification. Biomedical Signal Processing and Control 39:139–161
Article Google Scholar
Myronenko A (2018) 3d mri brain tumor segmentation using autoencoder regularization. In: International MICCAI brainlesion workshop. Springer, pp 311–320
Pan Y, Huang W, Lin Z, Zhu W, Zhou J, Wong J, Ding Z (2015) Brain tumor grading based on neural networks and convolutional neural networks. In: 2015 37Th annual international conference of the IEEE engineering in medicine and biology society (EMBC). IEEE, pp 699–702
Patro S, Sahu KK (2015) Normalization: A preprocessing stage. arXiv:1503.06462
Prasad PV (2006) Magnetic resonance imaging: methods and biologic applications, vol 124. Springer Science & Business Media
Rizk H, Shokry A, Youssef M (2019) Effectiveness of data augmentation in cellular-based localization using deep learning. arXiv:1906.08171
Rosenstein MT, Marx Z, Kaelbling LP, Dietterich TG (2005) To transfer or not to transfer. In: NIPS 2005 Workshop on transfer learning, vol 898, p 3
Sajjad M, Khan S, Muhammad K, Wu W, Ullah A, Baik SW (2019) Multi-grade brain tumor classification using deep cnn with extensive data augmentation. J Computat Sci 30:174–182
Article Google Scholar
Shokry A, Torki M, Youssef M (2018) Deeploc: a ubiquitous accurate and low-overhead outdoor cellular localization system. In: Proceedings of the 26th ACM SIGSPATIAL international conference on advances in geographic information systems. ACM, pp 339–348
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958
MathSciNet MATH Google Scholar
Thotapally S (2020) Brain cancer detection using mri scans
Yang Y, Yan LF, Zhang X, Han Y, Nan HY, Hu YC, Hu B, Yan SL, Zhang J, Cheng DL et al (2018) Glioma grading on conventional mr images: a deep learning study with transfer learning. Frontiers in Neuroscience, 12
Zhang D, Huang G, Zhang Q, Han J, Han J, Wang Y, Yu Y (2020) Exploring task structure for brain tumor segmentation from multi-modality mr images. IEEE Trans Image Process 29:9032–9043
Article Google Scholar
Zhang D, Huang G, Zhang Q, Han J, Han J, Yu Y (2021) Cross-modality deep feature learning for brain tumor segmentation. Pattern Recogn 110:107562
Article Google Scholar

Download references

Funding

Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB).

Author information

Authors and Affiliations

Pharos University, Alexandria, Egypt
Wessam M. Salama
American University in Cairo, Cairo, Egypt
Ahmed Shokry

Authors

Wessam M. Salama
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Shokry
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ahmed Shokry.

Ethics declarations

Ethics approval

This article does not contain any studies with human participants performed by any of the authors.

Conflict of Interests

All authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Salama, W.M., Shokry, A. A novel framework for brain tumor detection based on convolutional variational generative models. Multimed Tools Appl 81, 16441–16454 (2022). https://doi.org/10.1007/s11042-022-12362-9

Download citation

Received: 21 November 2020
Revised: 14 July 2021
Accepted: 18 January 2022
Published: 03 March 2022
Issue Date: May 2022
DOI: https://doi.org/10.1007/s11042-022-12362-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A novel framework for brain tumor detection based on convolutional variational generative models

Abstract

Similar content being viewed by others

Optimized CNN Model with Deep Convolutional GAN for Brain Tumor Detection

Brain Tumor Recognition from MRI Using Deep Learning with Data Balancing Methods and Its Explainability with AI

Improving Performance of a Brain Tumor Detection on MRI Images Using DCGAN-Based Data Augmentation and Vision Transformer (ViT) Approach

1 Introduction

2 Related work

2.1 Deep learning solutions

2.2 Transfer learning solutions

2.3 Deep learning-based segmentation

3 The proposed framework

3.1 Input data

3.2 Preprocessor

3.3 Generator model

3.4 Classifier model

3.4.1 Basic model

3.4.2 Increasing model robustness

3.5 Discussion

4 Experiments and results

4.1 Dataset description

4.2 Framework settings

4.2.1 Deep CNN model as a stand-alone system (CNN)

4.2.2 Transfer learning

4.2.3 Proposed framework

4.3 Performance metrics

4.4 Model accuracy

4.5 Effect of number of generated samples

4.6 Comparison with other systems

4.6.1 Accuracy comparison

4.6.2 Time comparison

5 Conclusion

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics approval

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation