Abstract
Medical Resonance Imaging (MRI) is one of the preferred imaging methods for brain tumor diagnosis and getting detailed information on tumor type, location, size, identification, and detection. Segmentation divides an image into multiple segments and describes the separation of the suspicious region from pre-processed MRI images to make the simpler image that is more meaningful and easier to examine. There are many segmentation methods, embedded with detection devices, and the response of each method is different. The study article focuses on comparing the performance of several image segmentation algorithms for brain tumor diagnosis, such as Otsu’s, watershed, level set, K-means, HAAR Discrete Wavelet Transform (DWT), and Convolutional Neural Network (CNN). All of the techniques are simulated in MATLAB using online images from the Brain Tumor Image Segmentation Benchmark (BRATS) dataset-2018. The performance of these methods is analyzed based on response time and measures such as recall, precision, F-measures, and accuracy. The measured accuracy of Otsu’s, watershed, level set, K-means, DWT, and CNN methods is 71.42%, 78.26%, 80.45%, 84.34%, 86.95%, and 91.39 respectively. The response time of CNN is 2.519 s in the MATLAB simulation environment for the designed algorithm. The novelty of the work is that CNN has been proven the best algorithm in comparison to all other methods for brain tumor image segmentation. The simulated and estimated parameters provide the direction to researchers to choose the specific algorithm for embedded hardware solutions and develop the optimal machine-learning models, as the industries are looking for the optimal solutions of CNN and deep learning-based hardware models for the brain tumor.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The human body consists of different cells having individual special functions. The cells grow normally, old cells die and new cells grow again. Moreover, the cell loses its ability to control its growth, new cells are produced when the human body does not require them, and old cells do not die, as they should. The build-up of additional cells regularly procedures the mass tissue called tumor [2, 75]. The brain is the focal part of the central nervous system, which consists of non-replaceable soft tissues and spongy. The brain is a sensitive organ, which has three major parts (Cerebrum, Cerebellum, and Brainstem). It receives the information from our body, senses assist as a master for different operations inside the body and permit us to manage our environment. Children have brain tumors in the posterior cranial fossa and adults have in anterior parts, two-thirds of the intellectual or cerebral hemispheres. The tumor can distress any portion of the brain. In the beginning phase, tumors do not contain cancer cells. In the second phase, brain tumors [22] take shape as metastatic tumors. A malignant brain tumor contains cancer cells. The unnatural growth, uncontrolled growth, and division of the cells are due to cancer [32]. The uncontrolled cell growth in the brain tissues is called a brain tumor. The brain tumor is one of the most lethal cancers. The tumor originates from the brain tissue cells, called a primary brain tumor. In some cases, cells develop cancerous at any one part of the body and spread into the brain, called a metastatic tumor. Gliomas are also one type of tumor, which is originated from, glial cells. The early-stage diagnosis of gliomas improves the treatment possibilities of the patient.
A cancerous nub may evolve somewhere else in the human body, and these cell discharges from the primary tumor. The discharged cells pass in the blood vessels and the lymphatic system is deployed through the blood circulation accumulated in the brain. MRI [44] is an innovative medical image processing method, used for visualization of the internal structure of the body and produces ultra-resolution images of the body parts. In MRI, internal structures of the body are obtained for diagnosing using a strong magnetic field and radio waves. From these high-resolution images, we can examine the development, location, and segmentation of tumor abnormalities.
According to the study done by American Brain Tumor Association [60, 65], 14,000 people die annually due to brain cancers. In 2021, more than 84,000 people will be treated for primary brain tumors, while more than 700,000 people in the United States now have brain tumors. According to the annual statistics of Central Brain Tumor of the US [72], people have120 types of brain tumors. The statistics are horrible and cases are increasing day by day.
The main motivation for the doctors is to learn about brain tumor diagnosis methods and include new clinical methods to prevent it. The new methods will explore the imaging scans and better track for possible track and watch the tumor growth [1]. The researchers are inspecting biomarkers that will support the diagnosis of the brain tumor to predict whether any particular treatment can be applied to a patient’s prognosis. Radiologists use MRI pictures to examine the patient [15] and learn about the tumor’s location, type and size to plan treatment and provide an accurate diagnosis. It is quite difficult for a person who has survived a brain tumor to remain optimistic. Humans can experience cognitive and physical changes in themselves, making them unable to live their lives as they live before the tumor. Immunotherapy [19] is the biological therapy used to enhance the human body for the natural defense against the tumor. It follows the materials and vaccines to restore the immune system [79] for dendritic cells. The tumor detection and segmentation [6] help with oncolytic virus therapy, blood-brain disruptions, gene therapy for the replacement of tumor growth, and combining new drugs to enhance the resistance power of tumor cells. The tumor MRI segmentation study helps the practitioners to know the better ways to reduce the side effects and reduce its symptoms for better treatment and quality of life [13].
The segmentation and classification [18, 31] of MRI images desire to be very proficient in the suitable analysis of brain tumors. Perfect segmentation of brain tumors from MRI images is a very challenging and crucial task in treatment planning and diagnosis that comprises the extraction of the images from one or more regions and creating the area of interest. Different algorithms have been advanced in brain tumor detection such as region-based methods, threshold-based techniques, classification approaches, deformable methods, and deep learning models [62]. The deformable models are among the best popular approaches used in MRI images for brain tumor segmentation. Extensive work has been carried out in the direction to explore the capabilities of the MRI image segmentation that can provide some meaningful information from medical data. Research on brain tumors has been carried out to investigate different types of tumors, and the shape of cerebrospinal fluid that enhances the complexity of tumor detection. Regardless of the supremacy of the segmentation technique of MRI images employed, the quality of segmentation methods depends significantly on the contrast of medical images, incomplete boundaries, and the extent of noise. Deep learning plays a very important role in the detection of brain tumors. There are several algorithms for image segmentation such as thresholding method, region-based segmentation, fuzzy clustering, k-means clustering, neural networks, level set method, Otsu’s method, neutral networks, and watershed algorithm, etc. The CNN [3, 81] has been widely used for tumor classification and detection. The CNN has a deep-layered architecture [56] as well as model scaling, which allows the model size to be raised for improved accuracy. It is CNN’s most significant advantage over other segmentation methods and the realization of the specific hardware for artificial intelligence systems. The problem statement of the work is to analyze the performance of the different image segmentation techniques for brain tumor detection based on different parameters such as response time, accuracy, recall, and precision. The main contribution of the paper is to study the different image segmentation techniques including CNN and apply them to the BRATS dataset to estimate the simulation time and performance. This study will help the researchers to identify the best approach for tumor segmentation and pre-estimate the simulation and performance parameters.
2 Related work
Okaili et al. [4] discussed the strategy to differentiate intra-axial brain masses and determine the accuracy of MRI images. The institutional review board approved the strategy for conventional MRI, perfusion MRI, proton MR spectroscopy, diffusion-weighted MRI, and classification of intra-axial masses as low-grade primary neoplasms, metastatic neoplasms, and high-grade primary neoplasms, with a Bayesian statistical approach used to determine the system accuracy. Anila et al. [9] used the concept of multi-resolution and noise removal to detect the abnormal behavior of the brain. The multi-resolution was based on curvelet and countrelet-based approximation. The counterlet method has proved better results to recognize the abnormality in the brain. Balafar et al. [10] discussed the different techniques for brain tumor detection and segmentation for MRI images. The accuracy of the tumor detection depends on the segmentation methods. They discussed Markov’s random model, watershed algorithm, anatomical deviations, Atlas-based segmentation, multi-region-based method, self-organizing maps, and learning vector quantization methods. It is suggested that brain segmentation can be improved if the atlas-based method and parallelization are combined. Chaudhary et al. [14] studied the concept of image segmentation using the clustering technique, and a support vector machine (SVM) is used for detecting brain tumors. The classifiers were able to detect the seven features, and SVM proved 94.6% accuracy. Gholipour et al. [20] addressed that automatic functional localization and functional brain imaging are very important for temporal and higher resolutions of brain tumors. The functional maps are helpful for the identification of dementia that can provide many differences between healthy and tumor patients. It will lead to valid interpretations and conclusions. Ratan et al. [66] reviewed different methods for tumor segmentation from MRI data, as the process is time-consuming identified by medical experts. They pointed out different methods such as intensity, texture, region-based, clustering, classification, fuzzy, neural network, knowledge, edge, probabilistic, fusion, SVM, level set methods, watershed, Atlas guided, morphology, Fuzzy C means, and k- means clustering-based algorithms. They suggested that the combined approach of thresholding with SVM or Basian could provide the best results for brain tumor detection. Ratan et al. [68] proposed an algorithm to detect tumors from MRI images through symmetry analysis and calculate the area of the tumor. They use median filters, thresholding & morphological operation to detect a tumor. Li et al. [45] used a unified level set technique for semi-automatic liver tumor segmentation to integrate prior information on the tumor, image gradient, and regional competition. It was applied directly on contrast-enhanced computed tomography (CT) images, and unsupervised fuzzy clustering was used for the probabilistic distribution of liver tumors. Thapaliya et al. [74] used the level set method for brain tumor segmentation using an automatic selection of local statistics. The threshold values of images were rationalized, and adjusted automatically for all MR images. Goel et al. [23] discussed the watershed algorithm and level set method for MRI image segmentation of brain tumor regions detection. The comparative study is carried out to estimate the performance and response time in MATLAB. The level set method has provided a good response in comparison to Otsu’s method. Mustaqeem et al. [54] discussed that brain tumor detection is possible using watershed segmentation, threshold segmentation, and morphological operators. They successfully simulated the samples of human brains using scanned MRI images. Patil et al. [63] used a watershed algorithm and morphological operators for the detection of tumors from brain MRI images. The noise removal functions, regions segmentation, and morphological operators were used for the scanned MRI images. Remya et al. [67] used the Fuzzy- C means method for noise filtering of MRI images. The method exactly researches the exact identification of the cerebrum tumor. Otsu’s method was applied for image segmentation. The authors claimed that their approach of fuzzy-C means provided good results even the patient thinks about the tumor. Sain et al. [69] explained the structure of the human brain and proposed an algorithm to detect the tumor in the brain using Otsu’s method for segmentation. Amin et al. [8] used DWT for image fusion that provided complete information about the brain tumor regions of MRI. The partial diffusion filter is used to eliminate the noise and the global thresholding technique is applied for tumor segmentation. Khode et al. [38] used DWT for brain tumor detection. MRI is a very important method in many cases and is capable to provide a detailed image analysis of the human body. The MRI images were used for the test, and the tumor was segmented from these images. Kumar et al. [39] suggested the use of DWT for image segmentation and determined the vertical, inclined, and circular regions. They use HAAR DWT in which the image was distributed into four sub-bands “LL”, “LH”, “HL”, and “HH” with its coefficients either + 1 or -1. Shree et al. [71] addressed that the identification, detection, and segmentation of the exact position of a brain tumor in MRI images is a very time-consuming and tedious process. The system needs fast transformation and computations as MRI images have many modalities and noise. They used DWT for image segmentation, with morphological filters to remove the noise, and system performance was improved. Singh et al. [73] suggested the use of DWT and discrete cosine transform (DCT) for image processing applications. They used the DCT and DWT for image compression and watermarking. The processes involve image segmentation operations and the best results have been obtained from DWT. Viji et al. [76] suggested that watershed segmentation is used for the automatic detection of brain tumors and computer-aided design (CAD) tools can be used for tumor identification in the manual segmentation process. This helps for 2D and 3D tumor image visualization for accessing tumor and surgical planning. Liao et al. [49] used medical joint photographic experts group (JPEG) images for the adaptive data hiding applications to preserve the difference between the DCT coefficient values and the adjacent DCT blocks at the same position by embedding the inter-block changes for patient’s security. The concept of reversal hiding for the image data encryption and decryption [46] was used to estimate the complexity of each block with a minimum bit error rate. The discrete Fourier transform DFT [48] and compressive sensing were used to separate the data hiding for image encryption and decryption for security. Joseph et al. [35] explained tumor segmentation using K-means clustering and morphological operators were used to avoid misclustered regions. Zhang et al. [80] used the adaptive Wiener filtering method for denoising. Different morphological operators and functions were used for eliminating the non-brain tissue. The K-means + + clustering method was used with the combination of Fuzzy- C- means a method to segment images. The clustering technique not only advanced the algorithm’s stability but also decreased the sensitivity of clustering parameters. Moeskops et al. [52, 53] used CNN for the automatic segmentation of MR brain images into some tissue classes. The segmentation technique was applied to five different data sets with different age groups and accurate results were obtained. Olszewska et al. [58, 59] discussed the performance of the autonomous intelligent vision system that was evaluated based on false-positive rate, false-negative rate, accuracy, and precision. Pereira et al. [64] used CNN for image segmentation of MRI images of the brain with a small kernel (3 × 3). The small kernel helps in the design of deeper CNN architecture. The results were verified on the BRATS-2013 and BRATS 2015 databases. Seetha et al. [70] proposed automatic brain tumor detection with the help of CNN’s deeper architecture and classification. The simulation is done using the Python programming language. The CNN architecture consists of three layers with small kernels. The CNN achieved 97.5% accuracy on the (BRATS) 2015 testing dataset. Wang et al. [77] used the multiscale CNN to segment the dermoscopic image. The image was first preprocessed using contrast enhancement and segmented to enhance the dataset using the segmentation loss function. Kaur et al. [36] applied different machine learning methods such as random forest, SVM, decision trees, and K-NN support to evaluate the performance of the real-time data for remote halt monitoring in which random forest predicted 96.42% accuracy for breast cancer and other diseases, which was high in compassion to other algorithms. Dargan et al. [16, 17] presented a compressive review on the need for deep learning for medical images and further use of machine learning to evaluate the performance. Deep learning introduces new data-processing techniques and infrastructures, allowing computers to learn different representations and objects. The framework for the biometric recognition system was discussed that comprises the serval process and sequences for the identification of behavioral modalities. Kumar et al. [41] predicted the COVID-19 cases and deaths in Italy, Spain, Japan, India, US, and UK. The model was followed using auto-regressive integrated moving average (ARIMA), and long short-term memory (LSTM) methods. Ghosh et al. [21] applied the machine learning algorithms such as KNN, support vector machine, and multi-layer perceptron (MLP) for Deoxyribonucleic acid (DNA) microarray data. Bansal et al. [11] used machine-learning algorithms for object recognition. The features of the objects were extracted using the different classifies such as decision tree, KNN, and random forest with 80.8%, 74.8%, and 85.9% accuracy respectively. Kumar et al. [12] applied machine learning for face detection and face recognition from an arbitrary image to get the human insight and knowledge to recognize and study the face data. Gupta et al. [25] used the scale-invariant feature transform (SIFT) and speed up robust feature (SURF) with random forest and decision tree machine learning to study the different features for the face [42] and achieved 99.7% accuracy.
3 Image segmentation methods
There are various techniques for image segmentation. The description of some techniques is given.
3.1 Otsu’s method
Otsu’s technique [61] is an image segmentation method based on image thresholding. It is the non-linear operation, used to convert an image from grayscale into a binary. In this technique, two levels are assigned to this operation, foreground pixel and background pixel. It is also called a bimodal histogram. It is based on statistical approaches [47] used to configure the threshold range. This technique is based on the minimization of all weighted sums of one-class variances of foreground and background pixels to generate a prime threshold. The prime threshold is computed by dividing two classes so that the combined spread is minimal. Deriving the same concept and maximizing the class variance is beneficial. The statistical approach of stationary objects is used to identify the items, rather than spatial coherence. The approach has been locally adapted in nature, with trails that indirectly equalize brightness so that the presence of objects can cause differences due to changes in bimodal brightness behavior. The threshold values are searched in Otsu’s approach [29], which reduces intra-class variation. The weighted total of variances of the two distinct class variances determines it. The intraclass variance is defined as the variance of a specific class. Equation (1) gives the expression for ‘within-class variance’ in weighted probability
- \({w}_{1}\) :
-
probability related to class-1
- \({w}_{2}\) :
-
probability related to class-2
- x:
-
threshold that divides the probabilities of two classes
- S1:
-
variance related to class-1
- S2:
-
variance related to class-2
The equation of class probabilities, following the L bins of the histogram, is presented as
The equation of the class means is presented as
\({\mu }_{1}\left(x\right)\) and \({\mu }_{2}\left(x\right)\) presents the means relating to the class-1 and class-2 respectively and \({\mu }_{X}\) is total mean. The equation of the total mean is presented as
The summation of the class probabilities is equal to 1.
The sum of the within-class or weighted variances, as well as the variation across classes, determines the total variance. For the definite threshold, it is calculated by adding weighted squared distances between the total mean and class-specific averages. The individual class variance is calculated using the equation [74, 79].
The expression for the total variance is presented as
- \({S}_{w}^{2}\left(x\right)\) :
-
Variance within the class
- \({S}_{b}^{2}\left(x\right)\) :
-
Variance between the class
The total variance is unaffected by the variables ‘x’ and ‘constant.‘ The result is a change in which the aids of both phrases are simply moved back and forth. According to Otsu’s method, minimizing inter-class variance is the same as maximizing inter-class variance. The important point is that recursively calculating quantities in \({S}_{b}^{2}\left(x\right)\) is likely since it violates the boundaries of ‘x’.
-
Calculate the histogram and probability for each level of intensity. The levels are threshold values that are obtained by maximizing the inter-class variance of MRI Images [24, 55].
-
Set up original \({w}_{j}\left(x\right)\) and \({\mu }_{j}\left(x\right)\)
-
Forward with all possible numbers of x = 1, 2 … to achieve maximum intensity. For 8-bit MRI the values are in the range [0: 255], and the levels are defined in this range.
-
Update the values of \({w}_{j}\left(x\right)\) and \({\mu }_{j}\left(x\right)\)
-
Calculate the value of \({S}_{b}^{2}\left(x\right)\)
-
The probable threshold is attainable corresponding to the maximum calculated value of \({S}_{b}^{2}\left(x\right)\).
3.2 Watershed algorithm
The watershed algorithm [27, 37] is a transformation used in a grayscale image. The name indicates the word “Metaphorically” about a geological watershed or drainage divide, which means it, is used to separate the adjacent drainage basins. The algorithm treats the image as topographic maps and the brightness of each pixel denotes the height used to determine the lines adjacent to the top of the ridges.
The watershed is said as the bridge that separates the areas drained using different river systems. The catchment basin is the specific geographical region, which drains into a recursion or river. The algorithm is very much applicable for analyzing the behavior of biological tissues, studying different galaxies, and finding new things in semiconductor technology. The computer analysis is used to understand which pixels are related to each object and make a decision based on it. The process is the repetition of objects from the background. These objects can be anything like DNA microarray elements, printed pages, blood cells, brain tissues, semiconductor dots, etc. The watershed is obtained using distance transform in which the distance from each pixel is calculated for its nearest non-zero valued pixel. The computed image is one catchment basin spanning a complete image and one coherent basin is applicable for each object. It is required to negate the distance transform and get knowledge about two bright regions of catchment basins. Each catchment basin of each object is called a watershed function that contains a positive integer value for the position of each catchment basin. Then zero values elements are used at the location of watershed lines to separate the objects in the original image.
3.3 Level set method
The method is a conceptual framework based on level sets used for numerical analysis of shapes and surfaces. It is based on the non-parametric model estimation with minimum contour energy to determine minimum distance. The level set technique is relied on embedding two central images. First, inserting the interface at zero levels, which is set about the standard function, and the second interface is about setting the highest dimension level set function. The method is used to maintain the surface governed by the level set equation, obtained by the iterative operation and updating the values of ‘φ’ in each time interval. The equation of the level set method [7] is given as
\(\mathrm\varphi\) denotes the level set function, and \(\nabla\mathrm\varphi\) denotes the change in the level set function. The contour [57, 59] deforms based on speed ‘v’, which is based on contour curvature and image gradient features. ‘v’ is the velocity term and decides the level set evaluation in the different levels of the image. For image segmentation, ‘v’ depends on the data and related curvature function. The equation is given in the form [7]
Here, X (I) = data function works on target function and
Function, \(\frac{\nabla\mathrm\varphi}{\left|\nabla\mathrm\varphi\right|}\) updates the level set and keeps it smooth.
3.4 K -Means clustering
The K-means clustering algorithm [34, 55] is an unsupervised algorithm used to segment the interest regions from the background of an image. The given image is partitioned into clusters and each cluster has its centroid [50]. After that, get the sum of all squared distances between all points and the cluster center. For a given set of operations (x1, x2….xn), the objective function [5] is presented as,
Here,
- Ca:
-
presents the centroid for cluster ‘a’
- \({\text{x}}_{\text{b}}^{\left(\text{a}\right)}\) :
-
case ‘a’
- n:
-
denotes number of cases
- k:
-
denotes number of clusters
- Q:
-
objective function
- \(\left\|\mathrm x_{\mathrm b}^{(\mathrm a)}-{\mathrm C}_a\right\|^2\) :
-
denotes the distance function
3.5 Discrete wavelet transform (DWT)
The DWT [51] is based on the 2D HAAR decomposition method in which the image is segmented row-wise and column-wise. The multi-resolution decomposition method is used in 2D images to segment the image in the ‘L’ sub-band and ‘H’ sub-band. The image is divided into four sub-bands: LL, LH, HL, and HH which contain extensive information about the image [28]. The sub-bands in which the original image is processed through a combinational filter such as a low pass and a high pass filter are shown in detail in Fig. 1a & b. Figure 2 displays a HAAR DWT example in which the picture (64 × 64) is separated into four bands: LL(32 × 32), LH(32 × 32), HL(32 × 32), and HH(32 × 32). The LL band is further divided into four bands: LLLL (16 × 16), LLLH (16 × 16), LLHL (16 × 16), and LLHH (16 × 16). For better refining, the specific region of the decomposed image, morphological operations such as erosion, dilation, opening, and closing can be employed, and the coefficients of the wavelet transform can be obtained using image arithmetic operations.
The DWT decomposes the input brain MRI images into a collection of sub-images with varying resolutions and frequency regions. The input image is subdivided into multiple frequency bands, which is done via sub-band coding with the help of the filter bank. A filter bank is a set of filters in which all have the same input or output to share a common input that constitutes an examination unit, while they share a common output from a fusion bank. The filter bank converts the image in the frequency domain. The first step in seamless reconstruction is to assess the input image low filter bank and the tumor database-related similar perception. The filter channel bank is divided into two portions, the first of which is the analysis section and the second of which is the synthesis section. The analysis stage decomposes the input image into a series of sub-band components, which are then used to rebuild the original signal from its modules in the synthesis section. Subband analysis and synthesis filters are designed in such a way that the filter behavior is alias-free and meet the perfect signal reconstruction property. The elimination of aliasing, phase distortion, and amplitude in real-time should result in a faultless reconstruction of these filter banks, making them more suitable for sub-band coding and multi-resolution image decomposition. The filter bank separates the MRI image into two equal frequency bands during operation. Low-pass filters H0[z] and high-pass filters H1[z] are the filter banks in the DWT. These signal outputs at levels 1 and 2 after filtering are provided. At the low pass, filter level processing (1),
At the high pass filter level processing (2),
The sampling frequency of the signal is too high after this filtering procedure [33]. As a result, the down-sampling technique rejects half of the samples. After that, Eqs. (16) and (17) are used to calculate the Z-transform. At level (3) processing, the output is given as
At level (4) processing, the output is given as
Based on two filtered and decimated signals, the synthesis filter bank restructures the signal. Expansion or interpolation is a synthesis approach that involves multiplying the signals in each branch by two. By inserting zeros between consecutive samples, this interpolation is achieved. The Z-transform of the signal at level 5 and level 6 is given in Eqs. (18) and (19), respectively, after interpolation. At level (5) processing, the output is given as
At level (6) processing, the output is given as
The wavelet is used to extract the coefficients from the brain MRI, and localize the frequency details of the signal function for the classification. The MRI image is decomposed into spatial low-frequency components that are taken from the LL sub-bands. The HL frequency components are having high performance in comparison to LL. The high-frequency components LH, HL, and HH present the horizontal, vertical, and diagonal components in the first and second levels of decomposition. The LL and HL bands are combined for MRI image analysis. The quantitative information of the image is extracted such as color feature, contrast, and shape. The DWT gathers statistical features from the brain tumor image and feeds them into probabilistic neural network classifiers [71] as inputs for training and testing, as well as classifying the image as normal or abnormal.
3.6 Convolutional neural network (CNN)
The convolutional neural network [31, 32] is one type of deep neural network, in which the information passes through the right side of the model and no feedback is present in the model. It is a feed-forward artificial neural network, processed through different layers [30]. CNN has been widely used by many researchers for different applications in computer vision. The CNN model architecture used for brain MRI images is shown in Fig. 3. The CNN architecture consists of three layers with two sequential convolutional layers, pooling layers, and a third fully connected layer for classification. The 3D convolution operation is performed using the convolution layer, directly used to map the features of the image with the convolution kernel [62, 78]. The pooling layer, which is utilized for subsampling and is applied to a pooling function over a spatial window without the pooling kernel corresponding to each output feature map [81], is the next step. The pooling layer output is coupled to the third layer, which gives the feature vector. The original MRI images (64 × 64 × 1) processed by two-convolution kennels (32 × 32 × 1) and (16 × 16 × 1), embedded with (2 × 2 × 1) pooling kernel for subsampling. Table 1 presents the size of the different MRI images processed in the layered architecture of CNN.
The MRI images are processed using the LeNet architecture, which consists of convolutional and pooling layers followed by a fully connected layer based on Softmax activation. In deep learning of MRI images, LeNet CNNs use a shortcut connection layer called max pooling between convolutional layers to minimize the spatial size of images, preventing overfitting and allowing CNNs to train more successfully.
4 Methodology
The methodology of brain tumor detection is depicted in Fig. 4. The steps involved in the methodology are brain MRI inputs, preprocessing, image segmentation, and analysis. The MRI is the computation method to get the visual information of the brain-related medical data. MRI is the best method for monitoring, surgery, and diagnosis. The radio waves are used in MRI and detailed information is archived using a magnetic field to get the data about brain tissues and brain stem. The testing in MRI is different from CT scan, as it does not contain radiation. The preliminary section is the collection and acquisition of images from the BRATS data set (https://www.med.upenn.edu/sbia/brats2018/data.html). Pre-processing is the essential step and applied before segmentation that includes the required operations to make diagnostic more obvious from input MRI images. In the pre-processing stage of images, the visual quality is improved and filtering operation is carried out to eliminate the different noise levels. The presence of such disruptions makes abnormal/normal tissue perception and hence exact interpretations are more challenging. Segmentation divides an image into multiple segments and describes the separation of the suspicious region from pre-processed MRI images to make the simpler image that is more meaningful and easier to examine. In this process, different segmentation methods such as watershed transformation, thresholding, level set method, K-means clustering, Otsu’s method, DWT and CNN are used. Thresholding is also the part of segmentation used to create binary images. It uses a fixed constant value to compare the pixel of an image. If a pixel value is less than the threshold value, then it replaces that pixel value with the black pixel. If a pixel value is greater than the threshold value, then it replaces the pixel value with the white pixel.
MRI-BRAST data is divided into two sections: training and testing. The LeNet CNN architecture is loaded and compiled. The network is being trained and serialized network weights can be saved to disc and reused if desired short of having to retrain the network. The extracted output from the fully connected layer shows that the implementation is working properly. The CNN processes the MRI images through convolutional layers and pooling processes that are typically structured so that the spatial resolution of the representations decreases as the number of channels increases. The LeNet CNNs, process the representations stored by the convolutional blocks before producing output using one or more fully connected layers. In the DWT, the MRI images are processed in low and high filter banks using LL, LH, HL, and HH combinations. To simplify the complexity and increase the performance, DWT-based brain tumor region growth segmentation is used. The accuracy and performance parameters in the detection of tumor site in brain MRI images is trained and tested using a probabilistic neural network classifier.
Image segmentation focuses on regions of interest [13] and keeps track of all subsequent modules for analysis. Although segmentation is a critical stage, it is not always necessary when direct classification takes precedence over other goals. The feature extraction process follows image segmentation. The image features are extracted from the initial steps to measure and derive non-redundant features. It gives information about image dimensions and measures full data. The meaning of full tissues and regions are exacted from the image analysis. The morphological operations are used for the same purpose. It is called the image analysis process in which the actual size and shape of the tumor are known to medical practitioners. Finally, the generated features with the smallest dimension are utilized to train a classifier to distinguish between normal and diseased brain tissues. After training, the system performance is assessed using performance metrics.
5 Results & discussions
The level set method works on iterations only to get the tumor results. The watershed algorithm is a pre-segmented and faster method. The simulation is carried out in MATLAB 18.0 using an image-processing toolbox. We have developed our code in MATLAB for all the algorithms. The simulation is performed using 2D HAAR DWT, K-means, Otsu’s, watershed, and level set and CNN methods for 10 images available online in the BRATS 2018 dataset for the brain tumor. Table 2 presents the MATLAB response time of different methods for brain tumor segmentation. The comparative graph of all the techniques is shown in Fig. 5, determined by the MATLAB simulation tool.
The response time is important to model the system for any specific hardware. The response time of the CNN, DWT, K-means, level-set, watershed and Otsu’s methods is 2.519, 2.675, 4.571, 7.290, 9.219, and 12.500 s respectively. The significance of the MATLAB simulation time is that the particular algorithm can predict the estimated delay when the algorithm will be realized in the MATLAB-HDL simulator or Simulink for tumor detection system implementation. The MATLAB simulation predicts the time complexity for the algorithm and estimates the designer to choose the machine learning model [26, 40]. The performance of the algorithms is evaluated based on the performance measures such as precision, recall, F-measure, and accuracy [43]. The computation is done based on standard equations. Table 3 lists the computed values of all the algorithms against each parameter. A confusion matrix is formed that depicted how well the trained model predicted each target class relative to the experimental confusion matrix count.
Where,
- TP:
-
True Positives: The target is positive, as anticipated by the model.
- TN:
-
True Negatives: Occurs when the target is positive while the model predicts it to be negative.
- FP:
-
False Positives: The target is negative, yet the model predicts a positive outcome.
- FN:
-
False Negatives: The target is negative, as anticipated by the model.
Figure 6 presents the simulation results of the watershed algorithm, level set method, and intermediate steps with morphological operations, and related operators. Figure 7 shows the pictorial view of results using CNN and other methods. The original tumor image is converted from RGB to gray and histogram equalization is applied to adjust the contrast of the image by changing the intensity values [33]. After that, the noise filtering is used to remove the noise using a Gaussian filter, and the Watershed algorithm is applied. After that, the different mathematical operators and morphological operations are used such as opening, opening by reconstruction, opening-closing, open-close by reconstruction, region maxima, and superimposed maxima to determine the background markers and threshold values. After processing through the modified maximum values, the tumor is detected. In the same way, the level set algorithm is applied in which the original tumor image is converted from RGB to gray and histogram equalization is applied to adjust the contrast of the image by changing the intensity values, then the resultant image is processed for the contour operation for the minimum and maximum contour values. The morphological operations are applied to get the detected tumor.
6 Conclusion
In the medical field, MRI image segmentation is very tedious work and time-consuming. The paper addressed brain MRI tumor detection and image segmentation using Otsu’s method, level set, watershed, K-means, DWT, and CNN. The tumor detection using all the methods is done successfully in the MATLAB-2018 simulation environment. The response time for Otsu’s method, level set, watershed, K-means, DWT, and CNN is 12.50 s, 9.219 s, 7.280 s, 4.571 s, 2.675 s, and 2.519 s respectively. Otsu’s method consumes more time in comparison to all techniques as it processes the image one time and works on computations of unnecessary regions. On the other hand, the image is pre-segmented in the case of the level set method. The K-means divides the images into clusters and works on average values. The watershed algorithm follows morphological operators, region filtering, and the exact position of the outline. The DWT method divides the image into low and high-frequency bands with four sub-bands as LL, LH, HL, and HH, make easy to decompose a large image. The CNN architecture processed the image using a layered architecture. The algorithms are analyzed based on precision, recall, F-measure, and accuracy as performance measures.
-
The computed values of performance measures: recall, precision, F-measure, and accuracy of Otsu’s method are 0.681, 0.892, 0.773, and 0.714 respectively.
-
The computed values of performance measures: recall, precision, F-measure, and accuracy of the watershed algorithm are 0.782, 0.827, 0.804, and 0.782 respectively.
-
The computed values of performance measures: recall, precision, F-measure, and accuracy of the level set method are 0.775, 0.863, 0.817, and 0.804 respectively.
-
The computed values of performance measures: recall, precision, F-measure, and accuracy of the K-means algorithm are 0.931, 0.764, 0.839, and 0.843 respectively.
-
The computed values of performance measures: recall, precision, F-measure, and accuracy of DWT algorithm are 0.885, 0.867, 0.876, and 0.869 respectively.
-
The computed values of performance measures: recall, precision, F-measure, and accuracy in CNN are 0.869, 0.952, 0.909, and 0.913 respectively.
In comparison to other algorithms for brain tumor application, CNN delivered the best performance based on performance measures and response time. We intend to use CNN with more layers in the future, as well as construct a machine-learning model to offer the same functionality.
References
Abd-Ellah MK, Awad AI, Khalaf AA, Hamed HF (2019) A review on brain tumor diagnosis from MRI images: practical implications, key achievements, and lessons learned. Magn Reson Imaging 61:300–318
Abiwinanda N, Hanif M, Hesaputra ST, Handayani A, Mengko TR (2019) Brain tumor classification using convolutional neural network. In: World congress on medical physics and biomedical engineering 2018, pp 183–189
Al-Galal SAY, Alshaikhli IFT, Abdulrazzaq MM (2021) MRI brain tumor medical images analysis using deep learning techniques: a systematic review. Health Technol 11(2):267–282
Al-Okaili RN, Krejza J, Woo JH, Wolf RL, O Rourke DM, Judy KD, Melhem ER (2007) Intraaxial brain masses: MR imaging-based diagnostic strategy-initial experience. Radiology 243(2):539–550
Alsabti K, Ranka S, Singh V (1997) An efficient k-means clustering algorithm. Electr Eng Comput Sci 43:5–10
Alshayeji M, Al-Buloushi J, Ashkanani A (2021) Enhanced brain tumor classification using an optimized multi-layered convolutional neural network architecture. Multimedia Tools Appl 80(19):28897–28917
Amarapur B (2020) Computer-aided diagnosis applied to MRI images of brain tumor using cognition based modified level set and optimized ANN classifier. Multimedia Tools Appl 79(5):3571–3599
Amin J, Sharif M, Gul N, Yasmin M, Shad SA (2020) Brain tumor classification based on DWT fusion of MRI sequences using convolutional neural network. Pattern Recognit Lett 129:115–122
Anila S, Sivaraju SS, Devarajan N (2017) A new contour let based multiresolution approximation for MRI image noise removal. Natl Acad Sci Lett 40(1):39–41
Balafar MA, Ramli AR, Saripan MI, Mashohor S (2010) Review of brain MRI image segmentation methods. Artif Intell Rev 33(3):261–274
Bansal M, Kumar M, Kumar M, Kumar K (2021) An efficient technique for object recognition using Shi-Tomasi corner detection algorithm. Soft Comput 25(6):4423–4432
Bisht A, Kumar A (2019) DWT chip design and FPGA synthesis for image processing. Int J Recent Technol Eng (IJRTE) 8:1–10
Chahal PK, Pandey S, Goel S (2020) A survey on brain tumor detection techniques for MR images. Multimedia Tools Appl 79(29):21771–21814
Chaudhary A, Bhattacharjee V (2020) An efficient method for brain tumor detection and categorization using MRI images by K-means clustering & DWT. Int J Inform Technol 12(1):141–148
Chen PY, Hsieh HY, Huang CY, Lin CY, Wei KC, Liu HL (2015) Focused ultrasound-induced blood–brain barrier opening to enhance interleukin-12 delivery for brain tumor immunotherapy: a preclinical feasibility study. J Translational Med 13(1):1–12
Dargan S, Kumar M (2020) A comprehensive survey on the biometric recognition systems based on physiological and behavioral modalities. Expert Syst Appl 143:113114
Dargan S, Kumar M, Ayyagari MR, Kumar G (2019) A survey of deep learning and its applications: a new paradigm to machine learning. Arch Comput Methods Eng 27(4):1071–1092
Davis FG, Malmer BS, Aldape K, Barnholtz-Sloan JS, Bondy ML, Brännström T, Kruchko C (2008) Issues of diagnostic review in brain tumor studies: from the Brain tumor epidemiology consortium. Cancer Epidemiol Prev Biomarkers 17(3):484–489
Di Giacomo AM, Valente M, Cerase A, Lofiego MF, Piazzini F, Calabrò L, Maio M (2019) Immunotherapy of brain metastases: breaking a “dogma.” J Exp Clin Cancer Res 38(1):1–10
Gholipour A, Kehtarnavaz N, Briggs R, Devous M, Gopinath K (2007) Brain functional localization: a survey of image registration techniques. IEEE Trans Med Imaging 26(4):427–451
Ghosh KK, Begum S, Sardar A, Adhikary S, Ghosh M, Kumar M, Sarkar R (2021) Theoretical and empirical analysis of filter ranking methods: experimental study on benchmark DNA microarray data. Expert Syst Appl 169:114485
Glover GH (1999) Simple analytic spiral k-space algorithm. Magn Reson Medicine: Official J Int Soc Magn Reson Med 42(2):412–415
Goel A, Chikara D, Srivastava AK, Kumar A (2016) Medical imaging with brain tumor detection and analysis. Int J Comput Sci Inform Secur 14(9):228
Gupta N, Khanna P (2017) A non-invasive and adaptive CAD system to detect brain tumor from T2-weighted MRIs using customized Otsu’s thresholding with prominent features and supervised learning. Sig Process Image Commun 59:18–26
Gupta S, Thakur K, Kumar M (2020) 2D-human face recognition using SIFT and SURF descriptors of face’s feature regions. Visual Comput 37(3):447–456
Gupta N, Jain A, Vaisla KS, Kumar A, Kumar R (2021) Performance analysis of DSDV and OLSR wireless sensor network routing protocols using FPGA hardware and machine learning. Multimedia Tools Appl 80(14):22301–22319
Hasan SK, Ahmad M (2018) Two-step verification of brain tumor segmentation using watershed-matching algorithm. Brain Inf 5(2):8
Hooda A, Kumar A, Goyat MS, Gupta R (2021) Estimation of surface roughness for transparent superhydrophobic coating through image processing and machine learning. Mol Cryst Liq Cryst 726(1):90–104
Huang DY, Wang CH (2009) Optimal multi-level thresholding using a two-stage Otsu optimization approach. Pattern Recognit Lett 30(3):275–284
Hussain S, Anwar SM, Majid M (2018) Segmentation of glioma tumors in brain using deep convolutional neural network. Neurocomputing 282:248–261
Iqbal S, Ghani MU, Saba T, Rehman A (2018) Brain tumor segmentation in multi-spectral MRI using convolutional neural networks (CNN). Microsc Res Tech 81(4):419–427
Isin A, Direkoglu C, Şah M (2016) Review of MRI-based brain tumor image segmentation using deep learning methods. Procedia Comput Sci 102:317–324
Jayaraman S, Esakkirajan S, Veerakumar T (2009) Digital image processing. TMH Publication 2, pp 31–121
Jose A, Ravi S, Sambath M (2014) Brain tumor segmentation using k-means clustering and fuzzy C-means algorithms and its area calculation. Int J Innov Res Comput Commun Eng 2(3)
Joseph RP, Singh CS, Manikandan M (2014) Brain tumor MRI image segmentation and detection in image processing. Int J Res Eng Technol 3(1):1–5
Kaur P, Kumar R, Kumar M (2019) A healthcare monitoring system using random forest and internet of things (IoT). Multimedia Tools Appl 78(14):19905–19916
Khan MA, Lal IU, Rehman A, Ishaq M, Sharif M, Saba T, Akram T (2019) Brain tumor detection and classification: a framework of marker-based watershed algorithm and multilevel priority features selection. Microsc Res Tech 82(6):909–922
Khode KMR, Salwe SR, Bagade AP, Raut RD (2017) Efficient brain tumor detection using wavelet transform. Int J Eng Res Appl 7:55–60
Kumar A, Rastogi P, Srivastava P (2015) Design and FPGA implementation of DWT, image text extraction technique. Procedia Comput Sci 57:1015–1025
Kumar A, Sharma P, Gupta MK, Kumar R (2018) Machine learning based resource utilization and pre-estimation for network on chip (NoC) communication. Wireless Pers Commun 102(3):2211–2231
Kumar M, Gupta S, Kumar K, Sachdeva M (2020) Spreading of COVID-19 in India, Italy, Japan, Spain, UK, US: a prediction using ARIMA and LSTM model. Digit Government: Res Pract 1(4):1–9
Kumar A, Kumar M, Kaur A (2021) Face detection in still images under occlusion and non-uniform illumination. Multimedia Tools Appl 80(10):14565–14590
Kumar A, Chauda P, Devrari A (2021) Machine learning approach for brain tumor detection and segmentation. Int J Organizational Collective Intell (IJOCI) 11(3):68–84
Kumaria A (2021) Tumor treating fields: additional mechanisms and additional applications. J Korean Neurosurg Soc 64(3):469
Li BN, Chui CK, Chang S, Ong SH (2012) A new unified level set method for semi-automatic liver tumor segmentation on contrast-enhanced CT images. Expert Syst Appl 39(10):9661–9668
Liao X, Shu C (2015) Reversible data hiding in encrypted images based on absolute mean difference of multiple neighboring pixels. J Vis Commun Image Represent 28:21–27
Liao PS, Chen TS, Chung PC (2001) A fast algorithm for multilevel thresholding. J Inf Sci Eng 17(5):713–727
Liao X, Li K, Yin J (2017) Separable data hiding in encrypted image based on compressive sensing and discrete Fourier transform. Multimedia Tools Appl 76(20):20739–20753
Liao X, Yin J, Guo S, Li X, Sangaiah AK (2018) Medical JPEG image steganography based on preserving inter-block dependencies. Comput Electr Eng 67:320–329
MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability. University of California Press, Berkeley, 1, pp 281–297
Mallat SG (1989) A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Mach Intell 11(7):674–693
Moeskops P, Benders MJ, Chiţǎ SM, Kersbergen KJ, Groenendaal F, de Vries LS, Išgum I (2015) Automatic segmentation of MR brain images of preterm infants using supervised classification. NeuroImage 118:628–641
Moeskops P, Viergever MA, Mendrik AM, De Vries LS, Benders MJ, Išgum I (2016) Automatic segmentation of MR brain images with a convolutional neural network. IEEE Trans Med Imaging 35(5):1252–1261
Mustaqeem A, Javed A, Fatima T (2012) An efficient brain tumor detection algorithm using watershed & thresholding based segmentation. Int J Image Graphics Signal Process 4(10):34
Nanda SJ, Gulati I, Chauhan R, Modi R, Dhaked U (2019) A K-means-galactic swarm optimization-based clustering algorithm with Otsu’s entropy for brain tumor detection. Appl Artif Intell 33(2):152–170
Nazir M, Shakil S, Khurshid K (2021) Role of deep learning in brain tumor detection and classification (2015 to 2020): a review. Comput Med Imaging Graph 91:101940
Olszewska JI (2015) Active contour based optical character recognition for automated scene understanding. Neurocomputing 161:65–71
Olszewska JI (2019) Designing transparent and autonomous intelligent vision systems. In: ICAART 2, pp 850–856
Olszewska JI, Houghtaling M, Goncalves PJ, Fabiano N, Haidegger T, Carbonera JL, Prestes E (2020) Robotic standard development life cycle in action. J Intell Robotic Syst 98(1):119–131
Ostrom QT, Cioffi G, Gittleman H, Patil N, Waite K, Kruchko C, Barnholtz Sloan JS (2019) CBTRUS statistical report: primary brain and other central nervous system tumors diagnosed in the United States in 2012–2016. Neurooncology 21:v1–v100
Otsu N (1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9(1):62–66
Pardakhti N, Sajedi H (2020) Brain age estimation based on 3D MRI images using 3D convolutional neural network. Multimedia Tools Appl 79(33):25051–25065
Patil RC, Bhalchandra AS (2012) Brain tumour extraction from MRI images using MATLAB. Int J Electron Communication Soft Comput Sci Eng 2(1):1–4
Pereira S, Pinto A, Alves V, Silva CA (2016) Brain tumor segmentation using convolutional neural networks in MRI images. IEEE Trans Med Imaging 35(5):1240–1251
Porter KR, McCarthy BJ, Freels S, Kim Y, Davis FG (2010) Prevalence estimates for primary brain tumors in the United States by age, gender, behavior, and histology. Neurooncology 12(6):520–527
Ratan R, Sharma S, Sharma SK (2009) Brain tumor detection based on multi-parameter MRI image analysis. ICGST-GVIP J 9(3):9–17
Remya R, Parimala GK, Sundaravadivelu S (2019) Enhanced DWT Filtering technique for brain tumor detection. IETE J Res: 1–10
Roy S, Bandyopadhyay SK (2012) Detection and quantification of brain tumor from MRI of brain and it’s symmetric analysis. Int J Inform Communication Technol Res 2(6):477–483
Sain PK, Singh M (2015) Brain tumor detection in medical imaging using MATLAB. Int Res J Eng Technol 2(2):191–196
Seetha J, Raja SS (2018) Brain tumor classification using convolutional neural networks. Biomed Pharmacol J 11(3):1457
Shree NV, Kumar TNR (2018) Identification and classification of brain tumor MRI images with feature extraction using DWT and probabilistic neural network. Brain Inf 5(1):23–30
Siegel RL, Miller KD, Jemal A (2020) Cancer statistics. CA Cancer J Clin 70(1):7–30
Singh AK, Dave M, Mohan A (2014) Hybrid technique for robust and imperceptible image watermarking in DWT–DCT–SVD domain. Natl Acad Sci Lett 37(4):351–358
Thapaliya K, Pyun JY, Park CS, Kwon GR (2013) Level set method with automatic selective local statistics for brain tumor segmentation in MR images. Comput Med Imaging Graph 37(7–8):522–537
Tivaskar SP, Lakhkar BN, Dhande RP, Mishra GV (2021) Role of TE in MR spectroscopy for the evaluation of brain tumour with reference to choline and creatinine. Indian J Forensic Med Toxicol 15(2)
Viji KA, Jayakumari J (2011) Automatic detection of brain tumor based on magnetic resonance image using CAD system with watershed segmentation. In: 2011 international conference on signal processing, communication, computing and networking technologies, pp 145–150
Wang H, Roa AC, Basavanhally AN, Gilmore HL, Shih N, Feldman M, Madabhushi A (2014) Mitosis detection in breast cancer pathology images by combining handcrafted and convolutional neural network features. J Med Imaging 1(3):03400
Wang J, Zhang G, He Z, Wang S, Sun Y (2020) Research on dermoscopic segmentation based on multi-scale convolutional neural network. Procedia Comput Sci 174:443–447
Weathers SP, Gilbert MR (2015) Current challenges in designing GBM trials for immunotherapy. J Neurooncol 123(3):331–337
Zhang C, Shen X, Cheng H, Qian Q (2019) Brain tumor segmentation based on hybrid clustering and morphological operations. Int J Biomed Imaging 2019:1–10
Zikic D, Ioannou Y, Brown M, Criminisi A (2014) Segmentation of brain tumor tissues with convolutional neural networks. Proceedings MICCAI-BRATS, pp 36–39
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kumar, A. Study and analysis of different segmentation methods for brain tumor MRI application. Multimed Tools Appl 82, 7117–7139 (2023). https://doi.org/10.1007/s11042-022-13636-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13636-y