Brain magnetic resonance image (MRI) segmentation using multimodal optimization

Akan, Taymaz; Oskouei, Amin Golzari; Alp, Sait; Bhuiyan, Mohammad Alfrad Nobel

doi:10.1007/s11042-024-19725-4

Brain magnetic resonance image (MRI) segmentation using multimodal optimization

Open access
Published: 02 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Multimedia Tools and Applications Aims and scope Submit manuscript

Brain magnetic resonance image (MRI) segmentation using multimodal optimization

Download PDF

Taymaz Akan ORCID: orcid.org/0000-0003-4070-1058^1,2,
Amin Golzari Oskouei^3,4,
Sait Alp⁵ &
…
Mohammad Alfrad Nobel Bhuiyan¹

185 Accesses
Explore all metrics

Abstract

One of the highly focused areas in the medical science community is segmenting tumors from brain magnetic resonance imaging (MRI). The diagnosis of malignant tumors at an early stage is necessary to provide treatment for patients. The patient’s prognosis will improve if it is detected early. Medical experts use a manual method of segmentation when making a diagnosis of brain tumors. This study proposes a new approach to simplify and automate this process. In recent research, multi-level segmentation has been widely used in medical image analysis, and the effectiveness and precision of the segmentation method are directly tied to the number of segments used. However, choosing the appropriate number of segments is often left up to the user and is challenging for many segmentation algorithms. The proposed method is a modified version of the 3D Histogram-based segmentation method, which can automatically determine an appropriate number of segments. The general algorithm contains three main steps: The first step is to use a Gaussian filter to smooth the 3D RGB histogram of an image. This eliminates unreliable and non-dominating histogram peaks that are too close together. Next, a multimodal particle swarm optimization method identifies the histogram’s peaks. In the end, pixels are placed in the cluster that best fits their characteristics based on the non-Euclidean distance. The proposed algorithm has been applied to a Cancer Imaging Archive (TCIA) and brain MRI Images for brain Tumor detection dataset. The results of the proposed method are compared with those of three clustering methods: FCM, FCM_FWCW, and FCM_FW. In the comparative analysis of the three algorithms across various MRI slices. Our algorithm consistently demonstrates superior performance. It achieves the top mean rank in all three metrics, indicating its robustness and effectiveness in clustering. The proposed method is effective in experiments, proving its capacity to find the proper clusters.

MRI Brain Tumor Segmentation System Based on Hybrid Clustering Techniques

Brain tumor segmentation from MRI using FCM clustering, morphological reconstruction, and active contour

Article 13 October 2023

Dominant Gray Level Based Particle Swarm Optimization Clustering for Segmenting Brain Tumor Sub-regions from Multimodal Magnetic Resonance Images

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Clinicians can identify diseases earlier thanks to medical imaging, which improves patient outcomes. Appropriate medical image analysis is essential to aid specialists and promote a healthy community. The application of image processing techniques for analyzing medical images is exceedingly successful, thanks to advanced medical equipment. Image segmentation, the first step in image processing, is one of the most significant and complex challenges in image analysis, particularly in the application of medical images. Segmentation divides an image into meaningful, nonoverlapping, homogeneous, connected regions concerning color similarity. Medical image segmentation aims to isolate anatomical objects of interest for analysis and is critical in medical imaging applications [1,2,3,4]. There is a wide variety of image segmentation methods, including threshold-based, clustering-based, region-based, edge-based, etc. [5,6,7,8,9]. Other hybrid image segmentation techniques combine multiple approaches [10]. Many years of research have examined segmentation features and methods. Nevertheless, one of the restrictions is that the appropriate number of segments is a parameter that must be established a priori, and determining this value is not an easy task [11]. In addition, the problem remains tough because, as the desired number of segments increases, the problem’s computing cost increases exponentially, making it unfeasible to employ accurate methods to search for all possible solutions exhaustively.

Computerized tomography (CT), magnetic resonance imaging (MRI), electroencephalography (EEG), and positron emission tomography (PET) are the four most prevalent forms of imaging techniques for the brain. However, magnetic resonance imaging (MRI) is the most frequently utilized method. MRI does not expose patients to radiation, has minimal invasiveness, and is widely available [12]. Furthermore, the MRI can discern more clearly between fatty tissue, water, muscle, and other types of soft tissue. In addition, MRI provides greater soft tissue contrast [13]. The information contained in these images is provided to medical professionals, who can utilize it to assist in the diagnosis of a wide range of illnesses and ailments. The goal of the segmentation of tumors in the human brain is to differentiate between normal and diseased tissues of the brain, specifically cerebrospinal fluid, gray matter, and white matter of the brain. When it comes to investigating brain tumors, it is currently simple to identify abnormal tissues; nevertheless, the segmentation process is challenging, making it difficult to reproduce results, characterize abnormalities, and achieve precision [13]. The term “tumor” refers to the condition that occurs when the uncontrollable expansion of cancer cells causes it. This tumor comes in various subtypes and manifestations, each of which can be treated with a specific modality according to its unique symptoms. Brain segmentation results in an image with labels that show the different regions’ boundaries or a group of contours. Segmentation methods can be further divided into bi-level segmentation methods, which divide an image into two parts, and multi-level segmentation methods, which divide an image into more than two parts [14]. Since MRI brain images contain more than two different types of regions, each of which may correspond to a unique object, the bi-level segmentation technique cannot be effective and results in under-segmentation.

As a result, multi-level image segmentation algorithms should be utilized to segment MRI brain images. Also, segmenting color images (RGB) is difficult due to the range of color intensities and three-color channels, unlike gray images, which have only one [15]. Therefore, various approaches can be taken to segment MRI brain images [1, 13]. Methods for segmenting MRI brain images can be divided into three broad classes: classification-based, region-based, and boundary-based methods [16, 17]. Fuzzy c-means is a typical classification-based approach widely used in medical image segmentation [18,19,20,21]. However, these clustering methods suffer from predefined values for determining the number of the proper segments.

Furthermore, the computational time is also a consideration since it depends on the number of clusters and image size. As a result, the paper’s motivation comes from automatically segmenting the image regardless of image size and selecting the appropriate number of clusters in addition to the actual peaks. To get around these problems, this study suggests using a 3D histogram and a modified multimodal particle swarm optimization method for brain MRI segmentation.

The cluster centers can be found by detecting the peaks in a three-dimensional histogram of a color image created from the RGB values of the pixels and smoothed with a Gaussian filter. The multimodal variation of particle swarm optimization (PSO) with a local search strategy has been utilized to find the global and local peaks in a histogram and the cluster centers. The contribution of the proposed clustering-based method is that a non-Euclidean distance metric has replaced the Euclidean metric for calculating the multimodal optimization algorithm’s pixel similarity and movement strategy. The number of clusters in the image can be automatically determined based on the number of PSO-detected peaks. Following the discovery of peaks, individual pixels are subsequently assigned to the cluster to which they are spatially closed using non-Euclidean distance, which results in the final segmented brain image. The proposed algorithm has been compared with the FCM [22] clustering algorithm.

The rest of this paper is organized as follows. Section 2 presents the FCM clustering. Section 3 describes the multimodal PSO algorithm. The proposed method is described in Section 4. Finally, section 5 presents the experimental and comparative results, while Section 5 concludes the paper.

2 Fuzzy c-means clustering

The k-means and c-means algorithms are two of the most well-known clustering methodologies in color image segmentation. These algorithms frequently yield good results and are widely used. Nevertheless, as mentioned before, one of the constraints is that the number of clusters is a parameter that must be determined in advance, and determining the value of this parameter is not easy. In addition, the computational time is a primary concern when solving the problem, as it depends on the required number of clusters and the image size. The c-means algorithm uses fuzzy memberships to put each pixel in the correct category.

$$j=\sum_{j=1}^N\sum_{i=1}^c{u}_{ij}^m{\left\Vert {x}_j-{v}_i\right\Vert}^2,$$

(1)

Where x_ij s a representation of pixel x_j^’s membership in the i^th cluster, v_i s the center of the i^th cluster, ‖∙‖ s a norm metric, and m is a constant. The fuzziness of the final partition is determined by the value of the parameter m, and usually m = 2.

The cost function is minimized when high membership values are assigned to pixels close to their clusters’ centroid. On the other hand, when low membership values are assigned to pixels with data far from the centroid, the cost function is maximized. This is because the membership function calculates the probability of a pixel being part of a particular cluster, given its location. This technique’s probability depends only on pixel distance from each cluster center. The following are the factors that contribute to the updating of the membership functions and cluster centers:

$${u}_{ij}=\frac{1}{\sum_{k=1}^c{\left(\frac{\left\Vert {x}_j-{v}_i\right\Vert }{\left\Vert {x}_j-{v}_k\right\Vert}\right)}^{2/\left(m-1\right)}}$$

(2)

$${v}_i=\frac{\sum_{j=1}^N{u}_{ij}^m{x}_j}{\sum_{j=1}^N{u}_{ij}^m}$$

(3)

The algorithm starts by randomly selecting the clusters’ centers and each cluster’s average location. The next step that c-means conducts is to assign an entirely random membership grade to each data point for each cluster. Finally, the means attempts to place the cluster centers in the correct location within a data set and calculates the degree of membership in each cluster for each data point by updating the cluster centers and the membership grades. This iteration tries to minimize an objective function that shows how far a given data point is from the center of a cluster based on how much that data point belongs to the cluster.

3 Multimodal PSO

While unimodal optimization algorithms can only identify a single global optimum (solution) within the collection of options, multimodal optimization algorithms can find many local and global optimum solutions [23, 24]. Even though multimodal optimization approaches have not been explored nearly as extensively as unimodal optimization methods, they have recently gained much attention. However, most of them have the same problem with niching parameters. Existing approaches have trouble figuring out the right niching radius, which is their main problem [25]. In the majority of studies, basic unimodal optimization algorithms, such as the Genetic Algorithm [26,27,28] and the PSO Algorithm [29,30,31,32], have been modified to become MMO algorithms. PSO’s movement (crossover) method is suited to adapt to a multimodal form. PSO is a nature-inspired method for unimodal stochastic optimization. t has been used to solve many computational problems [33]. PSO has been extended numerous times to a multimodal form in the literature. PSO mimics the swarming behavior of foraging birds to move particles toward the best solution. Therefore, PSO depends on the movement of particles in the search space to determine the best value. Each particle keeps track of its personal best position (i.e., personal best) and the overall best position (i.e., global best) gained by the population so far. Each particle (i^th) is associated with a position vector and velocity vector that is recorded as x_i and v_i, respectively. x_i and v_i are updated according to the following equation:

$${\displaystyle \begin{array}{c}{v}_i\left(t+1\right)=w{v}_{i(t)}+{R}_1{C}_1\left({p}_i^{best}-{x}_i\right)+{R}_2{C}_2\left({g}^{best}-{x}_i\right)\\ {}{x}_i\left(t+1\right)={x}_i(y)+{v}_i\left(t+1\right)\end{array}}$$

(4)

Where t is the iteration number, w indicates the inertia weight, ${p}_i^{best}$ and g^best correspond to the location of personal best and global best, respectively. R₁ and R₂ are two uniformly distributed random numbers generated inside the interval [0,1]. C₁ and C₂ represent the particle’s confidence and its neighbors correspondingly. Unimodal PSO cannot locate multiple solutions as all solutions move to global best (gbest). However, PSO’s mechanism for particle motion can be easily adapted to handle multimodal problems. Carrera and Coello Coello (2009) introduced a modified PSO variation for solving multimodal problems inspired by electrostatic charge interactions [32]. Each solution moves toward the solution with the greatest electrostatic interaction calculated based on the current fitness value to locate multiple optimal solutions. These interactions are determined mathematically according to the following:

$${F}_{ij}=\frac{Q_i{Q}_j}{4\pi {r}^2{\varepsilon}_0}$$

(5)

Where Q_ij, r ≠ 0, and ε₀ refer, respectively, to the electrical charges of the interacting particles, the distance between them, and the permittivity of the vacuum. To apply these ideas in an optimization context, each solution’s fitness value corresponds to the particles’ electric charge. Herein, Eq. 4 is simulated as:

$${F}_{ij}=\alpha \frac{f\left({p}_i^{best}\right)f\left({p}_j^{best}\right)}{{\left\Vert {p}_i^{best}-{p}_j^{best}\right\Vert}^2}$$

(6)

In this case, the constant scalar 4πε₀ s replaced by the variable α, computed based on Li [34]. g^best in Eq.4 is replaced by ${index}_i={\displaystyle \begin{array}{c} argmax\\ {}\textrm{J}=1:\textrm{M}\end{array}}{F}_{ij}$ or a constant index j. Here, M denotes the population size.

$${\displaystyle \begin{array}{c}{v}_i\left(t+1\right)=w{v}_{i(t)}+{R}_1{C}_1\left({p}_i^{best}-{x}_i\right)+{R}_2{C}_2\left({p}_{index_i}-{x}_i\right)\\ {}{x}_i\left(t+1\right)={x}_i(y)+{v}_i\left(t+1\right)\end{array}}$$

(7)

4 Proposed method

In a research paper called 3DHP, all global and local peaks within a 3D color histogram corresponding to each cluster’s center points were located using the aforementioned multimodal approaches with a local search strategy. Pixel’s color in RGB-model images is derived from a weighted average of red, green, and blue components. t can represent each pixel in an image as a three-dimensional feature vector made up of the pixel’s three component colors. 3D histograms can then be constructed from these three-color axes [35]. The presence of peaks in a histogram indicates that the image comprises multiple distinct segments, each corresponding to a particular segment. In 3DHP, a three-dimensional Gaussian filter was applied to three-dimensional histograms to reduce the effect of noise and turn them out into smoothed histograms. This procedure also eliminates insignificant smaller peaks that may have been present in the histogram. Figure 1 illustrates the three-dimensional histogram, the original color distribution, and the color distribution after the smoothing process for the image of Lena. The 3D histogram was considered an objective function, and the positions of peaks were solution space. In this case, the number of pixels in the particular position corresponds to the fitness value. Moreover, an additional local search step proposed in [36] was integrated into multimodal PSO to enhance local search ability. Finally, the fitness values are used to check out the neighbors of the i^th article. So, the following changes are made to the position of the i^th particle:

$$\left\{\begin{array}{c}f\left( bestNeares{t}_i\right)\ge f\left( pbes{t}_i\right)\longrightarrow temp=\sum_{d=1}^D{p}_{d,i}^{best}+{C}_1.\mathit{\operatorname{rand}}.\left({p}_{d,i}^{best\_ nearest}-{p}_{d,i}^{best}\right)\\ {}f\left( bestNeares{t}_i\right)<f\left( pbes{t}_i\right)\longrightarrow temp=\sum_{d=1}^D{p}_{d,i}^{best}+{C}_1.\mathit{\operatorname{rand}}.\left({p}_{d,i}^{best}-{p}_{d,i}^{best\_ nearest}\right)\end{array}\right.$$

(8)

$$f(temp)>f\left( pbes{t}_i\right)\longrightarrow pbes{t}_i= temp$$

(9)

Where bestNearest_i is the particle that is closest in the distance to the i^th the article, D is the number of dimensions, and temp is a new position in the i^th particle. A new position will then replace the particle’s position if it is determined that the new position is superior to the current position of the particle (x_i). Consequently, all particles do not need to move to a single global optimum; other possible local solutions are not missed. In the next step, K dominant peaks are located. Then, K sets of peak intensity levels corresponding to cluster centers are automatically obtained for each RGB component. These peaks are represented as follows: ${p}_1^{rgb}=\left({r}_1,{g}_1,{b}_1\right),{p}_2^{rgb}=\left({r}_2,{g}_2,{b}_2\right),{p}_3^{rgb}=\left({r}_3,{g}_3,{b}_3\right)$,⋯, ${p}_k^{rgb}=\left({r}_K,{g}_K,{b}_K\right)$.

Additionally, to eliminate non-dominant clusters, it is beneficial to confine the distance that separates the two peaks as much as possible. Therefore, dominant peaks in a region eliminate all non-dominant peaks within its radius based on a distance limit parameter. It is essential to remember that this procedure is not mandatory and can be skipped if desired. For the 3DHP, this parameter was set to 80 pixels. Ultimately, every pixel will be assigned to the peak closest to it regarding the Euclidean distance. The following equation was used to calculate the Euclidean distance between the k_th peak and the (i, j)_th (pixel)

$$\left\Vert {p}_k^{rgb}-{I}_{i,j}^{rgb}\right\Vert =\sqrt{\left({p}_k^r-{I}_{i,j}^r\right)+\left({p}_k^g-{I}_{i,j}^g\right)+{\left({p}_k^b-{I}_{i,j}^b\right)}^2}$$

(10)

Also, a non-Euclidean distance criterion was proposed in [37] and then applied in [37] on color image segmentation using FCM. Therefore, this equation is calculated per:

$$ned\left({x}_i,{x}_j\right)=\sum_{a=1}^A1-{e}^{-1{\left({x}_{i,a}-{c}_{j,a}\right)}^2}$$

(11)

Where A is the number of features.

In the proposed method, the devisor of the fraction in Eq. 6$\left(\left\Vert {p}_i^{best}-{p}_j^{best}\right\Vert \right)$ is replaced by Eq.7. This equation could also be expressed as:

$$ned\left({p}_i^{best},{p}_j^{best}\right)=\sum_{d=1}^{D=3}1-{e}^{-1{\left({p}_{i,d}^{best}-{p}_{j,d}^{best}\right)}^2}$$

(12)

Therefore:

$${F}_{ij}=\alpha \frac{f\left({p}_i^{best}\right)f\left({p}_j^{best}\right)}{ned\left({p}_i^{best},{p}_j^{best}\right)}$$

(13)

Also, after locating the histogram peaks, every pixel will be assigned to the peak (cluster head) closest to it in terms of the non-Euclidean distance instead of the Euclidean distance. Consequently, Eq.10 can be reformulated as:

$$ned\left({p}_k^{rgb},{I}_{i,j}^{rgb}\right)=1-{e}^{-1{\left(\left({p}_k^r-{I}_{i,j}^r\right)+\left({p}_k^g-{I}_{i,j}^g\right)+\left({p}_k^b-{I}_{i,j}^b\right)\right)}^2}$$

(14)

It is worth mentioning that a preprocessing step to smooth the image by Gaussian smoothing is applied to the RGB image before calculating the 3D histogram. Meanwhile, σ and the window size for this filter are set to 0.5 and (3 × 3), (respectively)

The flow diagram of the overall method is illustrated in Fig. 2.

5 Experimental results and performance evaluation

In this section, extensive experiments are performed on the proposed method. The results of the proposed method are compared with well-known FCM [22], FCM_FW [3], and FCM_FWCW [38]. The required parameters of the proposed method and their values are shown in Table 1. Also, the fuzziness parameter in all soft (fuzzy) clustering methods is set to 2. The maximum number of iterations for FCM, FCM_FW and FCM_FWCW is 100.

Table 1 Required parameters of the proposed method

Full size table

5.1 Dataset

We used the following two datasets to evaluate the proposed method:

The first datasets were taken from the Cancer Imaging Archive (TCIA). The National Cancer Institute supports it and contains corresponding medical imaging data for Cancer Genome Atlas (TCGA) participants.
The second dataset [39] is available at https://huggingface.co/datasets/miladfa7/Brain-MRI-Images-for-Brain-Tumor-Detection/tree/main.
The third dataset is available at https://www.kaggle.com/datasets/ammarnassanalhajali/brain-tumor.

5.2 Evaluation metrics

As the brain MRI slices are heterogeneous, qualitative (visual) evaluation of different methods is insufficient to analyze the results accurately. Therefore, quantitative metrics are needed to evaluate the results of various methods [40]. In the experiments, the following two groups of metrics are used to measure the performance of algorithms.

1)
Internals clustering metrics: A lower value of these metrics indicates a better segmentation result in this group.

F: this metric penalizes over-segmentation [41] (segmenting one region of the image into more than one segment):

$$\boldsymbol{F}=\frac{1}{1000\ \left(\textrm{M}\times N\right)}\sqrt{R}\sum_{i=1}^R\frac{{e_i}^2}{\sqrt{A_i}}$$

(15)

where M and N represent the length and width of the input image, R is the number of segmented regions, A_i indicates the number of pixels in the i^th segmented region e_i indicates the color error in region i, and $\sqrt{R}$ represents a penalizing term that discourages over-segmentation.

$\overset{\acute{\mkern6mu}}{\boldsymbol{F}}$ : this metric penalizes over-segmentation and is noise-robust [42]:

$$\overset{\acute{\mkern6mu}}{\boldsymbol{F}}=\frac{1}{10000\ \left(\textrm{M}\times N\right)}\sqrt{\sum_{A=1}^{\mathit{\operatorname{Max}}}{\left[R(A)\right]}^{1+\frac{1}{A}}}\sum_{i=1}^R\frac{{e_i}^2}{\sqrt{A_i}}$$

(16)

Q: this metric penalizes non-homogeneous regions [42]:

$$\boldsymbol{Q}=\frac{1}{10000\ \left(\textrm{M}\times N\right)}\sqrt{R}\sum_{i=1}^R\left[\frac{{e_i}^2}{1+\log {A}_i}+{\left(\frac{R\left({A}_i\right)}{A_i}\right)}^2\right]$$

(17)

2)
Externals clustering metrics: A higher value of these metrics indicates a better segmentation result in this group.

Accuracy: is the number of correct prediction pixels divided by the total number of pixels. This metric is calculated by Eq. (18).

$$Accuracy=\frac{TN+ TP}{TN+ TP+ FN+ FP}$$

(18)

Precision: It is the ratio of correct positive prediction pixels to the number of positive pixels predicted. This metric is calculated by Eq. (19).

$$Precision=\frac{TP}{Tp+ FP}$$

(19)

Recall: It is the ratio of the number of correct positive prediction pixels to the number of all relevant pixels. This metric is calculated by Eq. (20).

$$Recall=\frac{TP}{TP+ FN}$$

(20)

F ₁ Score: It is the harmonic mean between Precision and Recall. This metric is calculated by Eq. (21).

$$F1\ Score=2\times \frac{Recall\times Precision}{Recall+ Precision}$$

(21)

Specificity: The Specificity rate corresponds to the proportion of negative pixels that are correctly considered negative concerning all negative pixels. This metric is calculated by Eq. (22).

$$Specificity=\frac{TN}{TN+ FP}$$

(22)

In Eq.s (18) to (22), TP, FN, TN, and FN represent True Positive, False Positive, True Negative, and False Negative, respectively.

In our experiments, these metrics are expressed as a percentage. A high percentage indicates a better performance.

5.3 Experiment 1: Visualization-based analysis using internal metrics

In this section, we evaluate the proposed method with other methods qualitatively and quantitatively. Several images have been selected for the quality assessment of each dataset. In selecting these images, we tried to select images that include different types of tumors: small and large tumor sizes, different tumor tissues, spherical and non-spherical tumor shapes, and different lighting conditions. 23 images are selected from the first, 15 from the second datasets, and 15 images from third one.

Figures 3, 4, and 5 demonstrate the results of the visual qualitative analysis of the first, second and third datasets, respectively. The segmentation results for each method are displayed by employing a distinct color set to the base image to highlight the clusters obtained. Tables 2 (first dataset), 4 (second dataset), and 6 (third dataset) contain information regarding the peak locations of the three-dimensional histogram and the cluster centroids for each cluster achieved by FCM, FCM_FW [3] and FCM_FWCW. In the same way, Tables 3, 5, and 7 show the numerical and qualitative analysis of the results from each of the three tested methods.

Table 2 Cluster heads and centers of proposed and other methods (first dataset)

Full size table

Table 3 Statistical results of the first dataset

Full size table

In our tests, the number of peaks found by M3DHP is used to figure out the number of clusters (m) for FCM, FCM_FW and FCM_FWCW. However, within the FCM_FW and FCM_FWCW algorithms, there are scenarios where some clusters end up empty without any pixels being assigned to them. The algorithm might converge to a solution where one or more clusters do not have any data points associated with them. The issue can arise from various factors, including the selection of initial cluster centers, data distribution, or the fact that the specified number of clusters does not align with the natural clustering in the data.

Based on what is shown in Figs. 3, 4, and 5 for all images on both datasets, the background is clearly distinguished when using M3DHP. On the other hand, with FCM, the backgrounds of 11 images out of 23 on the first dataset are mistakenly divided into many regions and are over-segmented. Also, in the second dataset, with FCM, the backgrounds of 5 images out of 15 on the second dataset are mistakenly divided into many regions and are over-segmented. Also, in the third dataset, with FCM, the backgrounds of 5 images out of 15 on the third dataset are mistakenly divided into many regions and are over-segmented.

For TCGA_HT_8105_19980826_26, the tumor can be distinguished more easily with the M3DHP than with the FCM. Many pixels are mistakenly assigned to the same tumor cluster by the FCM. Moreover, for TCGA_FG_6688_20020215_24, the tumor in the bottom center of the brain is very clearly distinguished when using M3DHP. However, in this case, FCM against M3DHP is not successful. For TCGA_DU_7014_19860618_30, the proposed algorithm segments the image correctly, while the segmented image by FCM represents both brain and tumor regions in a single region. In this case, the image is under-segmented with the associated image. Visual inspection reveals that the M3DHP method generally yields more homogeneous segmentation regions (Tables 4, 5, and 6).

Table 4 Cluster heads and centers of proposed and other methods (second dataset)

Full size table

Table 5 Statistical results of the second dataset

Full size table

Table 6 Cluster heads and centers of proposed and other methods (third dataset)

Full size table

As shown in Tables 3, 5, and 7, according to F and F′, the proposed method outperforms FCM in 12 out of 23 cases. Furthermore, according to Q, the proposed method outperforms FCM in 17 out of 23 cases. Results for the three evaluation functions F, F′, and Q suggest that the quantitative performance of both approaches is comparable when applied to the same image. It is important to note that there is not a huge disparity between these numbers; they all tend toward zero. The M3DHP approach demonstrates its efficacy by generating reliable results for the three statistical metrics F, F′, and Q. When it comes to F, F′, and Q, M3DHP typically offers superior performance to FCM in most cases. The last three rows of Tables 3, 5, and 7 indicate the average rank of algorithms according to all three performance indicators. The M3DHP ranked first according to all three performance indicators.

Table 7 Statistical results of the third dataset

Full size table

In the comparative analysis of the three algorithms across various MRI slices using metrics F, F ́, and Q, our algorithm consistently demonstrates superior performance. It achieves the top mean rank in all three metrics, indicating its robustness and effectiveness in clustering. FCM_FW generally ranks second, outperforming the standard FCM, which consistently ranks last. The consistent top ranking of our algorithm across all metrics underscores its potential as a preferred choice for clustering tasks in varied contexts.

5.4 Experiment 2: Numerical analysis of all images using internal metrics

In this section, we evaluate the average results obtained on both datasets’ images. As shown in Tables 8 and 9, M3DHP has the best results. Regarding F criteria, after M3DHP, the method FCM_FWCW has better results. Also, in terms of F ́ criteria, after M3DHP, FCM_FWCW has better results. Regarding Q criteria, after M3DHP, FCM_FW in the first dataset and FCM in the second dataset have better results. Methods FCM_FWCW have almost high performance in terms of F ́and F criteria and have similar F ́and F to M3DHP. However, in terms of Q criteria are much larger than M3DHP. In the third dataset, similar other datasets, our model has first rank.

Table 8 The average performance of M3DHP and other state-of-the-art methods on all samples

Full size table

Table 9 The mean rank of M3DHP and other state-of-the-art methods on all samples

Full size table

These findings lead us to the conclusion that M3DHP can demonstrate competent performance during the segmentation of brain magnetic resonance images. The visual and numerical results show that the proposed M3DHP technique produces promising segmentation results. The method’s ability to automatically generate the desired number of clusters and cluster centroids proves this.

The thorough testing demonstrates that the proposed method performs well for image segmentation, surpassing the performance standards set by well-known methods like FCM, FCM_FWCW, and FCM_FW. The consistently superior outcomes across multiple evaluation criteria underscore the potential of the proposed method as a noteworthy contribution to the field.

5.5 Experiment 3: Analysis of all images with external metrics

In this experiment, to investigate the proposed method deeply and compare the obtained results with other state-of-the-art methods, the performance of the proposed method is evaluated with image ground truths and external evaluation metrics, such as accuracy, F1, precision, recall, and specificity. The statistical results are reported in Table 10. The visual segmentation results are also shown in Fig. 6. We test the proposed method only on the first dataset because the ground truth for the second one is not available.

Table 10 Statistical results of the first dataset with external metrics

Full size table

Table 10 and Fig. 6 show that the proposed method has the best average performance for tested images. For image TCGA_DU_5855_19951217_23, tumor and non-tumor areas are well segmented, and due to the low light intensity, shape and texture, the other methods could not accurately detect the entire tumor area. Also, in the image TCGA_DU_7014_19860618_45, the border of the skull is segmented as a tumor area in the compared methods. However, the proposed methods are able to detect tumor areas well. This error in other methods is due mainly to the high brightness of the tissue surrounding the skull. The accuracy, F1-Score, and precision metrics rates are by an average of 95.52%, 79.68%, and 77.2257% on all testing images. This result highlights the efficacy of the feature combination employed in our method. The recall and specificity metrics in the proposed method are lower than those of the FCM_FW method. The reason is apparent: using feature weighting schemas and applying efficient extracted features can improve the results. However, the feature extraction phase is not used in our method, as are the FCM and FCM_FWCW methods.

6 Conclusion

In this paper, we suggest a modified form of the 3D Histogram-based segmentation technique that can choose the appropriate number of segments. The appropriate number of segments is determined by taking advantage of peak detection using a multimodal optimization algorithm. Using a multimodal optimization method, the optimal number of segments is calculated by exploiting peak detection. The proposed method has been applied to brain MRI to be segmented. The optimal number of clusters is unknown, making M3DHP more flexible for practice than other methods. To prove the efficiency of the proposed method, it has been compared with the well-known FCM clustering scheme. The results of the experiments demonstrate that the suggested strategy produces the desired outcomes and outperforms FCM. In our research, we developed a segmentation method for brain MRI that currently works with a single 2D MRI slice. The next step in our research will focus on extending this algorithm to handle 3D MRI segmentation.

Data availability

The brain images supporting Fig. 1 are publicly available in the Kaggle: https://www.kaggle.com/datasets/mateuszbuda/lgg-mri-segmentation

References

Pham DL, Xu C, Prince JL (2000) A survey of current methods in medical image segmentation. Annu Rev Biomed Eng 2(3):315–337
Article Google Scholar
Rahkar Farshi T, Demirci R, Feizi-Derakhshi M-R (2018) Image clustering with optimization algorithms and color space. Entropy 20(4):296. https://doi.org/10.3390/e20040296
Article Google Scholar
A. G. Oskouei, M. A. Balafar, and T. Akan, (2023) “A brain MRI segmentation method using feature weighting and a combination of efficient visual features,” Applied Computer Vision and Soft Computing with Interpretable AI, pp. 15–34, https://doi.org/10.1201/9781003359456-2.
H. B. Tabrizi and C. Crick, “Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach,” 2023, Accessed: Jan. 06, 2024. [Online]. Available: https://arxiv.org/abs/2312.13162v1
Mousavirad SJ, Ebrahimpour-Komleh H (2020) Human mental search-based multilevel thresholding for image segmentation. Appl Soft Comput 97:105427. https://doi.org/10.1016/J.ASOC.2019.04.002
Article Google Scholar
Farag AA (2009) Edge-based image segmentation. Remote Sens Rev 6(1):95–121. https://doi.org/10.1080/02757259209532148
Article Google Scholar
Slabaugh G, Unal G, Wels M, Fang T, Rao B (2009) Statistical region-based segmentation of ultrasound images. Ultrasound Med Biol 35(5):781–795. https://doi.org/10.1016/J.ULTRASMEDBIO.2008.10.014
Article Google Scholar
Rahkar Farshi T, Ardabili AK (2021) A hybrid firefly and particle swarm optimization algorithm applied to multilevel image thresholding. Multimedia Systems 27(1):125–142. https://doi.org/10.1007/S00530-020-00716-Y/TABLES/7
Article Google Scholar
Rahkar Farshi T, Orujpour M (2019) Multi-level image thresholding based on social spider algorithm for global optimization. Int J Inf Technol 11(4):713–718. https://doi.org/10.1007/S41870-019-00328-4/FIGURES/2
Article Google Scholar
Gupta D, Anand RS (2017) A hybrid edge-based segmentation approach for ultrasound medical images. Biomed Signal Process Control 31:116–126. https://doi.org/10.1016/J.BSPC.2016.06.012
Article Google Scholar
Farshi TR, Drake JH, Özcan E (2020) A multimodal particle swarm optimization-based approach for image segmentation. Expert Syst Appl 149:113233. https://doi.org/10.1016/J.ESWA.2020.113233
Article Google Scholar
Xue G, Chen C, Lu Z-L, Dong Q (2010) Brain imaging techniques and their applications in decision-making research. Xin Li Xue Bao 42(1):120. https://doi.org/10.3724/SP.J.1041.2010.00120
Article Google Scholar
Saritha S, Amutha Prabha N (2016) A comprehensive review: segmentation of MRI images—brain tumor. Int J Imaging Syst Technol 26(4):295–304. https://doi.org/10.1002/IMA.22201
Article Google Scholar
Rahkar Farshi T, Demirci R (2021) Multilevel image thresholding with multimodal optimization. Multimed Tools Appl 80(10):15273–15289. https://doi.org/10.1007/S11042-020-10432-4/TABLES/3
Article Google Scholar
Kumar S, Pant M, Kumar M, Dutt A (2018) Colour image segmentation with histogram and homogeneity histogram difference using evolutionary algorithms. Int J Mach Learn Cybern 9(1):163–183. https://doi.org/10.1007/S13042-015-0360-7/FIGURES/5
Article Google Scholar
“Current Methods in the Automatic Tissue Segmentation of 3D Magnet…: Ingenta Connect.” Accessed: Jul. 09, 2022. [Online]. Available: https://www.ingentaconnect.com/content/ben/cmir/2006/00000002/00000001/art00008
Niessen WJ, Vincken KL, Weickert J, Ter Haar Romeny BM, Viergever MA (1999) Multiscale segmentation of three-dimensional MR brain images. Int J Comput Vis 31(2):185–202. https://doi.org/10.1023/A:1008070000018
Article Google Scholar
Wang J, Kong J, Lu Y, Qi M, Zhang B (2008) A modified FCM algorithm for MRI brain image segmentation using both local and non-local spatial constraints. Comput Med Imaging Graph 32(8):685–698. https://doi.org/10.1016/J.COMPMEDIMAG.2008.08.004
Article Google Scholar
Verma H, Verma D, Tiwari PK (2021) A population based hybrid FCM-PSO algorithm for clustering analysis and segmentation of brain image. Expert Syst Appl 167:114121. https://doi.org/10.1016/J.ESWA.2020.114121
Article Google Scholar
Sikka K, Sinha N, Singh PK, Mishra AK (2009) A fully automated algorithm under modified FCM framework for improved brain MR image segmentation. Magn Reson Imaging 27(7):994–1004. https://doi.org/10.1016/J.MRI.2009.01.024
Article Google Scholar
P. Wang and H. L. Wang, (2008) “A modified FCM algorithm for MRI brain image segmentation,” Proceedings - 2008 International Seminar on Future BioMedical Information Engineering, FBIE 2008, pp. 26–29, https://doi.org/10.1109/FBIE.2008.12.
Bezdek JC, Ehrlich R, Full W (1984) FCM: the fuzzy c-means clustering algorithm. Comput Geosci 10(2–3):191–203. https://doi.org/10.1016/0098-3004(84)90020-7
Article Google Scholar
Rahkar Farshi T, Orujpour M (2021) A multi-modal bacterial foraging optimization algorithm. J Ambient Intell Humaniz Comput 12(11):10035–10049. https://doi.org/10.1007/S12652-020-02755-9/FIGURES/6
Article Google Scholar
Orujpour M, Feizi-Derakhshi MR, Rahkar-Farshi T (2020) Multi-modal forest optimization algorithm. Neural Comput & Applic 32(10):6159–6173. https://doi.org/10.1007/S00521-019-04113-Z/FIGURES/11
Article Google Scholar
Farshi TR (2022) A memetic animal migration optimizer for multimodal optimization. Evol Syst 13(1):133–144. https://doi.org/10.1007/S12530-021-09368-3/TABLES/12
Article Google Scholar
Qing L, Gang W, Zaiyue Y, Qiuping W (2008) Crowding clustering genetic algorithm for multimodal function optimization. Appl Soft Comput 8(1):88–95. https://doi.org/10.1016/J.ASOC.2006.10.014
Article Google Scholar
Li JP, Balazs ME, Parks GT, Clarkson PJ (2002) A species conserving genetic algorithm for multimodal function optimization. Evol Comput 10(3):207–234. https://doi.org/10.1162/106365602760234081
Article Google Scholar
Liang Y, Leung KS (2011) Genetic algorithm with adaptive elitist-population strategies for multimodal function optimization. Appl Soft Comput 11(2):2017–2034. https://doi.org/10.1016/J.ASOC.2010.06.017
Article Google Scholar
Wang H, Wang W, Wu Z (2013) Particle swarm optimization with adaptive mutation for multimodal optimization. Appl Math Comput 221:296–305. https://doi.org/10.1016/J.AMC.2013.06.074
Article MathSciNet Google Scholar
Ren Z, Zhang A, Wen C, Feng Z (2014) A scatter learning particle swarm optimization algorithm for multimodal problems. IEEE Trans Cybern 44(7):1127–1140. https://doi.org/10.1109/TCYB.2013.2279802
Article Google Scholar
Der Chang W (2015) A modified particle swarm optimization with multiple subpopulations for multimodal function optimization problems. Appl Soft Comput 33:170–182. https://doi.org/10.1016/J.ASOC.2015.04.002
Article Google Scholar
J. Barrera and C. A. Coello Coello, (2009) “A particle swarm optimization method for multimodal optimization based on electrostatic interaction,” Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5845 LNAI, pp. 622–632, https://doi.org/10.1007/978-3-642-05258-3_55/COVER/.
R. Eberhart and J. Kennedy, (1995) “New optimizer using particle swarm theory,” Proceedings of the International Symposium on Micro Machine and Human Science, pp. 39–43, https://doi.org/10.1109/MHS.1995.494215.
X. Li, (2007) “A Multimodal Particle Swarm Optimizer Based on Fitness Euclidean-distance Ratio,” Proceedings of the 9th annual conference on Genetic and evolutionary computation - GECCO ‘07, https://doi.org/10.1145/1276958.
Navon E, Miller O, Averbuch A (2005) Color image segmentation based on adaptive local thresholds. Image Vis Comput 23(1):69–85. https://doi.org/10.1016/J.IMAVIS.2004.05.011
Article Google Scholar
Qu BY, Liang JJ, Suganthan PN (2012) Niching particle swarm optimization with local search for multi-modal optimization. Inf Sci (N Y) 197:131–143. https://doi.org/10.1016/J.INS.2012.02.011
Article Google Scholar
Golzari Oskouei A, Hashemzadeh M, Asheghi B, Balafar MA (2021) CGFFCM: Cluster-weight and Group-local Feature-weight learning in Fuzzy C-Means clustering algorithm for color image segmentation [Formula presented]. Appl Soft Comput 113. https://doi.org/10.1016/J.ASOC.2021.108005
Hashemzadeh M, Golzari Oskouei A, Farajzadeh N (2019) New fuzzy C-means clustering method based on feature-weight and cluster-weight learning. Appl Soft Comput 78:324–345. https://doi.org/10.1016/J.ASOC.2019.02.038
Article Google Scholar
“Brain MRI Images for Brain Tumor Detection.” Accessed: 05 Jan. 2024. [Online]. Available: https://www.kaggle.com/datasets/navoneel/brain-mri-images-for-brain-tumor-detection/discussion
Chang D, Zhao Y, Liu L, Zheng C (2016) A dynamic niching clustering algorithm based on individual-connectedness and its application to color image segmentation. Pattern Recogn 60:334–347. https://doi.org/10.1016/J.PATCOG.2016.05.008
Article Google Scholar
Liu J, Yang YH (1994) Multiresolution color image segmentation. IEEE Trans Pattern Anal Mach Intell 16(7):689–700. https://doi.org/10.1109/34.297949
Article Google Scholar
Borsotti M, Campadelli P, Schettini R (1998) Quantitative evaluation of color image segmentation results. Pattern Recogn Lett 19(8):741–747. https://doi.org/10.1016/S0167-8655(98)00052-X
Article Google Scholar

Download references

Funding

No funding for this study.

Author information

Authors and Affiliations

Department of Medicine, Louisiana State University Health Sciences Center, Shreveport, LA, 71103, USA
Taymaz Akan & Mohammad Alfrad Nobel Bhuiyan
Software Engineering Department, Istanbul Topkapi University, 34020, Istanbul, Turkey
Taymaz Akan
Faculty of Information Technology and Computer Engineering, Azarbaijan Shahid Madani University, Tabriz, Iran
Amin Golzari Oskouei
Department of Software Engineering, İstinye University, İstanbul, Turkey
Amin Golzari Oskouei
Department of Artificial Intelligence Engineering, Trabzon University, 61335, Trabzon, Turkey
Sait Alp

Authors

Taymaz Akan
View author publications
You can also search for this author in PubMed Google Scholar
Amin Golzari Oskouei
View author publications
You can also search for this author in PubMed Google Scholar
Sait Alp
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Alfrad Nobel Bhuiyan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the research process in various forms, including original draft preparation, writing, review, and editing. TA provided core concepts, drafted the manuscript, and carried out implementations for this manuscript. AGO provided core concepts and carried out simulations for this manuscript. SA carried out implementations and drafted the manuscript. MANB drafted and proofread the manuscript and approved the final manuscript.

Corresponding author

Correspondence to Mohammad Alfrad Nobel Bhuiyan.

Ethics declarations

Ethical approval and consent to participate

Not applicable.

Human ethics

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no potential competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Akan, T., Oskouei, A.G., Alp, S. et al. Brain magnetic resonance image (MRI) segmentation using multimodal optimization. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-19725-4

Download citation

Received: 01 December 2022
Revised: 13 June 2024
Accepted: 19 June 2024
Published: 02 July 2024
DOI: https://doi.org/10.1007/s11042-024-19725-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Brain magnetic resonance image (MRI) segmentation using multimodal optimization

Abstract

Similar content being viewed by others

MRI Brain Tumor Segmentation System Based on Hybrid Clustering Techniques

Brain tumor segmentation from MRI using FCM clustering, morphological reconstruction, and active contour

Dominant Gray Level Based Particle Swarm Optimization Clustering for Segmenting Brain Tumor Sub-regions from Multimodal Magnetic Resonance Images

1 Introduction

2 Fuzzy c-means clustering

3 Multimodal PSO

4 Proposed method