Multi-verse Optimizer with Rosenbrock and Diffusion Mechanisms for Multilevel Threshold Image Segmentation from COVID-19 Chest X-Ray Images

Han, Yan; Chen, Weibin; Heidari, Ali Asghar; Chen, Huiling

doi:10.1007/s42235-022-00295-w

Multi-verse Optimizer with Rosenbrock and Diffusion Mechanisms for Multilevel Threshold Image Segmentation from COVID-19 Chest X-Ray Images

Research Article
Published: 04 January 2023

Volume 20, pages 1198–1262, (2023)
Cite this article

Download PDF

Journal of Bionic Engineering Aims and scope Submit manuscript

Multi-verse Optimizer with Rosenbrock and Diffusion Mechanisms for Multilevel Threshold Image Segmentation from COVID-19 Chest X-Ray Images

Download PDF

Yan Han¹,
Weibin Chen¹,
Ali Asghar Heidari² &
…
Huiling Chen ORCID: orcid.org/0000-0002-7714-9693¹

3533 Accesses
20 Citations
Explore all metrics

Abstract

Coronavirus Disease 2019 (COVID-19) is the most severe epidemic that is prevalent all over the world. How quickly and accurately identifying COVID-19 is of great significance to controlling the spread speed of the epidemic. Moreover, it is essential to accurately and rapidly identify COVID-19 lesions by analyzing Chest X-ray images. As we all know, image segmentation is a critical stage in image processing and analysis. To achieve better image segmentation results, this paper proposes to improve the multi-verse optimizer algorithm using the Rosenbrock method and diffusion mechanism named RDMVO. Then utilizes RDMVO to calculate the maximum Kapur’s entropy for multilevel threshold image segmentation. This image segmentation scheme is called RDMVO-MIS. We ran two sets of experiments to test the performance of RDMVO and RDMVO-MIS. First, RDMVO was compared with other excellent peers on IEEE CEC2017 to test the performance of RDMVO on benchmark functions. Second, the image segmentation experiment was carried out using RDMVO-MIS, and some meta-heuristic algorithms were selected as comparisons. The test image dataset includes Berkeley images and COVID-19 Chest X-ray images. The experimental results verify that RDMVO is highly competitive in benchmark functions and image segmentation experiments compared with other meta-heuristic algorithms.

Performance optimization of hunger games search for multi-threshold COVID-19 image segmentation

Article 07 August 2023

CNN-IKOA: convolutional neural network with improved Kepler optimization algorithm for image segmentation: experimental validation and numerical exploration

Article Open access 10 January 2024

An Efficient Multilevel Threshold Image Segmentation Method for COVID-19 Imaging Using Q-Learning Based Golden Jackal Optimization

Article 14 June 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Coronavirus Disease 2019 (COVID-19) is currently the world’s most severe epidemic, posing a global public health problem and challenge. Diagnostic COVID-19 early and accurately is essential to controlling the spread of the epidemic. The most popular diagnostic method in biochemical is Real-Time Polymerase Chain Reaction (RT-PCR), which is currently the most popular diagnostic method against COVID-19 [1]. Although RT-PCR has a lower cost, it is time-consuming and may produce false positives [2]. In radiology, analyzing Chest CT and X-ray images of COVID-19 is also a key technology. Studies have shown that Chest CT scans are more sensitive than RT-PCR in the early detection of COVID-19 infection [3, 4]. When RT-PCR is negative, Chest CT image analysis is critical for diagnosing COVID-19. However, it is impossible to use CT on a large scale as CT is an expensive and unsafe test. The American College of Radiology (ACR) warns that the CT disinfection necessary following screening COVID-19 individuals potentially cause radiological service availability to be disrupted, and recommends that portable chest radiography be explored to minimize the likelihood of cross-infection [5]. Also, a positive X-ray test may not require CT if COVID-19 is suspected at a clinical high [6]. Therefore, Chest X-ray image testing is a feasible solution for the early diagnosis and control of COVID-19.

Artificial intelligence (AI), which is concerned with imitating human reasoning in computation, has made tremendous advances in a variety of fields [7, 8]. AI-assisted healthcare systems have lately attracted interest, with the aim of producing diagnostic technologies and clinical decision-making more instantaneous, self-sufficient, and efficient [9,10,11]. As we all know, image segmentation is crucial in computer image processing and analysis, and Multilevel threshold Image Segmentation (MIS) is a simple and efficient image segmentation technique [12]. Medical images are a particular class of images, their feature space is usually more complex, and more attention is paid to the accuracy and efficiency of image segmentation[13]. In recent years, many research results have been published on applying the Meta-Heuristic Algorithm (MHA) to optimize MIS on medical images [14,15,16]. The advantages of MHA in medical image segmentation are becoming more and more prominent. Since the outbreak of COVID-19, many scholars have conducted much research on MIS based on the MHA [13, 17,18,19,20,21,22,23]. These results indicate that the advantages of using MHA to optimize the segmentation of COVID-19 Chest X-ray Images (COVID-19-CXIs) are apparent. However, to achieve better segmentation results and improve the diagnostic quality of COVID-19, more research is still needed to propose better MHAs and segmentation schemes. As a result, this paper focuses on studying the MIS based on the MHA and proposes a high-performance scheme to segment COVID-19-CXIs.

Image segmentation is the process of dividing an image into different areas based on specific rules. These areas do not intersect, and each region has a universal consistency. Formally, it can be defined as follows: if $g\left(x,y\right)$ is the set of all pixels, where $\left(x,y\right)$ denotes the spatial coordinate, then segmentation is a partitioning of the set $g$ into a set of connected subsets $\left({g}_{1},{g}_{2}\dots {g}_{n}\right)$. These subsets satisfy the following conditions [24]:

${g}_{i}$ is a connected region, $1\le i\le n$.
$\bigcup_{i=1}^{n}{g}_{i}=g$.
${g}_{i}\cap {g}_{j}=\mathrm{\varnothing },1\le x\le n,1\le y\le n,i\ne j$.
${g}_{i}$ satisfies certain rules.

Image segmentation can be achieved using a variety of techniques. Pal [12] provided an overview of image segmentation methods, which include: gray-level thresholding [25,26,27,28], iterative pixel classification [29,30,31], surface-based segmentation [32, 33], segmentation of color images [34], edge detection [35], methods based on fuzzy set theory [36]. In addition, Pham et al. [37] divided these methods into eight categories: thresholding approaches, region growing approaches, classifiers, clustering approaches, Markov Random Field (MRF) models, artificial neural networks, deformable models, and atlas-guided approaches. Regardless of classification, thresholding is an old, simple, and popular image segmentation technique [12]. Thresholding is classified as Bilevel threshold Image Segmentation (BIS) or MIS based on the number of thresholds. BIS is the process of dividing the image into two regions: the object and the background. When multiple thresholds are chosen, the image can be divided into several objects and backgrounds, a process known as MIS. MIS is used in the majority of real-world scenes.

There are many ways of MIS, such as histogram shape-based [38, 39], clustering-based [25, 40], similarity-based [41, 42], entropy-based [28], etc. Among these, the MIS based on entropy is widely used due to its ease of implementation and accurate results. For example, Kapur’s entropy [28], fuzzy entropy [43], Tsallis entropy [44], etc. Kapur proposed Kapur’s entropy image segmentation in 1985. The essence of Kapur’s entropy image segmentation is to determine a set of segmentation thresholds to maximize Kapur’s entropy. Determining a set of segmentation thresholds is the key to Kapur’s entropy image segmentation. The traditional exhaustive method is time-consuming and ineffective [17, 18]. Determining the threshold is equivalent to solving a constrained optimization problem, precisely what the MHA is designed to solve.

The MHA is an iterative algorithm that uses the advantageous different data distribution of particle swarms. This new search method solves complex problems faster than traditional optimization analysis algorithms and is more adaptable. Therefore, many researchers were highly concerned about the MHA soon after it was first proposed. Various excellent MHAs have been continuously proposed, including Multi-Verse Optimizer (MVO) [45], Whale Optimization Algorithm (WOA)[46], Grey Wolf Optimization (GWO)[46], Harris Hawk Optimization (HHO) [47], Bat Algorithm (BA)[46], Particle Swarm Optimization (PSO) [48], Hunger Games Search (HGS) [49], Colony Predation Algorithm (CPA) [50], Slime Mould Algorithm (SMA) [51], Harris Hawks Optimization (HHO) [47], Weighted Mean of Vectors (INFO) [52], Runge Kutta Optimizer (RUN) [53], etc. They are widely used to solve optimization problems in many fields, such as image segmentation [54, 55], optimization of machine learning model [56], economic emission dispatch problem [57], medical diagnosis [58, 59], scheduling problems [60,61,62], plant disease recognition [63], practical engineering problems [64, 65], solar cell parameter Identification [66], feature selection [67, 68], bankruptcy prediction [69, 70], expensive optimization problems [71, 72], combination optimization problems [73], and multi-objective problem [74, 75]. There have also been numerous achievements in image segmentation.

Li et al. [76] improved the barnacle mating optimizer using a logistic model and chaotic mapping, and a multilevel color image segmentation algorithm was applied combined with Masi entropy. Li et al. [77] improved the pollination algorithm used for Tsallis entropy image segmentation. Li et al. [78] proposed a Modified Artificial Bee Colony (MABC) optimizer and solved the MIS problem. In 2021, Li et al. [79] used an improved coyote optimization algorithm to achieve fuzzy MIS. Houssein et al. [80] improved manta ray foraging optimization for MIS. Khairuzzaman [81] applied the GWO to Otsu MIS in 2017. Further, many studies based on Kapur’s entropy have also been published. Akay [82] applied PSO and Artificial Bee Colony (ABC) to Kapur’s entropy MIS, respectively, and compared the performance, Bhandari et al. [83] applied the Cuckoo Search (CS) algorithm and Wind Driven Optimization (WDO) to Kapur’s entropy MIS respectively in 2014, in the second year, they improved the ABC and combined using Kapur, Otsu, and Tsallis functions respectively for satellite image segmentation [84].

In 2016, Mirjalili et al. [45] proposed a new MHA called MVO. MVO, like other MHAs, suffers from slow convergence and is prone to falling into local optimum. However, it has a miracle that the structure is simple and the parameters are few. Since MVO was proposed in 2016, it has drawn the concentration of many scholars. In recent years, research results on MVO have been published continuously. In 2016, Faris et al. [85] proposed to use MVO to train the Multi-Layer Perceptron (MLP) neural network. PSO, Differential Evolution (DE), Firefly (FF), and CS were experimentally compared and then compared with two traditional gradient-based training methods (BP and LM algorithms). The experimental results showed that the MVO algorithm to retrain the MLP neural network is competitive. In 2017, Faris et al. [86] applied MVO to the machine learning algorithm in the paper. They proposed using MVO to optimize the Support Vector Machine (SVM) parameters and select the optimal features. The experimental results showed that MVO is used. The MVO algorithm can efficiently feature the number and improve the accuracy of predictions. Ewees et al. [87] proposed the Chaotic MVO (CMVO) algorithm in the paper. The CMVO algorithm combines the chaotic map based on the MVO algorithm, which effectively improves the algorithm’s performance and was successfully used to solve the problem of feature selection. Mirjalili et al. [88] proposed a Multi-Objective MVO (MOMVO) for solving problems in a multi-objective search space. Fathy et al. [89] applied MVO to determine the optimal parameter selection for Proton Exchange Membrane Fuel Cells (PEMFCs) under specific operating conditions. Yilmaz et al. [90] proposed a hybrid algorithm, IMVOSA, mixing MVO, and the simulated annealing algorithm. In the paper [91], MVO was improved, called the enhanced version of MVO EMVO. EMVO was used as a task scheduler in the cloud computing environment. Experiments showed that it could effectively improve resource utilization and minimize manufacturing time. Pothiraj et al. [92] used MVO to optimize 3D IC floor planning.

It is clear that it can improve MVO and apply it to real-world problems. However, as far as the author knows, there has not been much MVO research in the direction of Kapur’s entropy MIS. Considering all this, this paper improves the shortcomings of MVO and applies it to COVID-19-CXI segmentation.

This paper proposes the RDMVO algorithm to address the shortcomings of MVO. RDMVO is based on MVO and incorporates the Rosenbrock Method (RM) and the Diffusion Mechanism (DM). To test the performance of RDMVO. This paper first selected IEEE CEC2017 [93] as benchmark functions and used the Wilcoxon signed-rank test [94] and the Friedman test [95] to compare some mainstream MHAs. The experimental results show that RDMVO effectively improves the global search and local search capabilities of MVO and the convergence speed and accuracy of most test functions. The comprehensive performance is better than other MHAs involved in the experiment. Then, at different threshold levels, RDMVO was applied to Kapur’s entropy MIS experiment and compared to some MHAs. Using the Peak Signal-To-Noise Ratio (PSNR) [96], Structural Similarity Index (SSIM) [97], and Feature Similarity Index (FSIM) [98], three indicators for image segmentation effect evaluation, the results show that RDMVO’s performance is satisfactory. At the same time, Berkeley images (BKIs) and the COVID-19-CXIs were selected for the test images. The BKIs are widely used in image processing. It is used to test the comprehensive performance of RDMVO in MIS. COVID-19-CXIs are some of the Chest X-rays of patients with confirmed COVID-19. It is also the problem to be solved in this paper. The MIS experimental effects are all satisfactory. In addition, to clarify the generality of the algorithm, we use the Friedman test, and the test results also show that RDMVO is statistically significant. The main contributions of this paper are as follows:

An improved MVO algorithm is proposed, called RDMVO, which significantly enhances the convergence speed, accuracy, and ability to jump out of the local optimum of MVO.
The performance comparison experiment of RDMVO and some mainstream MHAs were conducted on IEEE CEC2017. The experimental data reveal that RDMVO’s performance is better than the other MHA's.
A novel MIS scheme (RDMVO-MIS) is proposed. Kapur’s entropy is used as the objective function, RDMVO determines the threshold, and the image segmentation experiment was successfully carried out on BKIs and COVID-19-CXIs.
MIS comparison experiments with some MHAs were carried out. The effect and generality of Kapur’s entropy MIS based on RDMVO are the best.

The remaining work is scheduled as follows in this paper: in Sect. 2, the work is related to image segmentation and improved Kapur’s entropy image segmentation method. In Sect. 3, the original MVO algorithm is presented. In Sect. 4, this paper mainly describes the proposed RDMVO algorithm. In Sect. 5, comparative experiments on benchmark functions and image segmentation experimentations are run to verify RDMVO’s and RDMVO-MIS’s performance. Section 6 gives a discussion. Finally, in Sect. 7, conclusions and the direction of future work are summarized.

2 Multilevel Threshold Image Segmentation

Thresholding is the most commonly used image segmentation method. In essence, thresholding is to select an attribute to divide the gray value of an image into two or more sets. Thresholding is commonly performed based on the histogram generated by the image’s gray level. The image is not disturbed by noise in an ideal situation, the histogram of the segmented image has two or more peaks, the threshold is set at the trough, and the image can be divided into multiple objects and backgrounds as needed. In a real picture, however, the image will be disturbed by various noises, and the gray value of the image is not the correct data. The histogram is disturbed by noise, there is no peak on the histogram, or there may be multiple peaks. Image segmentation results with thresholds on the troughs will be poor or incorrect. In this case, many threshold determination methods have been proposed [99,100,101]. To evaluate the threshold selection results, Pun [102] proposed the concept of entropy in 1980 and determined the optimal threshold by calculating the threshold corresponding to the maximum entropy. Kapur then proposed an improved threshold segmentation method based on maximum entropy based on Pun in 1985, which is simple to calculate and has a good segmentation effect.

The traditional calculation of Kapur’s entropy is based on the image’s one-dimensional gray value, which has poor anti-noise interference ability. Buades et al. [103] proposed non-local means 2D histogram in 2005. This technology can effectively reduce noise interference and has a good segmentation effect when the image is polluted by noise. Kapur’s entropy MIS used in this paper is based on non-local means 2D histogram. The specific process is to obtain the grayscale image corresponding to the original image and then perform non-local mean noise reduction processing on the grayscale image. The obtained image is called the NLM image. Then non-local means 2D histogram is obtained according to the grayscale and NLM images. Perform the maximum Kapur’s entropy calculation based on the 2D histogram, then obtain a set of thresholds corresponding to the maximum Kapur’s entropy, and finally perform image segmentation according to the thresholds.

Figure 1 shows an image segmentation example in this paper. The specific related content is introduced in the following. In addition, three mainstream image segmentation evaluation methods used in this paper will also be introduced one by one.

2.1 Kapur’s Entropy

Kapur’s entropy MIS is based on the gray value of the image. We define the gray value of the image to be stored in 8 bits, and the gray value ranges from 0 to 255. Let $L$=256, ${n}_{i}$ represents the number of pixels whose gray value is $i$, then the Kapur’s entropy $H$ is defined according to the following formula:

$$\begin{array}{c}N=\sum\limits_{i=0}^{L-1}{n}_{i}\end{array}$$

(1)

$$\begin{array}{c}{p}_{i}=\frac{{n}_{i}}{N}\end{array}$$

(2)

$$\begin{array}{c}H=-\sum\limits_{i=0}^{L-1}{p}_{i}ln{p}_{i}\end{array}$$

(3)

where ${p}_{i}$ is the probability of occurrence of gray value $i$, $H$ is Kapur’s entropy. For BIS, images are divided into two subclasses, ${C}_{0}$ and ${C}_{1}$. ${C}_{0}=\{\mathrm{1,2},3\dots {t}_{1}-1\}$, ${C}_{1}=\{{t}_{1},\dots L-1\}$, the Kapur’s entropy ${H}_{C}$ is described according to the following formula:

$$\begin{array}{c}{H}_{C}={H}_{{C}_{0}}+{H}_{{c}_{1}}\end{array}$$

(4)

$$\begin{array}{c}{H}_{{C}_{0}}=-\sum\limits_{j=0}^{{t}_{1}-1}\frac{{p}_{j}}{{\omega }_{0}}ln\frac{{p}_{j}}{{\omega }_{0}}\end{array}$$

(5)

$$\begin{array}{c}{\omega }_{0}=\sum\limits_{n=0}^{{t}_{1}-1}{p}_{n}\end{array}$$

(6)

$$\begin{array}{c}{H}_{{C}_{1}}=-\sum\limits_{j={t}_{1}}^{L-1}\frac{{p}_{j}}{{\omega }_{1}}ln\frac{{p}_{j}}{{\omega }_{1}}\end{array}$$

(7)

$$\begin{array}{c}{\omega }_{1}=\sum_{n={t}_{1}}^{L-1}{p}_{n}\end{array}$$

(8)

$$\begin{array}{c}{t}^{\star }=argMax\left({H}_{c}\right)\end{array}$$

(9)

where ${t}^{\star }$ is the split point set when ${H}_{c}$ takes the maximum value, it is also the determined threshold. Similar to BIS, for MIS, suppose the image gray value is divided into $m$ subsets, ${C}_{0}=\{\mathrm{1,2},3\dots {t}_{1}-1\}$, ${C}_{1}=\{{t}_{1},\dots {t}_{2}-1\}$, ${C}_{2}=\{{t}_{2},\dots {t}_{3}-1\}$,…, ${C}_{m-1}=\{{t}_{m-1},\dots L-1\}$, the Kapur’s entropy is described according to the following formula:

$$\begin{array}{c}{H}_{C}=\sum\limits_{i=0}^{m-1}{H}_{{C}_{i}}\end{array}$$

(10)

$$\begin{array}{c}{H}_{{C}_{i}}=-\sum\limits_{j={t}_{i}}^{{t}_{i+1}-1}\frac{{p}_{j}}{{\omega }_{i}}ln\frac{{p}_{j}}{{\omega }_{i}}\end{array}$$

(11)

$$\begin{array}{c}{\omega }_{i}=\sum\limits_{n={t}_{i}}^{{t}_{i+1}-1}{p}_{j}\end{array}$$

(12)

$$\begin{array}{c}{{\varvec{t}}}^{\star }=argMax\left({H}_{c}\right)\end{array}$$

(13)

2.2 Non-local Means 2D Histogram

Buades et al. [103] proposed non-local means 2D histogram in 2005; this innovative technology uses redundant information for denoising and maintaining the maximum detailed features of the image. Assuming that the original image is $O$, the non-local mean denoised image is $N$, and the gray value of the pixels $p$ in the image $O$ is denoted as $O(p)$, then after non-local mean filtering, the gray value $N(p)$ of pixels $p$ can be obtained by the following formula:

$$\begin{array}{c}N\left(p\right)=\frac{\sum_{q\epsilon O}O\left(q\right)\omega \left(p,q\right)}{\sum_{q\epsilon O}\omega \left(p,q\right)}\end{array}$$

(14)

$$\begin{array}{c}\omega \left(q,p\right)={exp}^{-\frac{{\left|\mu \left(p\right)-\mu \left(q\right)\right|}^{2}}{{\partial }^{2}}}\end{array}$$

(15)

$$\begin{array}{c}\mu \left(p\right)=\frac{\sum_{i\epsilon O\left(p\right)}O\left(i\right)}{m \times m}\end{array}$$

(16)

$$\begin{array}{c}\mu \left(q\right)=\frac{\sum_{i\epsilon O\left(q\right)}O\left(i\right)}{m \times m}\end{array}$$

(17)

where $\omega \left(q,p\right)$ is the weight of pixel $p$ and$q$, and $L(p)$ and $L(q)$ are local images and centered on pixel $p$ and $q$, respectively, and the size is$m \times m$. $\mu \left(p\right)$ and $\mu \left(q\right)$ are the local mean of pixel $p$ and $q$, respectively, and the mean of $L(p)$ and$L(q)$. $\partial$ is the standard deviation.

So far, there are corresponding grayscale image $O$ and image $N$ filtered by non-local mean. Combining them, the abscissa is the gray value corresponding to the pixel of the original image $O$, and the ordinate is the gray value corresponding to the image $N$ filtered by the non-local mean. Then we can get a 2D view of the 2D histogram, shown in Fig. 2. The non-local means 2D histogram according to the following formula:

$$\begin{array}{c}{P}_{ij}=\frac{{h}_{ij}}{m \times n}\end{array}$$

(18)

where $i$ refers to the value of image $O(x,y)$ pixels, $j$ refers to the value of image $N(x,y)$ pixels, and $h(i,j)$ denotes the number of times the point $(i,j)$ appears on the gray value vector $(s,t)$, the total number of pixels in this image is $m \times n$.

2.3 Kapur’s Entropy-based 2D Histogram

A 2D view histogram based on the above 2D histogram is given in Fig. 2. The main diagonal of the 2D histogram contains adequate image information. This paper calculates Kapur’s entropy as the objective function and Kapur’s entropy of the subregions on the main diagonal using Eq. (19). The optimal solution found by in {$t,{t}_{2}\dots {t}_{n}-1$}is the optimal threshold.

$$H\left(s,t\right)=-\sum_{i=0}^{{s}_{1}}\sum_{j=0}^{{t}_{1}}\frac{{P}_{ij}}{{P}_{1}}ln\frac{{P}_{ij}}{{P}_{1}}-\sum_{i={s}_{1}+1}^{{s}_{2}}\sum_{j={t}_{1}+1}^{{t}_{2}}\frac{{P}_{ij}}{{P}_{2}}ln\frac{{P}_{ij}}{{P}_{2}}\dots -\sum_{i={s}_{L-2}+1}^{{s}_{L-1}}\sum_{j={t}_{L-2}+1}^{{t}_{L-1}}\frac{{P}_{ij}}{{P}_{L-1}}ln\frac{{P}_{ij}}{{P}_{L-1}}$$

(19)

And ${P}_{1}=\sum_{i=0}^{{s}_{1}}\sum_{j=0}^{{t}_{1}}{P}_{ij}$, ${P}_{2}=\sum_{i={s}_{1}+1}^{{s}_{2}}\sum_{j={t}_{1}+1}^{{t}_{2}}{P}_{ij}$, ${P}_{L-1}=\sum_{i={s}_{L-2}+1}^{{s}_{L-1}}\sum_{j={t}_{L-2}+1}^{{t}_{L-1}}{P}_{ij}$.

2.4 Image Segmentation Evaluation

We all know how important it is to conduct practical evaluations of image segmentation experiments. The evaluation of mathematical computations is a crucial juncture that requires benchmarking, sufficient data, and appropriate metrics for a credible evaluation [104,105,106]. Many methods for evaluating segmentation results have been proposed, each with advantages and disadvantages. The three most commonly used evaluation methods are PSNR [96], SSIM [97], and FSIM [98]. As a result, they will be used to analyze the experimental results in this paper.

PSNR. PSNR is a full-reference image quality evaluation index, and it is the most commonly used image quality assessment metric. PSNR relies on MSE to measure the degree of distortion of the image. The calculation of PSNR is expressed as follows [96]:
$$\begin{array}{c}PSNR\left(f,G\right)=10\times lo{g}_{10}\frac{L}{MSE\left(f,G\right)}\end{array}$$
(20)
$$\begin{array}{c}MSE\left(f,G\right)=\frac{{\sum }_{i=0}^{M-1}{\sum }_{j=0}^{N-1}{\left[f\left(i,j\right)-G\left(i,j\right)\right]}^{2}}{M\times N}\end{array}$$
(21)
$M$ and $N$ are the numbers of rows and columns of the image, respectively, $f$ represents the original image, and $G$ represents the segmented image. $f\left(i,j\right)$ represents the pixel gray-scale value of the original image, and $G\left(i,j\right)$ is the pixel gray-scale value of the segmented image. $L$ is the scale range of the image. For an 8-bit image, $L={2}^{8}-1=255$.
SSIM. SSIM is a full-reference image quality evaluation index. Furthermore, it is a widely used image quality evaluation metric based on the assumption that the human eye extracts structured information from an image when viewing it. It calculates the brightness, contrast, and structure comparison functions of the image, respectively, then makes a comprehensive evaluation. Its calculation process is as follows [97]:
$$\begin{array}{c}SSIM\left(x,y\right)=\frac{\left(2{\mu }_{x}{\mu }_{y}+{C}_{1}\right)\left(2{\sigma }_{xy}+{c}_{2}\right)}{\left({\mu }_{x}^{2}+{\mu }_{y}^{2}+{C}_{1}\right)\left({\sigma }_{x}^{2}+{\sigma }_{y}^{2}+{C}_{2}\right)}\end{array}$$
(22)
where $x$ represents the image block of the original image, $y$ represents the image block of the segmented image. ${\mu }_{x}$ and ${\mu }_{y}$ are the means of the image block $x$ and the image block $y$, respectively, and it reflect the brightness information of the image. ${\sigma }_{x}$ and ${\sigma }_{y}$ are the standard deviations of the image block $x$ and the image block $y$, respectively, and it reflects the contrast information of the image. ${\sigma }_{xy}$ is the correlation coefficient between the image block $x$ and the image block $y$. And it reflects the structural similarity information of the image.
FSIM. FSIM is a relatively new full reference image quality evaluation index. It is based on two major features, Phase Consistency (PC) and Gradient Magnitude (GM). The calculation of FSIM is expressed as follows [98]:
$$\begin{array}{c}FSIM=\frac{\sum_{I\in\Omega }{S}_{L}\left(X\right)P{C}_{m}\left(X\right)}{\sum_{I\in\Omega }P{C}_{m}\left(X\right)}\end{array}$$
(23)
$$\begin{array}{c}{S}_{L}\left(X\right)={S}_{PC}{\left(X\right)}^{\alpha }{S}_{G}{\left(X\right)}^{\beta }\end{array}$$
(24)
$$\begin{array}{c}P{C}_{m}\left(X\right)=\frac{E\left(X\right)}{(\epsilon +\sum_{m}{A}_{n}(X))}\end{array}$$
(25)
$$\begin{array}{c}{S}_{PC}\left(X\right)=\frac{2P{C}_{1}\left(X\right)P{C}_{2}\left(X\right)+{T}_{1}}{P{C}_{1}^{2}\left(X\right)P{C}_{2}^{2}\left(X\right)+{T}_{1}}\end{array}$$
(26)
$$\begin{array}{c}{S}_{G}\left(X\right)=\frac{2{G}_{1}\left(X\right){G}_{2}\left(X\right)+{T}_{2}}{{G}_{1}^{2}\left(X\right){G}_{2}^{2}\left(X\right)+{T}_{2}}\end{array}$$
(27)
$$\begin{array}{c}G=\sqrt{{G}_{X}^{2}+{G}_{Y}^{2}}\end{array}$$
(28)

FSIM is coupled by PC term and GM term, ${S}_{L}$ refers to the similarity score, $\alpha$ and $\beta$ take 1 by convention. $P{C}_{m}\left(X\right)$ is $Max\left(P{C}_{1}\left(X\right),P{C}_{2}\left(X\right)\right)$, $\epsilon$ is a very small positive number, preventing a denominator of 0. ${T}_{1}$ and ${T}_{2}$ are a constant, $E\left(X\right)$ indicates the local energy, ${A}_{n}\left(X\right)$ is the amplitude value.

3 Overview of Original MVO

MVO’s mathematical model and algorithm are based on the multi-verse theory of cosmology’s three concepts of white holes, black holes, and wormholes. According to cosmological theory, white and black holes are two amazing celestial bodies, and their properties are diametrically opposed. White holes only eject matter and energy to the outside and are considered the main part of the universe. Black holes only absorb matter and energy in the universe. Wormholes are tunnels that connect parallel universes, bridging the gap between white and black holes, and allowing objects to move instantly between universes and time-spaces.

In the MVO algorithm, the concepts of white holes and black holes are modeled to represent the exploration process in the search space, and the wormhole model simulates the exploitation process. MVO proposes the concept of the expansion rate and believes that the universe is constantly changing, and objects in the multi-verse are constantly evolving to the most stable universe according to the expansion rate, white holes, black holes, and wormholes. The MVO algorithm specifies the following rules [45]:

The greater the expansion rate, the greater the possibility of a white hole, and the lower the possibility of a black hole.
A universe with a higher expansion rate sent matter through a white hole.
Universes with lower expansion rates receive more matter through black holes.
Regardless of the expansion rate, all objects in the universe can move randomly through wormholes toward the optimal universe.

Therefore, MVO starts by generating random universes. In each iteration, objects use white/black holes to move from universes with high expansion rates to other universes with low expansion rates. Furthermore, objects in any universe are randomly teleported to the best universe through a wormhole. Repeating these processes until the final criteria are met.

Based on the above theory, MVO first randomly generates a multi-verse:

$$\begin{array}{c}X=\left[\begin{array}{ccc}{{\varvec{X}}}_{1}^{1}& {{\varvec{X}}}_{1}^{2}& \begin{array}{cc}\cdots & {{\varvec{X}}}_{1}^{d}\end{array}\\ {{\varvec{X}}}_{2}^{1}& {{\varvec{X}}}_{2}^{2}& \begin{array}{cc}\cdots & {{\varvec{X}}}_{2}^{d}\end{array}\\ \begin{array}{c}\vdots \\ {{\varvec{X}}}_{n}^{1}\end{array}& \begin{array}{c}\vdots \\ {{\varvec{X}}}_{n}^{2}\end{array}& \begin{array}{c}\begin{array}{cc}\vdots & \vdots \end{array}\\ \begin{array}{cc}\cdots & {{\varvec{X}}}_{n}^{d}\end{array}\end{array}\end{array}\right]\end{array}$$

(29)

where $n$ represents the number of universes, each universe represents a candidate solution, $d$ represents the amount of matter in each universe, and the matter in the universe represents the parameters in the solution. The update of the universe is based on the following formula:

$$\begin{array}{c}{{\varvec{X}}}_{i}^{j}=\left\{\begin{array}{c}\begin{array}{cc}{{\varvec{X}}}_{k}^{j},& {r}_{1}<NI\left({{\varvec{X}}}_{i}\right)\end{array}\\ \begin{array}{cc}{{\varvec{X}}}_{i}^{j},& {r}_{1}\ge NI\left({{\varvec{X}}}_{i}\right)\end{array}\end{array}\right.\end{array}$$

(30)

Among them, ${{\varvec{X}}}_{i}$ is the $ith$ universe, $NI\left({{\varvec{X}}}_{i}\right)$ is the normalized expansion rate of the $ith$ universe, ${{\varvec{X}}}_{i}^{j}$ refers to the $jth$ matter in the $ith$ universe, and ${{\varvec{X}}}_{k}^{j}$ is the $jth$ substance of the $kth$ universe, $k$ is generated by the roulette selection mechanism, and ${r}_{1}$ is a random number of $\left[\mathrm{0,1}\right]$. As can be seen from these equations, the selection and determination of white holes are made by roulette, which is based on the normalized expansion rate. The lower the expansion rate, the more likely it is that an object will pass through the tunnel of a white hole. Beyond that, there could be wormholes in the universe that randomly alters objects in the universe, regardless of their expansion rate. The mechanism is expressed as follows:

$${\varvec{X}}_i^j = \left\{ {\begin{array}{*{20}{l}} {\left\{ {\begin{array}{*{20}{l}} {{\varvec{X}}_{best}^j + TDR \times \left( {\left( {ub - lb} \right) \times {r_4} + lb} \right),}&{{r_3} < 0.5} \\ {{\varvec{X}}_{best}^j - TDR \times \left( {\left( {ub - lb} \right) \times {r_4} + lb} \right),}&{{r_3} \geq 0.5} \end{array}} \right.}&{{r_2} < WEP} \\ {{\varvec{X}}_i^j,}&{{r_2} \geq WEP} \end{array}} \right.$$

(31)

Among them, ${{\varvec{X}}}_{i}^{j}$ refers to the $jth$ substance in the $ith$ universe, ${{\varvec{X}}}_{k}^{j}$ is the $jth$ substance in the $kth$ universe, ${{\varvec{X}}}_{best}^{j}$ is the $jth$ substance in the current best universe, $ub, lb$ is the upper and lower bounds of the variable, ${r}_{2}, {r}_{3}, {r}_{4}$ are random numbers of $\left[0, 1\right]$, $TDR$ is the travel distance rate, which defines that an object can be teleported by a wormhole to the currently obtained best universe around the distance rate, which increases in iterations. Moreover, $WEP$ is the wormhole existence rate, which defines the probability that wormholes exist in the universe and increases linearly in iterations. The adaptive formulas of $\mathrm{TDR}$ and $\mathrm{WEP}$ are as follows:

$$\begin{array}{c}WEP=min+l\times \left(\frac{\mathrm{max}-\mathrm{min}}{L}\right)\end{array}$$

(32)

$$\begin{array}{c}TDR=1-\left(\frac{{l}^{1/p}}{{L}^{1/p}}\right)\end{array}$$

(33)

where $\mathrm{min}$ is the minimum value, $\mathrm{max}$ is the maximum value, $l$ represents the current iteration number, and $L$ represents the maximum iteration number. Where $p$ defines the development precision in iterations. The higher $p$ is, the faster and more accurate the local search is. Algorithm 1 shows the pseudo-code of MVO.

4 Proposed RDMVO

4.1 Rosenbrock Method (RM)

RM is a local search method proposed by Rosenbrock [107] in 1960. It can adapt to the search direction and size and conduct multiple searches until a relatively optimal solution is found in a specific limited area. Regarding the determination of the search size, RM will first determine $a$ random number $\varepsilon$ as the search size and then determine whether $a$ better solution is obtained under the search size through calculation and comparison. If $a$ better solution is obtained, the next search size is $\varepsilon \times \alpha$ and$\alpha >1$, otherwise the next search size is $\varepsilon \times \beta$ and $0<\beta <1$. Regarding the determination of the search direction, RM refers to the orthonormal basis of the n-dimensional space rather than experimenting in a single direction.

The basic RM is effective for unimodal functions, and it is not easy to jump out of the local optimum for multimodal functions. The basic RM was modified in 2011 by Kang et al. [108] to solve this problem. Li et al. [109] applied RM modified to the HHO in 2021 and proved the mechanism's performance. The modified RM is used to enhance MVO’s performance in this paper. Algorithm 2 shows the pseudo-code of modified RM.

Eq refers to ${{\varvec{\delta}}}_{i}=\sqrt[2]{\frac{{\sum }_{k=1}^{n}\left({{\varvec{x}}}_{ki}-avg{{\varvec{x}}}_{i}\right)}{n}}+{\varepsilon }_{1},i=\mathrm{1,2}\dots d$. and $avg{{\varvec{x}}}_{i}=\frac{{\sum }_{k=1}^{n}{{\varvec{x}}}_{ki}}{n}$. Where $n$ is the number of prospect particles, $avg{{\varvec{x}}}_{i}$ refers to the average of the prospect particles in the $ith$ dimension, ${{\varvec{x}}}_{ki}$ refers to the value of the $kth$ prospect particles in the $ith$ dimension. ${\varepsilon }_{1}=1.0{e}^{-150}$ is a very small variable for preventing the initial value from being 0.

4.2 Diffusion Mechanism (DM)

The DM is mentioned in Literature [110, 111], which effectively alleviates the optimum local problem and dramatically improves the possibility of finding the global optimum. The diffusion process refers to the random diffusion of new particles around the original agent at different locations. There are two main methods for generating new particles: Levy flight and Gaussian distribution [111]. They are applied based on Eqs. (4.1) and (4.2), respectively.

$$\begin{array}{c}{{\varvec{x}}}_{i}^{q}={{\varvec{x}}}_{{\varvec{i}}}+{\boldsymbol{\alpha }}_{i}^{q}\otimes Levy\left(\lambda \right)\end{array}$$

(34)

$$\begin{array}{c}{{\varvec{x}}}_{i}^{q}={{\varvec{x}}}_{i}+\beta \times Gaussian\left({{\varvec{P}}}_{i},\left|{\varvec{B}}{\varvec{P}}\right|\right)-\left(\gamma \times {\varvec{B}}{\varvec{P}}-{\gamma }_{1}\times {{\varvec{P}}}_{i}\right)\end{array}$$

(35)

In Eq. (34), ${x}_{i}$ refers to the original agent, and $q$ refers to the number of new particles obtained after diffusion. $\alpha$ is a variable used to control the convergence speed. The symbol $\otimes$ refers to the Hadamard multiplications. In Eq. (35), likewise, ${{\varvec{x}}}_{i}$ refers to the original agent, and $q$ refers to the number of new particles obtained after diffusion. $\beta$ is equal to $(log(t))/t$, and $t$ is the number of iterations. $\gamma$ and ${\gamma }_{1}$ are random numbers between $[\mathrm{0,1}]$, and ${\varvec{B}}{\varvec{P}}$ is the best position of new particles so far. $Gaussian\left({{\varvec{P}}}_{i},\left|{\varvec{B}}{\varvec{P}}\right|\right)$ is the Gaussian distribution, and the mean and standard deviation are ${{\varvec{P}}}_{i}$ and $\left|{\varvec{B}}{\varvec{P}}\right|$, respectively.

In this paper, reference paper [112], We use the mathematical model of the DM as follows:

$$\begin{array}{c}{{\varvec{X}}}_{i}=Gaussian\left({\varvec{B}}{\varvec{e}}{\varvec{s}}{\varvec{t}}{\varvec{p}}{\varvec{o}}{\varvec{s}}{\varvec{i}}{\varvec{t}}{\varvec{i}}{\varvec{o}}{\varvec{n}},{\varvec{\varepsilon}},{\varvec{m}}\right)+\sigma \times \left({\varvec{B}}{\varvec{e}}{\varvec{s}}{\varvec{t}}{\varvec{p}}{\varvec{o}}{\varvec{s}}{\varvec{i}}{\varvec{t}}{\varvec{i}}{\varvec{o}}{\varvec{n}}-{{\varvec{X}}}_{i}\right)\end{array}$$

(36)

where $Gaussian\left({\varvec{B}}{\varvec{e}}{\varvec{s}}{\varvec{t}}{\varvec{p}}{\varvec{o}}{\varvec{s}}{\varvec{i}}{\varvec{t}}{\varvec{i}}{\varvec{o}}{\varvec{n}},{\varvec{\varepsilon}},m\right)$ means to generate a random matrix that obeys Gaussian distribution. ${\varvec{B}}{\varvec{e}}{\varvec{s}}{\varvec{t}}{\varvec{p}}{\varvec{o}}{\varvec{s}}{\varvec{i}}{\varvec{t}}{\varvec{i}}{\varvec{o}}{\varvec{n}}$ and ${\varvec{\varepsilon}}$ represent the mean and standard deviation, respectively, and ${\varvec{m}}$ is a vector that determines the form of the generated matrix. ${\varvec{B}}{\varvec{e}}{\varvec{s}}{\varvec{t}}{\varvec{p}}{\varvec{o}}{\varvec{s}}{\varvec{i}}{\varvec{t}}{\varvec{i}}{\varvec{o}}{\varvec{n}}$ denotes the position of the global optimal solution. $\sigma$ is a random number between $[\mathrm{0,1}]$. The calculation of ${\varvec{\varepsilon}}$ is expressed as follows:

$$\begin{array}{c}\varepsilon =\frac{log\left(t\right)}{t}\times \left|{{\varvec{X}}}_{i}-{\varvec{B}}{\varvec{e}}{\varvec{s}}{\varvec{t}}{\varvec{p}}{\varvec{o}}{\varvec{s}}{\varvec{i}}{\varvec{t}}{\varvec{i}}{\varvec{o}}{\varvec{n}}\right|\end{array}$$

(37)

The RDMVO algorithm introduces the RM and the DM into the standard MVO algorithm, enhancing the global search ability and local exploration ability of the MVO algorithm and effectively improving the convergence speed and accuracy. Algorithm 3 shows the pseudo-code of RDMVO. The flowchart of RDMVO is shown in Fig. 3.

4.3 Algorithm Complexity Analysis

Figure 3 shows the main process of RDMVO. The time complexity of RDMVO is mainly determined by the initialization, the RM, the DM, and the iterative updates of MVO. In addition, the parameters also affect the algorithm complexity. These parameters are mainly: the algorithm’s maximum number of iterations ($T$), the dimension ($d$) and the population size ($N$). Therefore, the main time complexity of RDMVO is O (RDMVO) = O (initialization) + O (fitness evaluation) + O (MVO iterative updates) + O (RM) + O (DM) ≈ O ($n\times d$) + O ($T\times n$) + O ($n\times d$) + O ($n\times d$) + O ($n$).

5 Experimental Results and Analysis

A series of experiments were pushed to verify RDMVO’s performance. To begin, we choose the IEEE CEC2017 as benchmark functions. On IEEE CEC2017, we selected some contemporary mainstream MHAs to conduct an algorithm comparison experiment. The experimental data show that RDMVO has apparent convergence speed and accuracy advantages. Also, we conducted parametric experiments on IEEE CEC2017 to choose a better $p$. Furthermore, we applied RDMVO in conjunction with Kapur’s entropy to segment BKIs and COVID-19-CXIs, compared the performance with some mainstream MHAs, and selected PSNR, SSIM, and FSIM as evaluation indicators. The experimental results are also positive. The following section will briefly introduce these experimental procedures and results. In addition, this paper uses MATLAB coding. All experiments were performed on the same computer. Computer parameters and coding software versions are given in Table 14.

5.1 Experiment on IEEE CEC2017 Benchmark Functions

The IEEE CEC2017 benchmark functions and related parameter settings are first introduced in this section. The parameter set includes two parts, the first part is the setting of public parameters, and the second part is the parameters of the algorithm participating in the experiment. Common parameters mean that to ensure the experiment’s fairness, as per other computational science works [113], and the differences in the algorithm itself, other parameters must be unified during the experiment. The settings of public parameters are shown in Table 1. Where $N$ is the number of particles in the population, $D$ is the dimension of the problem, $MaxFEs$ is the Maximum number of evaluations, $F$ is the number of independent runs of the experiment, and we set F to 30 to reduce the influence of randomness on the experimental results. In addition, the relevant parameter settings of the MHA involved in this experiment are given in Table 16.

Table 1 Unified parameter settings

Multi-verse Optimizer with Rosenbrock and Diffusion Mechanisms for Multilevel Threshold Image Segmentation from COVID-19 Chest X-Ray Images

Abstract

Similar content being viewed by others

Performance optimization of hunger games search for multi-threshold COVID-19 image segmentation

CNN-IKOA: convolutional neural network with improved Kepler optimization algorithm for image segmentation: experimental validation and numerical exploration

An Efficient Multilevel Threshold Image Segmentation Method for COVID-19 Imaging Using Q-Learning Based Golden Jackal Optimization

1 Introduction

2 Multilevel Threshold Image Segmentation

2.1 Kapur’s Entropy

2.2 Non-local Means 2D Histogram

2.3 Kapur’s Entropy-based 2D Histogram

2.4 Image Segmentation Evaluation

3 Overview of Original MVO

4 Proposed RDMVO

4.1 Rosenbrock Method (RM)

4.2 Diffusion Mechanism (DM)

4.3 Algorithm Complexity Analysis

5 Experimental Results and Analysis

5.1 Experiment on IEEE CEC2017 Benchmark Functions

5.1.1 Benchmark Functions

5.1.2 Influence of Two Mechanisms

5.1.3 Parameter Test

5.1.4 Comparison with Other Algorithms

5.2 Experiment on Image Segmentation

5.2.1 Influence of the Two Mechanisms

5.2.2 Comparison with Other Algorithms on Berkeley

5.2.3 Comparison with Other Algorithms on COVID-19

6 Discussions

7 Conclusions and Future Works

Availability of Data and Materials

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Appendices

Appendix 1

Appendix 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation