Advanced RIME architecture for global optimization and feature selection

Abu Khurma, Ruba; Braik, Malik; Alzaqebah, Abdullah; Gopal Dhal, Krishna; Damaševičius, Robertas; Abu-Salih, Bilal

doi:10.1186/s40537-024-00931-8

Advanced RIME architecture for global optimization and feature selection

Research
Open access
Published: 18 June 2024

Volume 11, article number 89, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Big Data Submit manuscript

Advanced RIME architecture for global optimization and feature selection

Download PDF

Ruba Abu Khurma^1,2,
Malik Braik³^na1,
Abdullah Alzaqebah⁴^na1,
Krishna Gopal Dhal⁵^na1,
Robertas Damaševičius⁶^na1 &
…
Bilal Abu-Salih^7,8^na1

808 Accesses
Explore all metrics

Abstract

The article introduces an innovative approach to global optimization and feature selection (FS) using the RIME algorithm, inspired by RIME-ice formation. The RIME algorithm employs a soft-RIME search strategy and a hard-RIME puncture mechanism, along with an improved positive greedy selection mechanism, to resist getting trapped in local optima and enhance its overall search capabilities. The article also introduces Binary modified RIME (mRIME), a binary adaptation of the RIME algorithm to address the unique challenges posed by FS problems, which typically involve binary search spaces. Four different types of transfer functions (TFs) were selected for FS issues, and their efficacy was investigated for global optimization using CEC2011 and CEC2017 and FS tasks related to disease diagnosis. The results of the proposed mRIME were tested on ten reliable optimization algorithms. The advanced RIME architecture demonstrated superior performance in global optimization and FS tasks, providing an effective solution to complex optimization problems in various domains.

Binary Growth Optimizer: For Solving Feature Selection Optimization Problems

An Improved Gannet Optimization Algorithm Based on Opposition-Based Schemes for Feature Selection Problems in High-Dimensional Datasets

Article 10 January 2024

Integrating metaheuristics and artificial intelligence for healthcare: basics, challenging and future directions

Article Open access 12 July 2024

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Introduction

The pursuit of optimal solutions within the expansive and intricate realms of global optimization problems is a critical and central endeavor across a multitude of scientific and engineering domains [1, 2]. These domains, ranging from computer science and operations research to various branches of engineering and applied sciences, are continually faced with challenges that are high-dimensional and multifaceted. The inherent complexity and diversity of problems within these domains necessitate the development and implementation of innovative, sophisticated, and efficient algorithms. These algorithms must be capable of navigating through the vast landscapes of high-dimensional spaces, exploring a myriad of potential solutions, and ultimately converging to solutions that are optimal or near-optimal.

Global optimization problems are characterized by their extensive search spaces and the presence of numerous local optima, making the task of finding the global optimum a highly non-trivial endeavor. The challenges posed by these problems are further compounded by the increasing dimensionality and complexity of the search spaces, requiring algorithms with enhanced exploration and exploitation capabilities. The exploration and exploitation dichotomy is crucial in global optimization, where algorithms must balance between exploring new, unvisited regions of the search space and exploiting known promising regions to refine solutions.

The exploration of nature-inspired algorithms has emerged as a promising and fruitful avenue in addressing the challenges posed by global optimization problems. Nature-inspired algorithms draw insights, principles, and strategies from various natural phenomena, biological processes, and ecological interactions observed in the natural world [3]. By mimicking the adaptive, evolutionary, and cooperative behaviors exhibited by biological entities and ecosystems, these algorithms design sophisticated computational models and search strategies capable of solving complex optimization problems.

Nature-inspired algorithms encompass a diverse array of approaches, each inspired by different aspects of the natural world. Evolutionary algorithms, for example, are inspired by the principles of natural selection and evolution, simulating the processes of selection, crossover, mutation, and reproduction to evolve populations of solutions over generations [4]. Swarm intelligence algorithms, on the other hand, draw inspiration from the collective behaviors of social insects and animal groups, utilizing mechanisms of cooperation, communication, and self-organization to explore and optimize search spaces [5, 6].

The appeal of nature-inspired algorithms lies in their inherent adaptability, flexibility, and robustness. These algorithms are capable of adapting to dynamic and uncertain environments, adjusting their search strategies in response to changing landscapes and problem constraints. The flexibility of nature-inspired algorithms allows them to be applied to a wide range of optimization problems, with the potential for customization and hybridization to suit the specific characteristics and requirements of individual problems. Furthermore, the robustness of these algorithms enables them to handle noise, uncertainties, and imperfections in problem formulations and data, providing reliable and stable performance across different problem instances. The development of nature-inspired algorithms is driven by a continuous quest for innovation and improvement. Researchers and practitioners in the field are engaged in the design and analysis of new algorithms, the enhancement of existing algorithms, and the exploration of hybrid and multi-objective approaches. The advancements in nature-inspired algorithms are fueled by interdisciplinary collaborations, bringing together expertise from computer science, mathematics, biology, physics, and other disciplines to develop more effective and efficient algorithms. The integration of theoretical foundations, empirical studies, and computational experiments is essential in understanding the underlying mechanisms of nature-inspired algorithms and in validating their performance and applicability.

The application of nature-inspired algorithms extends beyond the realm of global optimization to various areas such as machine learning, medical image processing [7], human activity recognition [8], software defect prediction [9], network intrusion detection [10], power scheduling [11], and logistics. In machine learning and data mining, for example, nature-inspired algorithms are employed for FS [12], clustering [13], classification [14], and regression, contributing to the discovery of knowledge and insights from data. In image processing and computer vision, these algorithms are utilized for segmentation, edge detection, object recognition, and enhancement, aiding in the analysis and interpretation of visual information. The versatility and efficacy of nature-inspired algorithms in addressing diverse problems underscore their significance and potential in advancing the frontiers of science and technology.

The RIME (RIME-Ice) Algorithm, the focal point of this article, is a new contribution to the realm of nature-inspired algorithms, deriving its conceptual framework from the intricate processes of RIME-ice formation [15]. RIME-ice, a meteorological phenomenon, occurs when super-cooled water droplets freeze upon contact with surfaces, forming crystalline structures. The algorithm simulates the distinct behaviors of soft and hard RIME formations, leveraging their unique characteristics to optimize search strategies and enhance convergence and solution accuracy in diverse conditions.

In this study, we present RIME: A physics-based optimization approach as a new method for solving continuous optimization and FS problems. Our research aims to evaluate the effectiveness, robustness, and applicability of the RIME algorithm in improving solution quality and computational efficiency in these domains by examining it across continuous and discrete optimization search spaces. The main contributions of this article are summarized as follows:

The article introduces modified RIME (mRIME), which enhances the RIME algorithm’s by adopting chaotic maps for initialization of solutions, crossover for improving exploration phase of the search space and greedy search for enhancing the selection of the best solution and decreasing the bias of the optimizer.
To facilitate the transformation of a continuous search space into a binary one, four distinct types of TFs from the S-shaped, V-shaped, U-shaped, and X-shaped families were chosen for FS problems.
The effectiveness of RIME is examined for global optimization through the use of CEC2011, CEC2017, and FS tasks in relation to applications for disease diagnosis.
The suggested mRIME’s algorithm has been tested compared to ten of the most reliable optimization algorithms. The rival algorithms to the one being described fall into three categories: the most well studied EAs, SFS [16], DE [17], BBO [18], and GA [19]. Some SI that are well-known and reliable are PSO [20], ACO [21], and AMO [22]. The three HBOs that are the most current and successful are TLBO [23], GSK [24], and WSO [25].
The convergence curve, running time, number of selected features, fitness value, specificity, sensitivity, and accuracy are the six standard metrics that were used to assess the mRIME.
A range of statistical numbers were given, including the worst, average, median, standard deviation, and best values. To assess the outcomes and demonstrate the robustness of the proposed mRIME, test methods proposed by Friedman and Holm were applied. Through meticulous development and extensive evaluations, it demonstrates the superior performance and versatility of the algorithm, offering valuable insights and methodologies for researchers and practitioners in the fields of optimization and FS.

Literature review

The discussed articles provide a comprehensive overview of the advancements in optimization algorithms, focusing on nature-inspired algorithms, enhancements to existing algorithms, and their diverse applications in medical, biological, engineering, and energy domains. The development and enhancement of these algorithms are pivotal for navigating through high-dimensional spaces and converging to optimal or near-optimal solutions in various scientific and engineering domains. The applications of these algorithms in real-world problems demonstrate their potential in providing innovative solutions in diverse fields.

Numerous articles examine the creation of algorithms that are modeled after the traits and behaviors of animals and other natural occurrences. For instance, Mohapatra [26] discussed the Golden Jackal Optimization (GJO) algorithm, inspired by the collaborative hunting behaviors of golden jackals, and proposes an enhanced variant incorporating opposition-based learning (OBL) to overcome its disadvantages . Similarly, Houssein introduces the Liver Cancer Algorithm (LCA), a bio-inspired optimization algorithm mimicking liver tumor growth and takeover processes [27]. Abdel-Basset presents the Mantis Search Algorithm (MSA), inspired by the unique hunting behavior and sexual cannibalism of praying mantises [28]. A Chimp-inspired Optimization Algorithm (COA). Remora Optimization Algorithm (ROA) [29, 30]. Arithmetic Optimization Algorithm (AOA) [31, 32]. An efficient Equilibrium Optimizer (SLEO) [33]. These articles highlight the versatility of nature-inspired algorithms in navigating high-dimensional spaces and converging to optimal solutions.

Using a novel PRISMA technique, this research assesses the evolution of the World Wide Open Access (WOA) during the last 5 years and critically analyzes it [34]. Strict inclusion criteria and screening procedures are used to improve the evaluation stage. Effective methods for hybridizing WOA variants are outlined, and 59 enhanced WOA and 57 hybrid WOA variants were chosen. Along with highlighting the dearth of thorough comparisons with earlier WOA variants, the report also provides a graphic representation of the distribution of qualifying WOA variants. There are recommendations for future paths.

To get around some restrictions and boost efficiency, a number of papers suggest improving and changing current algorithms. Hassan introduces a multi-objective variant of the marine predator’s algorithm (MPA), incorporating concepts from Quantum theory to enhance the MPA’s ability to balance between exploration and exploitation [35]. Mehmood combines Archimedes’ optimization algorithm (AOA) with chaotic maps to optimize complex engineering problems [36]. Zhou introduces LASMA, a local dimensional mutation strategy, and an all-dimensional neighborhood search strategy for the slime mould algorithm (SMA) to improve the algorithm’s exploration and exploitation abilities [37]. These enhancements aim to address the limitations of the original algorithms, such as poor exploitation ability and susceptibility to local optima.

The use of optimization algorithms in the biological and medical fields is the subject of several articles. Painul [38], this examines the latest developments in deep learning and machine learning methods for the detection and diagnosis of six different types of cancer: pancreatic, skin, lung, liver, breast, and brain. It analyzes important performance measures on benchmark datasets, including precision, accuracy, area under the curve, sensitivity, and dice score; it ends with research challenges for the future. Yu presents a hybrid model, bERIME_FKNN, for early recognition and timely treatment of Pulmonary Hypertension (PH) [39]. Emam proposes an optimized residual learning architecture for classifying multiple brain tumors, utilizing an improved variant of the Hunger Games Search algorithm (I-HGS) [40]. Chen designs a new wrapper gene selection algorithm, ABHGS, integrating hunger Games search (HGS) with an artificial bee strategy for high-dimensional genetic data [41]. These applications demonstrate the potential of optimization algorithms in addressing real-world problems in medicine and biology, such as disease diagnosis and gene selection.

Optimization algorithms are applied in engineering design and energy areas, as discussed in several articles. Deng develops the snow ablation optimizer (SAO) for numerical optimization and engineering design, focusing on real-world constrained optimization issues in process synthesis and mechanical engineering [42]. Dong introduces the boosting kernel search optimizer (BKSO) to solve the combined economic emission dispatch (CEED) problem in power systems [43]. Zhou presents a boosted atomic search optimization (ASO) with a new anti-sine-cosine mechanism (ASCASO) for parameter estimation of photovoltaic (PV) models [37]. These studies illustrate the versatility of optimization algorithms in optimizing engineering designs and enhancing energy conversion efficiency.

For FS application, there were many metaheuristic algorithms that have been used to tackle this optimization challenge. Fatahi [44], an algorithm for FSS in medical data preparation called the Improved Binary Quantum-based Avian Navigation Optimizer Algorithm (IBQANA) is proposed in this study. To handle less-than-ideal results from binary metaheuristic algorithms, it makes use of the Hybrid Binary Operator (HBO) and the Distance-based Binary Search Strategy (DBSS). HBO transforms continuous values into binary solutions, and DBSS speeds up convergence and improves search agent performance. Twelve medical datasets are used to examine the efficacy of HBO with thresholding and five different TF families. Additionally, IBQANA outperforms all compared algorithms in the detection of COVID-19. The suggested approach offers a possible remedy for the FSS issue in the preparation of medical data.

Based on the starling murmuration optimizer (SMO), a novel binary optimizer technique named BSMO is presented by Nadimi [45]. BSMO is capable of finding the best characteristics and resolving intricate engineering difficulties. To find useful characteristics in medical datasets, it employs two methods: first, it maps each dimension to either 0 or 1, utilizing a configurable threshold; second, it creates binary versions by utilizing multiple S-shaped and V-shaped TFs. Four medical datasets were used to assess the performance of BSMO, and it was contrasted with popular binary metaheuristic algorithms. The BSMO performed better in choosing useful features than rivals such as ACO, BBA, bGWO, and BWOA.

Based on the No-Free-Lunch theorem which demonstrates that optimization problems can be solved using different optimization algorithms with different outcomes each time. Moreover, the same optimization algorithm can be enhanced using various operators and strategies and tested on the same optimization problem and produce new results each time. The stochastic nature of optimization algorithms motivated us to conduct experiments using the recently developed RIME optimizer to test its accuracy, robustness and validity when it is utilized to solve both continuous and discrete optimization problems. The first one was accomplished by testing RIME on global engineering problems and the second one was accomplished by testing RIME on feature selection optimization problems and specifically for disease diagnosis in medical applications. The conversion of the original RIME optimizer was done using different TFs that belong to four families: S-shaped, V-shaped, U-shaped, and X-shaped families. Different enhancement strategies were embedded in original RIME to improve its performance such as crossover operators to enhance exploration phase, positive greedy selection process to choose the best solution, and chaotic functions to initialize solutions.

Methodology

Original RIME algorithm background

The building up of evaporated water in the environment that hasn’t yet solidified leads to RIME-Ice. At temperatures below zero, it succumbs to freezing and adheres to things like branches from trees. Some areas form a winter RIME-Ice landscape because of their distinct topography and weather-related traits. The formation of RIME ice is influenced by a number of factors, including temperature, moisture levels, wind velocity, and atmospheric pressure. While RIME-Ice can grow and expand, its growth is not limitless. External factors and the inherent nature of its formation ultimately halt its expansion, leading to a state of relative stability. To better understand the dynamics of RIME ice formation, consider the hypothetical scenario depicted in Fig. 1. The plane “ABC” represents the RIME formation area, and points ($D_1, D_2, D_3$, and $D_4$) represent the birthplaces sites where RIME ice begins to form. The formation of RIME ice typically falls into two distinct categories: soft RIME and hard RIME. Soft RIME, often associated with gentle breezes, is characterized by its fine, granular structure. The formation of soft RIME is a gradual process, as the wind slowly deposits ice crystals onto the nucleation sites. This delicate RIME ice is often ephemeral, susceptible to the whims of wind and temperature changes. In contrast, hard RIME, a product of strong winds, exhibits a denser, more compact structure. The formation of hard RIME is a rapid process, as the wind forcefully impinges ice crystals onto the nucleation sites. This robust RIME ice can withstand harsher conditions, persisting long after the winds have subsided. Figure 1a depicts a breeze as having low wind speeds, variable direction changes and constant presence at every angle at an identical level. As such, its delicate RIME develops slowly and unpredictably whereas gale winds can be identified by their fast speeds with an almost uniform direction, producing hard RIME growth more rapidly in one or several directions simultaneously. Gale winds can also produce large rainfall amounts in any given location at one or more height levels, producing rapid rainfall amounts with hard RIME formation as in Fig. 1b.

The RIME algorithm is motivated by the Ice-RIME development mechanism and provides a Soft-RIME search approach by simulating the motion of Soft-RIME particles. A Hard-RIME puncture mechanism is also suggested to make use of the algorithm by imitating the behavior of hard-RIME agents in crossover situations. Finally, the metaheuristic algorithm’s selection mechanism is enhanced, and a positive greedy selection mechanism is suggested. By fusing the aforementioned three methods, the RIME algorithm is developed, which has improved performance. Modeling the RIME mathematically: The process of forming each RIME strip in the RIME algorithm involves a detailed examination of various factors such as wind speed, freezing coefficient, the cross-sectional area of the connected material, and the duration of growth. These factors collectively influence the development of each RIME strip.

In contrast, the process of forming a RIME agent from RIME particles is simulated by modeling the progressive activity of each particle. This progression leads to the formation of the final RIME agent, which is akin to a piece of crystal. This simulation approach is inspired by the diffusion-limited aggregation method, commonly used for simulating the aggregation of metal particles. The RIME algorithm is structured into four distinct stages:

1.
Initialization of RIME Clusters: This initial stage sets up the framework for the formation of RIME structures. It involves preparing the initial conditions and parameters that govern the growth and development of the RIME clusters.
2.
Suggested Soft-RIME Search Method: In this stage, a method is proposed for the searching and growth of soft-RIME. This might involve algorithms or mechanisms that simulate the accumulation and adhesion processes characteristic of soft RIME.
3.
Suggested Hard-RIME Puncture Mechanism: This stage deals with the transition of RIME from a soft to a hard state. It proposes a mechanism for the puncture process, which is a critical phase in the formation of hard RIME, known for its denser and more compact structure.
4.
Suggested Greedy Selection Mechanism Enhancement: The final stage focuses on enhancing the selection mechanism, potentially using a greedy approach. This could involve selecting the most optimal or favorable conditions or parameters that facilitate the efficient growth and formation of RIME.

1. Initialization of the RIME group:

The RIME algorithm, which draws its inspiration from reality, views the population of the algorithm as the complete RIME-population and treats each agent’s RIME as the algorithm’s sought agent. At the start, R’s full RIME population is initialized. According to Eq. (1), the population of RIME is composed of n RIME agents ($S_i$), each of which is composed of d RIME particles ($x_{ij}$). As a result, the RIME-particles $x_{ij}$ can accurately reflect the RIME-population R, as demonstrated in Eq. (2).

$$\begin{aligned} R= & {} \begin{pmatrix} s_{1} \\ s_{2} \\ \vdots \\ s_{n} \\ \end{pmatrix}; S_i= [x_{i1} x_{i2} \cdots x_{ij} ] \end{aligned}$$

(1)

$$\begin{aligned} R= & {} \begin{pmatrix} x_{11} &{} x_{12} &{} x_{13}\\ x_{21} &{} x_{22} &{} x_{23}\\ \vdots \\ x_{i1} &{} x_{i2} &{} x_{i3} \end{pmatrix} \end{aligned}$$

(2)

where (i) and (j) represent the ordinal numbers for a RIME agent and particle respectively, and $F(S_i)$ represents each agent’s growth state or fitness value in the metaheuristic algorithm.

2. Soft-RIME searching approach:

1.
Each particle, $x_{ij}$, follows its own set of laws before condensing into soft RIME agents; their ability to wander is subject to external influences and can vary accordingly.
2.
if free-state RIME particles migrate near Soft-RIME agents, they may condense with its particles and alter its stability.
3.
Given that each particle experiences variable condensation levels, the distance between centers of two particles that adhere is not constant.
4.
Inter-particle condensation does not occur if particles move straight outside their escape radius.
5.
As the random condensation process unfolds, the area to which each particle adheres expands, raising the probability of free particle condensation during soft RIME formation. Nevertheless, environmental factors ultimately lead the agent to reach a stable state, halting its growth.

Within the RIME algorithm, the estimation of RIME particle positions is calculated as outlined in Eq. (3), aligning with the five distinct motion traits of RIME particles.

$$\begin{aligned} R_{ij}^{new}=R_{best,j}+r_1 \times \cos \theta \times \beta \times h \times Ub_{ij}-Lb_{ij} +Lb_{ij}, r_2<E \end{aligned}$$

(3)

The RIME algorithm employs a particle movement strategy that incorporates both Soft-RIME and Hard-RIME dynamics. The position of each particle, denoted by $R_{ij}^{new}$, is updated based on its current position i and j, the best RIME-Agent particle $R_{(best,j)}$, and a random parameter h. The direction of particle movement is influenced by the cosine function, which is modulated based on the number of iterations. The environmental component, represented by the term after the number of iterations in Eq. (5), ensures algorithm convergence and mimics the influence of external factors. The degree of adhesion, denoted by h, is a random value between 0 and 1 that controls the spacing between RIME particles.

$$\begin{aligned} \theta =\pi \times \frac{t}{10\times T} \end{aligned}$$

(4)

This iterative process continues until the algorithm’s current iteration count, denoted by t, reaches the maximum allowed iteration count, represented by T.

$$\begin{aligned} \beta =1-\big [\frac{w\times t}{T}\big ]/w \end{aligned}$$

(5)

The default value of w is set to 5, which controls the number of segments in the step function. [.] denotes rounding in this context, where the step function is the mathematical model of the process. Referring back to Eq. (3), the upper and lower bounds of the escape space, represented by the letters $Ub_{ij}$ and $Lb_{ij}$, respectively, define the boundaries of the particle motion’s practical range. E represents the attachment coefficient, which influences an agent’s likelihood of condensing and increases with the number of iterations. It is represented in Eq. (6).

$$\begin{aligned} E =\sqrt{t/T} \end{aligned}$$

(6)

3. Technique for Hard-RIME pierces:

Hard-RIME growth is easier and more dependable in strong gale conditions than soft-RIME growth. The features of a hard RIME are as follows when the RIME particle condenses:

Because the gale is so powerful, other influences are insignificant, which causes several Hard-RIME agents to snowball in the same direction.
Because each RIME agent can readily cross across because the growth direction is the same, this is known as RIME puncture.
Hard-RIME agents, like Soft-RIME agents, become larger as they mature, increasing the likelihood of puncturing between agents under favorable conditions for growth.

Consequently, the puncturing phenomenon and the associated Hard-RIME puncture mechanism enable the algorithm to exchange particles between solutions, thereby improving convergence and enhancing the ability to escape local optima. The particle replacement formula is depicted in Eq. (7).

$$\begin{aligned} R_{ij}^{new}=R_{best,j}, r_3<F^{normr} (S_i) \end{aligned}$$

(7)

where $R_{ij}^{new}$ represents the updated position of the particle and $R_{best,j}$ represents the $j_{th}$ particle of the best RIME-Agent in the RIME population R. The normalized value of the current agent’s fitness value, denoted by $F^{normr} (S_i)$, represents the probability of selecting the $i_{th}$ RIME-Agent. $r_3$ is a random number between (− 1) and (1). Algorithm 2 presents the pseudo-code for the Hard-RIME puncture mechanism.

4. An efficient technique of greedy selection:

In conventional metaheuristic optimization algorithms, the greedy selection mechanism regularly updates and records the best fitness value and the corresponding solution. Following each update, the solution’s updated fitness value is typically compared to the global optimum. If the updated value surpasses the existing global optimum, the optimal fitness value is replaced, and the solution is designated as the new optimum. While this approach is simple and efficient, it does not directly contribute to exploring or exploiting the population as it primarily serves as a record-keeping mechanism.

An aggressive greedy selection strategy is often employed in optimization algorithms to increase global exploration’s effectiveness, using fitness values of agents before and after updates as an indication of effectiveness or proximity of optimal solutions. At each update step, the algorithm compares updated fitness values against their previous fitness values; fitness can serve as a barometer for measuring how effectively agents solve their assigned problem, with those surpassing previous values being replaced or updated both regarding solution and properties to reflect this newfound success.

The implications of this strategy are two folds:

By actively replacing agents with improved versions, the overall quality of the population is raised. Good agents, those with higher fitness values, are consistently maintained in the population, driving the collective towards better solutions.
While this method ensures the population is always stocked with high-performing agents, the dramatic shift in the positions of the population’s agents with each iteration carries a risk. Some agents, as a result of these shifts, may end up performing worse than they did before the update. This degradation can negatively impact the population in the following iteration, as not all changes lead to improvements.

Algorithm 3 displays the pseudo-code of the positive greedy selection mechanism for addressing the minimum value problem.

The recommended RIME algorithm:

The overall organization of the algorithm in pseudo-code is illustrated in Algorithm 4 The algorithm presented here integrates several groundbreaking techniques inspired by the natural phenomena of RIME formation. These techniques contribute to the optimization process in unique ways:

1.
Soft-RIME Search Strategy: This primary optimization strategy draws inspiration from the movement and accumulation of Soft-RIME particles. Characterized by its delicate, crystalline structure, soft RIME embodies a gentle and exploratory approach, particularly in the early stages of the optimization process. This method prioritizes exploration over exploitation, enabling the algorithm to comprehensively traverse the solution space.
2.
Inspired by the crossover behavior of Hard-RIME particles: this mechanism facilitates dimensional crossover exchange between ordinary and ideal agents. Hard-RIME, with its denser and more compact structure, represents a more focused and intensive search strategy. This crossover interchange plays a crucial role in improving the algorithm’s solution accuracy, leading to a more refined and precise optimization process.
3.
Improved Positive Greedy Selection Mechanism: Building upon the traditional greedy selection mechanism, this improved version is designed to expand the population diversity. By actively selecting optimal solutions and constantly refreshing the population, the mechanism aims to avoid premature convergence to local optima. This approach ensures that the algorithm does not settle for suboptimal solutions too early and continues to search for better options, thus maximizing the potential for finding the global optimum.

Each strategy contributes significantly to the algorithm’s overall performance. The soft-RIME search strategy facilitates extensive exploration of the solution landscape. Subsequently, the hard-RIME puncture mechanism implements a more concentrated approach to refine the solutions. Finally, the enhanced positive greedy selection mechanism preserves diversity and hinders stagnation, guaranteeing consistent progress toward the optimal solution. Collectively, these strategies establish a balanced and dynamic optimization process, effectively traversing intricate solution spaces.

Computational complexity of mRIME

The positive greedy selection procedure, the hard-rime puncture system, the soft-rime search approach, and the fitness value computation are the key components of the complexity of the RIME algorithm. First, the soft-rime search method has a complexity issue of ${\mathcal {O}} (n^2)$. Next, in the two extreme situations, ${\mathcal {O}} (n)$ and ${\mathcal {O}} (n^2)$ represent the complexity issue of the hard-rime puncture mechanism. The mechanism of positive greedy selection has a complexity issue of ${\mathcal {O}} (n)$. Ultimately, ${\mathcal {O}} (m *log n)$ represents the complexity issue of the fitness value computation. Hence, ${\mathcal {O}} (RIME) = {\mathcal {O}} ((n + log n) *n)$ represents the overall complexity issue of the RIME method.

Binary RIME for FS

The primary step in adapting the RIME method for a search approach in FS problems involves converting it into a binary format. This conversion is necessary because the original RIME is suited only for continuous optimization challenges. However, FS problems inherently require a binary search space, represented by values of “1” or “0”. This adaptation is crucial for enabling an algorithm initially designed for continuous optimization to tackle binary optimization issues effectively. To achieve this, certain operators within RIME must be modified to create its binary version. This new binary mRIME then outputs results in binary form. The process of transforming RIME into mRIME is detailed in subsection “RIME using different TFs”, and the corresponding objective function is described in subsection “Objective function of the proposed mRIME”.

RIME using different TFs

To maintain the underlying structure of the RIME method while creating a binary version, a TF is introduced. This function determines the probability that an element $y^i$ in RIME’s solution subset will be restricted to binary choices: either selected (“1”) or not selected (“0”). In essence, a value of “1” indicates that the corresponding feature has been included, while a value of “0” implies that the feature has been excluded.

Logistic transformation functions, characterized by their S-shaped curve, are particularly useful for mapping operations due to their ability to produce results within the desired range of [0, 1]. This range is crucial for representing the probability of switching an element in a binary solution between “1” and “0”. Kennedy et al. underscored the significance of this feature in their work [46]. Mirjalili et al. [47] further introduced the V-shaped family of transformation functions, which exhibit comparable performance to the S-shaped family in various tasks. The slope of the transformation function plays a pivotal role in determining the effectiveness of both exploitation and exploration. A flatter or less steep curve may lead to insufficient exploitation and a tendency to get stuck in local optima, while an excessively steep curve can hinder exploration [48].

Subsequently, Mirjalili et al. proposed a U-shaped TF, distinguished by two control parameters, $\eta $ and $\chi $, which govern the slope and the width of the function’s basin, respectively [49]. Recognizing the shortcomings of prevalent TFs in the literature, the study also implemented an X-shaped TF, initially introduced by Ghosh et al. ghosh2020binary to generate binary counterparts of continuous optimization algorithms. This diverse range of TFs facilitates the efficient conversion of continuous solutions into binary representations, effectively tackling various challenges and characteristics of the optimization problems.

This study investigates four distinct TFs from different categories: S-shaped, V-shaped, U-shaped, and X-shaped, to address the absence of a universally acknowledged best TF for FS problems Add a reference to support your claim. These TFs were adapted and evaluated to determine the most effective one when combined with the proposed mRIME and the basic RIME algorithms. Each TF plays a critical role in influencing the probability of updating elements in the binary solution, specifically toggling between “1” and “0”. The study includes visual representations of these TFs, providing a clearer understanding of their operation and impact on the binary solution. This in-depth analysis aims to improve the effectiveness and efficiency of the mRIME and RIME algorithms in the context of FS problems.

The efficacy of the TFs will be assessed in the study’s experimental results section. Four distinct types of TFs from the S-shaped, V-shaped, U-shaped, and X-shaped families were chosen to facilitate the conversion of a continuous search space into a binary one for addressing FS problems. These are briefly described as follows:

S-shaped TFs: This function is represented by the sigmoid function, as defined in Eq. (8) and originally discussed in the work of Kennedy et al. [46]. It is depicted in Fig. 2a and is part of the S-shaped family. The primary role of this function is to convert the search space from a continuous format to a binary one, effectively adapting it for FS tasks. The sigmoid function is known for its characteristic ’S’ shape, which provides a smooth and gradual transition between the binary states, making it a suitable choice for this transformation process.
$$\begin{aligned} T_s\left( v^{i, j}_{t+1}\right) = \frac{1}{1+e^{-v^{i, j}_{t}}} \end{aligned}$$
(8)
In this context, $T_s$ is the transmutation vector, and $T_s\left( v^{i, j}_{t+1}\right) $ represents the probability value generated by the S-shaped TFs. The variables $v^{i, j}_{t}$ and $v^{i, j}_{t+1}$ signify the current and subsequent velocities, respectively, of the ith search agent in dimension j at iterations t and $t+1$. The S-shaped TF, defined in Eq. (8), is a robust function that effectively transforms an unbounded input into a bounded output, mapping any input range to the interval [0, 1]. As depicted in Fig. 2a, the likelihood of modifying the position value within the search space increases as the slope of the S-shaped TF decreases. This characteristic can efficiently update the positions of search agents, thereby facilitating the discovery of optimal solutions. The effectiveness of the S-shaped TF is further enhanced due to its increasing computational speed for determining position values. In this framework, the S-shaped TF in Eq. (8) serves to transform the search agent’s velocity into a probability value. This probability value, in turn, guides the calculation of the next position, $y^{i, j}_{t+1}$, of the solution’s elements. During the subsequent iteration, these elements will either transition to “1” or maintain their current state of “0”. This transition is governed by a widely used stochastic threshold, ensuring that the output of the Sigmoid function preserves its binary nature, as explained in Eq. (9). This approach enables a refined and probabilistic adjustment of the solution’s elements, aligning with the requirements of binary optimization in feature selection (FS) tasks.
$$\begin{aligned} y^{i, j}_{t+1} ={\left\{ \begin{array}{ll} 1 &{} \qquad \;\; \text {if} \;\; r_1 < T_s\left( v^{i, j}_{t+1}\right) \\ 0 &{} \qquad \;\; \text {if} \;\; r_1 \ge T_s\left( v^{i, j}_{t+1}\right) \end{array}\right. } \end{aligned}$$
(9)
In the context provided, $r_1$ is a uniformly distributed random number within the range [0, 1]. The term $y^{i, j}{t+1}$ represents the position of the ith search agent in dimension j at iteration $t+1$, which adopts a new binary value corresponding to the jth dimension of the ith solution at iteration $t+1$. The function $T_s\left( v^{i, j}_{t}\right) $ generates a probability value based on the TF described in Eq. (8). According to Eq. (9), the velocity of the search agents is utilized to calculate the probability of changing their positions. Specifically, if the output from the TF in Eq. (9) is greater than the random value $r_1$, then the position $y^{i, j}{t+1}$ is set to “1”. This indicates that the corresponding feature is considered significant and is selected. Conversely, if the output is less than or equal to $r_1$, the position $y^{i, j}_{t+1}$ is set to “0”, suggesting that the feature is not essential and is hence excluded from consideration. The stochastic nature of the value $r_1$ plays a crucial role in this process. It introduces randomness into the decision-making process, determining whether the value of the solution $y^{i, j}_{t+1}$ will change. This randomness, in combination with the probability value $T_s\left( v^{i, j}_{t+1}\right) $ derived from Eq. (8), drives the update mechanism for the search agents’ positions, ensuring a balanced approach between exploration and exploitation in the FS process. In the scenario where the value of $T_s\left( v^{i, j}_{t+1}\right) $ is low, the likelihood of changing the subsequent iteration value $y^{i, j}_{t+1}$ is also low. However, a critical observation regarding the sigmoid TF, as outlined in Eq. (9), is that its current form may not provide an optimal balance between exploration and exploitation. Ideally, the exploration rate should be higher than the exploitation rate at the early stages of the optimization process. Without this balance, some promising areas of the search space might not be adequately explored, leading to the possibility that the proposed Binary mRIME could become trapped in local optima. This issue is also evident during the exploitation phase. One inherent limitation of the S-shaped family of TFs in some meta-heuristic algorithms is that the update of search agents is dependent on their velocity value. In situations where the velocity value is zero, the search agents should ideally not move. However, in practice, a zero velocity is converted to “1” or “0” with a probability of 0.5, as noted by Ghosh et al. [50]. Attempts have been made by various researchers to rectify this flaw, but they have not been entirely successful in preventing the entrapment in local optima.
V-shaped TFs: Next, we consider the V-shaped TFs, as specified in Eq. (10) and shown in Fig. 2 (b), which was developed by Rashedi et al. [51]. This function is utilized to calculate the probability of altering the position of search agents from a continuous to a binary search space in both the fundamental and proposed FS algorithms. The V-shaped TFs, as its name suggests, have a distinctive V-shaped profile that influences how the probability of changing position values is computed, potentially offering different characteristics in the exploration and exploitation phases compared to the S-shaped TFs.
$$\begin{aligned} T_v\left( v^{i, j}_{t+1}\right) = \left| \frac{2}{\pi } \arctan \left( \frac{\pi }{2}v^{i, j}_{t}\right) \right| \end{aligned}$$
(10)
where $T_v$ is the V-shaped TF, and $T_v\left( v^{i, j}_{t+1}\right) $ identifies the probability of the V-shaped TF of the velocity, $v^{i, j}_{t+1}y$, for the ith search agent at dimension j and iteration $t+1$. As illustrated in Fig. 2b, the V-shaped TF, as delineated in Eq. (10), distinguishes itself from the S-shaped TF outlined in Eq. (8) through its unique structure and rules. The V-shaped TF, characterized by its distinct ’V’ shape, offers a different approach to transforming the continuous solution into a binary one. The process of this transformation utilizes Eq. (11), which effectively converts the continuous solutions derived from Eq. (10) into binary values. This conversion is based on the probability outcomes obtained from the V-shaped TF. The design of this function is such that it addresses certain aspects of optimization that the S-shaped function may not fully capture, especially in terms of the balance between exploration and exploitation in the search space. This approach underscores the importance of selecting appropriate TFs in FS algorithms, as different functions can significantly influence the performance and effectiveness of the algorithm in navigating the search space and avoiding local optima. The V-shaped TF, with its unique characteristics, is thus a critical component in the study’s exploration of efficient FS methodologies.
$$\begin{aligned} y^{i, j}_{t+1} ={\left\{ \begin{array}{ll} \lnot y^{i, j}_{t} &{} \qquad \;\; \text {if} \;\; r_1 < T_v\left( v^{i, j}_{t+1}\right) \\ y^{i, j}_{t} &{} \qquad \;\; \text {if} \;\; r_1 \ge T_v\left( v^{i, j}_{t+1}\right) \end{array}\right. } \end{aligned}$$
(11)
In the given context, $y^{i, j}{t}$ represents the position of the ith search agent in dimension j at iteration t. The term $\lnot y^{i, j}{t}$ is the complement of the solution at this position, meaning it inverts the binary value of $y^{i, j}{t}$. The variable $r_1$ is a uniformly distributed random number between 0 and 1. The function $T_v\left( v^{i, j}{t+1}\right) $ denotes the probability value generated by the V-shaped TF. An important characteristic of the V-shaped TFs, as shown in Fig. 2b, is its symmetrical nature. This symmetry plays a role in how the positions of the search agents are updated. According to Eq. (11), the updating process for the search agents involves flipping their positions, rather than simply assigning them the values of “1” or “0” based on a threshold or probability. This method of position updating differs significantly from other TFs like the S-shaped one, potentially offering a more dynamic approach in the exploration and exploitation phases of the optimization process. This flipping mechanism within the V-shaped TF allows for a more nuanced and flexible response to the search space, as it does not strictly bind the agents to binary extremes but rather provides a probabilistic approach to toggling their positions. This aspect is particularly relevant in complex FS problems, where the ability to adaptively explore and exploit the search space can lead to more effective solutions. The operation of the V-shaped TF is such that if the velocity value of a search agent is high, the agent’s position is switched to its opposite value. Studies, including those by Mirjalili et al. [47], have shown that V-shaped TFs can sometimes outperform S-shaped TFs in terms of efficiency. A key feature of the V-shaped TFs is that they encourage search agents to maintain their current positions when the velocity value is low during an iteration, as noted by Ghosh et al. [50]. Conversely, when the velocity is high, the search agents are induced to switch to their complementary positions. This characteristic has a significant impact on the updating of the positions of search agents and consequently on the identification of the best solution. While the V-shaped TF effectively addresses the issue of meta-heuristics encountering a zero-position value, it still faces challenges about falling into local optima. In essence, problems similar to those observed with the S-shaped TF may persist, leading to a potential imbalance between exploration and exploitation phases in meta-heuristic algorithms. In recognition of these challenges, this work explores the use of other types of TFs to achieve a more favorable balance between exploration and exploitation. Alongside the S-shaped and V-shaped TF, additional U-shaped and X-shaped TFs are employed to transform continuous algorithms into binary formats. The inclusion of these varied TFs aims to enhance the algorithm’s ability to navigate the search space more effectively, reducing the likelihood of getting trapped in local optima and increasing the chances of finding optimal solutions in FS tasks.
U-shaped TFs: The U-shaped TF, as specified in Eq. (12) and developed by Mirjalili et al. [49], is another approach explored in this study. This function is visually represented in Fig. 2c. The U-shaped TF is employed to calculate the probability of altering the positions of search agents from a continuous to a binary search space in both fundamental and proposed algorithms within the study. The U-shaped TF is characterized by its U-shaped curve. This shape influences how the function processes and transforms velocity values into probabilities, which in turn determines the positional updates of the search agents. The distinctive aspect of the U-shaped TF is its ability to provide a different mechanism for balancing exploration and exploitation compared to the S-shaped and V-shaped functions. By implementing the U-shaped TF, the study aims to explore whether this function offers better performance in terms of avoiding local optima and effectively navigating the search space, particularly in complex FS problems. The incorporation of the U-shaped TF into the optimization algorithms reflects an effort to diversify the strategies employed in transforming continuous solutions into binary ones, thereby potentially enhancing the overall effectiveness of the FS process.
$$\begin{aligned} T_u\left( v^{i, j}_{t+1}\right) = \eta \left| \left( v^{i, j}_{t}\right) ^{\chi }\right| \;\;\;\;\; \eta = 1, \chi = 1.5, 2.0, 3.0, 4.0 \end{aligned}$$
(12)
In the U-shaped TF, as outlined in Eq. (12), two crucial control parameters play a significant role: $\eta $ and $\chi $. The parameter $\eta $ is responsible for defining the slope of the function, while $\chi $ determines the width of the curve’s basin. The function $T_u\left( v^{i, j}_{t+1}\right) $ represents the probability of velocity for the solution concerning the search agent i at dimension j and iteration t. The U-shaped TF, with its distinct shape and control parameters, offers a unique mechanism in the transformation process. The parameter $\eta $ adjusts the saturation point of the function, and $\chi $ sets the width of the trough. The speed at which the U-shaped function reaches its saturation point affects the likelihood of flipping a bit in the solution. This characteristic promotes exploration by allowing for rapid variation in variables. A broader U-shaped curve translates to reduced exploratory behavior, while a steeper curve, proportional to the value of $\eta $, enhances exploration, as can be more distinctly seen in Fig. 2c. The values of the continuous solution elements, as given in Eq. (12), can be converted into binary format using Eq. (13). This conversion mechanism leverages the properties of the U-shaped TF to navigate the search space effectively, balancing exploration and exploitation by adjusting the parameters $\eta $ and $\chi $. The U-shaped TF thus contributes to the overall strategy of optimizing FS algorithms by providing a distinct approach to managing the transformation of continuous solutions into binary ones.
$$\begin{aligned} y^{i, j}_{t+1} ={\left\{ \begin{array}{ll} \lnot y^{i, j}_{t} &{} \qquad \;\; \text {if} \;\; r_1 < T_u\left( v^{i, j}_{t+1}\right) \\ y^{i, j}_{t} &{} \qquad \;\; \text {if} \;\; r_1 \ge T_u\left( v^{i, j}_{t+1}\right) \end{array}\right. } \end{aligned}$$
(13)
In the context of the U-shaped TFs, as mentioned, $r_1$ signifies a uniformly distributed random number ranging between 0 and 1, and $T_u\left( v^{i, j}_{t+1}\right) $ denotes the probability value generated by the U-shaped TF. As per Eq. (13), the future position of a search agent, $y^{i, j}_{t+1}$, is determined based on the probability value $T_u\left( v^{i, j}_{t+1}\right) $, as derived from Eq. (12). The role of the random values generated by $r_1$ is pivotal in deciding whether the current solution’s value $y^{i, j}_{t}$ at a given iteration is inverted. Consequently, if the probability value $T_u\left( v^{i, j}_{t+1}\right) $ is small, the likelihood of inverting the value in the next iteration is also minimized. In the context of optimization, the initial phases of iteration prioritize exploration to ensure a comprehensive search of the available space. This stage is crucial for identifying various potential solutions. As the process transitions from exploration to exploitation, the latter becomes vital in the final iterations for pinpointing the most effective solutions. Comparing the U-shaped TF in Fig. 2c with the V-shaped TF in Fig. 2b, its observable that while both have similarities, the U-shaped TF might offer a higher rate of exploration compared to the V-shaped TF. This enhanced exploratory capability could potentially make the U-shaped TF more effective in certain scenarios, particularly where a broader search of the solution space is required. Thus, in the quest for optimal FS, the U-shaped TF could be a superior choice over other TFs, especially in cases where avoiding premature convergence to local optima is crucial.
X-shaped TFs: The study also incorporates an X-shaped TF to address the limitations of the more traditionally used TFs found in the literature. As depicted in Fig. 2d, the X-shaped TF is distinctive in its approach, utilizing two components to generate different outcomes. The process involves comparing these outcomes to the previous solution to determine the best result. If the newly generated solution surpasses the previous solution in terms of effectiveness, it is adopted as the next position. However, if it is not superior, a crossover operation is implemented between the new and former solutions. The crossover operation is designed to combine elements of both solutions, to retain advantageous properties from the previous iteration. The outcome that emerges as the most effective from this crossover process is then selected as the new position. This methodology introduces a new element to the optimization process, providing a mechanism for the new solution to inherit beneficial attributes from the solution of the previous iteration. Such an approach enhances both the exploration and exploitation capabilities of the proposed mRIME. To facilitate these operations, Eqs. (14) and (16) are employed. Notably, Eq. (16) acts as a mirror image of the first, as stated by Ghosh et al. [50]. This mirrored structure of the X-shaped TF allows for a more dynamic and adaptive optimization process, potentially leading to more effective exploration of the search space and a better balance between exploration and exploitation in the quest for optimal solutions.
$$\begin{aligned} T_{y1}\left( v^{i, j}_{t+1}\right) = \frac{1}{1+e^{-v^{i, j}_{t}}} \end{aligned}$$
(14)
$$\begin{aligned} y^{i, j}_{1_{t+1}} ={\left\{ \begin{array}{ll} 1 &{} \qquad \;\; \text {if} \;\; r_1 < T_{y1}\left( v^{i, j}_{t+1}\right) \\ 0 &{} \qquad \;\; \text {if} \;\; r_1 \ge T_{y1}\left( v^{i, j}_{t+1}\right) \end{array}\right. } \end{aligned}$$
(15)
$$\begin{aligned} T_{y2}\left( v^{i, j}_{t+1}\right) = \frac{1}{1+e^{v^{i, j}_{t}}} \end{aligned}$$
(16)
$$\begin{aligned} y^{i, j}_{2_{t+1}} ={\left\{ \begin{array}{ll} 1 &{} \qquad \;\; \text {if} \;\; r_2 > T_{y2}\left( v^{i, j}_{t+1}\right) \\ 0 &{} \qquad \;\; \text {if} \;\; r_2 \le T_{y2}\left( v^{i, j}_{t+1}\right) \end{array}\right. } \end{aligned}$$
(17)
where $y^{i, j}_{1_{t+1}}$ and $y^{i, j}_{2_{t+1}}$ are the binary versions of the solutions produced by Eqs. (14) and (16), respectively, and $r_1$ and $r_2$ are random numbers created within the range [0, 1]. As per Eqs. (15) and (17), the new solutions can be formed as follows:
$$\begin{aligned} \acute{y}^{i, j}_{t+1} ={\left\{ \begin{array}{ll} y^{i, j}_{1_{t+1}} &{} \qquad \;\; \text {if} \;\; fit(y^{i, j}_{1_{t+1}}) < fit(y^{i, j}_{2_{t+1}})\\ y^{i, j}_{2_{t+1}} &{} \qquad \;\; \text {if} \;\; fit(y^{i, j}_{1_{t+1}}) \ge fit(y^{i, j}_{2_{t+1}}) \end{array}\right. } \end{aligned}$$
(18)

where fit represents the fitness function of the FS problems.

In the optimization process involving the X-shaped TF, a critical step is the evaluation and comparison of fitness values of the current and newly generated solutions. Specifically, if the fitness of the new solution, denoted as $\acute{y}^{i, j}{t+1}$, is better than that of the current solution $y^{i, j}{t}$ (i.e., $fit(\acute{y}^{i, j}{t+1}) < fit(y^{i, j}{t})$), then the new solution $\acute{y}^{i, j}_{t+1}$ is adopted for the next iteration.

However, if this condition is not met, a crossover operation is performed between $\acute{y}^{i, j}{t+1}$ and $y^{i, j}{t}$. This crossover process generates two offspring, from which the one with the best fitness is selected as the subsequent solution. This approach allows the ’child’ solution to potentially retain beneficial attributes of the ’parent’ solution $y^{i, j}_{t}$, thereby preserving advantageous characteristics while exploring new possibilities.

The specific type of crossover used in this study is the uniform crossover, as described by Syswerda in 1993 [52]. In uniform crossover, each bit of the offspring is independently chosen from one of the corresponding bits of the parents, offering a more diverse and random mixing of parental features. This method is summarized in Algorithm 6.

The inclusion of the uniform crossover in the optimization process adds a layer of diversity and adaptability to the algorithm. This can be particularly beneficial in avoiding local optima and ensuring a more thorough exploration of the search space, thereby enhancing the overall efficacy of the FS process.

To summarize the process of transforming the continuous RIME into a binary version, the velocity of each search agent is mapped to a probability value within the range [0, 1]. This mapping is accomplished through the use of various equations that represent different TFs. Specifically, Eqs. (8), (10), (12), (14), and (16) correspond to the functions $T_s$, $T_v$, $T_u$, $T_{y1}$, and $T_{y2}$, respectively. Each of these equations defines a different method for transforming the agents’ velocities into probabilities, reflecting the distinct characteristics of the S-shaped, V-shaped, U-shaped, and X-shaped TFs.

Following this mapping, the newly obtained probability values are then utilized to update the positions of the search agents. This is achieved using the corresponding update equations for each TF: (9), (11), (13), (15), and (17). These equations determine the new positions for each agent, effectively converting the continuous search space of the RIME algorithm into a binary format suitable for FS tasks.

In the results section of the study, a comprehensive comparison is made between the different TFs as applied to the basic RIME and the proposed Binary mRIME. The goal of this comparison is to identify the best FS algorithm, considering the effectiveness of each TF in the conversion process. This step is crucial as the performance of the classifier used for FS problems significantly depends on how well the continuous search space is transformed into a binary one. The appropriate selection of a TF, therefore, has a direct impact on the accuracy and efficiency of the FS algorithm.

Objective function of the proposed mRIME

In FS methods, particularly those utilizing a wrapper-based approach, it’s essential to incorporate a learning algorithm to evaluate the efficacy of the selected feature subset. In this study, the

k-Nearest Neighbor (k-NN) classifier, as referenced in the work by Keller et al. [53], is employed for this purpose. The k-NN classifier provides a measure of classification accuracy for the solutions generated by the FS process.

When designing an FS method, two critical aspects must be considered:

1.
How to formulate the solution for the FS problem.
2.
How to assess the quality of this solution.

In this study, the feature subset is represented as a binary vector, with its length equal to the number of attributes in the dataset. This representation allows for a straightforward interpretation of which features are selected (denoted by 1) and which are not (denoted by 0).

FS is inherently a multi-objective optimization problem, striving to achieve two main goals: (i) reducing the number of selected attributes, and (ii) improving the classification accuracy as determined by the k-NN classifier. There is a natural trade-off between these goals: generally, decreasing the number of attributes enhances the model’s simplicity and potentially its generalizability, while maintaining or improving classification accuracy ensures the utility and effectiveness of the selected features.

The best solution in this context is one that balances these two objectives effectively—it should have the fewest number of attributes while achieving the highest possible classification accuracy. To address this multi-objective nature, the FS methods need to integrate these contradictory goals into a single objective function. In this study, this integration is accomplished by formulating a fitness criterion for each solution. This fitness criterion is calculated using the k-NN classifier, as shown in Eq. (19). It quantifies how well a given solution balances the trade-off between attribute reduction and classification accuracy, guiding the iterative process of the mRIME toward finding the most effective feature subset.

$$\begin{aligned} fitness=\alpha \zeta _{k}+\beta \frac{|R|}{|N|} \end{aligned}$$

(19)

In the context of the objective function described in Eq. (19) for FS, several key components and parameters are involved:

$\zeta _{k}$: This represents the classification rate achieved by the k-NN classifier. It is a measure of how accurately the classifier can predict the class labels of the dataset using the selected features. A higher classification rate indicates better predictive performance.
|R| and |N|: These symbols denote the number of selected attributes in the solution vector (|R|) and the total number of native attributes in the dataset (|N|), respectively. The objective is to reduce |R| while maintaining or improving the classification rate $\zeta _{k}$.
$\alpha $ and $\beta $: These are balancing parameters, both ranging from 0 to 1. They are used to assign relative weights to the two aspects of the objective function: the classification rate ($\alpha $) and the selection ratio of the selected attributes ($\beta $). The parameter $\alpha $ specifically weights the importance of the classification rate in the objective function, while $\beta $, being the complement of $\alpha $ (i.e., $\beta = 1 - \alpha $), weights the importance of minimizing the number of selected attributes.

The objective function thus formulated effectively balances the dual goals of maximizing the classification rate (a measure of the effectiveness of the selected features in predicting outcomes) and minimizing the number of features used (a measure of model simplicity and efficiency). The values of $\alpha $ and $\beta $ can be adjusted depending on the specific requirements of the FS task, allowing for flexibility in prioritizing either classification accuracy or model simplicity. This balance is crucial in creating an FS model that is not only accurate but also efficient and interpretable.

In the realm of classification tasks, a variety of classifiers are available, each with its strengths and applications. While the k-NN classifier is chosen in this study for its simplicity and effectiveness, especially in pattern recognition and FS contexts, other significant classifiers are also widely used in different scenarios. These include: Artificial Neural Networks (ANN) as discussed by Bishop [54], are powerful tools for pattern recognition and have the ability to learn complex nonlinear input–output relationships. Bayesian Classifier, as explained by Russell and Norvig [55], is effective in probabilistic classification and is known for handling uncertainty and incomplete data well. Support Vector Machines (SVM) as explored by Ding et al. [56], are renowned for their effectiveness in high-dimensional spaces and are widely used in applications such as image classification and bioinformatics. Decision trees, as described by Rokach and Maimon [57], are simple to understand and interpret, making them popular for tasks where the explanation of the model’s decision is important.

Despite their strengths, classifiers can struggle in situations where patterns from different classes are closely clustered or overlap under complex conditions. The k-NN classifier, as studied by Denoeux [58] and others, is selected in this study due to its non-parametric nature and straightforwardness, making it one of the easiest machine learning classification methods, as highlighted by Qin et al. [59]. Its effectiveness in pattern recognition and data mining has been recognized in various fields, as noted by Shakhnarovich et al. [60], and it often outperforms more advanced classifiers in practice [61].

In the specific context of FS, the k-NN classifier’s ability to provide a clear and direct evaluation of the quality of a feature subset makes it a popular choice. This is evidenced in the works of Braik et al. [62] and Khurma et al. [63], among others. The selection of a base classifier for FS tasks largely depends on the specific requirements and characteristics of the application at hand.

The objective function detailed in Eq. (19) plays a crucial role in evaluating the selected feature subsets in FS tasks. This objective function is designed to create a balance between the number of features selected in the solution vector. It operates under the principle that an effective feature subset should not only be compact (i.e., contain a smaller number of features) but also maintain or enhance the classification accuracy of the model. However, it’s important to note that there’s an inherent dichotomy like this problem. On one hand, the objective function is part of a minimization problem where the goal is to reduce the number of features in the solution vector. Fewer features can lead to a more streamlined and potentially more interpretable model, which is less likely to overfit the training data. This reduction is particularly important in contexts where computational efficiency or model simplicity is valued. On the other hand, the aspect of classification accuracy, which is crucial for the effectiveness of the model, represents a maximization problem. Higher classification accuracy implies that the model is better at making correct predictions, which is the primary goal of most classification tasks. The challenge, therefore, is to maximize classification accuracy while simultaneously minimizing the number of features used.

The objective function in Eq. (19) seeks to address this challenge by integrating both aspects—the minimization of feature count and the maximization of classification accuracy— into a single evaluative criterion. This integrated approach ensures that the selected feature subset is not only compact but also effective in terms of classification performance, striking a balance that is vital for the development of robust and efficient predictive models.

ExpeRIMEntal results

The RIME optimization algorithm’s study claims are generally focused on how well it works, how efficient it is, and how many different optimization problems it can solve. The RIME optimization algorithm makes the following frequent claims, which may be supported by experimental evaluations. We can offer proof of the efficiency and suitability of optimization algorithms for resolving practical issues by carrying out thorough experimental assessments that refute these assertions.

Claim: mRIME achieves higher solution quality than current approaches. ExpeRIMEntal evaluation: Conduct comparative experiments where the mRIME is compared against state-of-the-art methods using benchmark instances. Metrics such as solution quality, convergence rate, and computational time can be used to assess performance.
Claim: mRIME is more resilient to changes in the types of problems that arise. Experimental evaluation: Test the algorithm on a variety of problem instances with different characteristics (e.g., size, structure, complexity). Measure the stability of the algorithm’s performance across these instances and compare it with other methods.
Claim: mRIME can tackle complex optimization issues and is scalable. Experimental evaluation: Measure the algorithm’s scalability in terms of computational time, and solution quality as problem size rises. Assess the algorithm’s performance on progressively larger problem instances.
Claim: “Our algorithm is efficient and converges to near-optimal solutions quickly. Experimental evaluation: Analyze the algorithm’s convergence behavior by monitoring the evolution of the solution quality or objective function values over time. On benchmark examples, compare the convergence speed with alternative techniques.
Claim: mRIME can be used to solve optimization issues in the real world. Experimental evaluation: Test the algorithm using real-world datasets or examples of issues that are pertinent to particular fields such as engineering, and FS problems. Examine its functionality and usefulness in these real-world situations.
Claim: Better trade-offs between processing resources and solution quality are made by mRIME. Experimental evaluation: Test the trade-offs between computing resources, such as runtime, and solution quality for the suggested mRIME in comparison to other methods. This can entail examining the answers obtained by running the algorithm with different computational budgets.
Claim: mRIME is flexible and may be tailored to many variants of problem domains. Experimental evaluation: Test the algorithm with a range of problem samples from different domains, with varying constraints or problem formats. Examine how consistently and adaptably it performs in these different environments.

Evaluation metrics

In the proposed algorithm’s evaluation, six standard metrics were employed: accuracy, specificity, sensitivity, fitness value, running time, number of selected features, and convergence curves. With these measurements, we were able to gauge its efficiency and effectiveness for continuous grid optimization (global optimization) and binary optimization (FS) in the context of disease diagnosis applications.

TP: Correct classification as a disease.
TN: Correct classification as a non-disease.
FP: misclassification as a disease.
FN:misclassification as a non-disease.

Accuracy metric alone may not suffice when evaluating a classification algorithm’s performance when dealing with imbalanced datasets; in such cases, additional metrics like Specificity, Sensitivity, and F-measure, and Convergence Curves may provide a more comprehensive evaluation of its efficacy.

Accuracy in machine learning is often used as the standard metric to assess classification algorithms’ success, serving as an indication of the overall correctness of prediction made by these programs, such as representing the percentage of instances correctly classified by an algorithm. To calculate accuracy simply needs to divide the number of correctly classified instances by the total count in their dataset. Accuracy can be represented mathematically as in Eq. (20):
$$\begin{aligned} Accuracy = \frac{TP + TN}{TP + TN + FP + FN} \end{aligned}$$
(20)
Sensitivity, referred to as True Positive Rate or Recall. It calculates the number of correctly classified attack files out of an entire dataset as in Eq. (21):
$$\begin{aligned} Sensitivity = \frac{TP}{TP+FN} \end{aligned}$$
(21)
Specificity, referred to as True Negative Rate. It measures the percentage of actual negatives identified correctly by the classifier as in Eq. (22):
$$\begin{aligned} Specificity = \frac{TN}{TN+FP} \end{aligned}$$
(22)
F-Measure is an often-utilized metric for gauging the overall performance of classification algorithms when the cost of false positives and false negatives differ significantly. A higher F-measure indicates better performance with 1 being representative of perfect precision and sensitivity. It measures precision and recall by taking their harmonic mean values of precision and sensitivity, representing it mathematically using Eq. (23):
$$\begin{aligned} F-measure = \frac{2 * precision * recall}{precision + recall} \end{aligned}$$
(23)
Fitness Values are used to evaluate the quality of solutions produced by optimization algorithms, particularly feature selection techniques. They serve as a gauge to indicate which solutions produce quality solutions; in terms of classification accuracy vs number of selected features this value represents a trade-off between classification accuracy and feature selection frequency, higher values indicate better solutions.
Running time is an invaluable metric of an algorithm’s computational efficiency, measured in seconds to represent its total execution period which includes FS and classification tasks.
Number of selected features serves as an important evaluation metric for FS algorithms, reflecting their ability to reduce dimensionality thereby having a direct bearing on model complexity, interpretability, and generalization performance.
Convergence curves provide insight into an algorithm’s optimization process and convergence behavior, representing its optimization iterations or generation number on one axis while fitness value on another (y-axis) remains constant over time (the x-axis represents several iterations or generation), helping analyze speed, stability, and ability to escape local optima on another (x-axis represents iterations or generation count on another axis and fitness value on third). The convergence curve helps evaluate speed stability and ability to escape local optima).

Controlling parameter setup

The suggested mRIME’s findings have been evaluated with 10 of the most reputable optimization algorithms in the appropriate field of research when examined on the aforementioned test suites to support its overall performance and thorough evaluation. The competing algorithms to the one that is being provided can be classified into three groups: (i) GA [19], DE [17], BBO [18] and SFS [16] as the most studied EAs, (ii) PSO [20], ACO [21] and AMO [22] as hot and reliable SI algorithms, WSO [25] and (iii) TLBO [23] and GSK [24] as efficacious and recent human-based optimizers. The control parameters and settings of RIME and other competing algorithms are shown in Table 1.

Table 1 Setting up the parameters for mRIME and other search algorithms

Advanced RIME architecture for global optimization and feature selection

Abstract

Similar content being viewed by others

Binary Growth Optimizer: For Solving Feature Selection Optimization Problems

An Improved Gannet Optimization Algorithm Based on Opposition-Based Schemes for Feature Selection Problems in High-Dimensional Datasets

Integrating metaheuristics and artificial intelligence for healthcare: basics, challenging and future directions

Explore related subjects

Introduction

Literature review

Methodology

Original RIME algorithm background

Computational complexity of mRIME

Binary RIME for FS

RIME using different TFs

Objective function of the proposed mRIME

ExpeRIMEntal results

Evaluation metrics

Controlling parameter setup

ExpeRIMEnts series1: global optimization using CEC 2017 test suit functions

Experiments series1: Applying mRIME for global optimization using CEC2017 test suit functions

ExpeRIMEnts series2: applying mRIME for global optimization using CEC2011 test suit functions

Statistical test analysis

Evaluation FS results

Convergence curves

Statistical test

Final remarks and upcoming projects

Availability of data and materials

Code availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation