Binary arithmetic optimization algorithm for feature selection

Xu, Min; Song, Qixian; Xi, Mingyang; Zhou, Zhaorong

doi:10.1007/s00500-023-08274-x

Binary arithmetic optimization algorithm for feature selection

Optimization
Published: 17 May 2023

Volume 27, pages 11395–11429, (2023)
Cite this article

Download PDF

Soft Computing Aims and scope Submit manuscript

Binary arithmetic optimization algorithm for feature selection

Download PDF

Min Xu¹,
Qixian Song¹,
Mingyang Xi¹ &
…
Zhaorong Zhou^1,2

1672 Accesses
4 Citations
Explore all metrics

Abstract

Feature selection, widely used in data preprocessing, is a challenging problem as it involves hard combinatorial optimization. So far some meta-heuristic algorithms have shown effectiveness in solving hard combinatorial optimization problems. As the arithmetic optimization algorithm only performs well in dealing with continuous optimization problems, multiple binary arithmetic optimization algorithms (BAOAs) utilizing different strategies are proposed to perform feature selection. First, six algorithms are formed based on six different transfer functions by converting the continuous search space to the discrete search space. Second, in order to enhance the speed of searching and the ability of escaping from the local optima, six other algorithms are further developed by integrating the transfer functions and Lévy flight. Based on 20 common University of California Irvine (UCI) datasets, the performance of our proposed algorithms in feature selection is evaluated, and the results demonstrate that BAOA_S1LF is the most superior among all the proposed algorithms. Moreover, the performance of BAOA_S1LF is compared with other meta-heuristic algorithms on 26 UCI datasets, and the corresponding results show the superiority of BAOA_S1LF in feature selection. Source codes of BAOA_S1LF are publicly available at: https://www.mathworks.com/matlabcentral/fileexchange/124545-binary-arithmetic-optimization-algorithm

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Article 09 April 2023

Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning

Article 19 January 2024

A Survey of Artificial Hummingbird Algorithm and Its Variants: Statistical Analysis, Performance Evaluation, and Structural Reviewing

Article 27 May 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the rapid development of Internet technologies, amounts of data are being generated and accumulated in numerous fields such as social media (Hu et al. 2022a), financial services (Huang and Tsai 2009), telecommunication applications (Abdel-Basset et al. 2021a), etc. Due to the considerable amount of relevant and redundant information contained in these data, data preprocessing becomes necessary to improve the efficiency of data acquisition. As a common and effective data preprocessing method, feature selection has received wide attention and has been applied in several fields, such as data mining (Hichem et al. 2019), medical diagnosis (Georges et al. 2020), and pattern recognition (Hu et al. 2022a). Feature selection is defined as the process of eliminating redundant features and selecting the optimal feature subset to describe the problem (Sadeghian et al. 2021). In this process, the evaluation criteria and the search approach are involved, the former is used to measure the quality of the feature subsets to guide the search, and the latter is used to explore the search space to find the optimal feature subset. Concerning the evaluation criterion for the feature subset, three techniques are used, i.e., filter-based, wrapper-based, and embedded-based techniques (Senawi et al. 2017). Among these techniques, the filter-based technique is used to evaluate the quality of the feature subsets based on the dependency of the statistical data (Sadeghian et al. 2021). The wrapper-based technique is employed to assess the feature subsets by the classifier (Xue et al. 2015). The embedded-based technique is conducted to select the feature subsets during the training process of the classifier (Nguyen et al. 2020). Compared to the other two techniques, the wrapper-based technique achieves better results although additional computational resources are required (Lin et al. 2014).

From an optimization perspective, feature selection is a hard combinatorial optimization problem (Hu et al. 2020), the key task in this process involves the selection of the best feature subset using search approaches, including the complete, random and heuristic searches. Using the complete search approach, the best feature subset can be selected among 2^N possible subsets from an original dataset with N features, the computational burden exponentially grows with the increase in the number of features (Guha et al. 2020). Another search approach for the feature subset is a random search in which the better subsets are constantly produced by iterations (Aljarah et al. 2018). The third search approach for the feature subset is the heuristic search, in which the sequential forward method and the sequential backward method are primarily contained (Hu et al. 2020).

Recently, due to the advantages of simplicity and easy implementation, many proposed meta-heuristic algorithms have aroused widespread interest. According to the design paradigms of theses algorithms, they are categorized into four groups: (1) evolution-based algorithms, (2) physics-based algorithms, (3) swarm intelligence-based algorithms and (4) human-based algorithms. The first category of algorithms is inspired by natural selection. For example, the genetic algorithm (GA) is a well-known evolution-based algorithm, which entirely mimics the processes of biological evolution (Siedlecki and Sklansky 1993). Similar to GA, the differential evolution (DE) algorithm is yet another algorithm and consists of three steps, i.e., selection, crossover and mutation (Deng et al. 2021). The direction of optimization search in DE is guided by the cooperation and competition mechanisms among the individuals.

The second category of algorithms simulates various physical phenomena and laws. For instance, the simulated annealing (SA) algorithm, a type of physics-based algorithms, is inspired from the melting and cooling processes of metallurgical materials (Chantar et al. 2021). The gravity search algorithm (GSA) is inspired by the law of gravity (Mittal et al. 2020). Moreover, the momentum search algorithm (MSA) is based on two physical laws, i.e., the momentum conservation law and kinetic energy law (Dehghani and Samet 2020). Some other algorithms of this category are the henry gas solubility optimization (HGSO) (Hashim et al. 2019), spring search algorithm (SSA) (Dehghani et al. 2020) and flow direction algorithm (FDA) (Karami et al. 2021).

The third category of algorithms is inspired by the movements and hunting behaviors of animals. For instance, the particle swarm optimization (PSO) algorithm is a famous optimization method that imitates the foraging behaviors of birds (Sharkawy et al. 2011). The grey wolf optimizer (GWO) is designed by simulating the cooperation behaviors of wolves (Panda et al. 2019). The whale optimization algorithm (WOA) mimics the common bubble net foraging behaviors of humpback whales with three steps, i.e., encircling the prey, attacking the bubble nets and searching the prey (Gharehchopogh and Gholizadeh 2019). Other swarm intelligence-based algorithms include the African vultures optimization algorithm (AVOA) (Abdollahzadeh et al. 2021) golden jackal optimization (GJO) (Chopra and Ansari 2022), jellyfish search optimizer (JSO) (Chou and Truong 2021), snake optimizer (SO) (Hashim and Hussien 2022), artificial hummingbird algorithm (AHA) (Zhao et al. 2022).

The final category of algorithms is inspired by human behaviors. A commonly used such algorithm is the simple human learning optimization algorithm, which is based on the learning mechanisms of humans (Wang et al. 2014). The poor and rich optimization algorithm is designed by simulating the behaviors of rich and poor when accumulating wealth and improving economic situations (Moosavi and Bardsiri 2019). Other algorithms of this category include the teamwork optimization algorithm (Dehghani and Trojovský 2021), past present future (Naik and Satapathy 2021) and coronavirus herd immunity optimizer (CHIO) (Al-Betar et al. 2021).

Nevertheless, most meta-heuristic algorithms focus only on continuous optimization problems, feature selection is classified as a discrete optimization problem. Therefore, the corresponding discrete algorithms, especially binary algorithms, should be designed to perform feature selection. As a novel meta-heuristic algorithm inspired by the basic arithmetic operators, the arithmetic optimization algorithm (AOA) performs well when dealing with continuous optimization problems (Abualigah et al. 2021). Consequently, binary arithmetic optimization algorithms (BAOAs) are proposed to perform feature selection. The main contributions of this paper are as follows:

Multiple BAOAs utilizing different strategies are proposed to perform feature selection.
Six algorithms are formed based on six different transfer functions by converting the continuous search space to the discrete search space. Moreover, in order to enhance the speed of searching and the ability of escaping from the local optima, six other algorithms are developed by integrating the transfer functions and Lévy flight.
Based on 20 common UCI datasets, the performance of the proposed algorithms utilizing the different strategies is evaluated, and the results illustrate that BAOA_S1LF is the most superior among all the proposed algorithms. Furthermore, the performance of BAOA_S1LF is compared with that of other meta-heuristic algorithms on 26 UCI datasets, and the results demonstrate that BAOA_S1LF performs better than other meta-heuristic algorithms in feature selection.

The rest of this paper is organized as follows: Sect. 2 presents a literature review of the application of meta-heuristic algorithms for feature selection. Section 3 introduces the AOA. Section 4 presents the proposed BAOAs utilizing different strategies. Section 5 compares the results of our proposed algorithms and other meta-heuristic algorithms on the UCI datasets. Section 6 concludes the paper.

2 Literature review

Over the previous decades, meta-heuristics algorithms have been utilized as search strategies in feature selection and have demonstrated superior efficiencies when compared to exact methods. For instance, evolution-based algorithms, such as the binary DE integrated with the mutation operator, the one-bit purifying search operator and the efficient non-dominated sorting operator are developed to tackle feature selection (Zhang et al. 2020). Xue et al. (2021) propose a multi-objective binary GA integrating an adaptive operator selection mechanism (MOBGA-AOS), and the experimental results of 10 datasets reveal that it performs well. For physics-based algorithms, the binary version of the multi-verse optimizer (MVO) is introduced to select the optimal feature subset for solving feature selection (Hans and Kaur 2020b). Neggaz et al. (2020) adopt the HGSO to select the significant features to improve the classification accuracy. Guha et al. (2020) propose a new approach based on the GSA, where a clustering technique is used to overcome the premature convergence.

Concerning the swarm intelligence-based algorithms, a new variant of GWO with an improved spread strategy and a chaotic local search (CLS) mechanism is proposed in (Hu et al. 2022b) to select the best feature subset, where the spread strategy is used to enhance the ability of search agents to avoid the local optima, global exploration capability and individual movement’s randomness, and the CLS mechanism is adopted to accelerate the convergence rate of the evolving agents. Agrawal et al. (2020) propose a quantum WOA for feature selection, in which the modified mutation and crossover operators are applied for quantum-based exploration, shrinking and spiral movements of whales. Ouadfel and Abd (2020) propose a new approach based on the crow search algorithm (CSA) for feature selection with the global search strategy and the adaptive awareness probability. Hu et al. (2020) analyze the range of parameters and introduce a new updating formula for these parameters to balance the ability of the global search and local search in the proposed binary GWO (BGWO). Mafarja et al. (2018) propose a wrapper-based feature selection approach based on the binary dragonfly algorithm (BDA), and eight transfer functions are integrated into the BDA, and the experimental results on 18 datasets illustrate that BDA outperforms other compared approaches. Based on the sine cosine algorithm (SCA) and the ant lion optimizer (ALO), a hybrid sine cosine ant lion optimizer (SCALO) is proposed to tackle feature selection, and the experimental results indicate that SCALO performs better than other compared algorithms (Hans and Kaur 2020a). Chaudhuri and Sahu (2021) propose a novel feature selection approach based on the CSA, the strategy of time varying flight length is adopted to prevent from being trapped in the local optima. Djellali et al. (2018) investigate two hybrid versions of the artificial bee colony (ABC) algorithm with PSO and GA to solve feature selection, the use of particles contributes to the effectiveness of the ABC algorithm, and the mutation operators are applied in the onlooker and scout stages. In (Hu et al. 2022a), the slime mould algorithm (SMA) is embedded with the dispersed foraging strategy and transfer function for feature selection.

For human-based algorithms, Allam and Nandhini (2022) propose a new feature selection approach based on the teaching–learning based optimization algorithm, and the experimental results show that this new approach have the high classification accuracy with the minimum number of features on the Wisconsin diagnosis breast cancer dataset. Alweshah et al. (2022) propose two feature selection approaches based on the CHIO and the greedy crossover operator, and the experimental results reveal that the adopted strategy leads to the performance improvement of the CHIO in feature selection. Furthermore, Table 1 summarizes some existing approaches, and the following observations are made as below:

It is inefficient to identify the optimal feature subset with higher classification accuracy in previous studies.
The test datasets used in the previous studies lack diversification.
Only a few or old algorithms are used in the previous studies.

Table 1 Comparative analysis of existing approaches for feature selection

Binary arithmetic optimization algorithm for feature selection

Abstract

Similar content being viewed by others

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning

A Survey of Artificial Hummingbird Algorithm and Its Variants: Statistical Analysis, Performance Evaluation, and Structural Reviewing

1 Introduction

2 Literature review

3 AOA

3.1 Exploration

3.2 Exploitation

4 Proposed algorithms

4.1 Transfer function

4.2 Lévy flight

4.3 Fitness function

5 Experimental results and discussions

5.1 Sensitivity analysis

5.2 Evaluating BAOAs with transfer functions

5.3 Evaluating BAOAs with transfer functions and Lévy flight

5.4 Comparison with other meta-heuristics algorithms

6 Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Informed consent

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation