Fuzzy-ChOA: an improved chimp optimization algorithm for marine mammal classification using artificial neural network

Saffari, Abbas; Khishe, Mohammad; Zahiri, Seyed-Hamid

doi:10.1007/s10470-022-02014-1

Fuzzy-ChOA: an improved chimp optimization algorithm for marine mammal classification using artificial neural network

Published: 10 March 2022

Volume 111, pages 403–417, (2022)
Cite this article

Download PDF

Analog Integrated Circuits and Signal Processing Aims and scope Submit manuscript

Fuzzy-ChOA: an improved chimp optimization algorithm for marine mammal classification using artificial neural network

Download PDF

34 Citations
Explore all metrics

Abstract

Chimp optimization algorithm (ChOA) is a robust nature-inspired technique, which was recently proposed for addressing real-world challenging engineering problems. Due to the novelty of the ChOA, there is room for its improvement. Recognition and classification of marine mammals using artificial neural networks (ANNs) are high-dimensional challenging problems. In order to address this problem, this paper proposed the using of ChOA as ANN’s trainer. However, evolving ANNs using metaheuristic algorithms suffers from high complexity and processing time. In order to address this shortcoming, this paper proposes the fuzzy logic to adjust the ChOA’s control parameters (Fuzzy-ChOA) for tuning the relationship between exploration and exploitation phases. In this regard, we collect underwater marine mammals sounds and then produce an experimental dataset. After pre-processing and feature extraction, the ANN is used as a classifier. Besides, for having a fair comparison, we used a benchmark audio database of marine mammals. The comparison algorithms include ChOA, coronavirus optimization algorithm, harris hawks optimization, black widow optimization algorithm, Kalman filter benchmark algorithms, and also comparative benchmarks include convergence speed, local optimal avoidance ability, classification rate, and receiver operating characteristics (ROC). The simulation results show that the proposed fuzzy model can tune the boundary between the exploration and extraction phases. The convergence curve and ROC confirm that the convergence rate and performance of the designed recognizer are better than benchmark algorithms.

Evolving Machine Learning-Based Classifiers by Metaheuristic Approach for Underwater Sonar Target Detection and Recognition

Bird species recognition using spiking neural network along with distance based fuzzy co-clustering

Article 13 September 2023

An Optimised Grid Search Based Framework for Robust Large-Scale Natural Soundscape Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Recently, the processing of underwater audio signals in automatic recognition and classification of marine mammals has been expanding internationally and has become one of the favorite fields of researchers active in this field [9]. In other words, underwater sound signal processing is the newest way to measure the existence, abundance, and patterns of migratory marine mammals [5, 38].

The use of human-based methods and intelligent methods is one method of recognizing marine mammals. Initially, human operator-based methods were used to identify marine mammals. Its advantages include simplicity and ease of work [32]. However, the main disadvantage is the dependence on the operator’s psychological state and inefficiency in environments where the signal-to-noise ratio is low [22]. Then, contour-based recognition was used to recognition marine mammals due to their time complexity and low identification rate [13, 14]. To eliminate these defects, automatic target recognition (ATR) based on intelligent systems is used [3].

MLP-NN neural network, due to its simple structure, high performance, and low computational complexity, has become a useful tool for automatically recognizing targets [35]. In the past, MLP-NN training used gradient-based methods and error propagation, but these algorithms had a low convergence speed and were stuck in local minima [15, 25].

In recent years, we have witnessed the increasing use of meta-heuristic algorithms for the subject of neural network training. Genetic Algorithm (GA) [24], Simulated Annealing (SA) [20], Biogeography Algorithm (BBO) [28], Gray Wolf Optimizer (GWO) [10], Spider Community Algorithm (SSA) [27], Dragonfly Algorithm (DA) (Mohammad [19] and Algorithm for Finding No Progress (IWT) (M. [16] Suggested examples of the algorithm Existing meta-innovations that have been used as artificial neural network trainers. GA and SA are likely to reduce local optimization but converge at a slower rate. This works poorly in applications that require real-time processing. BBO has time-consuming calculations. GWO and IWT, despite their low complexity and high convergence speed, fall into the trap of local optimization, so they are not suitable for problems with local optimization. Many regulatory parameters and high complexity in the SSA algorithm are among the weaknesses of this algorithm. In addition to having many control parameters, DA has time-consuming calculations that are not suitable for real-time use.

The main reason for getting stuck in local optimizations is the imbalance between the two phases of exploration and extraction. Techniques for avoiding local optimization and creating trade-offs between the exploration and extraction phases include optimization with Chaotic maps [21], the use of non-linear coefficients [18], and utilize spiral shapes to improve performance [16] and linear control strategy (LCS) and arcsine-based non-linear control strategy (NCS-arcsine) [37]. On the other hand, the No Free Launch (NFL) theorem logically states that meta-heuristic algorithms do not have the same answer in dealing with different problems [33]. According to the NFL theory in this paper, a fuzzy system is used to create a triad between the exploration and extraction phases for MLP-NN training. The proposed technique for optimizing ChOA using a fuzzy system is called Fuzzy-ChOA, and in this article is indicated by the symbol FChOA.

In the following, the article is organized so that in part 2, it designs an experiment for data collection. Section 3 deals with the pre-processing of the obtained data and feature extraction. Section 4 discusses how to design an FChOA-MLPNN to automatically detect the sound produced by marine mammals. Section 5 will simulate and discuss it, and finally, Sect. 6 will conclude.

2 The experiment design and data acquisition

to obtain a real data set of sound produced by dolphins and whales from a research ship called the Persian Gulf Explorer and a Sonobuoy, a UDAQ_Lite data acquisition board, and three hydrophones (Model B& k 8013) were obtained was used with equal distance to increase the dynamic range. This test was performed in Bushehr port.

The raw data including 125 samples of Bottlenose Dolphin (10 sightings), 150 samples of Common Dolphin (18 sightings), 190 cases of Atlantic Spotted Dolphin (15 sightings), 106 cases of Sperm Whale (6 sightings), 115 samples of Northern Right Whale (10 sightings), and 172 samples of Killer Whale (8 sightings).

2.1 The ambient noise reduction and reverberation suppression

For example, the sounds emitted by marine mammals (dolphins and whales) recorded by the hydrophone array are considered x (t), y (t), z (t), and the original sound of dolphins and whales is considered s (t). The mathematical model of the output of hydrophones is in Eq. 1.

$$\begin{gathered} {\text{x}}\left( {\text{t}} \right) = \mathop \smallint \limits_{ - \infty }^{t} h\left( {t - \tau } \right)s\left( \tau \right) \hfill \\ y(t) = \mathop \smallint \limits_{ - \infty }^{t} g\left( {t - \tau } \right)s\left( \tau \right) \hfill \\ {\text{z}}\left( {\text{t}} \right) = \mathop \smallint \limits_{ - \infty }^{t} q\left( {t - \tau } \right)s\left( \tau \right) \hfill \\ \end{gathered}$$

(1)

In Eq. 1, the Environment Response Functions (ERF) are denoted by h (t), g (t), and q (t). ERFs are unknown, and "tail" is considered uncorrelated [4], and naturally, the first frame of sound produced by marine mammals does not reach the hydrophone array at one time. Due to the sound pressure level (SPL) in the Hydrophone B&K 8103 and the reference (Hi and Testa n.d.), which deals with the underwater audio standard, the recorded sounds must be pre-amplified by ${10}^{6}$ factor.

Then, the Hamming window and Fast Fourier Transform (FFT) are applied to the frequency domain SPL. Next, frequency bandwidth is alleviated to 1 Hz bandwidth by Eq. 2.

$${\text{SPL}}_{{1}} = {\text{ SPL}}_{{\text{m}}} - {1}0{\text{ log}}\Delta {\text{f}}$$

(2)

SPLm is the obtained SPL at each fundamental frequency center in dB; re 1 μPa, SPL1 reduces SPL to 1 Hz bandwidth in dB; re one μPa, and Δf represents the bandwidth for each 1/3 Octave band filter. Wiener filter has been used to minimize the square mean error (MSE) between ambient noise and marine mammal noise [8]. After that, the results were calculated using Eqs. 3 to detect sounds with low SNR, i. e., less than 3 dB, deleted from the database.

$${\text{SPL}}_{V} = 10\log [10^{{^{{(SPL_{T} /10)}} }} - 10^{{^{{(SPL_{A} /10)}} }} ]$$

(3)

where T, V, and A represent all the available signal, sound, and ambient sound, respectively. After that, the SPLs were recalculating at a standard measuring distance of 1 m as Eq. 4 (Fig. 1).

$${\text{SPL }} = {\text{ SPL}}_{{1}} + { 2}0log\left( r \right)$$

(4)

In the next part, the effect of reverberation must be removed. In this regard, the common phase is added to the band (reducing the phase change process using the delay between the cohesive parts or the initial sound is called the common phase) [29]. Therefore, a cross-correlation pass function by adjusting each frequency band’s gain eliminates non-correlated signals and passes the correlated signals. Finally, in each frequency band, the output signals are combined to generate the estimated signal, i. e. $\widehat{S}$. The reverberation removal block diagram is indicated in Fig. 2. Typical representations for dolphin and whale sounds and melodies and their spectroscopy are shown in Fig. 3.

It must be noted that the application of the gain controller depends on different inputs. For example when x and y are the same, i.e. $\widehat{\mathrm{S}}=x=y$, or one of them is delayed by a few milliseconds, the delay is adjusted automatically as $\widehat{\mathrm{S}}\left(t\right)=x\left(t-T\right)=y(t)$, where T is a hypothetical delay modification. If the delay is greater than the window length L, the x and y are being uncorrelated and the gain is being zero. Finally, when x and y are uncorrelated, totally, $\widehat{\mathrm{S}}\left(t\right)$ is zero.

All of the terms are true for each frequency band. Thus if x and y are uncorrelated at one frequency and correlated at another, then the uncorrelated part is removed and the correlated part remains. Since all of the operations are defined on a moving average basis, everything may change as a function of time.

3 average cepstral features and cepstral liftering features.

The effect of ambient noise and reverberation decreases after detecting the audio signal frames obtained in the pre-processing stage. In the next step, the detected signal frames enter the feature extraction stage. The sounds made by dolphins and whales emitted from a distance to the hydrophone experience changes in size, phase, and density. Due to the time-varying multipath effect, fluctuating channel complicates the dolphins and whales recognition task. The cepstral factors with the cepstral liftering feature can significantly alleviate the multipath effects, while the average cepstral coefficients can notably eliminate the time-varying effects of shallow underwater channels [6]. Therefore, this section proposes the use of cepstral features, including mean cepstral features and cepstral liftering features, to make a valid data set. The channel response cepstrum and the original sound dolphins and whales cepstrum could be separated by higher and lower cepstrum indices, respectively [5]. They are in non-overlapping parts of the liftering cepstrum. Thus, the quality of the features is improved by lower time liftering. After the noise and reverberation removal, the frequency domain frames of SPLs (S(k)) are transferred to the cepstrum feature extraction section. The following Equation calculates the cepstrum features of the sound generated with dolphins and whale's signals.

$$c(n) = \sum\limits_{l = 1}^{M} {\log (\sum\limits_{k = 0}^{N - 1} {\left| {S(k)} \right|^{2} H_{l} } (k))\cos (n(l - 0.5)\frac{\pi }{M}} )$$

(5)

where S(k) represents the frequency domain frames of sounds generate dolphins and whales SPLs, N denotes the number of discrete frequencies used for FFT, Hl (k) shows the transfer function of the Mel-scaled triangular filter, where l = 0, 1,…, M. Finally, cepstral coefficients are transferred back to the time domain as c(n) by Discrete Cosine Transform (DCT).

As mentioned earlier, the sound produced by dolphins and whales is extracted by Low-time liftering process. Thus, the low-time liftering is proposed as Eq. 6 to extract the sound that originated from the whole sound.

$$w_{e} [n] = \left\{ {\begin{array}{*{20}c} 1 & {0 \le {\text{n}} \le {\text{L}}_{c} } \\ 0 & {{\text{L}}_{c} \le n \le N/2} \\ \end{array} } \right.{\text{ }}$$

(6)

Lc shows the length of the liftering window, which is used as 15 or 20. The final features can be calculated by multiplying the cepstrum c(n) with the and applying logarithm and DFT functions as Eqs. 7 and 8, respectively.

$$c_{e} (n) = w_{e} [n] \, c(n)$$

(7)

$$c_{L} (n) = Log(DFT(c_{e} (n))$$

(8)

Finally, the feature vector would be represented using Eq. 9.

$${\mathbf{X}}_{m} = [c_{L} (0),c_{L} (1),...,c_{L} ({\text{P}} - 1)]^{{\text{T}}}$$

(9)

The first 512-cepstrum points (out of 8192 points in one frame for a sampling rate of 8192 Hz, except for zeroth index {${\text{C}}_{y}^{m} [0][0]$}) are corresponded to 62.5 ms liftering coefficients, are windowed from the N indices, which is equivalent to one frame length to reduce the liftering coefficients to 44 features. The length of sub-frames, before averaging, is five seconds. During the averaging liftering process, ten previous frames comprise 50 s average cepstral features. The final average cepstral features are calculated by smoothing those ten frames. Consequently, the average cepstral feature vector consists of 44 features. In the next phase, the X_m vector would be an input vector for an MLP-NN.

To sum up, the number of neural network inputs is equal to P. The whole process of the feature extraction stage is indicated in Fig. 4. To sum up, the output of this stage is shown in Fig. 5.

The correlation matrix shows the correlation coefficients between the variables (or features). Each cell in the table shows the correlation between the two variables. Correlation between two variables means measuring the prediction of values of one based on the other. This means that the higher the correlation coefficient, the greater the possibility of predicting the value of one variable over the other. Pearson and Spearman correlation coefficients are between 1 and − 1. Thus, if the correlation coefficient is close to or equal to 1, there is a strong correlation between the two variables. In this case, we can say that the direction of change of both variables is similar to each other.

As shown in Fig. 5, the extracted features are not highly correlated with each other. Therefore, relatively good features have been extracted (Fig. 6).

4 Design of an FChOA-MLPNN for automatic detection of sound produced by marine mammals

MLP-NN is the simplest and most widely used neural network [23]. Important applications of MLP-NN include automatic target recognition systems. For this reason, this article uses MLP-NN as an identifier [2]. MLP-NN is one of the most robust neural networks used for systems with high nonlinearity. Also, the MLP-NN is a feed-forward network capable of non-linear fittings with higher precision. Despite what has been said, one of the challenges facing MLP-NN is always training and adjusting the edges' bias and weight [7].

The steps for using meta-heuristic algorithms to teach MLPNN are as follows: The first phase determines how to display the connection weights. The second phase assesses the fitness function to evaluate these connection weights, which can be considered Mean Square Error (MSE) for the recognition problems. In the third phase, the evolutionary process is used to minimize the fitness function, which is the MSE.

The technical design of the evolutionary strategy of connection weights training can be presented in Fig. 7 and Eq. 10.

$${\vec{\text{V}}} = \left\{ {\left. {{\vec{\text{W}}}.{\vec{\theta }}} \right\}} \right. = \left\{ {\left. {{\text{W}}_{11} .{\text{W}}_{12} .{ } \ldots .{\text{W}}_{{{\text{ih}}}} .{\text{ b}}_{1} .{\text{b}}_{2} .{ } \ldots .{\text{ b}}_{{\text{h}}} .{\text{ M}}_{11} .{\text{ M}}_{12} .{ } \ldots .{\text{ M}}_{{{\text{hm}}}} } \right\}} \right.$$

(10)

where i represents the number of the input nodes, h denotes the number of the hidden nodes, ${\text{W}}_{{{\text{ih}}}}$ indicates the connection weight from the ith input node to the hth hidden node, ${\text{b}}_{{\text{h}}}$ denotes the bias (threshold) of the hth hidden neuron, $q_{hm}$ indicates the connection weight from the hth hidden node to the mth output node.

As previously stated, the MSE is a common criterion for evaluating an MLP-NNs as Eq. 11.

$$MSE = \mathop \sum \limits_{i = 1}^{m} \left( {O_{i}^{k} - d_{i}^{k} } \right)^{2}$$

(11)

where m is the number of neurons in the MLP outputs, $d_{i}^{k}$ is the optimal output of $ith$ input unit in cases where $kth$ training sample is utilized, and $O_{i}^{k}$ denotes the real output of $ith$ input unit in cases where the $kth$ training sample is observed in the input. The MLP must be adapted to a set of training samples to be effective. Therefore, MLP performance based on the mean MSE on all training samples is calculated as:

$$E = \mathop \sum \limits_{k = 1}^{T} \frac{{\mathop \sum \nolimits_{i = 1}^{m} \left( {O_{i}^{k} - d_{i}^{k} } \right)^{2} }}{T}$$

(12)

T denotes the number of training samples, $d_{i}^{k}$ denotes the optimal output related to ith input when using kth the training sample, and m denotes the number of outputs and $O_{i}^{k }$ indicates the input’s real output when using $kth$ the training sample. In the final stage, the recognizer needs a meta-heuristic algorithm to tune the parameters mentioned above. The next section proposes using an enhanced chimp optimization algorithm (ChOA) with fuzzy logic called FChOA as an instructor.

4.1 Fuzzy-ChOA

In this section, a fuzzy system is proposed as a ChOA developer. The first subsection introduces the ChOA algorithm, and the second subsection describes how to use the fuzzy system to adjust the ChOA control parameters.

4.1.1 Chimp optimization algorithm

This algorithm was introduced in 2020, inspired by the way chimps were hunted by Khishe and Mousavi ([17]. The behavior of the first two roles in team hunting, driver and chase, is statistically described next:

$${\text{d}} = \left| {{\text{c}}.{\text{ x}}_{{{\text{prey}}}} \left( {\text{t}} \right) - {\text{m}}.{\text{x}}_{{{\text{chimp}}}} \left( {\text{t}} \right)} \right|$$

(13)

$${\text{x}}_{{{\text{chimp}}}} \left( {{\text{t}} + 1} \right) = {\text{x}}_{{{\text{prey}}}} \left( {\text{t}} \right) - {\text{a}} \cdot {\text{d}}$$

(14)

t represents the number of current iterations, a, m, and c are coefficient vectors, ${\text{x}}_{{{\text{prey}}}}$ is the prey position vector, and ${\text{x}}_{{{\text{chimp}}}}$ is the Chimp position vector. The vectors a, m and c, are calculated by Eqs. (15), (16), and (17), respectively.

$${\text{a}} = {2} \cdot {\text{f}} \cdot {\text{r}}_{1} - {\text{f}}$$

(15)

$${\text{c}} = {2} \cdot r_{2}$$

(16)

$${\text{m }} = {\text{ Chaotic}}\_{\text{value}}$$

(17)

where f decreases nonlinearly by the iteration process from 2.5 to zero (in both exploration and extraction stages), while ${\text{r}}_{1}$ and $r_{2}$ are random vectors in the interval [1,0]. Also, m is a vector that is calculated based on turbulent maps. This vector indicates the effect of Chimp's sexual motivation on the hunting process.

The stochastic population generation of chimps is the first step in the ChOA algorithm. Next, the chimps are randomly arranged into four independent groups: driver, barrier, chaser, and attacker. Each group strategy will determine the location updating method of individual chimps by determining the f vector, while all of the groups aim to estimate the potential prey’s positions. The c and m vectors are tuned adaptively and will enhance the local minima avoidance and convergence rate.

Chimps (driver, barrier, and chaser) search for prey and then surround it. The hunting process is usually carried out by invading chimps. Chasing stimuli, obstacles, and Chimps sometimes participate in the hunting process. To mathematically simulate chimps' behavior, it is assumed that the first attacker (the best solution available), the stimulus, the obstacle, and the pursuer are more aware of the potential prey's location. Thus, the four solutions that are the best solution are obtained, stored, and the other Chimps are forced to update their positions according to the best Chimp locations. This relationship is expressed by Eqs. (18), (19), and (20).

$$\begin{gathered} d_{{{\text{Attacker}}}} = \left| {c_{1} x_{{{\text{Attac}}}} - m_{1} x} \right|,\,d_{{{\text{Barrier}}}} = \left| {c_{2} x_{{{\text{Barrier}}}} - m_{2} x} \right| \hfill \\ d_{{{\text{Chaser}}}} = \left| {c_{3} x_{{{\text{Chaser}}}} - m_{3} x} \right|,\, d_{{{\text{Driver}}}} = \left| {c_{4} x_{{{\text{Driver}}}} - m_{4} x} \right| \hfill \\ \end{gathered}$$

(18)

$$\begin{gathered} x_{1} = x_{{{\text{Attacker}}}} - a_{1} \left( {d_{{{\text{Attacker}}}} } \right),\,x_{2} = x_{{{\text{Barrier}}}} - a_{2} \left( {d_{{{\text{Barrier}}}} } \right) \hfill \\ x_{3} = x_{{{\text{Chaser}}}} - a_{3} \left( {d_{{{\text{Chaser}}}} } \right),x_{4} = x_{{{\text{Driver}}}} - a_{4} \left( {d_{{{\text{Driver}}}} } \right) \hfill \\ \end{gathered}$$

(19)

$$x\left( {t + 1} \right) = \frac{{x_{1} + x_{2} + x_{3} + x_{4} }}{4}$$

(20)

The m parameter mathematically models chimps' chaotic behavior in the hunt's final stage to obtain more meat and, subsequently, more social favors such as grooming.

4.1.2 Proposed fuzzy controlled system to adjust ChOA control parameters

All optimization algorithms have essential and influential parameters [30]. These parameters play a significant role in the search process. For example, early convergence, convergence rate, local optimization, exploration, and extraction depend on the correct and dynamic adjustment of these significant parameters. As shown in the previous section and Eqs. (15) and (16), a and c are two critical parameters in ChOA. On the other hand, parameters a and c in classical ChOA are influenced by random components ${r}_{1}$ and ${r}_{2}$. In references ([31], fuzzy inference has been used as an automatic adjuster of control parameters to upgrade and modify them. According to studies, the fuzzy system is a suitable candidate for intelligent control of control parameters. It should be noted that efficiency and complexity in most intelligent systems based on meta-heuristic algorithms are directly related. The higher the efficiency, the more complex the parallel. Therefore, the use of an additional intelligent controller (fuzzy controller) increases the computation's complexity, which improves the performance of the high-dimensional feature space. Therefore, this paper uses fuzzy inference to develop ChOA.

The proposed fuzzy system has three stages of fuzzyization, fuzzy inference, and de-fuzzy. The first step is fuzzy construction. This step converts the inputs to the fuzzy model for processing based on a fuzzy system. In the second step, the fuzzy system evaluates and infers the rules using the Mamdani inference algorithm. The last step is called defuzzification, in which the results of fuzzy inference, which are in the form of fuzzy sets, are converted into quantitative data and information.

Fig. 8. shows the proposed fuzzy model for adjusting the control parameters of the chimp optimization algorithm.

In this model, the membership functions for fuzzification (shown in Fig. 9) are:

${\mathrm{U}}_{\mathrm{n}}$: The number of repetitions in which the fitness function has not changed. The ${\mathrm{U}}_{\mathrm{n}}$ the membership function is shown in Fig. 9(a).

The parameter f: f is reduced nonlinearly from the range of 2.5 to zero by the iteration process. Any continuous function can perform independent group updates. These functions must be selected so that f decreases during each iteration. The membership function f is shown in Fig. 9(b).

After fuzzification of input data, a set of rules is presented using the Mamdani algorithm according to Table 1. Next, the Mamdani inference machine's output will be defuzzification by two membership functions of parameters a and c. The de-fuzzy phase consists of two parameters, a and c, which, if properly adjusted, will define the boundary between the exploration and extraction phases. Defuzzification membership functions (shown in Fig. 10) The two parameters a and c are:

Table 1 Fuzzy rules used

Full size table

The parameter a: a is a random variable in the range [5, − 5]. This parameter leads to convergence or divergence of search agents to prey. Figure 10.a shows the membership function of parameter a.

The c: c parameter is a random variable in the range [2 and 0]. Vector c always needs to generate random values and execute the exploration process in the initial iterations and the final iterations. This factor has been beneficial to avoid the occurrence of local minimum points. Figure 10(b) shows the membership function of parameter a.

5 Simulation results and discussion

to show the power and efficiency of MLP-FChOA, in addition to using the sounds obtained in Sects. 2 and 3, the reference dataset [36] is also used. As already mentioned, To obtain the data set, the $X_{m}$ vector is assumed to be an input for the MLP-FChOA. The $X_{m}$ dimension is 645 $\times 44$, which implies that the data set contains 44 features and 645 samples. Also, the dimension of the benchmark dataset is 435 $\times 44$.

In MLP-FChOA, the number of input nodes is equal to the number of features. 70% of the data was used for training, and 30% was used for testing. To have a fair comparison between the algorithms, the condition of stopping 300 iterations is considered. There is no specific equation for obtaining the number of hidden layer neurons, so Eq. 23 is used to obtain it [26].

$$H = 2 \times N + 1$$

(23)

where N shows the number of the inputs, and H denotes the number of the hidden nodes. Similarly, the number of output neurons is equivalent to the marine mammal classes, i. e., six neurons.

In order to have a fair evaluation of FChOA, the performance of this algorithm with five benchmark algorithms ChOA (M. [17], CVOA[34], HHO [12], KF [1], and BWO [11] are measured. The basic parameters and the primary values of these benchmark algorithms get demonstrated in Table 2. The classifiers' performance is then tested for classification rate, local minimum avoidance, and convergence speed. Each algorithm is run 33 times, and the classification rate, mean and standard deviation of the minimum error, and P-value are shown in Table 3. Mean values and standard deviation of minimum error and P-value indicate the algorithm's strength in avoiding local optimization. Figures 11 and 13 also show a comprehensive comparison of the convergence speed and syntax and the classifiers' final error rate. Figures 12 and 14 show the ROC curve to demonstrate the ability to evaluate classifiers. All of the evaluation was carried out in MATLAB-R2020a, on a PC with processor Intel Core i7-4500u with a 2.3 GHz processor, 5 GB RAM running memory, in Windows 10.

Table 2 The initial parameters and primary values of the benchmark algorithms

Full size table

Table 3 Results were obtained from different algorithms

Full size table

Figures 10, 11, 12, 13, 14 show the simulation results of MLP training using different metaheuristic algorithms. For a more comprehensive study and a fairer comparison, in Table 3, in addition to the simulation results of MLP training using metaheuristic methods, the simulation results using the traditional method of descent gradient (GD) are also discussed. Considering the values obtained in the measured metrics, the use of GD for MLP training in this investigation is very disappointing.

Adding a subsystem to a metaheuristic algorithm increases its complexity. However, a comparison of the convergence curves in Figs. 11 and 13 shows that the FChOA achieved the global optimum faster than the other algorithms used. Other algorithms were stuck in the local optimum if they converged. In particular, by comparing the ChOA and FChOA in Fig. 13, it can be seen that the use of an auxiliary (fuzzy system) subsystem is necessary to avoid getting caught up in the local optimum in the ChOA. In general, using a fuzzy system to improve ChOA increases complexity. However, the convergence curves and better performance of FChOA than other algorithms used show a reduction in computational cost. Another evidence that, despite increasing complexity, shows an improvement in FChOA performance is the lower MSE of this algorithm compared to other algorithms used.

As shown in Figs. 11 and13, among the benchmark algorithms used for MLP training, FChOA has the highest convergence speed by correctly detecting the boundary between the exploration and extraction phases. KF has the lowest convergence speed by adjusting control parameters. As shown in Table 3, MLP-FChOA has the highest classification rate, and MLP-KF has the lowest classification rate among the classifiers. The STD values, shown in Table 3, indicate that the MLP-FChOA results rank first in the two datasets, confirming that the FChOA performs better than other standard training algorithms and demonstrates the FChOA's ability to avoid getting caught up in local optimism. A P-value of less than 0.05 indicates a significant difference between FChOA and other algorithms. As shown in Figs. 12 and 14, the ROC curve also confirms the algorithm’s better performance. Generally, the precision-recall plot shows the trade-off between recall and precision for various threshold levels. A high area under the precision-recall curve represents high precision and recall. High precision indicates a low false-positive rate, and high-recall indicates a low false-negative rate. As can be observed from the curves in Fig. 12, FChOA has a higher area under the precision-recall curves.

6 Conclusion

This paper classified marine mammals based on intelligent systems and used with an MLP-NN as a classifier. The neural training network was performed using ChOA, FChOA, CVOA, BWO, HHO, KF. The simulation results showed that the fuzzy system performs the task of adjusting and controlling the primary and proper parameters of FChOA well and makes the algorithm powerful in determining the boundary between the exploration and extraction phases and finding it possible to find the optimal global algorithm and avoid getting stuck and Prevents local optimizations. The convergence and ROC curves indicate MLP-FChOA, MLP-HHO, MLP-CVOA, MLP-ChOA, MLP-BWO, and MLP-KF, respectively, perform better in the classification of marine mammals based on the sound emitted from them, respectively. Also, the fastest convergence speed is related to the MLP-FChOA classifier.

The research topics for the future are as follows:

Using deep learning to classify marine mammals
Using of RBF network and metaheuristic algorithms to classify marine mammals
Investigating the role of chaotic maps in improving the training of ANNs using metaheuristic mammal
Implement FChOA to classify marine mammals using FPGA
Design of optimal neural network structure for marine mammal classification

References

Abdul, A., et al. (2020). Engineering applications of artificial intelligence regulated kalman filter based training of an interval type-2 fuzzy system and its evaluation. Engineering Applications of Artificial Intelligence, 95, 103867. https://doi.org/10.1016/j.engappai.2020.103867
Article Google Scholar
Aitkin, M., & Foxall, R. O. B. (2003). “Statistical modelling of artificial neural networks using the multi-layer perceptron. Statistics and Computing, 1(1), 227–39.
Article MathSciNet Google Scholar
Bhanu, B. I. R. (1986). Automatic target recognition: state of the art survey. IEEE Transactions on Aerospace and Electronic Systems, 4, 364–379.
Article Google Scholar
Allen, J. B., Berkley, D. A., & Blauert, J. (1977). Multimicrophone signal-processing technique to remove room reverberation from speech signals. The Journal of the Acoustical Society of America, 62, 912.
Article Google Scholar
Das, A., Kumar, A., & Bahl, R. (2012). Radiated signal characteristics of marine vessels in the cepstral domain for shallow underwater. The Journal of the Acoustical Society of America, 128(4), EL151–EL156.
Article Google Scholar
Das, A., Kumar, A., & Bahl, R. (2013). Marine vessel classification based on passive sonar data : the cepstrum-based approach. ET Radar Sonar & Navigation, 7(1), 87–93.
Article Google Scholar
D. Devikanniga, K. Vetrivel, and N. Badrinath, Review of meta-heuristic optimization based artificial neural networks and its applications. In Journal of Physics: Conference Series 1362(1) (2019)
Gallardo, D., Sahni, O., & Bevilacqua, R. (2016). Hammerstein —wiener based reduced—order model for vortex-induced non—linear fluid-structure interaction. Engineering with Computers, 33(2), 219–37.
Article Google Scholar
González-hernández, F. R., Sánchez-fernández, L. P., Suárez-guerra, S., & Sánchez-pérez, L. A. (2017). Marine mammal sound classification based on a parallel recognition model and octave analysis. Applied Acoustics, 119, 17–28.
Article Google Scholar
Hadavandi, E., Mostafayi, S., & Soltani, P. (2018). A grey Wolf optimizer-based neural network coupled with response surface method for modeling the strength of Siro-Spun Yarn in spinning mills. AApplied Soft Computing Journal, 72, 1–13. https://doi.org/10.1016/j.asoc.2018.07.055
Article Google Scholar
Hayyolalam, V., & PourhajiKazem, A. A. (2020). Black widow optimization algorithm: a novel meta-heuristic approach for solving engineering optimization problems. Engineering Applications of Artificial Intelligence, 87, 103249.
Article Google Scholar
Heidari, A. A., et al. (2019). Harris Hawks Optimization: algorithm and applications. Future Generation Computer Systems https://doi.org/10.1016/j.future.2019.02.028
Article Google Scholar
Hi, H., et al. The specialist committee on hydrodynamic noise. Final Report and Recommendations to the 28th ITTC (Vol. 2, pp. 502–578).
Hurtado, J., et al. (2020). Enveloping CAD models for visualization and interaction in XR applications. Engineering with Computers https://doi.org/10.1007/s00366-020-01040-9
Article Google Scholar
Keshtegar, B. (2016). Limited conjugate gradient method for structural reliability analysis. Engineering with Computers, 33(3), 621–629.
Article Google Scholar
Khishe, M., & Mosavi, M. R. (2019). Improved whale trainer for sonar datasets classification using neural network. Applied Acoustics, 154, 176–192. https://doi.org/10.1016/j.apacoust.2019.05.006
Article Google Scholar
Khishe, M., & Mosavi, M. R. (2020). Chimp optimization algorithm. Expert Systems with Applications, 149, 113338. https://doi.org/10.1016/j.eswa.2020.113338
Article Google Scholar
M. Khishe, M.R. Mosavi, M. Kaveh, in The Fourth Iranian Conference on Engineering Electromagnetics Sonar Data Set Classification Using MLP Neural Network Trained by Non-Linear Migration Rates BBO (2016)
Khishe, M., & Safari, A. (2019). Classification of sonar targets using an MLP neural network trained by Dragonfly algorithm. Wireless Personal Communications, 108(4), 2241–2260. https://doi.org/10.1007/s11277-019-06520-w
Article Google Scholar
Koh, C. S., & Hahn, S. Y. (1994). Detection of magnetic body using artificial neural network with modified simulated annealing. IEEE Transactions on Magnetics, 30(5), 3644–3647.
Article Google Scholar
Lu, H., Wang, X., Fei, Z., & Qiu, M. (2014). The effects of using Chaotic map on improving the performance of multiobjective evolutionary algorithms. Mathematical Problems in Engineering, https://doi.org/10.1155/2014/924652
Article MathSciNet MATH Google Scholar
F. Maire, L.M. Alvarez, A. Hodgson, Automating marine mammal detection in aerial images captured during wildlife surveys: a deep learning approach, in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 9457: 379–85 (2015)
M.M. Mijwil, A. Alsaadi, Overview of neural networks (2019)
D.J. Montana, L. Davis, Training feedforward neural networks using genetic algorithms. In Proceedings of the 11th International Joint Conference on Artificial intelligence, vol 1 89: pp. 762–67. http://dl.acm.org/citation.cfm?id=1623755.1623876 (1989)
Moradi, Z., Mehrvar, M., & Nazifi, E. (2018). Genetic diversity and biological characterization of sugarcane streak mosaic virus isolates from Iran. VirusDisease, 29(3), 316–323. https://doi.org/10.1007/s13337-018-0461-5
Article Google Scholar
Mosavi, M. K., Khishe, M., & Mosavi, M. R. (2018). Design and implementation of a neighborhood search biogeography- based optimization trainer for classifying sonar dataset using multi—layer perceptron neural network. Analog Integrated Circuits Signal Processing. https://doi.org/10.1007/s10470-018-1366-3
Article Google Scholar
Pereira, L., et al. (2014). Social-spider optimization-based artificial neural networks training and its applications for Parkinson’s disease identification. In Proceedings IEEE Symposium on Computer-Based Medical Systems (pp. 14–17).
Pu, X., Chen, S., Yu, X., & Zhang, L. (2018). Developing a novel hybrid biogeography-based optimization algorithm for multilayer perceptron training under big data challenge. Scientific Programming, https://doi.org/10.1155/2018/2943290
Article Google Scholar
Bretschneider, H., et al. The specialist committee on hydrodynamic noise. Final Report and Recommendations to the 27th ITTC (pp. 639–690).
Rodr, L., Castillo, O., & Soria, J. (2017). A study of parameters of the grey wolf optimizer algorithm for dynamic adaptation with fuzzy logic. In Nature-inspired design of hybrid intelligent systems (pp. 371–390).
Rodriguez, L., et al. (2017). Dynamic simultaneous adaptation of parameters in the Grey Wolf optimizer using fuzzy logic. In 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).
Shiu, Y et al. (2020). Deep neural networks for automated detection of marine mammal species. Scientific Reports., 10(1), 1–12.
Article Google Scholar
Simaan, M. A. (2003). Simple explanation of the no-free-lunch theorem and its implications. Journal of optimization theory and applicationsl., 115(3), 549–70.
MathSciNet Google Scholar
Martínez-Álvarez, F., et al. (2020). Coronavirus optimization algorithm: a bioinspired metaheuristic based on the COVID-19 Propagation Model. Big Data, 8(4), 308–322.
Article Google Scholar
Wang, H., Moayedi, H., & Foong, L. K. (2020). Genetic algorithm hybridized with multilayer perceptron to have an economical slope stability design. Engineering with Computers https://doi.org/10.1007/s00366-020-00957-5
Article Google Scholar
W.A. Watkins et al. “W Lt ^ I.” (code 211): 4 (1992)
Wu, X., Zhang, S. E. N., Xiao, W., & Member, S. (2019). The exploration / exploitation trade-off in whale optimization algorithm. IEEE Access, 7, 125919–125928.
Article Google Scholar
Lubis, Z. M., Mujahid, M., Harahap, M. S., & Tauhid, M. (2016). Signal processing: passive acoustic in fisheries and marine mammals. J Biosens. Bioelectron, 7(208), 2.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, University of Birjand, Birjand, Iran
Abbas Saffari & Seyed-Hamid Zahiri
Department of Marine Electronics and Communication Engineering, Imam Khomeini Marine Science University of Nowshahr, Nowshahr, Iran
Mohammad Khishe

Authors

Abbas Saffari
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Khishe
View author publications
You can also search for this author in PubMed Google Scholar
Seyed-Hamid Zahiri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Seyed-Hamid Zahiri.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Saffari, A., Khishe, M. & Zahiri, SH. Fuzzy-ChOA: an improved chimp optimization algorithm for marine mammal classification using artificial neural network. Analog Integr Circ Sig Process 111, 403–417 (2022). https://doi.org/10.1007/s10470-022-02014-1

Download citation

Received: 24 August 2021
Revised: 19 January 2022
Accepted: 16 February 2022
Published: 10 March 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s10470-022-02014-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Fuzzy-ChOA: an improved chimp optimization algorithm for marine mammal classification using artificial neural network

Abstract

Similar content being viewed by others

Evolving Machine Learning-Based Classifiers by Metaheuristic Approach for Underwater Sonar Target Detection and Recognition

Bird species recognition using spiking neural network along with distance based fuzzy co-clustering

An Optimised Grid Search Based Framework for Robust Large-Scale Natural Soundscape Classification

1 Introduction

2 The experiment design and data acquisition

2.1 The ambient noise reduction and reverberation suppression

3 average cepstral features and cepstral liftering features.

4 Design of an FChOA-MLPNN for automatic detection of sound produced by marine mammals

4.1 Fuzzy-ChOA

4.1.1 Chimp optimization algorithm

4.1.2 Proposed fuzzy controlled system to adjust ChOA control parameters

5 Simulation results and discussion

6 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation