1 Introduction

Various methods like adsorption, chemical precipitation, membrane filtration and electrodialysis were used to treat wastewater. Adsorption is one of the most efficient techniques for treating wastewater because of its advantages of high removal efficiency and ease of operation [1]. Heavy metals and colors are removed from wastewater using several adsorbents such as activated carbon, agricultural wastes, sawdust, zeolite, and clay [2]. However, these adsorbents frequently need additional separation stages with high, which limited their applicability. Nano zero-valent iron (nZVI) gained a lot of attention for environmental applications, because of its high reactivity, low cost, availability, ease of separation, and presence of many active sites [3]. NZVI has various physical features that enable it to remove heavy metals and colors from industrial effluent. It was discovered that nZVI outperformed both mZVI and iron oxide particles when evaluating how well they remove dyes. This was illustrated in depth using a critical physical feature, the available active surface area. Where nZVI has a surface area of roughly (29.67 m2/g) and mZVI has a substantially smaller sur- face area (2.55 m2/g). Using these particles, it was revealed that the efficiency of nZVI was 90%, but the efficiency of mZVI was just 25% [4]. Additionally, nZVI particles have a strong tendency to destroy textile colors [5, 6] and also have a number of chemical features that make them more efficient at degrading colors found during industrial wastewater. The nZVI particles have a vital reaction that degraded the dyes and formed of other simpler chemicals or related compounds. The connection between nZVI and dyes formed when the nZVI (electron donor) lost electrons, which were subsequently taken up by the dye molecules (electron acceptor) [7]. In aqueous environments, the nZVI particles transformed to ferrous Fe2+ and ferric Fe3+ ions, which subsequently reacted with hydroxyl ions in those solutions as a result of oxidation and reduction re- actions between them and also reacted with the dyes to break the chromophore (− N = N −) link [8]. The auxochrome connection was likewise broken by the nZVI particles, which removed the color of dye molecules [9].

Bimetallic nanoparticles were in a nanoscale, which consisted of mixtures of two different metals. When two metals were combined, they can retain the properties of their constituents while also gaining better attributes as a result of the joining process. Bimetallic nanoparticles’ properties were influenced by their size, structure, and morphology [10]. The presence of a small fraction of transition metals with nZVI acted as a reducing agent, speeding up the surface reaction to remove organic molecules and other contaminants [11]. Researchers used different transition elements in recent years to see the most efficient for re-moving colors and heavy metal ions from industrial effluent. Bimetallic Nano zero-valent iron-copper (Fe0-Cu) attracted much attention due to the high efficiency seen in their performance to remove colors [12].

The initial concentration of dyes was one of the most important aspects for selecting the adsorption technique used to treat industrial wastewater. The adsorption efficiency decreased with increasing the dye content [13]. Pores, charges, hydrophilic or hydrophobic character, size, and distribution all affected adsorption effectiveness. It also depended on the surface area and whether functional groups were present or not [14, 15]. The magnetic property inherent in particular adsorbents provided outstanding and remarkable efficiency. This found in various treatments, such as using Nano ilmenite FeTiO3 to remove cationic dyes like methylene blue with high effectiveness of up to 71.9 mg/g [16]. The contact time of the adsorbent materials with the industrial wastewater was critical in removing as many contaminants as feasible. By extending the contact time, the adsorption capacity of the adsorbent increased [17, 18].

Fava bean was widely consumed in several regions, including the Mediterranean region of Europe and Africa, as well as Latin America, China, and India. Fava bean peels were thrown out in large amounts. Just the small portion of such waste was used as animal feed by farmers [19]. Fava bean is one of Egypt’s most important winter crops, being planted from the north to the south [20]. According to Egypt’s Ministry of Agriculture and Land Reclamation, 194.259 tons of fava bean seeds were produced across 41.975 hectares in 2012/2013 [21]. For instance, the number of Egyptian beans imports was expected to in- crease from nearly 410 thousand tons in 2018 to about 500 thousand tons in 2025. The domestic production would decrease from about 144 thousand tons to about 16 thousand tons. Hence, beans consumption was expected to decrease from approximately 281 thousand tons in 2018 to about 72.5 thousand tons in 2025 [22]. Different activation techniques were used to produce activated carbon with different properties. In prior research, fava bean activated carbon powder was employed to remove heavy metal ions with high removal efficiencies [23]. Neural networks are considered one of the most important deep learning techniques, which are a subset of machine learning and are also known as artificial neural networks (ANNs) or simulated neural networks (SNNs) [24]. Their structure and nomenclature are modeled after the human brain, mirroring organic neuron communication. Each node, or artificial neuron, is linked to others and has its own weight and threshold. Any node whose output exceeds the defined threshold value is activated and begins sending data to the network’s top layer. Otherwise, no data is transferred to the next network layer.

The artificial neural network (ANN) node layer is divided into three main types of layers, an input layer, an output layer, and one or more hidden layers. The concentration, dose, and time are considered the three main neurons of the input layer. The removal efficiency is considered a desired value and the only neuron inside the output layer. The hidden layer contains (3 to 15) neurons, the tenth neuron detected the least error of the most adsorption processes.

The input data are analyzed to be in the 0 to 1 range. The network’s input variables were dosage, contact time, and temperature, while the network’s output layer was removal efficiency. The experimental data set obtained by using the RSM was appropriate for evaluating the ANN model. The data set was split into three categories: 70% of the data is used to train the network, 15% for validation, and the balancing for testing the results. The network’s performance was evaluated independently by the test set. The validation set altered the network’s bias, variance, and generalization.

Response Surface Methodology (RSM) and Artificial Neural Network (ANN) combined with optimization algorithm are considered more effective for the opti- mization process [24]. The ANN model can learn linear, nonlinear and complex relationships between process variables [25]. ANN model is implemented to predict removal efficiency. The Moth search algorithm (MSA) is a metaheuristic algorithm applied to the ANN model to get optimum conditions that achieve maximum removal efficiency. Methyl Orange (M.O) removal by polyaniline- based nano-adsorbent was investigated in [26]. RSM was implemented to predict removal efficiency. Furthermore, ANN was integrated with differential evolution optimization (DEO) for the prediction of removal efficiency. ANN-DEO had better accuracy in prediction than RSM. However, the maximum removal efficiency was not evaluated. Furthermore, the polyaniline nano-adsorbent was used to adsorb M.O in [27]. ANN was implemented to predict the removal efficiency. OFAT was implemented to get the maximum removal efficiency. The M.O removal by lignin-derived zeolite templated carbon materials was optimized by RSM with Box–Behnken design [28]. RSM was used to find the most influential factor on the removal efficiency.

This study aims to produce low-cost adsorbents such as nZVI, Bimetallic Fe0-Cu, and Fava Bean Activated Carbon-Supported Bimetallic AC-Fe0-Cu to treat the industrial wastewater from the anionic methyl orange (M.O) dye. RSM and ANN models predict the removal efficiency for different values for the factors. The maximum removal efficiency is evaluated by optimizing these mod- els. ANN is optimized by Moth Search Algorithm (MSA). These optimization techniques predict the maximum removal efficiency and the corresponding factor values. The optimization results are validated experimentally. Isotherm and kinetic models are implemented to understand the chemical characteristics. They provide information about the type of adsorption and the adsorbent surface. Furthermore, the maximum amount adsorbed can be estimated by them.

2 Experimental Work

2.1 Chemicals and Reagents

All used chemicals and reagents in the present work are high-grades, including ferric chloride hexahydrate (FeCl3.6H2O 97% pure, LOBA Chemie), sodium borohydride (NaBH4 95% pure, ADVENT), copper chloride dihydrate (CuCl2.2H2O, BIOCHEM), charcoal activated (LOBA Chemie), ethanol (C2H5OH 96% pure), and methyl orange. Deionized H2O is used for all experiments Fig. 1.

Fig. 1
figure 1

The graphical abstract illustrates the main steps of the brand-new manufacture technology of the low-cost adsorbents (nZVI)

2.2 Preparation of Nano Zero Valent Iron (nZVI)

As shown in Eq. 1, the preparation procedure is based on the chemical reduction of ferric chloride hexahydrate FeCl3.6H2O by sodium borohydride NaBH4. Equivalent volumes of 0.1 M FeCl3.6H2O solution and 0.5 M NaBH4 solution are prepared in the laboratory for this approach. The ferric chloride solution in the conical flask is stirred constantly at 250 rpm. In contrast, a sodium borohydride solution is added drop by drop from the burette. After mixing, the solution is stirred for another 20 min. To ensure that the reduction reaction for the iron ions in the solution is finished. Following that, filtration is used to separate the produced precipitate, as shown in Fig. 2. The isolated precipitate is rinsed with deionized water three times and then with ethanol once. Finally, the product is dried at 70 and stored at room temperature under an ethanol layer [29].

$$\begin{aligned} & 2\left( {FeCl_{3} .6H_{2} O} \right) + 6NaBH_{4} + 6H_{2} O \\ & \quad \to 2Fe^{0} + 21H2 + 6B\left( {OH} \right)_{3} + 6NaCl\end{aligned}$$
(1)
Fig. 2
figure 2

Preparation steps of nZVI a Materials, b Titration & Stirring, c Filtration

2.3 Preparation of Bimetallic (Fe0–Cu)

Using the co-precipitation method, a 0.002 M copper chloride dihydrate CuCl2.2H2O solution is added to the FeCl3.6H2O solution before adding the NaBH4 solution from the burette under the same conditions [30]. The redox reaction between Cu2+ and nZVI is mostly determined by the reaction Eq. 2.

$$ Fe^{0} + Cu^{2 + } \to Fe^{2 + } + Cu^{0} $$
(2)

2.4 Preparation of Fava Bean Activated Carbon (AC)

Fava bean peel waste is collected from local markets in Egypt. It is then washed, dried overnight at 105 °C, and ground into tiny pieces. Physical and chemical activation techniques are both accessible. Physical activation consists of a two-step process that begins with high-temperature carbonization and ends with activation. Chemical activation is a combined carbonization and activation process that takes one step at a low temperature.

2.4.1 Physical Activation

The fava bean peel waste is heat-treated at a high tem- perature after grinding thoroughly in the physical activation technique. 50 g of fava bean is placed in a muffle at 400 °C for one h, then at 600 °C for two h. Finally, for 15 min., the temperature is lowered to 300 °C. It is kept in a sealed bottle until it is used to protect the activated carbon (AC) from humid- ity [31, 32].

2.4.2 Chemical Activation

Different chemical activators are used in the chemical activation process. H3PO4 or H2SO4 are used as activators in two forms of activated carbon. 250 ml of 85% H3PO4 is added to 50 g fava bean peel waste, which is then mixed for 4 h at 85 °C on a magnetic stirrer before drying for 24 h at 110 °C [33]. The sample was then placed in a muffle for one hour at 600 °C, washed many times to obtain pH 7, and dried in an oven at 110 °C [34].

In the case of H2SO4 activation, 150 ml of 4 M H2SO4 is added to 50 g of fava bean peel waste and mixed at room temperature to make a homogeneous mix- ture [35]. After that, the mixture is placed for two h at 110 °C, then carbonized for three h at 400 °C, washed with deionized water until pH  7 is obtained, and dried in an oven at 110 °C [36].

2.5 Preparation of Fava Bean Activated Carbon-Supported Bimetallic (AC-F e0- Cu)

Before adding sodium borohydride, activated carbon-supported bimetallic (AC-F e0-Cu) is made by adding activated carbon to a ferric chloride solution (1 g/L of AC per a solution of 0.1 M FeCl3.6H2O and 0.002 M CuCl2.2H2O) [37]. Physically activated carbon, H2SO4 chemically activated carbon, H3PO4 chemically activated carbon, and commercial charcoal are used to make the (AC-F e0-Cu); moreover, the bimetallic particles are supported on the raw fava bean after grinding.

2.6 Adsorption Experiments

The efficiency of the produced compounds is tested using 1000 ppm methyl orange (MO) dye. The (MO) concentration is measured using a UV-V spec- trophotometer set at 465 nm. The removal efficiency is investigated using a pH range of 3 to 9, a contact time between the adsorbent material and MO ranging from 15 to 180 min., and methyl orange concentrations ranging from 25 to 1000 ppm. Shaking at 120 rpm and room temperature is used to carry out all adsorption processes [38]. The percentages of removal are computed using the equation below (Eq. 3):

$$ Sorption\left[ \% \right] = \left[ {C_{0} - C_{e} } \right]/C_{0} \times 100 $$
(3)

where C0 represents the initial concentration of M.O solution (mg/L) and Ce represents the equilibrium concentration of it (mg/L). The following Eq. 4 is used to compute the quantity of M.O dye extracted by the adsorbent material:

$$qe[mg/g] = [[C0 - Ce] \times V ]/m$$
(4)

where \({q}_{e}\) represents the equilibrium adsorption capacity (mg/g), V repre- sents the dye solution volume (L), and m represents the adsorbent mass (g).

2.7 Characterization of nZVI, Fe0-Cu and AC-Fe0-Cu

The size and charge of adsorbent material particles are determined using dynamic light scattering (DLS). The functional groups are identified using Fourier transition infrared (FT-IR). The structural groups of the adsorbent materials are demonstrated using X-ray diffraction (XRD) analysis. A scanning electron microscope (SEM) shows the surface morphology.

2.8 Adsorption Isotherms and Adsorption Kinetics Optimization

Models are built using response surface methodology (RSM) by setting up a series of experiment runs and fitting empirical models to data received under the specified design. Equation 5 is used to derive the anticipated responses.

$$ Y = \beta_{0} + \sum\limits_{i = 1}^{k} {\beta_{i} x_{i} } + \in $$
(5)

where Y represents the expected response, xi represents the input variable, k represents the number of variables, β0 represents the constant term, βi repre- sents the linear coefficient, and ϵ represents the experimental residual.

MATLAB (R2019a) fits the kinetics using the pseudo-first-order, pseudo- second-order, Elovich, and intra-particle diffusion models. The isothermal mod- els of diverse materials are investigated by Langmuir, Temkin, Freundlich, Du- binin Radushkeivch, and Harkins–Jura.

3 Results and Discussion

3.1 Adsorption STUDIES and EFFECT of Operating Parameters

The removal efficiency of the produced adsorbents tests against 1000 ppm M.O dye. In addition, the removal efficiency is investigated at pH levels ranging from 3 to 9, contact times between the adsorbent and M.O ranging from 15 to 180 min., and M.O concentration ranging from 25 to 1000 ppm. Shaking at 120 rpm and room temperature is used to carry out all adsorption processes.

3.1.1 Screening Test and Effect of pH

In this section, the screening experiment is combined with the effect of pH to get the best adsorbent materials, as shown in Fig. 3. The various produced materials of nZVI, Fe0-Cu, and different AC-Fe0-Cu are screened by treating 10 ml of 1000 ppm of the synthetic M.O solution with 0.01 g of the adsorbent. Different pH values of 3, 5, 7, and 9 are used simultaneously to determine the best pH value of which adsorbent. The obtained results indicate that as the pH value is decreased, the removal efficiency of M.O increases. That is due to the positive surface charges and more robust interactions with anionic M.O dye (pH = 3). At pH = 3, the removal efficiency of the adsorbent materials varied from 93.44% to 97.34%

Fig. 3
figure 3

Screening experiment with applied different pH values for Methyl orange removal using different prepared adsorbent materials

3.1.2 Effect of Contact Time

The bimetallic Fe0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC-Fe0-Cu are selected from the screening test, which has the best removal efficiencies among the produced adsorbent materials, with removal capacities of 678.66, 666.65, and 679.47 mg/g, respectively. The impact of contact time is investigated by combining the same dose of 0.01 g of adsorbent material and 10 ml of 1000 ppm dye solution at room temperature with shaking at 120 rpm and various contact times of 15, 30, 60, 90, 120, 150, and 180 min, as shown in Fig. 4. The optimum residence time to reach the highest removal efficiency by Bimetallic F e0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC-Fe0- Cu is 180 min. science, 97.2%, 94.3%, and 99.1% are the achieved removal efficiency by Bimetallic F e0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC-Fe0-Cu, respectively. The contact time tests show that as the contact time increases, the removal efficiency increases. Bimetallic Fe0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC-Fe0-Cu have equilibrium removal capacities of 779.62, 759.59, and 786.03 mg/g, respectively.

Fig. 4
figure 4

Contact time experiment for Methyl Orange removal using bimetallic Fe0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC-Fe0-Cu

3.1.3 Effect of Concentration

The chosen adsorbent materials are tested to absorb different concentrations of M.O dye of 25, 50, 100, 250, 500, 750, and 1000 ppm at the optimal time and pH values (3 h and pH 3), as shown in Fig. 5. The Adsorption capacity increases as the initial concentration of dye increases. Raising the initial concentration increases the moving power of the dye molecules from the bulk to the adsorbent material surface.

Fig. 5
figure 5

Effect of different Methyl orange concentrations on the adsorption capacity using bimetallic Fe0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC-Fe0-Cu

3.2 Characterization of Adsorbent Materials

DLS, FT-IR, XRD, and SEM are used to Characterize the selected adsor- bents (bimetallic Fe0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC- Fe0-Cu) as follows:

3.2.1 Dynamic Light Scattering (DLS) Results

The particle size and charge of the chosen adsorbents are determined using DLS, as illustrated in Fig. 6 and summarized in Table 1. At pH = 3, these charges neutralize and become positive, which may be owing to the protonation of the adsorbents’ amino groups to be more suitable to adsorb and remove the anionic M.O dye. All prepared adsorbents are in nanoscale, with dimensions of 37.84, 458.66, and 341.9 nm for Fe0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC-Fe0-Cu, respectively.

Fig. 6
figure 6

DLS results of a bimetallic Fe0-Cu, b H2SO4 chemical AC-Fe0-Cu, and c commercial AC-Fe0-Cu

Table 1 DLS potential of the prepared adsorbent materials

3.2.2 Fourier Transition Infrared (FT-IR) Results

FTIR is considered one of the most important characterization methods used to detect functional groups on the surface of adsorbent materials: Fig. 7 and Table 2 represent the FTIR analysis and the copious functional groups detected on the surface of the three types of adsorbents bimetallic Fe0-Cu, H2SO4 chemical AC-\(F{e}^{0}\) -Cu, and commercial AC-\(F{e}^{0}\)-Cu. A copious functional group detected for both bimetallic \(F{e}^{0}\)-Cu, and commercial AC-F e0 -Cu adsorbent materials state their highest capacity for M. orange removal than H2SO4 chemical AC-\(F{e}^{0}\)-Cu as represented in Fig. 6.

Fig. 7
figure 7

FT-IR spectrum of a bimetallic Fe0-Cu, b H2SO4 chemical AC-Fe0-Cu, and c commercial AC-Fe0-Cu

Table 2 FT-IR spectrum of (a) bimetallic Fe0-Cu, (b) H2SO4 chemical AC-Fe0-Cu, and (c) commercial AC-Fe0-Cu

The spectra of the bimetallic Fe0 -Cu showed a strong peak at 3383.1 \(C{m}^{-1}\) associated with OH and NH stretching polyphenols, and the one at 2975.43 \(C{m}^{-1}\) is assigned to CH-CH2 stretching aliphatic group. The peak that appeared within the range 2100–2550 \(C{m}^{-1}\) is associated with the C = C conjugated C–C. Another beak that appeared at 1923.41 \(C{m}^{-1}\) is associated with Carboxyl acid. The peak that appeared at 1650.28 \(C{m}^{-1}\) is associated with the C = O Amide I band. The peak that appeared within the range (1400–1460) \(C{m}^{-1}\) is associated with the stretching inorganic carbonate C = O. Another beak that appeared at 1381.87 \(C{m}^{-1}\) is associated with CH and CH2 aliphatic bending groups. The peak that appeared within the range (1240–1340) \(C{m}^{-1}\) is associated with the C-N Amide III band. The peak that appeared at 1088.01 \(C{m}^{-1}\) is associated with the P = O from phospholipids. The peak that appeared at 1048.24 \(C{m}^{-1}\) is associated with the Alkyl amine. Another beak that appeared within the range (800–900) \(C{m}^{-1}\) is associated with C = C, C = N, C–H in the ring structure. The peak that appeared at 655.75 \(C{m}^{-1}\) is associated with the C = O bending. Another beak that appeared at 434.58 \(C{m}^{-1}\) is associated with Alkyl halides. The spectra of the H2SO4 chemical AC-F e0 -Cu showed a strong peak at 3442.54 \(C{m}^{-1}\) is associated with the OH of carbohydrates. The peak that appeared at 2984.81 \(C{m}^{-1}\) is associated with the CH of the aromatic ring. The peak that appeared at 2090.19 Cm1 is associated with the C = C conjugated and C ≡ C. The peak that appeared at 1635.13 Cm1 is associated with the 1635.13. Another beak that appeared within the range (1400–1460) \(C{m}^{-1}\) is associated with stretching –C = O inorganic carbonate. The peak that appeared at 1166.88 Cm1 is associated with C–O–C polysaccharide a. The peak that appeared at 1082.34 \(C{m}^{-1}\) is associated with the P = O from phospholipids. The peak that appeared at 1044.54 \(C{m}^{-1}\) is associated with the Alkyl amine group. Another beak that appeared at 568.54 \(C{m}^{-1}\) is associated with the Alkyl halides.

The spectra of commercial AC-\(F{e}^{0}\) -Cu showed a strong peak at 3650.45 Cm1 is associated with the OH stretching vibration. The peak that appeared at 3423.58 \(C{m}^{-1}\) is associated with the OH of carbohydrates. The peak that appeared at 2962.25 \(C{m}^{-1}\) is associated with the CH3 stretching asym. The peak that appeared at 2923.5 \(C{m}^{-1}\) is associated with CH and CH2 aliphatic stretching groups. The peak that appeared at 2853.53 \(C{m}^{-1}\) is associated with the C-H stretching sym. of > CH2 in fatty acid chains, -CH2 stretching sym. methylene chains in membrane lipids Saturated fatty acids. Another beak that appeared at 1662.59 Cm1 and 1637.73 \(C{m}^{-1}\) is associated with Amide I of proteins. The peak that appeared at 1379.63 \(C{m}^{-1}\) is associated with the C = O stretching sym. of COO-: fatty acids and amino acids. The peak that appeared at 1166.79 \(C{m}^{-1}\) is associated with the ester C–O–C str., asym. -C–O ester str., single bond mainly phospholipids. Another beak that appeared at 1044.97 \(C{m}^{-1}\) is associated with the Glycogen band due to OH str. coupled with -C-O str.-C-O of C OH groups of carbohydrates. Another beak that appeared at 839.81 \(C{m}^{-1}\) is associated with an Aromatic plane ring with 2 neighboring C-H groups. A small peak that appeared at 491.73 \(C{m}^{-1}\) is associated with the Alkyl halide groups.

3.2.3 X-Ray Diffraction (XRD) Analysis Results

The amorphous shape appears in different images of Fig. 8. The peak at 2θ = 44.5 proves the existence of zero-valent iron Fe0 [39]. The peak of activated carbon is at 2θ = 24. However, the nZVI surface peaks at 2θ = 35, which corresponds to the iron oxides Fe2O3 and Fe3O4. The presence of Fe2O3 and Fe3O4 is induced by partial oxidation of nZVI during processing and transfer, which produces oxides on the surface of adsorbents [40].

Fig. 8
figure 8

XRD analysis of a bimetallic Fe0-Cu, b H2SO4 chemical AC-Fe0-Cu, and c commercial AC-Fe0-Cu

3.2.4 Scanning Electron Microscope (SEM) Results

The surface morphology of the adsorbents is investigated by a scanning elec- tron microscope (SEM), as shown in Fig. 9. The synthesized bimetallic Fe0-Cu particles (Fig. 8a) have a semi-spherical shape with an average size of 434.03 nm. It looks like a huge nanocluster, which might be owing to magnetic forces between the nanoparticles. Many pores in the bimetallic Fe0-Cu nanoparticles allow the M.O dye impurities to be transported better. (Fig. 8b) The H2SO4 chemical AC has a huge semi-continuous surface with a spraying of Fe0-Cu nanoparticles with numerous fractures to absorb the M.O dye impurities. (Fig. 8c) shows a commercial AC that has been covered with Fe0-Cu nanoparticles with an average size of 314.23 nm. There is a broad surface area with numerous holes to facilitate the absorption of MO dye contaminants.

Fig. 9
figure 9

SEM images of a bimetallic Fe0-Cu, b H2SO4 chemical AC-Fe0-Cu, and c commercial AC-Fe0-Cu

3.3 Kinetics Study

The Pseudo-first-order, Pseudo-second-order, and Intra-particle diffusion mod- els are applied to fit the results of bimetallic Fe0-Cu, H2SO4 chemical AC- Fe0-Cu, and commercial AC-Fe0-Cu adsorption using MATLAB(R2019a) as illustrated in Tables 3, 4, and 5. By identifying characteristics and employing kinetic models, the adsorption kinetics study seeks to characterize and quantify the adsorption process. The kinetics of adsorption is determined using various kinetic models that are tailored to the experimental data. To determine which kinetic model best fits the experimental data, the computed results from the kinetic models are contrasted with the data, and the highest correction factor is detected. For example, for adsorption kinetic, pseudo-first-order kinetic, and intraparticle diffusion models, which are implemented for each type of adsorbent [41]. The experimental data were fit by pseudo-first-order (PFO), pseudo-second-order (PSO), and intra-particle models. As shown in Tables 3, 4, and 5, The pseudo-second-order is the best fit for bimetallic F e0–Cu with R2 = 0.9913. The pseudo-first-order is the best fit for H2SO4 chemical AC-F e0-Cu with R2 = 0.902. The Intra-particle diffusion is the best fit for commercial AC-F e0 -Cu with R2 = 0.94. Table 6 shows the evaluation of the kinetic model parameters.

Table 3 Comparison between nonlinear kinetic models for M.O removal by bimetallic Fe0 Cu
Table 4 Comparison between nonlinear kinetic models for M.O removal by H2SO4 chemical AC − Fe0 − Cu
Table 5 Comparison between nonlinear kinetic models for M.O removal by commercial AC − Fe0 − Cu
Table 6 Kinetic parameters for different models for the adsorption process of M.O using bimetallic Fe0 − Cu, H2SO4 chemical AC−Fe0–Cu, and commercial AC−Fe0 − Cu

3.3.1 The Pseudo-First-Order and The Pseudo-Second-Order Adsorption Kinetic Models

Most of the adsorption kinetic research conducted over the last two decades modeled their kinetic datasets using the traditional PFO and PSO rate laws. These two models have been used to investigate a variety of adsorption systems. As contaminants or adsorbates, biomass, nanomaterials, heavy metals, and medicines are all possible. Ho and McKay (1999) evaluated several literature datasets using the linear versions of these two models. Since then, the models have grown in popularity, PSO more so than PFO because it can fit most kinetic datasets and is thus regarded as the better of the two. Because these models are so widely used, they have generated a modeling practice cul- ture that introduces uncertainty and inaccuracies that are frequently ignored or dismissed. The pseudo-first-order and pseudo-second-order Adsorption kinetic models are introduced mathematically [42].

3.3.2 Intra-Particle Diffusion Model

The absorption mechanism is either particle diffusion control or film diffusion control. Before adsorption, several diffusion mechanisms are known to influence the adsorption process to occur. Before entering the sorbent’s micropores and macrospores, the sorbate must diffuse through most of the solution to the film surrounding the adsorbent. The first is bulk diffusion resistance, which naturally decreases with enough agitation to reduce the concentration gradient. The resistance to external mass transfer is the second, and intraparticle mass transfer is the third. When the rate-limiting step is the last, intraparticle diffusion controls the sorption mechanism. The Intra particle diffusion model’s mathematical form is introduced [43].

3.4 Isotherm Models

Isothermal models include Temkin, Freundlich, Dubinin-Radushkeivch (DR), and Nonlinear Langmuir models. The state of equilibrium of an adsorption isotherm describes the process of a substance being adsorbed on a surface while the temperature remains constant. It shows how much substance is bonded to the surface of the substance in the solution. The adsorbate is the substance that must be removed during the adsorption process, and the adsorbent is the material onto which it is adsorbed. Adsorption isotherms are used to calculate the affinity of the adsorbate for the adsorbent. To provide experimental data, the nonlinear Langmuir, Temkin, Freundlich, and Dubinin-Radushkeivch(DR) models are chosen [44]. The Dubinin-Radushkeivch (DR) model is the best fit for the bimetallic Fe0Cu, as shown in Table 7. The concentration experiments’ results are consistent with the isotherm models. As illustrated in Tables 7, 8, and 9. Langmuir, Freundlich, Temkin, and Dubinin Radushkeivch (DR) mod- els are used to fit the data for M.O removal by bimetallic Fe0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC-Fe0-Cu. Freundlich and DR models are the best fit results for bimetallic Fe0-Cu and H2SO4 chemical AC-Fe0- Cu, with R2 = 0.9367 and R2 = 0.9908 for bimetallic Fe0-Cu, and with R2 values of 0.9996 and 0.9995, respectively. Freundlich is the best fit model for M.O removal by commercial AC-Fe0-Cu, with R2 = 0.9674. The adsorption is physisorption for the three different adsorbents. The isotherm parameters are evaluated as shown in Table 10. The Freundlich and the DR models have a high R-square; however, the mean free energy (E) is higher than 8, as shown in Table 10. Therefore, adsorption is Physisorption. The maximum amount adsorbed \({q}_{e}\) equals 1174 calculated from the DR model for the bimetallic \(F{e}^{0}\) Cu. The maximum amount adsorbed \({q}_{e}\) equals 1200 calculated from the Langmuir model for the bimetallic \(F{e}^{0}\) Cu, as shown in 10.

Table 7 Comparison between Isothermal models for M.O removal by bimetallic Fe0 − Cu
Table 8 Comparison between Isothermal models for M.O removal by H2SO4 chemical AC − Fe0 − Cu
Table 9 Comparison between Isothermal models for M.O removal by commercial AC-\(F{e}^{0}-Cu\)
Table 10 Isothermal adsorption parameters for different models for the adsorption process of M.O using bimetallic Fe0 − Cu, H2SO4 chemical AC − Fe0 − Cu, and commercial AC- Fe0- Cu

3.4.1 Temkin Model

Temkin’s isotherm model considers the effect of indirect adsorbate/adsorbate interactions on the adsorption process and assumes that increasing surface coverage results in a linear decrease in the heat of adsorption for all molecules in the layer. The Temkin isotherm is only applicable at intermediate ion concentrations. The Temkin model’s mathematical form is introduced [45].

3.4.2 Freundlich Model

The Freundlich isotherm governs adsorption activities on heterogeneous sur- faces. This isotherm provides an equation that characterizes the exponential distribution of energy and surface heterogeneity at active sites. The linear form of the Freundlich isotherm, as well as the adsorption capacity and intensity, show how the energy is distributed relatively and how heterogeneous the adsorbate sites are [45].

3.4.3 Dubinin-Radushkeivch (DR) Model

The Dubinin-Radushkeivch isotherm model, an empirical adsorption model, is frequently used to represent adsorption mechanisms with Gaussian energy distribution onto heterogeneous surfaces. This isotherm is only suitable for middle ranges of adsorbate concentrations due to its unrealistic asymptotic behavior and failure to anticipate Henry’s laws at low pressure. In a semiempirical equa tion, adsorption is modeled as a pore-filling process. It is a fundamental equation that characterizes the adsorption of gases and vapors on microporous sorbents qualitatively. It has a multilayer structure that involves Van Der Waal’s forces, which are important in physical adsorption processes. Langmuir adsorption, which was developed to characterize gas–solid phase adsorption, is used to measure and compare the adsorption capacity of various adsorbents. By balancing, the Langmuir isotherm explains surface coverage [45].

3.5 Response Surface Methodology (RSM) Optimization for M.O Removal by Bimetallic Fe0-Cu, H2SO4 Chemical AC-Fe0-Cu, and Commercial AC-Fe0- Cu

3.5.1 RSM for the Adsorption Using the Bimetallic Fe0-Cu

The data in Table 11 fits the quadratic model, and the model is generated as follows:

$$ y = - 28.45 + 0.0673x_{1} + 200.852x_{2} + 0.30356x_{3} - 0.0156x_{1} x_{2} - 0.000018x_{1} x_{3} - 0.12091x_{2} x_{3} - 0.000015x_{1}^{2} - 115.97x_{2}^{2} - 0.000944x_{3}^{2} $$
(6)

whereby, x1, x2, x3 are the removal efficiency, concentration, dose, and time, re- spectively. The quadratic and linear models are suggested due to the results of the fit statistics, as shown in Table 12. The quadratic model is not aliased as in the cubic model, and it has R2 = 0.8371, as shown in Fig. 10.

Table 11 The FCCD design matrix for the adsorption using bimetallic Fe0-Cu
Table 12 The fit summery for the adsorption using bimetallic Fe0-Cu
Fig. 10
figure 10

The predicted removal efficiency versus the actual for the adsorption using bimetallic Fe0-Cu

The ANOVA analysis is used to identify the essential factors that affect the removal efficiency, as shown in Table 13. Model terms with P-values less than 0.0500 are significant. In this situation, A is an effective model term, and A is the concentration. The model terms are not important if the value is bigger than 0.1000. Model reduction may enhance the model if there are numerous inconsequential model terms.

Table 13 ANOVA analysis for the adsorption using bimetallic Fe0-Cu

3.5.2 RSM for the Adsorption using H2SO4 Chemical AC-Fe0-Cu

The data in Table 14 fits the quadratic model, and the model is generated as follows:

$$ y = {3}.{478} + 0.{2498}x_{{1}} + {3}.{8515}x_{{2}} - 0.0{225}x_{{3}} - 0.00{988}x_{{1}} x_{{2}} - 0.000067x_{1} x_{3} + 0.08882x_{2} x_{3} - 0.000151x_{1}^{2} - 1.8982X_{2}^{2} - 0.00008X_{3}^{2} , $$
(7)

whereby, x1, x2, x3 are the removal efficiency, concentration, dose, and time, re- spectively. The quadratic model is recommended based on the fit statistics, as illustrated in Table 15. The quadratic model is not aliased as in the cubic model, and it has R2 = 0.9984, as shown in Fig. 11.

Table 14 The FCCD Centered design matrix for the adsorption using H2SO4 chemical AC-Fe0-Cu
Table 15 The fit summery for the adsorption using H2SO4 chemical AC-Fe0-Cu
Fig. 11
figure 11

The predicted removal efficiency versus the actual for the adsorption using H2SO4 chemical AC-Fe0-Cu

The ANOVA analysis is used to identify the essential factors that affect the removal efficiency, as shown in Table 16. Model terms with P-values less than 0.0500 are significant. A, C, AC, BC, and A2 are crucial model terms in this scenario, where A denotes concentration, B denotes dosage, and C denotes time. The model terms are not important if the value is bigger than 0.1000. Model reduction may enhance the model if there are numerous inconsequential model terms.

Table 16 ANOVA analysis for the adsorption for H2SO4 chemical AC-Fe0-Cu

3.5.3 RSM for the Adsorption Using the Commercial AC-Fe0-Cu

The data in Table 17 fits the quadratic model, and the model is generated as follows:

$$ y = - {1}.{656} + 0.{2243}x_{{1}} + {15}.{274}x_{{2}} + 0.{1282}x_{{3}} + 0.00{21}x_{{1}} x_{{2}} - 0.00005x_{1} x_{3} + 0.0133x_{2} x_{3} - 0.000138x_{1}^{2} - 10.6036x_{2}^{2} - 0.000413x_{3}^{2} $$
(8)

whereby, x1, x2, x3 are the removal efficiencies with different concentration, dose, and time, respectively. The quadratic model is recommended based on the fit statistics, as illustrated in Table 18. The quadratic model is not aliased as in the cubic model, and it has R2 = 0.999, as shown in Fig. 12.

Table 17 The FCCD Centered design matrix for the adsorption using commercial AC-Fe0-Cu
Table 18 The fit summery for the adsorption using commercial AC-Fe0-Cu
Fig. 12
figure 12

The predicted removal efficiency versus the actual for the adsorption using com- mercial AC-Fe0-Cu

The ANOVA analysis is used to identify the essential factors that affect the removal efficiency, as shown in Table 19. Model terms are significant if their P-values are less than 0.0500. A, C, AC, A2, and C2 are important model variables in this situation, where A represents concentration and C represents time. The model terms are not significant if their values exceed 0.1000. Model reduction may enhance the model if there are a lot of inconsequential model terms.

Table 19 ANOVA analysis for the adsorption using commercial AC-Fe0-Cu

3.6 Artificial Neural Network (ANN)

Deep learning techniques depend on neural networks, also known as artificial neural networks (ANNs) or simulated neural networks (SNNs), which are a subset of neural networks. AI stands for machine learning. Their structure and nomenclature are modeled after the human brain, mirroring organic neuron communication. Each node, or artificial neuron, is linked to others and has its own weight and threshold. Any node whose output exceeds a predefined threshold value is activated and sends data to the network’s top layer. If not, information from the next network level is retained. An artificial neural network (ANN) node layer consists of an input layer, one or more hidden layers, and an output layer.

Three neurons make up the input layer: concentration, dose, and time. The removal efficiency is represented by one neuron in the output layer. The hidden layer’s neuron count ranged from 3 to 15, and in most adsorption techniques, ten is the best number that achieves a minor error. The input data is normalized to be in the 0 to 1 range. The network’s input variables were dosage, contact time, and temperature, while the network’s output variable was removal efficiency. the layer from which the network output. It was claimed that the experimental data set obtained by using the RSM was enough to properly assess the ANN model. ANN is applied to the same data of RSM that is shown in Tables 11, 14, and 17. ANN is implemented using MATLAB (R2019a), and the feed-forward network is selected. The network is made up of three layers: an input layer, a hidden layer, and an output layer, where the input and output layer contain three neurons and one neuron, respectively. The number of neurons in the hidden layer is ten after experimenting with other values. That has the smallest mean square error. Tansig is the transfer function between the input and hidden layers, while a pure line (Purline) is the transfer function between the hidden and output layers. The ANN has a higher R2 values than the RSM models, as confirmed in the regression plots shown in supplementary figures (S-1,S-2,S-3).

The data set was split into three groups; 70% of them were used to train the network, 15% to validate it, and the remaining 15% to test the results. The test set independently assessed the performance of the network. The validation set modified the network’s bias, variance, and generalization. R2019b MATLAB was utilized as the programming language, and TensorFlow was used for the ANN calculations. The training, validation, testing, and overall correction fac- tors are obtained in supplementary figures (S-1,S-2,S-3).for bimetallic F e0 -Cu, H2SO4 chemical AC-F e0 -Cu, and commercial AC-F e0–Cu, respectively.

Table 11 indicates experimental removal efficiency, RSM predicted removal efficiency, RSM residual, ANN predicted removal efficiency, and ANN residual for the bimetallic F e0–Cu. Tables 14 and 17 obtain experimental removal efficiency, RSM predicted removal efficiency, RSM residual, ANN pre dicted removal efficiency, and ANN residual for H2SO4 chemical AC-F e0 -Cu, and commercial AC-F e0–Cu, respectively. The previous values in all these Tables 11, 14, and 17 are obtained through 20 runs with three different space types (factorial, axial, and center). Five neurons are the best number in the case of using Bimetallic F e0–Cu. In Table 11 for the Bimetallic F e0 -Cu, the best experimental and predicted removal efficiency is 96.8% RE (through runs 1, and 14). In Table 14 for H2SO4 chemical AC-F e0 -Cu, the best experimental and predicted removal efficiency is 96.25% RE (through run 18). In Table 17 for the commercial AC-F e0–Cu, the best experimental and predicted removal efficiency is 94.93%RE (through run 5).

The use of artificial neural networks (ANN) for mapping, regression, modeling, clustering, classification, and multivariate data analysis is gaining popularity. The ANN is a multivariate statistical technique used to describe a wide range of mathematical objects and processes. Nonlinearity, which allows for a better fit to the data; noise insensitivity, which provides accurate prediction in the presence of uncertain data and measurement errors; high parallelism, which implies fast processing and hardware failure tolerance; learning and adaptability, which allows the system to update (modify) its internal structure in response to changing environment and generalization are the main advantages of ANN. artificial neural networks (ANNs) have been routinely used to predict dye adsorption. This research applies the uses of ANN approaches for dye adsorption, including multilayer feedforward neural networks (MLFNN). These ANNs models are gaining prominence as techniques that can be used successfully for dye adsorption with acceptable accuracy.

3.7 Optimization Results

The maximum removal efficiency is achieved at optimum conditions, which are evaluated by optimizing RSM quadratic model and the ANN model. First, numerical optimization is applied to the RSM model using Design-Expert soft- ware. Then, the Moth Search Algorithm (MSA) is applied to the ANN model to get the optimum conditions that maximize the removal efficiency. A metaheuris- tic algorithm is used to optimize ANN. Metaheuristic combines one main word (heuristic) and suffix (meta) that have roots in Greek. The term (heuristic) is an old Greek word that means discovering new rules in dealing with different problems, and the term (meta) means some upper-level methodologies in na- ture. The metaheuristics use higher-level approaches to implement a searching process that can avoid the optimum local result and find the global optimum [46]. Many applications use these algorithms, such as extracting optimized bio- impedance model parameters using different typologies of oscillators and Plant stem tissue modelling and parameter identification using metaheuristic opti- mization algorithms [47, 48]. Moth Search Algorithm (MSA) is selected to be combined with ANN (ANN + MSA). The nocturnal behavior of moths inspires MSA [49]. The results of these optimization methods are shown in supplementary table (S-1), which are validated experimentally as shown in the confirmation column. The maximum removal efficiency is 98.66% by using Bimetallic Fe0-Cu according to the confirmation of ANN + MSA results. The error of ANN + MSA is 1.8%, and the RSM model error is 4.2% in the case of using Bimetallic Fe0-Cu. RSM and ANN + MSA give almost the same result when using the chemical AC-Fe-Cu. However, RSM gives better results than ANN + MSA when using commercial AC-Fe-Cu.

4 Conclusion

The anionic M.O dye was used to screen the prepared adsorbents of nZVI, bimetallic Fe0-Cu, physical AC-Fe0-Cu, H2SO4 chemical AC-Fe0-Cu, H3PO4 chemical AC-Fe0-Cu, commercial AC-Fe0-Cu, and raw fava bean-Fe0-Cu. The most significant removal efficiencies were obtained by bimetallic Fe0-Cu, H2SO4 chemical AC-Fe0-Cu, and commercial AC-Fe0-Cu. The most efficient adsorbent materials were characterized using DLS, FT-IR, XRD, and SEM.

Bimetallic Fe0-Cu has a removal capacity of 995.96 mg/g at (pH = 3, 180 min. contact time under shaking at 120 rpm at room temperature, 1000 ppm of M.O, and 1 g/l dose of bimetallic Fe0-Cu adsorbent). The pseudo-second-order is the best fit for bimetallic Fe0-Cu with R2 = 0.9913. H2SO4 chemical AC-Fe0-Cu adsorbent has a removal capacity of 990.35 mg/g at (pH = 3, 180 min. contact time under shaking at 120 rpm at ambient temperature, 1000 ppm of M.O, and 1 g/l dose of H2SO4 chemical AC-Fe0-Cu adsorbent).

The pseudo-first-order is the best fit for H2SO4 chemical AC-Fe0-Cu with R2 = 0.902. At (pH = 3, 180 min. contact time under shaking at 120 rpm at room temperature, 1000 ppm M.O, and 1 g/l dose of commercial AC-Fe0-Cu adsorbent), the removal capacity of commercial AC-Fe0-Cu adsorbent is 977.53 mg/g. Intra-particle diffusion is the best fit for commercial AC-Fe0-Cu with R2 = 0.94. The isothermal results show that physisorption is involved in adsorption and that multi-layers develop on the adsorbent surface.

The experimental data were fit by pseudo-first-order (PFO), pseudo-second-order (PSO), and intra-particle models. The pseudo-second-order is the best fit for bimetallic F e0 -Cu with \({R}^{2}\) = 0.9913. The pseudo-first-order is the best fit for H2SO4 chemical AC-\(F{e}^{0}\)-Cu with \({R}^{2}\)= 0.902. The Intra-particle diffusion is the best fit for commercial AC- \(F{e}^{0}\)-Cu with \({R}^{2}\) = 0.94. The Dubinin-Radushkeivch (DR) model is the best fit for the bimetallic Fe0Cu. The Freundlich and the DR models have a high R-square; however, the mean free energy (E) is higher than 8 Therefore, adsorption is Physisorption.

The maximum amount adsorbed \({q}_{e}\) equals 1174 was calculated from the DR model for the bimetallic Fe0Cu. The maximum amount of adsorbed material \({q}_{e}\) equals 1200 calculated from the Langmuir model for the bimetallic Fe0Cu. The Bimetallic \(F{e}^{0}\)-Cu, the best experimental and predicted removal efficiency is 96.8% RE (through runs 1, and 14). For the H2SO4 chemical AC-\(F{e}^{0}\)-Cu, the best experimental and removal efficiency is 96.25% RE (through run 18). The commercial AC-\(F{e}^{0}\)–Cu, the best experimental and predicted removal efficiency is 94.93%RE (through run 5). That is clear low-cost adsorbents such as Bimetallic Fe0-Cu, and Fava Bean Activated Carbon-Supported Bimetallic AC-Fe0-Cu can remove the Methyl orange dye with a high removal efficiency which is competed for the commercial and chemical adsorbents.