Introduction

The presence of Arsenic (As, Z = 33) in terms of most plentiful element in the human body, seawater and earth’s crust is 12th, 14th and 20th position, respectively (Mandal and Suzuki 2002). As an ultra-trace nutrient, arsenic deficiency leads to inhibited growth, although beyond its necessary level it causes toxic effect on plants, animals, and human beings. Inorganic arsenics are proven to be carcinogens (Ng 2005). A worldwide recognized problem related to arsenic contamination in drinking water connected with 21 countries; the largest population being risked are Bangladesh followed by West Bengal in India (Jiang et al. 2012). World Health Organization (WHO) recommended that 10 µg L−1 is the limit for utmost permissible intensity of arsenic in drinking water. Geological source such as natural reaction and volcanic emissions are the main causes of arsenic pollution where other factors such as the burning of fossil fuels, mining, smelting, and pesticide use by human activities are also responsible for arsenic toxicity (Bissen and Frimmel 2003).

Till date, several methods have been attempted such as coagulation/precipitation (Song et al. 2006), coagulation/filtration (Wickramasinghe et al. 2004), adsorption (Asadullah et al. 2014), oxidation (Borho and Wilderer 1996), ion exchange (Urbano et al. 2012), membrane/reverse osmosis (Fogarassy et al. 2009), and electro-coagulation (Hansen et al. 2007) to eliminate arsenic from water. Among these technologies, adsorption is preferably a good choice due to its operational easiness, cost effectiveness, eco-friendly nature and accessibility of various types of adsorbent (Ranjan et al. 2009).

For the last couple of decades, activated carbons as an adsorbent has been widely used for arsenic removal from water (Kalderis et al. 2008; Mohan and Pittman 2007; Mondal et al. 2008) as it possesses significant surface area with micro/mesopore structure along with extensive adsorption capacity. Commercial activated carbons are expensive and it is the main drawback of using this adsorbent in all the application sectors (Roy et al. 2013). Recently, some of the low cost compounds such as zirconium oxide-coated marine sand, chir pine sawdust, bagasse fly ash, clay and magnetic chitosan-bamboo sawdust composites were employed while some of the biomasses were also used to remove arsenic from water (Khan and Sing 2010; 2013, 2014, Khan and Nazir 2015; Ali et al. 2014). Chemical activated carbon (linseed cake activated carbon and waste tyre activated carbon) was also used to eliminate arsenic from aqueous solutions (Khan et al. 2016, 2017). Here, in this paper we have tried to prepare a low cost agricultural residues mung bean husk as a precursor to prepare potential carbonized adsorbent that can deal with waste disposal problem as well as the possibility of recycling the waste towards its application in removal of toxic contaminant from aqueous solution (Budinova et al. 2009; Mondal et al. 2016a, c). Vigna radiate (Mung bean) seeds are an inhabitant from the Indian subcontinent and 90% of world’s mung bean is produced in Asia, and India is the top producer of this crop (Mondal et al. 2015, 2016b).

The typical optimization of a removal process requires changing one variable while fixing the other variables at a certain level. This sort of optimization process is impractical as it requires a large number of experiments to be performed due to the possibility of different factorial combinations of the test variables (Chang et al. 2011). This sort of problem can be overcome by using appropriate optimization process like RSM and ANN.

The aims of the present study includes: (1) synthesis of steam activated mung bean carbon as an adsorbent (2) characterization of adsorbent material, (3) investigation of the effects of process parameters such as adsorbate solution pH, temperature, adsorbent dosage and speed of agitation on the adsorption of As(V) onto adsorbents, (4) Isotherm, kinetics and thermodynamics of the adsorption process and (5) finding an optimum condition of adsorption process using RSM and ANN and their comparison.

Materials and methods

Preparation of adsorbate solution

Sodium arsenate dibasic heptahydrate (Na2HAsO4·7H2O) as arsenic sourced compound was purchased from Fisher scientific (USA) and was dissolved in deionized water (18.2 MΩ) to prepare adsorbate solution. Arsenic stock solution of 3 ppm equivalent was prepared and further diluted to the desired concentration with deionised water (Millipore Corp., USA). The adsorbate pH was adjusted using 0.1 N NaOH and 0.1 N HCl.

Preparation of sorbents and its characterization

The precursor mung bean husk used for preparation of activated carbon was purchased from Durgapur (India) local market. Distilled water was used to wash MBH several times followed by drying to obtain the desired precursor. In the first step of operation in a stainless steel cylindrical shelled furnace chamber, the precursor was gradually carbonized with the rate of heating of 55 °C per 15 min at 550 °C for 1 h. After carbonization, temperature was heated up to 650 °C into the furnace chamber and superheated steam was introduced at a pressure of 1.5–2.0 kg cm−2 for 1 h for activation of carbon (Mondal et al. 2015). The prepared activated carbon was then placed to desiccators to cool down to room temperature. Characterization of the sorbent was done properly using different instrumental techniques such as SEM, BET, FTIR and point of zero charge.

Batch mode adsorption studies

The appropriate quantity of mung bean husk derived activated carbon for the adsorption of arsenic was estimated by the batch equilibrium study. Arsenic concentration of 2 mg L−1 was taken in a fixed volume (100 ml) of liquid sample in all the tested batch sorption experiments in an incubator shaker (Model Innova 42, New Brunswick Scientific, Canada) at 310 K. Process optimization was done by altering the dosage (50 mg–1 g), pH (3–9) and agitation speed (40–180) for a preset contact time interval. The concentration of arsenic in a flask was measured by taking adsorbate solution at a specific time interval. Arsenic solution from flask was taken at a specific time interval and then centrifuged at 4000 rpm for 20 min. Supernatant solution collected at different time interval was analyzed for the presence of arsenic using arsenic kit (Merck, USA). Each batch experiment was conducted in triplicate and for data analysis purposes mean values were used. At equilibrium q e (mg g−1) conditions, the quantity of arsenic compound adsorbed can be estimated according to the mass balance from the initial and final equilibrium concentrations by means of the following equation:

$$q_{\text{e}} = \frac{{\left( {C_{\text{i}} - C_{\text{e}} } \right)V}}{m},$$
(1)

where C i is the initial arsenic concentration (mg L−1), C e is the equilibrium arsenic concentration (mg L−1), V is the volume of the arsenic solution (L) in contact with the adsorbent, m is the mass of the adsorbent (g).

The percent removal of arsenic was calculated using the following relation:

$${\text{Sorption}}\left( \% \right) = \frac{{\left( {C_{\text{i}} - C_{\text{e}} } \right)}}{{C_{\text{i}} }} \times 100.$$
(2)

Control experiments were conducted in each batch study to confirm that arsenic was not adsorbed by the flask wall.

Models for kinetic and equilibrium studies

The evaluation of the adsorption kinetics was tested by subsequent models, which are pseudo-first order (Kumar et al. 2003), pseudo-second order (Ho et al. 2000) and Elovich (Chien and Clayton 1980).

The assessment of the equilibrium adsorption isotherm was checked by different models. These are: Langmuir (Mohan et al. 2011a), Freundlich (Mohan et al. 2011a), Temkin and Dubinin–Radushkevich (Dubinin 1960).

Removal process modeling

The optimization of As(V) elimination from water was done through RSM by the same process variables used in batch mode, viz. adsorbent dosage, pH and agitation speed. Concentration of As(V) was taken 2 mg L−1 in this modeling process. CCD was chosen from RSM to form experimental matrix to build up a suitable model for explaining the process of arsenic removal. Table 1 shows the variables range and levels used in this experiment. This modeling process had taken removal percentage of arsenic as the response of the system.

Table 1 CCD with experimental and predicted values

Quadratic equation to correlate response function and variables process can be mathematically done by the following equation:

$$x_{j} \quad \gamma = \beta_{0} + \sum\limits_{i = 1}^{k} {\beta_{i} x_{i} } + \sum\limits_{i = 1}^{k} {\beta_{ii} x_{i}^{2} } + \sum\limits_{i = 1} {} \sum\limits_{j = 1 + 1} {\beta_{ij} x_{i} x_{j} + \varepsilon } ,$$
(3)

where \(\beta_{0}\) is the constant coefficient, \(\beta_{i}\), \(\beta_{ii}\) and \(\beta_{ij}\) are linear, quadratic and interaction coefficients effect, respectively, \(x_{i}\), \(x_{j}\) are independent variables and \(\varepsilon\) is the error.

In CCD, three-dimensional response plots were generated that explain the interactive effect of independent variables with their corresponding effect.

Artificial neural network model works in a very complicated manner like the neurons of the nervous system of human to make a refined optimal decision based on the data or stimuli. Currently for engineering application, so many ANN models have been developed such as feed forward, multilayer perception (MLP), and radial basis function. The present investigation studied As(V) removal modeling using feed-forward three-layer back propagation algorithm with a linear transfer function.

ANN model essentially consists of three layers, i.e., input layers, hidden layers and an output layer. The independent variables are input and hidden layers, while dependent variables are output layer. ANN model is structured in such a fashion that all layers are interconnected by a processing unit called neuron, while the neurons are connected to each other by links (Fig. 1). Input layer takes the information from outer source and pass the information to the next layer (hidden) for processing of data. Input values were weighted individually before it passes through the hidden layer. The hidden layer processes the data and analyzes to produce the output based on the sum of the weighted input values (Khataee and Khani 2009). The mathematical equation to denote the net input Y ij of the neuron j in the layer i is represented by

$$Y_{ij} = \sum\limits_{k = 1}^{{n_{i - 1} }} {W_{ijk} } V_{i - 1,k} + \theta_{ij} ,$$
(4)
$$V_{ij} = g\left( {Y_{ij} } \right),$$
(5)

where V ik is the input of neuron, W ijk is the weight of connection and θ ij is the bias of neuron. The neuron output V ij is estimated using an activation function g(Y ij ). Equations (4) and (5) can be used to model nonlinear relationships between a set of input and output variables. It has to be noted that the training is performed using the input as well as output data taken from the system under study.

Fig. 1
figure 1

Topology of the ANN architecture

In this present study, same input, viz. adsorbent dosage, pH and agitation was taken from RSM to model the ANN process. CCD matrix generated from RSM along with the output results was used in ANN modeling to train output and input layer. Matlab (version 7.12) toolbox of Neural Network was used to calculate the ANN data.

Results and discussion

Characterization of activated carbon

The BET-specific surface area and micro-pore volume were estimated to be 405 m2 g−1 and 0.2853 cm3 g−1, respectively. Adsorbents surface charge can be estimated by the help of point of zero charge (pHpzc) and experimentally it was observed to be 8.6. FTIR spectrum of SAC-MBH and SAC-MBH along with adsorbed As(V) molecule was shown in Fig. 2a, b, respectively. The process of adsorption has been influenced by the presence of functional groups such as carboxyl, hydroxyl, sulfate, and amino groups on the surface of an adsorbent. The SAC-MBH exhibited different bands at 770 cm−1 (C–H bend), 1068 cm−1 (C–N stretch), 1215 cm−1 (C–H bend) and 1384 cm−1 (nitro group). Sifting of some bands such as 770.39–769.25 cm−1, 1637.6–1629.22 cm−1 and 3393.85–3390.59 cm−1 suggests that C–H bend, N–H bend stretch and N–H stretch may be participating for the adsorption of arsenic onto SAC-MBH.

Fig. 2
figure 2

a Before and b after adsorption of As(V) onto SAC-MBH

One of the vital parameters in the process of adsorption is specific surface area of adsorbent, and pore size is helpful to know adsorbates binding chances with the functional group into the activated carbon. The surface structure of the raw activated carbon was found to be porous, irregular and uneven shaped where porous structure was highly heterogeneous in nature associating with high surface area in SEM micrograph as depicted in Fig. 3a. It was shown from the Fig. 3b that the pores number was reduced, and the heterogeneous structure helps to capture the As(V) molecule inside the activated carbon surface to facilitate absorption. The surface texture became smooth due to filling up of the pores by As(V) molecule adsorption. The point of zero charge of the adsorbent was estimated to be 8.6 that point out the ‘H’ category of carbon. H category carbons have a lesser density of oxygen-containing functional group, more hydrophobic and showing strong acid adsorption capacity (Mohan et al. 2011b).

Fig. 3
figure 3

a Before and b after adsorption of As(V) onto the surface of SAB-MBH on SEM at ×500 magnification

Effect of adsorbent dosage

Experimental results suggest that the percentage removal of the As(V) species increased (Fig. 4a) with increased dosages of adsorbent (0.5–0.75 g). Increase in adsorbent dosage means the availability of more surface area which, in turn, indicates the presence of more active binding sites to adsorb the adsorbate molecules (Lewinsky 2007). Further increase in dosages beyond 0.75 g mass did not cause any significant increment in the elimination tendency of As(V). This is due to the adsorption of almost all As(V) onto the adsorbent surface and attaining of equilibrium between the As(V) adsorbed to the adsorbent and those As(V) left in the solution. The results of this study are the following similar type of pattern reported as findings by other researchers (Li et al. 2012; Roy et al. 2013). The optimum value of adsorbent dosage was 0.75 g L−1 and for the rest of the experiment this value was used.

Fig. 4
figure 4

Effect of a adsorbent dosage, b pH, c agitation speed, and d temperature on the adsorption of As(V) by SAB-MBH

Effect of solution pH

The solution pH is a significant factor for wastewater treatment process. Figure 4b presents the activated carbon behavior over the pH range of 3–9 and it is easily understood that the optimum arsenic elimination is achieved at pH 3.

The solution pH affects the activated carbon by dissociating the functional groups on the adsorbent surface active site by changing the surface charge (pHpzc) of the activated carbon. On the other hand, the solution pH affects the target pollutant by changing degree of ionization (pKa) and speciation of the arsenic. Since the pHpzc of SAC-MBH surface was 8.6, below the pHpzc value the adsorbent surface becomes positive in a solution. On the other side, As(V) has dissociation constant pKa = 6.94, thereby at pH > 6.97 it is protonated where at pH < 6.97 it is deprotonated. Thus, it can be said that at less than pH 6.97, As(V) remains in solution in anionic form (H2AsO4 ), so at this stage arsenate becomes negatively charged while SAC-MBH becomes positively charged, so electrostatic attraction occurs and this was most likely the adsorption mechanism (Liu et al. 2012). As the pH increases, adsorbent surface becomes less positive (pH < pHpzc) or more negative (pH > pHpzc). These two factors cumulatively decrease electrostatic attraction while increasing repulsion that leads to decrease in As(V) removal at higher pH.

Effect of agitation speed

Agitation speed is a significant parameter for As(V) removal by SAC-MBH. The adsorption study was carried out at variable agitation speed ranging from 40 to 200 rpm that has been depicted in Fig. 4c. Increasing agitation speed causes increased removal of As(V) and reached a maximum removal value at 160 rpm (95%). This optimum agitation speed confirms usability of all the surface active sites of adsorbent to uptake arsenic. It was an optimum speed of agitation to assure the availability of all the surface binding sites for arsenic uptake. During slow agitation speed, SAC-MBH did not spread in the solution properly, instead it conglomerated in the solution; thus, beneath the apex layers of adsorbent it hide many active sites so apex layer is only available for adsorption and in this process hidden layers are not present to participate in the adsorption process, since they had no proper contact with As(V). In case of higher agitation speed, As(V) particles have not enough time to make a bond with SAC-MBH due to random collisions among the participant particle. Similar type of agitation speed pattern was also found by the other researchers (Liu et al. 2012; Shafique et al. 2012).

Effect of temperature

In general, the adsorption process is affected by the temperature. Influence of temperature on the arsenic adsorption by the sorbent was investigated at four different temperatures (298, 303, 308 and 313 K).

As(V) removal was observed to increase from 95 to 98.75% with increase in temperature as depicted in Fig. 4d. It indicates that As(V) removal by adsorption onto SAC-MBH was encouraging at elevated temperatures. Since by further increasing the temperature the percentage removal of the adsorbate remained constant. Increasing temperature speed up the As(V) molecule while slower the retarding forces on the molecules; thereby, percentage removal of the sorption capacity of the adsorbent had been enhanced. Percentage removal of arsenic molecule was also increased due to pore size changes and increase in kinetic energy of the adsorbents. This type of trend was found by other authors (Pokhrel and Viraraghavan 2008).

Kinetic studies

In the process of adsorption, kinetics was done to understand two important factors: one is equilibrium time and other is adsorption mechanism. To study the mechanism of arsenic adsorption, three kinetic models were fitted. The kinetic parameters of the fitted model are presented in Table 2.

Table 2 Kinetic parameters for As(V) removal using activated MBH carbon

Adsorption kinetics of arsenic was investigated in the present investigation using three models, namely Lagergren pseudo-first order, pseudo-second-order and Elovich equation. Fitting and evaluation of the three models into the adsorption kinetics study ultimately give the finest model to explain the adsorption mechanism at different temperature.

Theory of kinetics

Lagergren pseudo-first-order kinetics and second-order kinetics models are mathematically explain in their linear form by the following Eqs. (6) and (7), respectively:

$$\log \left( {q_{\text{e}} - q_{t} } \right) = \log q_{\text{e}} - \frac{k1}{2.303}t,$$
(6)
$$\frac{t}{{q_{t} }} = \frac{1}{{k_{2} q_{\text{e}}^{2} }} + \frac{1}{{q_{\text{e}} }}t,$$
(7)

where \(q_{\text{e}}\) and \(q_{\text{t}}\) are the quantity of arsenic adsorption at an equilibrium and time t (mg g−1); k 1 (min−1) and \(k_{2}\) (g mg−1 min−1) are the pseudo first order and pseudo second order rate constants, respectively.

The linear plot of \(\log \left( {q_{\text{e}} - q_{\text{t}} } \right)\) versus \(t\) and \({t \mathord{\left/ {\vphantom {t {q_{t} }}} \right. \kern-0pt} {q_{t} }}\) versus \(t\) gives model applicability of pseudo first order and pseudo second order kinetics (Fig. 5a), respectively. The kinetic parameters k 1 (min−1) and \(k2\) (g mg−1 min−1) were obtained from the slope and intercept of the above-mentioned linear plot by applying Eqs. (4) and (5).

Fig. 5
figure 5

a Pseudo-second-order kinetics, b Langmuir isotherm and c Gibbs free energy change versus temperature of As(V) onto SAB-MBH

Elovich kinetic model assumes that the active sites present on the solid surface are heterogeneous in nature and have different activation energy for chemisorptions. Generally, this model is one of the most practical to understand chemisorptions process.

The following mathematical equation helps to understand the model

$$q_{t} = \frac{1}{\beta }\ln (\alpha \beta ) + \frac{1}{\beta }\ln t$$
(8)

where α is the initial sorption rate constant (mg g−1 min−1) and β is the desorption constant (g mg−1). The linear plot of q t versus \(\ln t\) gives the slope and intercept, from where the value of the parameters α and β is obtained. The value of 1/β indicates the sites available for adsorption.

Prediction of best kinetic model for As(V) adsorption

The suitability of the kinetics model was judged by their correlation coefficients (R 2) which has been tabulated in Table 2. At first, the experimental equilibrium data were analyzed by pseudo-first-order equation. The experimental data were found to be very unfavorable for first-order kinetic model to all temperatures (Figure not shown). Therefore, it was not justifiable to use the model of As(V) kinetics to superheated steam activated carbon.

The experimental data were then fitted into pseudo-second-order equation. The correlation coefficients value was found to be close to unity or unity where the calculated q e values were close to experimental q e values at all temperature tested in this experiment. Therefore, the excellent fit of the data in the model as well as the kinetic parameters suggests that As(V) adsorption kinetics followed pseudo-second-order kinetics.

Using last experimental data, Elovich equation model was verified. The correlation coefficient did not show good linearity but the constant α and β increase with increasing temperature; thus, it can be said that both the rate of chemisorptions and availability of adsorption surface would increase.

Equilibrium studies

The equilibrium adsorptive studies were done by applying different adsorption isotherm to find the relationship between the adsorption of adsorbate by adsorbent and remaining adsorbate concentration in the solution. The parameters found from the equilibrium model show some adsorption mechanism insight, properties and affinity of the adsorbent surface.

The isotherm of Langmuir model provides information about the adsorptive capacity and broad equilibrium behavior. The following equation used to describe the model

$$\frac{{C_{\text{e}} }}{{q_{\text{e}} }} = \frac{{C_{\text{e}} }}{{Q_{\text{m}} }} + \frac{1}{{K_{\text{L}} Q_{\text{m}} }},$$
(9)

where \(C_{\text{e}}\) is the equilibrium adsorbate concentration at liquid phase (mg L−1), q e is the equilibrium adsorbate concentration at solid phase (mg g−1), Q m is the maximum adsorption capacity (mg g−1), and \(K_{\text{L}}\)is the adsorption equilibrium constant (L mg−1).

The plot between \(\frac{{C_{\text{e}} }}{{q_{\text{e}} }}\) vs \(C_{\text{e}}\) of Langmuir isotherm indicates the isotherm constant value of \(K_{\text{L}}\) and \(Q_{\text{m}}\) from the slope and intercept of the graph, respectively (Fig. 5b).

Freundlich equilibrium model is associated with heterogeneous surface where low to intermediate concentration ranges are functional. The following equation can satisfactorily describe the model

$$\log q_{\text{e}} = \log K_{\text{F}} + \left( {\frac{1}{n}} \right)\log C_{e} ,$$
(10)

where \(K_{\text{F}}\) and \(n\) are Freundlich constants linked to adsorption capacity and adsorption intensity, respectively. These two constants can be estimated from the intercept and slope of the linear plot of \(\log q_{\text{e}}\) vs \(\log C_{\text{e}}\).

The indication of adsorption favorability depends on the value of n.

Temkin isotherm model signifies the heat of the adsorption and the adsorbent–adsorbate interaction. For the Temkin isotherm model, following equation is used

$$q_{\text{e}} = B_{\text{T}} {\text{In}}K_{\text{T}} + B_{\text{T}} {\text{In}}C_{\text{e}} ,$$
(11)

where q e is the quantity of As(V) adsorbed at equilibrium (mg g−1), B T is the Temkin adsorption isotherm constant or heat of adsorption, where B T is the (RT/b), R is the universal gas constant and T is the temperature in kelvin, K T is the Temkin equilibrium binding constant or the maximum binding energy (L mg−1), and \(C_{\text{e}}\) is the adsorbate concentration at equilibrium liquid phase (mg L−1).

The values of B T and K T were calculated from the plot of q e against lnC e.

The Dubinin–Radushkevich (D–R) model is generally expressing the adsorption mechanism. It is applied to differentiate between physical and chemical adsorption. It is expressed by the subsequent equation

$$\ln q_{\text{e}} = \ln q_{\text{d}} - 2\beta \varepsilon^{2} ,$$
(12)

where q d  is the Dubinin–Radushkevich isotherm constant related to the degree of sorbate sorption by the sorbent surface (mg/g), \(\beta\) is the mean free energy coefficient of adsorption, \(\varepsilon\) = Polanyi potential = RTIn(1 + 1/C e), where R is the Gas constant (8.314 J mol−1 K−1) and T is the temperature (K).

The mean free energy of the each adsorbate molecule can be mathematically expressed by the following equation (Kundu and Gupta 2006):

$$E = \frac{1}{{\sqrt {2\beta } }},$$
(13)

E is the mean free energy of the adsorption (E).

The comparative analysis of the four isotherm models was done on the basis of linear coefficient R 2 value and the isotherm was validated by Chi square test.

The Chi square test (χ 2) was calculated by the following equation:

$$\chi^{2} = \sum {\frac{{\left( {q_{\exp } - q_{\text{mea}} } \right)^{2} }}{{q_{\text{mea}} }}}$$
(14)

where \(q_{\exp }\) is the experimental amount of arsenic concentration at equilibrium and \(q_{\text{mea}}\) is the measured or calculated amount of arsenic concentration at equilibrium.

Table 3 shows the diverse isotherm parameter with their corresponding values and from the Langmuir isotherm it was seen that the R 2 value is close to unity or unity as compared to the other three isotherm models. The R 2 value indicates the fit of experimental data into the liner isotherm equation; the higher the R 2 value the better is the goodness of fit.

Table 3 Isotherm parameters for As(V) removal using activated MBH carbon

The error factor when considering, it is shown that low χ 2 value in Langmuir model has better represented the equilibrium data into the equation. An outstanding fit of the adsorption data into the Langmuir isotherm model established that the adsorption is monolayer where each molecule of adsorption had identical activation energy and negligible sorbate–sorbate interaction (Kannan and Sundaram 2001). Lastly to confirm the results, Langmuir adsorption model was checked by the R L value which is a dimensionless constant or separation factor or equilibrium parameter. R L is computed by means of the subsequent equation

$$R_{\text{L}} = \frac{1}{{1 + K_{\text{L}} C_{0} }},$$
(15)

where C 0 is the initial As(V) concentration (mg L−1).

The value of R L indicates the isotherm nature which may be irreversible (R L = 0), favorable (0 < R L < 1) or unfavorable (R L = 1). Investigational data suggest the value of R L was in between 0 and 1; which confirm adsorption was favorable and excellent fit of Langmuir model to describe the investigational data. Apart from Langmuir model, the parameters of D–R model have also some significance due to its R 2 value (0.984) and low Chi square value (0.378). From D–R model, the magnitude of E (21.218 kJ mol−1) indicates the process followed chemisorptions adsorption. The maximum adsorption capacity of As(V) by SAC-MBH has been compared with other adsorbents found in different literature (Table 4).

Table 4 Comparative adsorptive capacity of prepared SAC-MBH and other different adsorbent found in the literature towards the adsorption of As(V)

Thermodynamic studies

Thermodynamic behavior of a system can be understood by the help of the following equation:

$$\Delta G^{0} = - RT\ln K_{\text{c}} ,$$
(16)
$$K_{\text{c}} = \frac{{C_{\text{a}} }}{{C_{\text{e}} }},$$
(17)
$$\Delta G^{0} = \Delta H^{0} - T\Delta S^{0} ,$$
(18)

where K c is adsorption distribution coefficient, C a and C e are the equilibrium arsenic concentrations on the adsorbent and in the solution, respectively. ∆G 0 is the Gibbs free energy change, ∆H 0 is the enthalpy change and ∆S 0 is the entropy change (Yao et al. 2010).

A plot of ∆G 0 versus T gives the slope and intercept, from that the values of ∆S 0 and ∆H 0 can be measured (Fig. 5c).

Gibbs free energy for the adsorption of As(V) by SAC-MBH was obtained from the Eq. (15) and presented in Table 5.

Table 5 Thermodynamics parameters for As(V) removal by using SAC-MBH

The negative value of ∆H 0 strongly supports the exothermic nature of adsorption while the negative value of ∆S 0 suggests that the process was enthalpy driven. At all temperatures, the negative values of ∆G 0 confirmed the spontaneous and feasible adsorption process. By increasing the temperature, the value of ∆G 0 becomes more negative that suggests that the higher temperature makes the adsorption favorable.

Modeling of the process

Response surface methodology

The experimental design of the modeling of removal process was carried out and results were obtained according to CCD matrix (Table 1). Experimental data of As(V) removal process were fitted into different models (linear, interactive, quadratic and cubic) using two tests, i.e., the sequential model sum of squares and lack of fit test were carried to find the best fitted model and the model summary statistics are specified in Table 6. It was clear from the results that high value of determination of coefficient (R 2), adjusted determination of coefficient (R 2adj ), predicted determination of coefficient (R 2pre ) while low value of p values in quadratic model proved its competency among the different models. In addition to this, the experimental data were successfully covered in this model. The quadratic model was expressed in terms of coded variables (Eq. 19) and in terms of actual variables (Eq. 20).

$$\begin{aligned} \% {\text{ Removal of As }}\left( {\text{V}} \right) = & + 9 4. 8 5- 1. 9 5\times A + 0.60 \times B + 0.80 \times C - 0.63 \times A \times B + 0.75 \times A \times C \\ & + 0.50 \times B \times C - 1.39 \times A^{2} - 2.27 \times B^{2} - 3.33 \times C^{ 2} \\ \end{aligned}$$
(19)
$$\begin{aligned} \% {\text{ Remvoal of As }}\left( {\text{V}} \right) = & + 6 5. 5 3 4 8 9+ 1. 6 6 1 5 3\times {\text{pH}} + 0. 1 8 1 7 9\times {\text{Agitation speed}} \\ & + 0.0 2 8 2 2 4\times {\text{Dosage}} - 3. 8 4 6 1 5 {\text{E}} - 00 3\times {\text{pH}} \times {\text{Agitation speed}} \\ & + 7. 50000{\text{E}} - 00 4\times {\text{pH}} \times {\text{Dosage}} + 1. 9 2 30 8 {\text{E}} - 00 5\\ & \times {\text{Agitation speed}} \times {\text{Dosage}} - 0. 2 2 2 2 6\times {\text{pH}}^{ 2} - 5. 3 7 9 90{\text{E}} - 00 4\\ & \times {\text{Agitation speed}}^{ 2} - 2.0 8 3 5 4 {\text{E}} - 00 5\times {\text{Dosage}}^{ 2} \\ \end{aligned}$$
(20)
Table 6 RSM Model fit summary statistics and ANOVA analysis of RSM model

Quadratic model’s statistical significance was checked using ANOVA (analysis of variance). The results of ANOVA for second order response surface model are shown in Table 6. Different parameters associated with quadratic model proved that the model is significant such as 1. High fishers F value (94.43). 2. Very low probability value (<0.0001). 3. Multiple correlation coefficient (R 2) value (0.9946) was fairly high so significant for Goodness of fit test. 4. Predicted multiple correlation coefficient value (0.9277) was in a reasonable good agreement with the adjusted multiple correlation value (0.9778). 5. The lack of fit value is less than 0.05. 6. Coefficient of variance (CV) value (0.66) is low suggesting precision and reliability of the data. 7. The values of “probability >F” less than 0.0500 indicate that the model term is significant and 8 coefficients of the main, square as well as interaction effects of model terms are significant.

Effect of adsorbent dosage and pH on As(V) removal

Prediction of As(V) removal by the combined effect of adsorbent dosage and solution pH is described in three dimensional plot (Fig. 6a). Increasing the adsorbent dosage while decreasing the pH of the solution increases the % removal of As(V). This phenomena occurs as more adsorbent dosage indicates more adsorbent mass so increases adsorbent surface area that ultimately leads to more sorption site for adsorption. The adsorbed As(V) amount increases with decrease in solution pH. The explanation of the fact is that when the solution pH is less than pHpzc, then the adsorbent surface (pHpzc = 8.6) becomes positively charged where the surface of arsenate (pKa = 6.97) becomes more negative at pH; thus, negatively charged anionic particles (arsenic molecules) strongly attract positively charged cationic surface of adsorbent and, thus, adsorption became more at low pH.

Fig. 6
figure 6

Three-dimensional plot showing the interaction effect of a pH and dosage, b agitation speed and dosage and c pH and agitation speed of SAB-MBH on removal of As(V)

Effect of adsorbent dosage and agitation speed on As(V) removal

The combined effect of adsorbent dosage and agitation speed was shown in the three-dimensional plot as indicated in Fig. 6b. Both independent variables have positive effect on the As(V) removal so increasing the both variables has synergistic effect on this process.

Effect of pH and agitation speed As(V) removal

Figure 6c demonstrates the interactive effects of two independent variables, i.e., pH and agitation speed. Regression analysis indicates the model term pH and agitation speed has net positive effect on As(V) removal. Collective p value (0.0393) of these two parameters is lesser than 0.1000 signifying the combined model terms.

Artificial neural network training for the optimization of the As(V) removal model

Input data were feed into the ANN architectures where algorithm (error back propagation) was used to train the process. The same experimental matrix of CCD was used to train the input and output data of ANN modeling of the process. Independent variables, i.e., dosage, pH and rpm, are selected as input nodes where dependent variables response or the removal of As(V) is the output nodes. In the hidden layer, nonlinear transformations were done to test different types of algorithm to select the best network which gave R 2 (coefficient of correlation) that is very close to one.

Training of ANN network to optimize the removal process is a crucial step. The selection of POSLIN (positive linear transfer function) for input data where PURELIN was selected for output data to train the ANN. The best ANN network was selected by optimizing neuron present within the hidden layer. After feeding the data and running the software in ANN, we found the best correlation coefficient value (0.99) from the regression plot of the trained network (Fig. 7a). A high correlation coefficient that is closer to 1 signified the neural network modeling reliability with the experimental data. Among the different algorithm used in ANN (Table 7), Levenberg–Marquardt back propagation (Trainlm) showed the best fitted algorithm to explain the ANN networking for the removal process.

Fig. 7
figure 7

a Regression plot of ANN model. b Residuals distribution patterns of RSM and ANN

Table 7 Trial and error method using transfer function (poslin and purelin) for % removal of As(V) by ANN model

Comparison of RSM and ANN network for the removal of As(V)

Utilizing RSM and ANN modeling process, linear and nonlinear multivariate regression problems are solved. The difference between the experimental value and model predicted value (RSM and ANN) is compared in Table 1. The residuals distribution pattern of RSM and ANN is shown in Fig. 7(b). Compared to RSM model, the residual fluctuation of ANN models is less diverge, normal and minute. Further both models are compared using three statistical estimators, i.e., root mean squared error (RMSE), coefficient of determination (R 2) and absolute average deviation (AAD)

$${\text{RMSE}} = \left( {\frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {T_{\text{\% removal, predict}} - T_{\text{\% removal, exp}} } \right)^{2} } } \right)^{1/2} ,$$
(21)
$$R^{2} = \frac{{\sum\nolimits_{i = 1}^{n} {\left( {T_{\text{\% removal, experimental}} - \overline{{T_{\text{\% removal, cal}}}} } \right)^{2} } }}{{\sum\nolimits_{i = 1}^{n} {\left( {T_{\text{\% removal, experimental}} - \overline{{T_{\text{\% removal, cal}}}} } \right)^{2} + \left( {T_{\text{\% removal, experimental}} - T_{\text{\% removal, cal}}} \right)^{2} } }},$$
(22)
$${\text{AAD}} = \left( {\frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {\frac{{T_{\text{\% removal, predict}} - T_{\text{\% removal, exp}} }}{{T_{\text{\% removal, exp}} }}} \right)} } \right) \times 100,$$
(23)

where n is the number of points, \(T{\text{\% removal,}}\;{\text{predict}}\) is the predicted value, \(T{\text{\% removal,}}\exp\) is the actual value, and the symbol '–' is the average of the related values.

Statistical comparison (Table 8) showed that both models have good quality of data fitting and estimating capability, although ANN has some slight edge over RS. The advantage of ANN is its flexibility and non-requirement of standard experimental design. Its disadvantages include no combined parametric effect on the process. In general, ANN approach is a trustworthy, rational and simulates non-linear form while overcoming the limitations of quadratic non-linear correlation assumption of RSM.

Table 8 Comparative statistical analysis of RSM and ANN

Conclusion

Isotherm, kinetics, thermodynamics and optimization of the process parameter associated with adsorption of pentavalent arsenic onto mung bean husk derived activated carbon have been studied in detail. Experimentally the maximum adsorptive removal of arsenic has been observed to take place at an adsorbent dosage 0.75 g, pH 3.0, agitation speed 160 and temperature of 308 K in case of 2 ppm As(V). Out of four isotherm models tested, the fitting order with respect to coefficient of correlation was Langmuir > D–R model > Temkin > Freundlich. The kinetic aspect of the data showed that the adsorption followed pseudo-second-order model. The heat of sorption was found to be (21.218 kJ mol−1) which indicates that the sorptive removal process was chemisorption. Thermodynamic parameters suggest that the adsorption process was exothermic, spontaneous, and feasible in nature and higher temperature was favorable for adsorption process. An effort has been made to model the removal process utilizing RSM and ANN network approaches. The process modeling done by ANN (R 2 = 0.9969) has some minute edge over RSM (R 2 = 0.9883) although in general both the models have some good optimizing capability.