Introduction

The Philippines is one of the largest phosphate fertilizer producers in the Southeast Asia, processing phosphate ore imported from different locations to wet phosphoric acid, an intermediate product in fertilizer production, and phosphogypsum (PG). PG is a powdery byproduct of which roughly 40% are presently used in the cement industry and as soil conditioners in the Philippines. The Mines and Geosciences Bureau of the Philippines is leaving no stone unturned in its quest to locate rare earth elements (REEs) that could support the country’s production sector while reducing metal imports from China [1, 2]. Ramirez et al. [3] recently pointed to the approximately 10.1 million t PG that are dry-stacked and accessible in the Philippines as a potential secondary resource of REEs.

Phosphate ore, the main raw material in phosphate fertilizer production is known for its elevated content of valuable trace elements, most notably REEs [4,5,6,7,8,9,10,11] and uranium [12,13,14,15,16]. During wet phosphoric acid production with sulfuric acid as it is done at most phosphate fertilizer plants worldwide, the majority of REEs (> 80%) transfers to the solid PG while most of the uranium (> 80%) transfers to the liquid phosphoric acid [17,18,19,20]. There is increased research activity to fully utilize PG stacks worldwide instead of just managing them [21,22,23,24,25,26,27,28,29,30]. Not processing PG but stacking the material could result in potential present and future environmental risks [31,32,33,34,35,36,37]. Recovering REEs that occur in relevant concentrations in PG [38,39,40,41,42,43] and ideally also removing actinides from the PG before using the remaining gypsum matrix as an inexpensive raw material in construction seems to be an attractive option [23, 44,45,46,47]. Although there have been considerable efforts to determine the concentrations and quantities of REEs in Philippine PG [3, 48] no work ever attempted to recover these resources. The present study aims at filling this gap by providing a simple as well as potentially economic process and optimization for REE leaching from Philippine PG that can be upscaled and applied in a next step.

Materials and methods

PG samples and elemental composition

PG samples were collected from 2-m-deep trenches in the tailing ponds of the main fertilizer plant in the Philippines as described in a previous work [3]. Although, the Philippines is one of the largest producers of fertilizer in Southeast Asia it has no domestic source of phosphate rock (PR). The PG in the Philippines is produced from a combination of sedimentary PR imported from China, Egypt, Israel, Jordan, Peru, Tunisia, the USA and Vietnam and igneous PR from Russia and South Africa [49, 50]. There are around 10.1 million t PG in the tailings ponds in the Philippines that have been accumulated since 1984 [48].

Five (5) of the samples with the highest total REE (TREE) concentrations from a previous study of the tailings ponds [3] were pulverized using mortar and pestle, and subsequently mixed to form a composite. To ensure homogeneity, the composite, weighing approximately 10 kg, was mixed in a Thermo Scientific bottle/tube roller for 24 h at 80 rpm. Samples were then sent to a third-party testing laboratory (Intertek Testing Services Philippines, Muntinlupa City, Philippines) for analysis. The laboratory is accredited by the Philippine Accreditation Bureau and also ISO/IEC 17025:2017-certified. Approximately 1 g of the composite was digested using a combination of analytical grade 37% HCl, 70% HNO3, 50% HF and 69–72% HClO4 and then analyzed for REEs using a combination of Inductively Coupled Plasma Mass Spectrometry (ICP-MS Agilent 7700x) and inductively coupled plasma optical emission spectrometry (ICP-OES Agilent 5100). Blank solutions and certified reference materials (i.e., OREAS 501c, 600, 623 90, and 44P) were used to ensure the accuracy of the results. The detection limits ranged from 0.05 to 0.1 mg kg−1. The REE composition of the PG composite is presented in Table 1.

Table 1 REE composition of the PG composite analyzed by ICP-MS

REE leaching procedure

The leaching experiments were conducted following the patterns proposed by Al-Thyabat and Zhang [51, 52], Cánovas et al. [41], Lütke et al. [53], Rychkov et al. [54], and Walawalkar et al. [55] for other than Philippine PG. It is well known that PG from different locations shows different trace-element concentrations, so that leaching experiments successfully conducted at one PG location may lead to different results when a different PG stack is considered. The differences can be attributed to the different phosphate ore processed, the different processing conditions, as well as different qualities of the sulfuric acid used for wet-phosphoric acid processing [56]. The REE leaching optimization followed four succeeding steps: (1) optimizing acid concentration (C), (2) optimizing temperature (T), (3) optimizing time (t), and (4) optimizing solid-to-liquid ratio (S/L ratio) as summarized in Table 2.

Table 2 Experimental matrix of the leaching experiment

There are a number of acids that are technically promising for leaching of REEs from PG [57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72]. In this work, H2SO4 (< 10% vol/vol) was chosen for its comparable leaching efficiency with HCl and HNO3, low solubility of PG in H2SO4 (i.e., resulting PG residue from leaching will enable secondary applications), and most importantly for onsite availability and economic reasons which could be beneficial for large scale extraction of REEs from PG in the Philippines [54].

Step 1: determination of optimum acid concentration

10 g PG was added to 50 mL (1:5 S/L ratio) of acid of varying concentrations in a 250 mL beaker. The mixture was leached at 380 rpm for 2 h at ambient temperature using 1% to 10% H2SO4. This concentration range was used to avoid common ion effect and formation of less soluble bisulfates that could inhibit the leaching of REEs from PG [73].

Step 2: determination of optimum temperature

10 g PG was added to 50 mL (1:5 S/L ratio) of the optimum acid concentration obtained in Step 1 in a 250 mL beaker. The mixture was leached at 380 rpm for 2 h at temperatures 40 to 80 °C.

Step 3: determination of optimum time

10 g PG was added to 50 mL (1:5 S/L ratio) of optimum acid concentration in a 250 mL beaker. The mixture was leached at 380 rpm at optimum temperatures obtained in Step 2 and at varying leaching times from 5 to 60 min.

Step 4: determination of optimum S/L ratio

10 g PG was added to varying volumes of optimum acid concentrations in 250 mL beaker to form 1:2, 1:3, 1:4, 1:5, and 1:10 S/L ratios. The mixture was leached for 30 min at 380 rpm at optimum temperatures.

The leaching experiment was performed in a hot bath (Fig. 1a). The temperature of the acid was stabilized prior to the addition of the PG. The acid-PG mixture was covered with a watch glass to prevent acid evaporation. After each leaching experiment, the PG and acid mixture was filtered using 125 mm Whatman ™ filter papers (Cat No 1440 125) and washed with 100 mL of distilled water (Fig. 1b). The collected residue was then dried in an oven at 105 °C for 24 h. Each experimental setup was repeated three (3) times which resulted in a total of 54 experiments. Due to the complexity of analyzing metals dissolved in sulfuric acid matrix [74], the dried residues were instead analyzed for REEs by the four-acid digestion method using ICP-MS and ICP-OES.

Fig. 1
figure 1

Experimental setup of the leaching procedure showing the A hot bath with the PG and acid mixture, and B filtering of the mixture after the leaching experiment

REE leaching efficiency

The efficiency of the leaching procedure was determined using the following formula:

$${\text{Leaching }}\,{\text{Efficiency }}\,\left( \% \right) = \frac{{C_{i} - C_{f} }}{{C_{i} }} \times 100$$
(1)

where Ci and Cf are the REE concentrations in PG composite and PG residue, respectively.

Taguchi method

Taguchi method is an engineering technique used for process optimization which involves system design, parameter design, and tolerance design procedures [75, 76]. The signal/noise (S/N) ratio is used to examine the response in each experiment and the corresponding variance in the Taguchi method. The S/N ratio is a measure of deviation of quality characteristics from the ideal values [77]. There are usually three types of S/N ratios:

$${\text{Nominal value is better:}}\,\frac{S}{N} = - 10\log \mathop \sum \limits_{i = 1}^{n} \frac{{\left( {y_{i} - m} \right)^{2} }}{n}$$
(2)
$${\text{Smaller value is better}}:\frac{S}{N} = - 10\log \frac{1}{n}\left( {\mathop \sum \limits_{i = 1}^{n} y_{i}^{2} } \right)$$
(3)
$${\text{Larger value is better}}:\frac{S}{N} = - 10\log \frac{1}{n}\left( {\mathop \sum \limits_{i = 1}^{n} \frac{1}{{y_{i}^{2} }}} \right)$$
(4)

where m is the desired nominal value, n is the number of experiments, and y is the experimental result [78].

Multiple linear regression

A simple linear regression evaluates the relationship between the explanatory variable x and the response variable y. If there are multiple explanatory variables, Multiple linear regression (MLR) is utilized [79, 80]. Regression is normally used to make predictions. MLR assumes that the explanatory and response variables have a linear relationship, that the data has a normal distribution, that there are no extreme values, and that there are no multiple ties between the explanatory variables. MLR also synchronically accounts for the variance of the explanatory variable in the response variables [81].

Stepwise regression

Stepwise regression is also a multivariate modelling technique in which an explanatory variable is added or removed from the linear model at each step. In each step, the variable that increases the R2 coefficient the most is added to the model [82, 83]. In contrast to MLR, stepwise regression does not incorporate all the explanatory variables into the model but instead evaluates their statistical significance one at a time. It is typically used when investigating numerous explanatory variables. In this work, the regression models were performed in IBM SPSS Statistics version 25.

Artificial neural network (ANN)

Artificial neural network (ANN) is a machine learning technique that is now extensively used in mineral processing to identify complex relationships between input and output data using a series of nonlinear functions [84,85,86]. Unlike regression, ANN can be trained to learn and recognize patterns between the inputs and outputs [87]. One of the many benefits of using ANN is that it tolerates data noise [88]. ANN has been employed for optimization of leaching and extraction processes of precious metals (i.e., Cu, REEs, etc.) in several studies [75, 86, 88, 89].

ANN is essentially a computer model that simulates the brains learning mechanism. ANN consists of nodes or neurons which are processors that operate in a parallel way. The neurons are arranged in layers including an input layer, one or more hidden layers, and an output layer. The neurons are interconnected to one another through connection links carrying specific weights.

In this work, the feed-forward ANN using back-propagation algorithm was used to model the relationship between the explanatory variables and the REE leaching efficiency using MATLAB R2021b. Back-propagation algorithm is a method of reducing the error between output and input data by altering the weighted connections between neurons [90]. The architecture of the 4-9-5 neural network used in this work is shown in Fig. 2.

Fig. 2
figure 2

ANN structure for the optimization of REE leaching from PG in the 4-9-5 form

Results and discussion

Experimental leaching output

Acid concentration, temperature, time, and S/L ratio were optimized to maximize the efficiency of REE leaching from PG. The variables used in this work are based on previous experimental works [41, 51,52,53, 55]. Strong inorganic acids (i.e., HCl, HNO3, and H2SO4) were extensively used for the investigation of REE leaching from other than Philippine PG [91, 92]. H2SO4 was also used for leaching experiments from Florida PG by Gaetjens et al. [93] and Liang et al. [94], Russian PG by Lokshin et al. [95, 96], and Brazilian PG by Lütke et al. [53].

We performed a total of 18 leaching setups, repeated 3 times each to guarantee high accuracy of the results. The total experimental design matrix and the obtained REE leaching efficiencies are presented in Table 3. The leaching efficiencies for La, Ce, Nd, Y, and TREE ranged from 17 to 75%, 12 to 72%, 13 to 71%, 14 to 68%, and 14 to 71%. The setup with the highest TREE leaching efficiency was test number 6 which used 10% H2SO4, at 50 °C, a leaching time of 120 min, and a 1:5 S/L ratio.

Table 3 Experimental matrix and the resulting La, Ce, Nd, Y, and TREE leaching efficiencies (%)

For all the REEs, the leaching efficiencies followed the same trend in each of the leaching steps as shown in Figs. 3A–D. The efficiency of REE leaching increased with higher acid concentrations (Fig. 3A). This can be attributed to bisulfate formation that causes an increase in Ca2+ concentration after the reaction between H+ with SO42− as a result of increased gypsum solubility that controls the REE leaching efficiency since the gypsum hosts the REEs [32]. The temperature has a catalytic effect so that the leaching efficiency increased as the temperature increased from 40 °C to 50 °C (p < 0.05) as shown in Fig. 3B. The results for 50 °C to 80 °C are not significantly different (p > 0.05), although 50 °C leached the most REEs in the experiment. Generally, leaching efficiencies decrease at higher temperatures due to dissolution of fluoride precipitates which then reacts with the REEs and forms an insoluble precipitate [92]. The majority of the REEs leached from the PG after 15 min (Fig. 3C) although the setup with leaching time of 120 min leached the most REEs. Considering the economics in an industrial scale, we used 30 min for the final step of the optimization procedure. Leaching kinetic studies also show that the maximum REEs were leached from the PG after 20 min [53, 55]. Lastly, the most diluted mixture (1:10 S/L ratio) leached most of the REEs although the results of 1:3, 1:4, and 1:5 were not very different (Fig. 3D). The slight decrease in leaching efficiency observed for 1:5 is not significantly different (p > 0.05) with the results of 1:3 and 1:4. In some cases studied, a decrease in leaching efficiency could be explained with reaching the gypsum solubility limit [55]. In general, it is not desirable to have increased Ca2+ concentration in the solution because it can compete with REEs for available binding sites on the leaching agents. This means that if there is an excess of Ca2+ ions in the solution, they may bind to the leaching agents instead, which reduces the efficiency of the REE leaching process.

Fig. 3
figure 3

Effects of A acid concentration, B temperature, C time, and D solid-to-liquid ratio on the leaching efficiency of REEs in H2SO4

The Pearson correlation coefficients r between the independent variables (C H2SO4, T, t, and S/L ratio) and the dependent variables (La, Ce, Nd, Y, and TREE) is shown in Table 4. Among the explanatory variables, C H2SO4 has the highest r 0.918 to 0.956 (p < 0.01), followed by T with r 0.681–0.739 (p < 0.01). It is noteworthy that t has a negative r while the S/L ratio has a small positive r. Both were not significant (p > 0.05). Although the variables that should be used in regression models are for r > 0.3, we still used t and the S/L ratio in the regression models.

Table 4 Pearson correlation coefficient between the explanatory variables and specific REE leaching efficiency

Determining the optimum leaching conditions using Taguchi method

Although the design of the experiment is not based on the orthogonal array suggested by Taguchi, we still used the Taguchi method to determine the optimum combination of variables. The result of the REE leaching efficiency was converted to signal-to-noise (S/N) ratios. The S/N ratio is a measure of deviation of quality characteristics from the ideal values [77]. There are three different types of S/N ratios (i.e., nominal value is better, smaller is better, and larger is better) depending on the data characteristics [78]. In this work, the larger is better type was used building on the work of Brest Kasongo and Mwanat [75]. Thus, the levels of the explanatory variables with the highest average S/N ratios are considered optimal. The result of this method can therefore determine the optimum levels and combination of the variables to maximize REEs leaching from PG. The square of responses, inverse of the square of responses, and S/N for the REE yield for each of the experiments is shown in Table 5.

Table 5 Square of responses, inverse square of responses and S/N ratios of the REEs for each experimental setup

The average S/N ratios of the explanatory variables at different levels for specific REE leaching is shown in Table 6. For all the REEs, the highest average S/N ratios corresponded to level 4 (10% H2SO4) for the acid concentration, level 6 (80 °C) for the temperature, level 3 (30 min—Y and TREE) to level 4 (45 min—La, Ce, and Nd) for the time, and level 1 (1:10) for the S/L ratio. Therefore, the optimum combination of the variables for the maximum leaching of REEs in PG is 10% H2SO4, 80 °C, 30–45 min, and 1:10 for the acid concentration, temperature, time, and S/L ratio, respectively.

Table 6 Response for signal/noise ratio of the explanatory variables and their corresponding levels

Aside from finding the optimum combination of the variables, the Taguchi method also ranks the variables according to their overall importance to REE leaching efficiency. Also shown in Table 6 are the deltas and ranks of the explanatory variables which compare the relative magnitude of their effects. The delta is the difference between the highest and lowest average S/N ratio of each variable. And for this work, the higher the delta, the greater is the influence of the variable which assigns their rankings. Based on the delta, the ranking of the variables according to their overall importance to REE leaching in PG is as follows: acid concentration > temperature > time > S/L ratio.

Modelling of REE leaching efficiency using MLR

Multiple linear regression (MLR) models the linear association between the independent/explanatory variables (i.e., C, T, t, S/L ratio) and the dependent/response (i.e., Laeff, Ceeff, Ndeff, Yeff, and TREEeff) variables. We used the first order MLR to model the leaching efficiencies of REEs in PG using the explanatory variables. The MLR model used in this work follows the form proposed by Uyanık and Güler [81]:

$$y = \beta_{0} + \mathop \sum \limits_{i = 1}^{k} \beta_{i} \cdot X_{i}$$
(5)

where \({\beta }_{0}\) is the coefficient of the intercept or the constant and \({\beta }_{i}\) is the slope or the coefficient of the explanatory variable \({X}_{i}\) [89]. The significance of the explanatory variable for inclusion in the linear model was validated using p values (p ≤ 0.05) or Sig. For all the REEs, the regression statistics show an R2 of 0.938–0.961, adjusted R2 of 0.919–0.949, standard error of 3.534–4.025, and an overall p value of 0.000. The coefficients, p values, and the 95% confidence intervals of the explanatory variables are shown in Table 7.

Table 7 First order MLR model of the explanatory variables

Based on these values, the forms of the MLR models for the leaching efficiencies of the specific REEs are:

$${\text{La}}_{{{\text{eff}}}} \left( \% \right) = {5}.{889} + {4}.{7}0{1}\,C\,{\text{H}}_{2} {\text{SO}}_{4} + \, 0.{194}T$$
(6)
$${\text{Ce}}_{{{\text{eff}}}} \left( \% \right) = - {1}.{943} + {5}.{119}\,C\,{\text{H}}_{2} {\text{SO}}_{4} + \, 0.{181}\,T + \, 0.0{58}\,t$$
(7)
$${\text{Nd}}_{{{\text{eff}}}} \left( \% \right) = - {1}.{873} + {4}.{434}\,C\,{\text{H}}_{2} {\text{SO}}_{4} + \, 0.{216}\,T + \, 0.0{74}\,t$$
(8)
$${\text{Y}}_{{{\text{eff}}}} \left( \% \right) = {6}.{183} + {4}.{518}\,C\,{\text{H}}_{2} {\text{SO}}_{4} + \, 0.0{48}\,t$$
(9)
$${\text{TREE}}_{{{\text{eff}}}} \left( \% \right) = {2}.{138} + {4}.{63}0\,C\,{\text{H}}_{2} {\text{SO}}_{4} + 0.0{56}\,t$$
(10)

The regression models confirmed the results of the Pearson correlation and Taguchi method that the S/L ratio is not a particularly important variable in determining the leaching efficiency of REEs. T is significant for Laeff, Ceeff, and Ndeff whereas t is significant for all except for Laeff. For all the REEs, C H2SO4 is the most significant variable.

We validated the models using the experimental parameters in Table 3. Using the models, we found very good correspondence between the experimental and predicted values (Fig. 4A) with rExpt-Predicted (p < 0.01) of 0.983, 0.989, 0.990, 0.977, and 0.966 for Laeff, Ceeff, Ndeff, Yeff, and TREEeff models, respectively. We also computed the deviation of the predicted values from the experimental values using the % error. The performance of each model based on the average % error computed from the 18 experimental tests (Fig. 4B) is in the order of 7.08 ± 3.59% (Laeff) < 7.38 ± 9.66% (Ndeff) < 8.17 ± 8.90% (Ceeff) < 11.02 ± 8.48% (Yeff) < 13.53 ± 6.72 (TREEeff).

Fig. 4
figure 4

A Comparison of the experimental REE leaching efficiencies and predicted REE leaching efficiencies using the MLR model, and the B the deviation of the predicted values from the experimental values using the % error

Modelling of REE leaching efficiency using stepwise regression

Leaching efficiency of TREEs can also be affected by an interaction among the different parameters. To determine such effects, we developed a first order multiple linear regression model with interaction effects between the explanatory variables using stepwise regression. The model follows the general form of:

$$y = \beta_{0} + \mathop \sum \limits_{i = 1}^{k} \beta_{i} \cdot X_{i} + \mathop \sum \limits_{i = 1}^{k - 1} \mathop \sum \limits_{j = i + 1}^{k} \beta_{ij} \cdot X_{i} \cdot X_{j} + \mathop \sum \limits_{i = 1}^{k - 2} \mathop \sum \limits_{j = k + 1}^{k - 1} \mathop \sum \limits_{l = j + 1}^{k} \beta_{ijl} \cdot X_{i} \cdot X_{j} \cdot X_{k}$$
(11)

where k is the number of explanatory variables [89]. A total of 18 interactions were analyzed including C H2SO4, T, t, S/L, C H2SO4·C H2SO4, C H2SO4·T, C H2SO4·t, C H2SO4·S/L, T·T, T·t, T·S/L, tt, t·S/L, S/L·S/LI, C H2SO4·T·t, C H2SO4·T·S/L, C H2SO4·t·S/L, and T·tS/L. The stepping method criteria used for inclusion in the model is the probability of F (entry ≤ 0.05 and removal ≥ 0.10). The coefficients, p values, 95% confidence interval, and other relevant statistics of the stepwise regression models of the possible interactions between the explanatory variables are presented in Table 8. For each of the REEs, two regression models were produced but the second model was selected for its non-multicollinearity such that Tolerance > 0.1 and VIF < 10 [89].

Table 8 Stepwise regression model of the interaction effects with significant p (p < 0.05) including their coefficients, p values, and the 95% confidence intervals

Among these possible interactions, we found that only C H2SO4 is significant for modelling the leaching efficiencies of all the REEs. The stepwise regression verifies the previous result of the MLR that the S/L ratio and its possible interaction with the other explanatory variables is not significant for the REE leaching efficiency from PG. And like the previous MLR models, there were very high accuracies in the regression models with R2 = 0.936–0.960, adjusted R2 = 0.928–0.955, a standard error of 3.3121–3.7875, and an overall p value of 0.000–0.011.

The analytical forms of the leaching models based on the interaction between the explanatory variables are therefore:

$${\text{La}}_{{{\text{eff}}}} \left( \% \right) = {14}.{686} + {4}.{613}\,C\,{\text{H}}_{{2}} {\text{SO}}_{{4}}$$
(12)
$${\text{Ce}}_{{{\text{eff}}}} \left( \% \right) = {9}.{517} + {4}.{82}0\,C\,{\text{H}}_{{2}} {\text{SO}}_{{4}}$$
(13)
$${\text{Nd}}_{{{\text{eff}}}} \left( \% \right) = {11}.{7}0{3} + {3}.{989}\,C\,{\text{H}}_{{2}} {\text{SO}}_{{4}}$$
(14)
$${\text{Y}}_{{{\text{eff}}}} \left( \% \right) = {14}.{785} + {4}.{157}\,C\,{\text{H}}_{{2}} {\text{SO}}_{{4}}$$
(15)
$${\text{TREE}}_{{{\text{eff}}}} \left( \% \right) = {12}.{8}0{4} + {4}.{324}\,C\,{\text{H}}_{{2}} {\text{SO}}_{{4}}$$
(16)

Validation of the model showed a very high correlation between the experimental and predicted values (Fig. 5A) with rExpt-Predicted (p < 0.01) = 0.914, 0.915, 0.918, 0.970, and 0.946 for La, Ce, Nd, Y, and TREE, respectively. Meanwhile, the average % error of the models is in the following order: Y (6.54 ± 9.24%) < L (7.58 ± 5.61%) < Ce (8.81 ± 6.63%) < TREE (8.89 ± 7.23%) < Nd (10.75 ± 8.03) as shown in Fig. 5B.

Fig. 5
figure 5

A Comparison of the experimental TREE leaching efficiency and predicted TREE using the stepwise regression model of interaction effects, and the B the deviation of the predicted values from the experimental values using the % error

The regression models consistently eliminate the significance of the S/L ratio. This makes sense since the leaching efficiency only increased by 3–5% as the mixture became more diluted in Step 4 of the experimental procedure (Fig. 2D). Unlike the previous regression model, the stepwise regression of possible interaction between the parameters eliminated the significance of T and t in the leaching of REEs. To validate the role of the S/L ratio that may probably not observe linear patterns, the ANN was used to find hidden patterns that the regression models were not able to identify.

Modelling of REE leaching efficiency using ANN

The modelling was carried out using a 4-9-5 architecture based on the recommendation by the improved version of the Kolmogorov theorem called the Kolmogorov Mapping Neural Network Existence Theorem. This theorem recommends a three-layer neural network composing of n inputs, 2n + 1 hidden layers, and m outputs [97]. Thus, the 4-9-5 ANN architecture corresponds to 4 neurons in the input layer, 9 neurons in one hidden layer, and 5 neurons in the output layer. The nntool was used to perform the computation in MATLAB R2021b. By default, this tool uses 70% of the data for training, 15% for validation, and 15% for testing.

The Levenberg–Marquardt backpropagation algorithm that updates the values of weights according to the Levenberg–Marquardt optimization was used to train a feed-forward ANN. In a backpropagation algorithm, the network continues until convergence or a maximum number of iterations is reached [98]. The pureline function was used as the activation function in the ANN structure. The number of epochs was set at an initial of 100 but the training process stopped after 3 iterations. The training stops when the model with the lowest root mean squared error on single test points is found. The correlation coefficient R for the training of the final model was 0.98073. The R values for the testing and the overall model were 0.98039 and 0.97379, respectively. The results are shown in Figs. 6A–C. The leaching efficiencies of REEs from PG can be predicted at very high accuracy using ANN. In contrast to the results of the regression models, the ANN was able to accurately predict REE leaching efficiency with very high R values even after considering the S/L ratio.

Fig. 6
figure 6

Leaching efficiency predicted by the neural network in the A training, B testing, and C the overall model versus the experimental leaching efficiency showed very high R using the 4-9-5 ANN architecture

Conclusions

This work investigated for the first time, the leaching efficiency of the REEs from Philippine PG with H2SO4 and optimized the relevant parameters: acid concentration > temperature > time > solid-to-liquid ratio using Taguchi method, regression, and ANN analysis. A TREEs leaching efficiency of 71% (La 75%, Ce 72%, Nd 71% and Y 63%) was realized and it could be shown that the modelling approaches are powerful tools to predict and optimize the leaching efficiencies of REEs from Philippine PG. The experiments described here, though very successful, were all conducted at laboratory scale, and it is recommended to conduct larger pilot plant scale experiments next, to better understand the potential of REE recovery from Philippine PG on larger scale.