Machine learning insights in predicting heavy metals interaction with biochar

Wei, Xin; Liu, Yang; Shen, Lin; Lu, Zhanhui; Ai, Yuejie; Wang, Xiangke

doi:10.1007/s42773-024-00304-7

Machine learning insights in predicting heavy metals interaction with biochar

Perspective
Open access
Published: 25 January 2024

Volume 6, article number 10, (2024)
Cite this article

Download PDF

You have full access to this open access article

Biochar Aims and scope Submit manuscript

Machine learning insights in predicting heavy metals interaction with biochar

Download PDF

Xin Wei^1,2,
Yang Liu³,
Lin Shen⁴,
Zhanhui Lu²,
Yuejie Ai³ &
…
Xiangke Wang ORCID: orcid.org/0000-0002-3352-1617³

1783 Accesses
Explore all metrics

Abstract

The use of machine learning (ML) in the field of predicting heavy metals interaction with biochar is a promising field of research, mainly because of the growing understanding of how removal efficiency is affected by characteristic variables, reaction conditions and biochar properties. The practical application in biochar still faces large challenges, such as difficulties in data collection, inadequate algorithm development, and insufficient information. However, the quantity, quality, and representation of data have a large impact on the accuracy, efficiency, and generalizability of machine learning tasks. From this perspective, the present data descriptors, the efficiency of machine learning-aided property and performance prediction, the interpretation of underlying mechanisms and complicated relationships, and some potential ways to augment the data are discussed regarding the interactions of heavy metals with biochar. Finally, future perspectives and challenges are discussed, and an enhanced model performance is proposed to reinforce the feasibility of a particular perspective.

Graphical Abstract

Highlights

A high growth rate of studies on the application of machine learning (ML) in biochar in recent years.
ML interpretability of heavy metals (HMs) interaction mechanisms with biochar is explicated emphatically.
Challenges and perspectives of ML application in the removal of HMs by biochar.
Combining an advanced machine learning technique to achieve better predicted performance.

Machine learning assisted adsorption performance evaluation of biochar on heavy metal

Article 20 January 2024

Enhancing lead adsorption capacity prediction in biochar: a comparative study of machine learning models and parameter optimization

Article 10 November 2023

Aqueous Adsorption of Pharmaceutical Pollutants on Biochar: a Review on Physicochemical Characteristics, Classical Sorption Models, and Advancements in Machine Learning Techniques

Article 23 October 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Heavy metals (HMs), which are serious toxic pollutants to water and soil, have negative effects on human health and the ecological environment. There are various treatment technologies available for removing HMs from solution, such as adsorption, photocatalysis, electrochemistry, and membrane separation (Hao et al. 2023; Gu et al. 2022; Liu et al. 2023a; Uliana et al. 2021). Biochar is an attractive absorbent material, which is carbon-rich and generated via pyrolysis of biomass under no oxygen or oxygen-limited conditions (Huang et al. 2019). Moreover, it is environmentally friendly and cost-efficient and has many predominant physical and chemical properties, such as high surface area, unique pore structure, abundant functional groups, and stable framework. Hence, the application of biochar to remove HMs in the environment is extensive (Liu et al. 2022). Moreover, the adsorption capacity can be improved through various modification techniques, including magnetizing biochar, metal impregnation, and plasma-treated biochar, which increase the utilization of biochar adsorbents (Fang et al. 2023).

The adsorption capacity of biochar for HMs is related to many factors, including various adsorption mechanisms (e.g., surface complexation, electrostatic interaction, precipitation), biochar properties (e.g., surface area, functional groups, pore size), and experimental conditions (e.g., time, temperature, concentration) (Qiu et al. 2022). However, many possible combinations of these methods result in traditional controlled variable experimental techniques, which are not ideal routes for rapid screening of high-performance biochar. Hence, machine learning, as a pop and data-driven research paradigm for exploring complex relationships and determining features of adsorption performance, is warranted (Wei et al. 2023, 2024). We searched published articles with machine learning and biochar as keywords on the Web of Science website up to 2023. Using SiteSpace software (citespace.podia.com), the obtained articles are visualized in Fig. 1. The number of related studies has increased, and more attention has been given to HMs in recent years. Cao et al. (2016) pioneered the employment of the artificial neural network (ANN) model and least squares support vector machine (LSSVM) model to predict biochar yield during the pyrolysis of cattle manure. Although several correlation analyses, meta-analyses, and interpretability analyses have been carried out to evaluate the adsorption capacity of biochar for HMs, machine learning model-based prediction studies are still lacking (Wang et al. 2023). Zheng et al. (2022) and Almalawi et al. (2022) developed hybrid deep learning models to improve the predictive performance of various descriptors for the adsorption capacity of HMs. A better strategy for building adsorption models was to use hybrid models and optimization algorithms. Additionally, extensive studies have focused more on obtaining deeper insights into the prominent characteristics of adsorption properties (Leng et al. 2022; Sun et al. 2022b). Although the factors influencing biochar adsorption capacity are complex and diverse, the interpretability of machine learning allows one to understand exactly the causes of biochar adsorption in association with model predictions. Black-box models (e.g., ANNs and SVMs) are considered to lack transparency and reliability. As an explainable artificial intelligence method, rough set machine learning is used to improve the interpretability of the predicted biochar surface properties (Ang et al. 2023). The interpretability of models could also improve through sensitivity analysis of experimental features (Chen et al. 2022). The most common methods involve the use of a partial dependence plot (PDP), local surrogate (LIME) and Shapley value (SHAP), which can be used to determine the importance of input-influencing features on adsorption capacity. In turn, these methods further add to the interpretability of biochar adsorption mechanisms and are instructive for future development directions. For example, an adaptive network-based fuzzy inference system (ANFIS) model was used to predict the adsorption removal efficiency of arsenate (As(III) by Al-Yaari et al. (2022). The statistical parameter R% was used to interpret the relative importance of input features, and the results established that the most dominant featurea were pH, initial As concentration, and contact time. Da et al. (2022) showed that the multilayer perceptron model with two hidden layers had the best prediction effect on biochar for uranium adsorption capacity by comparing four machine learning methods. With the permutation feature importance method, they found that the uranium adsorption capacity was highly dependent on the specific surface area (SA), and the optimal range of SA was 500 ~ 1200 m²g⁻¹. Zhu et al. (2022) developed models with interpretable random forest algorithms and obtained dependent relationships between target properties and input critical descriptors via PDP analysis. The results suggested that iron impregnation increased the C–O and C=O ratios of the iron-biochar composite, which in turn better facilitated Cr(VI) removal by the iron-biochar. Nevertheless, readily available macroscopic properties as descriptors have received increased attention in existing machine learning research, while the exploitation of more informative adsorbent descriptors (e.g., atomic number, topological structure, molecular fragment, surface area, energy) for building model processes is still lacking. In addition, the great economic cost of these experiments and the complexity of biochar properties have led to the building machine learning models with insufficient data or low-quality data. In these cases, the prediction results can be biased, non-convergent, unstable, or overfitted. This is also a hurdle to jump of current machine learning models. It is suggested that surface functional groups are more important than elemental composition for evaluating the adsorption capacity of biochar for HMs. This research has several limitations and may lead to uncertainty in the results owing to the lack of data directly relevant to HM adsorption (Palansooriya et al. 2022). Zhang et al. (2023) comprehensively and systematically reviewed recent works on the application of biochar as an adsorbent for pollutant removal via machine learning. The data scale, the effectiveness of the datasets, and the construction of corresponding databases need to be carefully considered in future research.

2 Descriptors employed in machine learning for data-driven adsorption studies

The first stage in applying machine learning to chemistry research is to determine the characteristics and information of chemistry that are acceptable to machine learning. The selection of suitable input descriptors (features) plays a critical role in improving the accuracy of machine learning prediction models and uncovering the key features that influence adsorbent capacity and selectivity. Thousands of different descriptors have been mined depending on the objective application and machine learning algorithm. For example, biochar yield is closely related to biomass characteristics and pyrolysis conditions, the proximate composition and elemental composition of biomass materials can be regarded as input descriptors, and experimental conditions such as temperature, heating rate, and retention time are also considered (Li et al. 2023a). As shown in Fig. 2, descriptors can be classified into different categories. According to the data source, the descriptors can be classified into experimental-based descriptors, theory-guided descriptors, and descriptors for combining experimental data and theoretical calculations. Descriptors can also be divided into qualitative and quantitative descriptors. Qualitative descriptors, also known as molecular fingerprints, encode molecular characteristics using MACCS keys, Morgan fingerprints, daylight fingerprints, etc.. The latter abstracts the molecular structure into descriptors by field or graph theory methods. According to the different data types, the descriptors can be divided into integers (i.e., atomicity), real numbers (i.e., molecular weight), and vectors (i.e., dipole moment). The dimensions of the molecular structure required to calculate descriptors include zero-dimensional descriptors, one-dimensional descriptors, two-dimensional descriptors, and three-dimensional descriptors. These descriptors can also be divided into topological descriptors, geometric descriptors, composition descriptors, and molecular property descriptors according to the difference in physical meaning. Given that there are no hard or absolute rules to govern this selection, developing and adopting new descriptors to enhance the usability of machine learning algorithms and promote the process of rationalizing in the field of chemistry with a more open attitude is necessary.

3 Machine learning applications in the removal of HMs by biochar

Using machine learning can effectively predict the properties and capacities of biochar, discover their underlying reaction mechanisms and complex relationships, and help design new materials. For the removal of HMs by biochar, machine learning-aided prediction helps with the rapid innovation and screening of high-performance materials. The experimental remediation ratios and the corresponding prediction results of the ANN model are shown in Fig. 3a, and further sensitivity analysis of the ANN model is shown in Fig. 3b. The ANN model performance is acceptable, and the contributions of the input descriptors for modelling are were compared. The contributions of biochar properties, especially biochar pH, were greater for immobilization efficiency than for soil physiochemical properties and other factors (Sun et al. 2022a). Indeed, as different machine learning models are being used to correlate input descriptors to HM removal interactions with biochar, selecting appropriate descriptors for modelling is a challenge. For example, genetic programming yields good predictions and further yields a simple mathematical expression for the adsorption process to determine the quantitative relationship between biosorption capacity and input descriptors. The descriptors in this study were grouped into biochar characteristics, biosorption conditions, initial concentration ratio of HMs to biochar, and heavy metal characteristics. Finally, the initial concentration ratio of heavy metals/biochar and the carbon content of the biochar were found to be the most influential descriptors by sensitivity analysis (Dashti et al. 2023). As mentioned above, discussions on the sensitivity analysis of machine learning models are always used to guide model understanding and parameter importance. Machine learning methods have grown to become powerful tools for uncovering hidden relationships. The sensitivity of machine learning models, that is, the parameters in the model may vary under different input conditions and are vital in the performance and output results of models. Sensitivity analysis is a popular method for evaluating the performance of a model when parameters change. This approach can provide insight into the impact of input parameters on the target; for instance, by calculating the gradient, relevancy factor, partial derivative, or other relevant covariates, one can determine the responses of the model to variations in different parameters. In addition, there are more common methods for evaluating the sensitivity of models, which can be divided into parameter sensitivity analysis, feature importance analysis, local sensitivity analysis (LSA), global sensitivity analysis (GSA), the gradient method, backpropagation sensitivity analysis, and Monte Carlo simulation. Zhao et al. (2021) demonstrated a new approach through the application of a kernel extreme learning machine (KELM), kriging models, and local sensitivity analysis to identify sensitive parameters influencing the adsorption process. The LSA usually studies a single input parameter once, but the GSA concurrently studies the variations across the entire spectrum of all input parameters. When a global understanding or consideration of the interactions between parameters is needed, GSAs, such as Sobol indices, can be used to measure the sensitivity of the whole input space (Sun et al. 2022a).

In addition, for modified biochar, suitable descriptors can be used to investigate adsorption mechanisms and aid engineered biochar design. For example, high-efficiency removal is often difficult to achieve with pristine biochar due to the electrostatic repulsion between the negative charge on the biochar surface and oxyanions. Fe possesses strong affinities for As and is an attractive modification metal for use in decorating biochar. Biomass characteristics, As species, initial concentrations, and adsorption conditions were used as input descriptors for machine learning modelling. Among them, As species were the first to be considered input descriptors. However, As species are not important because As(III) and As(V) have similar removal mechanisms on biochar. The As(V) adsorption capacity as a function of the Fe content is positively correlated according to the partial dependence plot analysis. Statistical comparisons revealed that the Fe content, as a direct factor in As adsorption capacity, was relatively limited. The possible interactions between As(III), As(V), and Fe-modified biochar through FeOH and FeOH₂⁺ groups may be the dominant mechanism (Liu et al. 2023b). As shown in Fig. 3c, the partial least squares path model (PLS-PM) also quantified the direct and indirect effects of key descriptors on the HM immobilization ratio. The electrical conductivity (EC_BC of biochar, EC_soil of soil), cation exchange capacity (CEC_BC of biochar, CEC_soil of soil), organic carbon (OC), and biochar application rate (Rate_BC) were considered input descriptors. Soil pH and OC content directly positively influence the immobilization of Cd(II). At low soil pH, H⁺ and Cd²⁺ undergo electrostatic repulsion by competing with adsorption sites, and Cd²⁺ precipitates and reacts easily with abundant oxygen-containing functional groups in an alkaline environment. For the immobilization ratio of Zn(II), higher C and N contents are better for Zn(II) immobilization, and an increase in the surface area of biochar (SSA) will provide more activated adsorption sites. Pb(II) immobilization can occur through precipitation and cation exchange. The OC content promoted Pb(II) immobilization because the surface complex reaction occurred with C–π and –COO–. Similarly, the Cu(II) immobilization ratio should increase with increasing C and N contents because the complex binding sites are provided by carbon-containing functional groups. The results of the statistical analysis showed that cation exchange is important for Cu(II) immobilization (Guo et al. 2023). The metals can be divided into cationic metals and anionic metals according to their species. Investigations of the influence of input descriptors on modelling have shown some differences between them.

A meta-analysis, a scientific statistical analysis algorithm, was used to explain the immobilization of different anionic metals. As shown in Fig. 3d, the mechanism underlying the immobilization of four anionic metals on biochar in soil mainly includes the following steps: (1) first, surface complexation of anionic metals with functional groups (e.g., C–O groups, O=C–O groups, C–OH groups, etc.) occurs on biochar; (2) second, electrostatic interactions are caused by positive and negative charges on metal ions and adsorbents, which depend on pH and speciation of ions. Explanatory variable analysis also indicated that biochar pH and soil pH are the key factors influencing HM immobilization; (3) third, precipitation or coprecipitation can occur via the synergistic effect of cations and anions (Zhang et al. 2022b). In addition, Zhang et al. (2022a) used X-ray micro-CT imaging to establish a novel 3D in situ visualization method for Pb(II) adsorption on biochar particles. The images were reconstructed and subsequently segmented via the K-means clustering unsupervised machine learning algorithm. Semiquantitative 3D in situ visualization analysis of the rendered images revealed the mechanism of Pb(II) adsorption on the different biochar particles. Coconut shell-activated char had a low adsorption capacity for Pb(II), which was mainly due to its neutral pH value, thus limiting precipitation and π electronic interactions. Micro-CT showed that the lowest Pb(II) concentration in the core of the particles was inseparable from the smallest pore diameter and largest micropore volume. There was sufficient Pb(II) diffusion in the rice husk biochar with a thin-walled porous morphology. Based on the typical Crank model, the intraparticle diffusion of adsorbed particles may be explained by a function of time and the radial distance from the surface to the centre of a particle. For the wheat straw biochar, the concentration was the highest in the outer layer of the particles, and the concentration decreased outside of the ellipse. This was attributed to the relatively uniform microstructure. This 3D in situ visualization provides new insight into the adsorption mechanism via image representation.

These studies built machine learning models based on biochar properties, experimental conditions, and pollutant characteristics to predict the HM adsorption capacity of biochar or engineered biochar. Then, the direct and indirect influences of the input descriptors are revealed via interpretability analysis, which contributes significantly to developing and exploring novel viewpoints on material design, mechanism analysis, and process optimization. However, many more likely contributing descriptors, such as pH_pzc, metal content, surface functional group, reaction energy, and experimental spectral data, have yet to be detected. For constructing a reliable and accurate model, the relevant data reported in the published literature are very sparse. Therefore, for a more comprehensive and greater understanding of the HM interaction mechanism, implementing related experiments and encouraging additional feature analyses of the modelling process in further studies are urgently needed. Compared with practical big data problems, the currently collected dataset has relatively sparse, discrete, and noise data, which is a common difficulty that exists in the intersection between machine learning and information on physicochemical properties of materials, possibly due to high experimental costs and errors. Another key challenge in machine learning applications is choosing and building a suitable model, especially for small datasets. Although many studies have demonstrated the feasibility of using machine learning algorithms with relatively small datasets, increasing the amount of data is still effective. Chen et al. (2023) utilized data augmentation as a powerful auxiliary modelling tool to compensate for the lack of data and build an optimal RF model to predict the characteristics of hydrothermal biochar. The sensitivity analysis was subsequently used for RF model interpretation. The findings showed that temperature affects the hydrothermal reaction intensity and subsequently affects the mass yield of organic carbon and the total P content in biochar, which are the main key features of biochar preparation by hydrothermal carbonization. Compared with that of traditional machine learning algorithms, the accuracy of biochar property prediction greatly improved on average from 5.8% to 15.8% after data enhancement.

More data and factors can be considered to improve the prediction accuracy of models and provide a clearer understanding of the underlying mechanism. Adding related data from the additives in the modelling of Cr(III) and Cd(II) migration during pyrolysis cannot be ignored (Li et al. 2023b). The addition of biomass waste additives (BWA) increases the carbon and volatile matter contents of sludge and manure, which are harmful to the total concentration (TC) of heavy metals. The different types of inorganic additives (IAs) had different effects on the TC concentration of heavy metals and the retention rate (RR) of Cr(III) in biochar. Ca-based IAs have high thermal stability and are left in solid to increase biochar yield, which decreases the Cr(III) content while increasing its retention in biochar. Na- and Al-based IAs both increase Cr(III) content and retention, while K-based IAs have the opposite effect. The TC and RR of Cd(II) decreased with increasing BWA. Due to the low thermal stability of Cd(II), it might transfer from the solid to the gas or liquid phase during pyrolysis through decarboxylation, dehydration, and demethylation. Adding BWA can increase the oxygen and hydrogen content, thereby accelerating the pyrolysis process, which also decreases the feedstock nitrogen content to lower the RR of biochar. In terms of adding IA, K-IA, e.g., KOH and K₂CO₃, has a strong binding capacity and high specific surface area. Thus, Cd(II) may be selectively adsorbed and trapped easily by carbon-based materials with K-IA. However, not every publication has all the information we want; it is an enormous challenge to construct a satisfactory database if we consider all the relevant factors. Many studies have limitations in real applications, which are related to the quality and quantity of the collected data. Due to the variety of research methods, research objectives and experimental conditions, the input features selected according to the output targets are indeterminate. For example, the efficiency of immobilization is determined based on a wide array of features, such as bioavailability, the exchangeable fraction, the labile fraction, leaching, and the water-soluble fraction of HMs. There are many ways to determine effective HM concentrations in soil using deionized water, diethylenetriaminepentaacetic acid, toxicity characteristic leaching procedures, and calcium chloride extraction methods. These limitations may cause uncertainty in the prediction results and prevent us from precisely mirroring real-world conditions (Palansooriya et al. 2022). Therefore, studies should emphasize increasing the uses of effective datasets in the modelling process, even when constructing a comprehensive database that considers all HMs and important descriptors to improve machine learning models. This approach can provide a full understanding of environmental density during biochar application, which will be more meaningful and realistic.

Herein, we compare the accuracy of our model with that of a previous model (Dashti et al. 2023). Our model proposes an integrated approach based on the Gaussian noise-based data augmentation method and the LSSVM model to predict the sorption efficiency of biochar. In general, directly using the training set for developing the machine learning model may yield unacceptable results in simulations due to the overfitting problem and poor generalization ability. To achieve this, an equal amount of Gaussian noise is calculated from a normal distribution and randomly assigned to the variables of the dataset. The formula is as follows:

$$\begin{array}{c}N\left(x\right)=\frac{1}{\sigma \sqrt{2\pi }}\text{exp}\left(-\frac{{\left(x-\mu \right)}^{2}}{2{\sigma }^{2}}\right)\end{array}$$

(1)

where the expectation of Gaussian distribution is $\mu$, the variance of Gaussian distribution is $\sigma$, from which we can obtain:

$$\begin{array}{c}G\left(x\right)=g\left(x\right)+N\left(x\right)\end{array}$$

(2)

where $g\left(x\right)$ is the original data distribution, and $G\left(x\right)$ is the function of augmented data. The experimental and comparable datasets were obtained from Dashti’s study. As shown in Fig. 4a, when we added Gaussian noise to the dataset, the R² of the testing set increased from 0.94 to 0.97. In addition, we selected 30% of the experimental data as the training set. As shown in Fig. 4b, the R² of the LSSVM model on the testing set was 0.88, which increased to 0.93 after Gaussian noise-based data augmentation. As shown in Fig. 4c and d, compared with the distribution of sorption values predicted by the LSSVM, the results obtained by combining the Gaussian noise-based data augmentation method were more consistent with the experimental data, especially at the extremum points. The data augmentation method uses virtual samples generated from a small dataset to improve the generalization ability and performance of the prediction model, which also helps the process of further analysis to some extent.

Similarly, the relevancy factor (r) is also counted in the sensitivity analysis. The sensitivity analysis provides a quantitative description for measuring the importance level of input features on the output labels. The relevancy factor results are calculated from the predicted values. As shown in Fig. 5a), as the absolute value of r increases, the corresponding feature becomes more important for determining the sorption efficiency. Our model prediction results obtained consistent feature importance with the experimental data (Dashti et al. 2023). Next, we discuss the Sobol method that is used to compare the models’ stability on testing data. The Sobol sensitivity indices can be divided into the first-order, second-order, total‐order, and higher‐order sensitivity indices. The total‐order sensitivity indices are appropriate for evaluating a full range of feature spaces and the influence of different disturbance values on the output label values. In this study, minimum and maximum feature values were set after rescaling to [0, 1]. In Fig. 5b, the sensitivity indices determine the contribution of the features’ interactions to the overall model output label. The results indicate that the model had low sensitivity to variations in features within a limited range, and the stability of the model was verified via data augmentation.

4 Conclusion, perspectives and challenges

In conclusion, the latest applications of machine learning as an advanced tool for determining the adsorption performance of biochar are summarized. Using sensitivity and interpretability analyses of models has become the pursuit of researchers. Thus, there is a demand for exploring novel, efficient descriptors, high-quality databases, and practical techniques to accelerate intelligent experimental control. Based on the related literature, the following aspects deserve special mention as perspectives and challenges for promoting the operational application of machine learning in the removal of HMs by biochar in the environment and minimizing the disparities in knowledge between dissimilar subjects:

1.
The input feature space should be expanded to obtain more closely correlated or more important properties; some studies have used a strategy that adopts graph data as molecular representations and combined deep learning methods. In addition, the removal efficiency of HMs by biochar depends heavily on the molecular structure characteristics of the biochar, and there is little related research and discussion. In future research, more input descriptors need to be explored in detail.
2.
Machine learning is essentially a statistical method that has certain requirements for the amount of data, especially in deep learning, which has higher demands on data than traditional machine learning. Using active learning, Bayesian optimization, or other algorithms to obtain more valuable data is effective, e.g., combined with Gaussian noise-based data augmentation could improve the accuracy and generalizability of the model in this study.
3.
The application of machine learning techniques to biochar cultivation is scalable and practical, but several challenges remain in terms of building and normalizing design databases. Standardization of the data format and data conventions is a key avenue for increasing data accessibility to researchers and benefiting transdisciplinary applications. However, significant obstacles are waiting to be overcome, datasets with comprehensive and unified information are expected to be obtained through either the addition of generalized descriptors or the combination of quantum computing.
4.
Neural network algorithms are extensively applied in materials science, but a notable challenge lies in their inherent lack of interpretability. This lack of transparency hinders the understanding of the underlying relationships between input features and output predictions, limiting the trust and adoption of these models in critical decision-making processes within materials science. Addressing the interpretability gap of the existing models remains a crucial area of research, and it is also necessary and urgent to exploit grey-box or white-box models with high interpretability to promote the acquisition of clear physiochemical laws.
5.
Impacted by the big data-driven “fourth paradigm”, the application of machine learning in the optimization of HM removal processes is both an opportunity and a challenge. It is necessary to concentrate on mitigating the existing constraints through further research and enable more researchers to utilize biochar materials efficiently to protect the environment.

Data availability

Authors can confirm that all relevant data are included in the article.

References

Al-Yaari M, Aldhyani THH, Rushd S (2022) Prediction of arsenic removal from contaminated water using artificial neural network model. Appl Sci 12(3):999. https://doi.org/10.3390/app12030999
Article CAS Google Scholar
Almalawi A, Khan AI, Alqurashi F, Abushark YB, Alam MM, Qaiyum S (2022) Modeling of remora optimization with deep learning enabled heavy metal sorption efficiency prediction onto biochar. Chemosphere 303:135065. https://doi.org/10.1016/j.chemosphere.2022.135065
Article CAS Google Scholar
Ang JC, Tang JY, Chung BYH et al (2023) Development of predictive model for biochar surface properties based on biomass attributes and pyrolysis conditions using rough set machine learning. Biomass Bioenergy 174:106820. https://doi.org/10.1016/j.biombioe.2023.106820
Article CAS Google Scholar
Cao HL, Xin Y, Yuan QX (2016) Prediction of biochar yield from cattle manure pyrolysis via least squares support vector machine intelligent approach. Bioresour Technol 202:158–164. https://doi.org/10.1016/j.biortech.2015.12.024
Article CAS Google Scholar
Chen C, Liang R, Ge YD et al (2022) Fast characterization of biomass pyrolysis oil via combination of ATR-FTIR and machine learning models. Renew Energy 194:220–231. https://doi.org/10.1016/j.renene.2022.05.097
Article CAS Google Scholar
Chen C, Wang Z, Ge YD et al (2023) Characteristics prediction of hydrothermal biochar using data enhanced interpretable machine learning. Bioresour Technol 377:128893. https://doi.org/10.1016/j.biortech.2023.128893
Article CAS Google Scholar
Da TX, Ren HK, He WK, Gong SY, Chen T (2022) Prediction of uranium adsorption capacity on biochar by machine learning methods. J Environ Chem Eng 10(5):108449. https://doi.org/10.1016/j.jece.2022.108449
Article CAS Google Scholar
Dashti A, Raji M, Harami HR, Zhou JL, Asghari M (2023) Biochar performance evaluation for heavy metals removal from industrial wastewater based on machine learning: application for environmental protection. Sep Purif Technol 312:123399. https://doi.org/10.1016/j.seppur.2023.123399
Article CAS Google Scholar
Fang L, Huang T, Lu H et al (2023) Biochar-based materials in environmental pollutant elimination, H₂ production and CO₂ capture applications. Biochar 5:42. https://doi.org/10.1007/s42773-023-00237-7
Article CAS Google Scholar
Gu H, Liu X, Wang S et al (2022) COF-based composites: extraordinary removal performance for heavy metals and radionuclides from aqueous solutions. Rev Environ Contam Toxicol 260:23. https://doi.org/10.1007/s44169-022-00018-6
Article Google Scholar
Guo GM, Lin LY, Jin FM, Mašek O, Huang Q (2023) Application of heavy metal immobilization in soil by biochar using machine learning. Environ Res 231:116098. https://doi.org/10.1016/j.envres.2023.116098
Article CAS Google Scholar
Hao M, Liu Y, Wu W et al (2023) Advanced porous adsorbents for radionuclides elimination. EnergyChem 5:100101. https://doi.org/10.1016/j.enchem.2023.100101
Article CAS Google Scholar
Huang Q, Song S, Chen Z et al (2019) Biochar-based materials and their applications in removal of organic contaminants from wastewater: state-of-the-art review. Biochar 1:45–73. https://doi.org/10.1007/s42773-019-00006-5
Article Google Scholar
Leng LJ, Yang LH, Lei XN et al (2022) Machine learning predicting and engineering the yield, N content, and specifc surface area of biochar derived from pyrolysis of biomass. Biochar 4:63. https://doi.org/10.1007/s42773-022-00183-w
Article CAS Google Scholar
Li HL, Ai ZJ, Yang LH, Zhang WJ, Yang ZQ, Peng HY, Leng LJ (2023a) Machine learning assisted predicting and engineering specific surface area and total pore volume of biochar. Bioresour Technol 369:128417. https://doi.org/10.1016/j.biortech.2022.128417
Article CAS Google Scholar
Li J, Pan LJ, Li ZW, Wang Y (2023b) Unveiling the migration of Cr and Cd to biochar from pyrolysis of manure and sludge using machine learning. Sci Total Environ 885:163895. https://doi.org/10.1016/j.scitotenv.2023.163895
Article CAS Google Scholar
Liu ZX, Xu ZY, Xu LF et al (2022) Modified biochar: synthesis and mechanism for removal of environmental heavy metals. Carbon Res 1:8. https://doi.org/10.1007/s44246-022-00007-3
Article Google Scholar
Liu X, Li Y, Chen Z et al (2023a) Recent progress of COFs membranes: design, synthesis and application in water treatment. Eco-Environ Health 2:117–130. https://doi.org/10.1016/j.eehl.2023.07.001
Article Google Scholar
Liu JX, Xu ZL, Zhang WJ (2023b) Unraveling the role of Fe in as(III & V) removal by biochar via machine learning exploration. Sep Purif Technol 311:123245. https://doi.org/10.1016/j.seppur.2023.123245
Article CAS Google Scholar
Palansooriya KN, Li J, Dissanayake PD et al (2022) Prediction of soil heavy metal immobilization by biochar using machine learning. Environ Sci Technol 56(7):4187–4198. https://doi.org/10.1021/acs.est.1c08302
Article CAS Google Scholar
Qiu MQ, Liu LJ, Ling Q, Cai YW, Yu SJ, Wang SQ, Fu D, Hu BW, Wang XK (2022) Biochar for the removal of contaminants from soil and water: a review. Biochar 4(1):19. https://doi.org/10.1007/s42773-022-00146-1
Article CAS Google Scholar
Sun Y, Zhang YY, Lu L, Wu YJ, Zhang YC, Kamran MA, Chen BL (2022a) The application of machine learning methods for prediction of metal immobilization remediation by biochar amendment in soil. Sci Total Environ 829:154668. https://doi.org/10.1016/j.scitotenv.2022.154668
Article CAS Google Scholar
Sun ZY, Feng L, Li YQ, Han YM, Zhou HJ, Pan JT (2022b) The role of electrochemical properties of biochar to promote methane production in anaerobic digestion. J Clean Prod 362:132296. https://doi.org/10.1016/j.jclepro.2022.132296
Article CAS Google Scholar
Uliana AA, Bui NT, Kamcev J, Taylor MK, Urban JJ, Long JR (2021) Ion-capture electrodialysis using multifunctional adsorptive membranes. Science 372(6539):296–299. https://doi.org/10.1126/science.abf5991
Article CAS Google Scholar
Wang RP, Zhang SY, Chen HL et al (2023) Enhancing biochar-based nonradical persulfate activation using data-driven techniques. Environ Sci Technol 57(9):4050–4059. https://doi.org/10.1021/acs.est.2c07073
Article CAS Google Scholar
Wei X, Peng D, Shen L, Ai YJ, Lu ZH (2023) Analyzing of metal organic frameworks performance in CH₄ adsorption using machine learning techniques: a GBRT model based on small training dataset. J Environ Chem Eng 11(3):110086. https://doi.org/10.1016/j.jece.2023.110086
Article CAS Google Scholar
Wei X, Lu Z, Ai Y, Shen L, Wei M, Wang X (2024) Implementing and understanding the unsupervised transfer learning in metal organic framework toward methane adsorption from hypothetical to experimental data. Sep Purif Technol 330:125291. https://doi.org/10.1016/j.seppur.2023.125291
Article CAS Google Scholar
Zhang HH, Li YF, Xie RY, Zhu Y, Shi S, Yang ZL, Han LJ (2022a) A particle scale micro-CT approach for 3D in-situ visualizing the pb (II) adsorption in different crop residue-derived chars. Bioresour Technol 344:126269. https://doi.org/10.1016/j.biortech.2021.126269
Article CAS Google Scholar
Zhang YJ, Ren M, Tang YM et al (2022b) Immobilization on anionic metal(loid)s in soil by biochar: a meta-analysis assisted by machine learning. J Hazard Mater 438:129442. https://doi.org/10.1016/j.jhazmat.2022.129442
Article CAS Google Scholar
Zhang WT, Chen RH, Li J, Huang TY, Wu BD, Ma J, Wen QQ, Tan J, Huang WG (2023) Synthesis optimization and adsorption modeling of biochar for pollutant removal via machine learning. Biochar 5(1):25. https://doi.org/10.1007/s42773-023-00225-x
Article CAS Google Scholar
Zhao Y, Li YL, Fan D, Song JP, Yang F (2021) Application of kernel extreme learning machine and kriging model in prediction of heavy metals removal by biochar. Bioresour Technol 329:124876. https://doi.org/10.1016/j.biortech.2021.124876
Article CAS Google Scholar
Zheng XL, Nguyen H (2022) A novel artificial intelligent model for predicting water treatment efficiency of various biochar systems based on artificial neural network and queuing search algorithm. Chemosphere 287:132251. https://doi.org/10.1016/j.chemosphere.2021.132251
Article CAS Google Scholar
Zhu XZ, Xu ZB, You SM et al (2022) Machine learning exploration of the direct and indirect roles of Fe impregnation on cr(VI) removal by engineered biochar. Chem Eng J 428:131967. https://doi.org/10.1016/j.cej.2021.131967
Article CAS Google Scholar

Download references

Acknowledgements

The authors thank the valuable comments of anonymous reviewers and editor.

Funding

Financial support from NSFC (22076044, U2267222) was acknowledged.

Author information

Authors and Affiliations

School of Control and Computer Engineering, North China Electric Power University, Beijing, 102206, People’s Republic of China
Xin Wei
School of Mathematics and Physics, North China Electric Power University, Beijing, 102206, People’s Republic of China
Xin Wei & Zhanhui Lu
MOE Key Laboratory of Resources and Environmental Systems Optimization, College of Environment and Chemical Engineering, North China Electric Power University, Beijing, 102206, People’s Republic of China
Yang Liu, Yuejie Ai & Xiangke Wang
Key Laboratory of Theoretical and Computational Photochemistry of Ministry of Education, College of Chemistry, Beijing Normal University, Beijing, 100875, People’s Republic of China
Lin Shen

Authors

Xin Wei
View author publications
You can also search for this author in PubMed Google Scholar
Yang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Lin Shen
View author publications
You can also search for this author in PubMed Google Scholar
Zhanhui Lu
View author publications
You can also search for this author in PubMed Google Scholar
Yuejie Ai
View author publications
You can also search for this author in PubMed Google Scholar
Xiangke Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

XW: Investigation, writing-original draft and editing; YL: Writing-original draft and editing; LS: Investigation; ZL: Investigation; YA: Investigation; XW: Writing, review and editing. The authors read and approved the final manuscript.

Corresponding authors

Correspondence to Zhanhui Lu, Yuejie Ai or Xiangke Wang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Agree.

Competing interests

Xiangke Wang is an Associate editor of Biochar and was not involved in the editorial review, or the decision to publish this article. All authors declare that there are no competing interests in this manuscript.

Additional information

Handling editor: Wenfu Chen.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wei, X., Liu, Y., Shen, L. et al. Machine learning insights in predicting heavy metals interaction with biochar. Biochar 6, 10 (2024). https://doi.org/10.1007/s42773-024-00304-7

Download citation

Received: 31 October 2023
Revised: 25 December 2023
Accepted: 11 January 2024
Published: 25 January 2024
DOI: https://doi.org/10.1007/s42773-024-00304-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Machine learning insights in predicting heavy metals interaction with biochar