Abstract
Due to large specific surface area, abundant functional groups and low cost, biochar is widely used for pollutant removal. The adsorption performance of biochar is related to biochar synthesis and adsorption parameters. But the influence factor is numerous, the traditional experimental enumeration is powerless. In recent years, machine learning has been gradually employed for biochar, but there is no comprehensive review on the whole process regulation of biochar adsorbents, covering synthesis optimization and adsorption modeling. This review article systematically summarized the application of machine learning in biochar adsorbents from the perspective of all-round regulation for the first time, including the synthesis optimization and adsorption modeling of biochar adsorbents. Firstly, the overview of machine learning was introduced. Then, the latest advances of machine learning in biochar synthesis for pollutant removal were summarized, including prediction of biochar yield and physicochemical properties, optimal synthetic conditions and economic cost. And the application of machine learning in pollutant adsorption by biochar was reviewed, covering prediction of adsorption efficiency, optimization of experimental conditions and revelation of adsorption mechanism. General guidelines for the application of machine learning in whole-process optimization of biochar from synthesis to adsorption were presented. Finally, the existing problems and future perspectives of machine learning for biochar adsorbents were put forward. We hope that this review can promote the integration of machine learning and biochar, and thus light up the industrialization of biochar.
Graphical Abstract
![](http://media.springernature.com/lw685/springer-static/image/art%3A10.1007%2Fs42773-023-00225-x/MediaObjects/42773_2023_225_Figa_HTML.png)
Highlights
-
All-round regulation of biochar by machine learning (ML) was reviewed for the first time.
-
Optimization of pyrolysis process and cost assessment by ML were summarized.
-
Modeling of pollutant adsorption via ML was reviewed.
-
General guidelines of ML for biochar pyrolysis and pollutant adsorption were proposed.
-
Challenges and perspectives of ML for biochar adsorbents were put forward.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Biochar is the product of biomass pyrolysis such as agricultural waste (Yu et al. 2016), sludge (Nair et al. 2020; Krueger et al. 2020; Rathnayake et al. 2022; Wu et al. 2016), animal manure (Ro 2016; Guo et al. 2019) and forest leaves (Kim et al. 2020) under low or no oxygen conditions. The main element of biochar is carbon, but it also contains oxygen, hydrogen, sulfur, nitrogen and other elements (Xie et al. 2022; Pan et al. 2021; Yang et al. 2020). Due to its large specific surface area, uniformly distributed pore size and rich functional groups, biochar can be applied as an adsorbent for the treatment of pollutants in wastewater (Luo et al. 2022; Liu and Chen 2022; Isaeva et al. 2021; Katiyar et al. 2021). Moreover, thanks to its low cost, biochar is gradually becoming a popular adsorbent (Boraah et al. 2022; Alsawy et al. 2022; Tan et al. 2022). Biochar could be applied to remove many kinds of pollutants, such as organic pollutants represented by antibiotics (Masrura et al. 2022), pesticides (An et al. 2021), dyes (Choudhary et al. 2020; Ortiz-Monsalve et al. 2020) and drugs (Jung et al. 2015) and inorganic pollutants represented by arsenic (Cha et al. 2021), cadmium (Tyagi 2022; Liu et al. 2021b), chromium, lead (Din et al. 2021; Lee et al. 2019), copper and nickel (An et al. 2019).
The adsorption performance of biochar depends on two factors, namely, internal factors and external factors (Zhang et al. 2021b; Tan et al. 2020; Wu et al. 2019). Internal factors refer to the physicochemical properties of biochar itself, while external factors refer to the setting of corresponding adsorption parameters. The physicochemical properties of biochar include specific surface area, pore size distribution, the type and quantity of surface functional groups, which depend on the source of raw materials, pyrolysis conditions and pyrolysis method (Wu et al. 2022b; Tomczyk et al. 2020). Specially, the source of raw materials and pyrolysis conditions have a significant impact on the specific surface area and pore structure of biochar (Liu et al. 2022c). In detail, different sources of biochar, such as industrial waste and urban organic waste, have important influences on biochar formation. Among various pyrolysis conditions, pyrolysis temperature is a key parameter for biochar formation. The physicochemical properties of biochar can be improved by modifying the surface of biochar with functional groups (Wang et al. 2022b; Yang et al. 2022b; Zuo et al. 2021). Further, the experimental settings of adsorption process, such as adsorbent dosage (Li et al. 2021a), initial concentration of adsorbate (Dalhat et al. 2021; Li et al. 2020c), adsorption time, stirring speed (Jabar and Odusote 2021; Komnitsas et al. 2017), pH, coexisting ions (Tang et al. 2022; Wang et al. 2020a) and temperature (Zeng et al. 2018), have an important impact on adsorption efficiency. The adsorption performance of biochar is closely related to the synthesis and adsorption process of biochar, but there are many influencing factors. It is obviously unrealistic to enumerate through experimental methods.
The development of computational science, especially machine learning (Onyekwena et al. 2022; Ertugrul 2020; Li et al. 2020a), provides a powerful tool to reveal the complex nonlinear relationship in the process of biochar synthesis (Paula et al. 2022; Narde and Remya 2022; Zhu et al. 2019b) and adsorption (Medeiros et al. 2022; Lakshmi et al. 2021). As a new research paradigm, machine learning has been gradually employed in biochar synthesis and adsorption for more efficient pollutant removal. Selvam et al. applied various machine learning algorithms to reveal the influence and interaction of various factors on biochar yield during microwave pyrolysis (Selvam and Balasubramanian 2022). The results exhibited that microwave power was the most important factor affecting the yield and properties of biochar. Zhu et al. trained the adsorption data of biochar for six kinds of heavy metals (lead, cadmium, nickel, arsenic, copper and zinc) with artificial neural network (ANN) and random forest (RF) algorithm, and found that the characteristics of biochar had the greatest impact on adsorption efficiency (Zhu et al. 2022b). Further, Zhu et al. found that the content of oxygen-containing functional groups on the surface of biochar played an important role in Cr (VI) removal by employing machine learning (Zhu et al. 2019a). In a word, machine learning provides a powerful tool for optimizing biomass synthesis and adsorption processes, thus increasing biochar yield and adsorption performance.
In recent years, a number of machine learning applications have emerged for biochar adsorbents (Fig. 1) (Ascher et al. 2022; Yang et al. 2022a; Zhang et al. 2022e; Jeyasubramanian et al. 2021), but there is no comprehensive and systematic review from the perspective of the whole process regulation of biochar adsorbents. Summarizing machine learning of biochar adsorbents from the whole process regulation in time is beneficial to promote the integration of machine learning and biochar adsorbents. This review article comprehensively and systematically summarized the application of machine learning in biochar adsorbents from the perspective of the whole process regulation for the first time, including synthesis optimization of biochar adsorbents and modeling of adsorption process. The workflow of machine learning and the mainstream machine learning algorithms were introduced. Then, the latest progress of machine learning in the optimization of biochar adsorbent synthesis conditions was summarized, including prediction of biochar yield and physicochemical properties, optimal preparation conditions and economic evaluation. The application of machine learning in pollutant adsorption via biochar was reviewed covering prediction of adsorption efficiency, optimization of experimental parameters and revelation of adsorption behavior. General guidelines for optimizing synthesis process and modeling adsorption process of biochar via machine learning were proposed. Finally, the existing problems and future perspectives of machine learning in the research of biochar adsorbents were put forward. We hope that this review can promote the interpretability of machine learning and biochar adsorbents, and thus light up the industrialization of biochar adsorbents.
a Number of publications and citations of machine learning for synthesis (Left) and pollutant adsorption (Right) of biochar between 2014 and 2021 in Web of Science Core Collection; b The frequency of keywords in machine learning for synthesis (Left) and pollutant adsorption (Right) of biochar; c Some milestone events in machine learning for optimizing pyrolysis process and modeling adsorption process of biochar for pollutant removal (Cao et al. 2016; Chowdhury et al. 2019; Hai et al. 2021; Li et al. 2022a; Li et al. 2022c; Palansooriya et al. 2022; Zhang et al. 2022a)
2 The overview of machine learning
2.1 General concepts of machine learning
Machine learning refers to the prediction or judgment of another part of the data after computer obtains the corresponding knowledge and rules by learning part of the data (Masood and Ahmad 2021; Nandy et al. 2021; Schleder et al. 2019; Da et al. 2022). According to the structure of data, machine learning is divided into supervised, unsupervised, semi-supervised and reinforcement learning (Zhang et al. 2022d; Tran and Ha 2022). The characteristic of supervised learning is that training data has corresponded labels (Lian et al. 2019; Zhang et al. 2018). According to the learning strategy, supervised learning is divided into generative method and discriminant method. Supervised learning is usually used for classification or regression. The dilemma of supervised learning is that it is expensive to acquire training data with target value, or even impossible to obtain training data with corresponding target value (Liu et al. 2022b; Wang et al. 2021). The characteristic of unsupervised learning lies in that there is no corresponding target value for training data (Cheng et al. 2021; Sohail and Arif 2020; Li et al. 2020b). Therefore, unsupervised learning is widely applied in clustering analysis and dimensionality reduction (Mizikovsky et al. 2022; Rovira et al. 2022; Wu et al. 2022a). The challenge of unsupervised learning is that model performance could not be well evaluated. Semi-supervised learning could be applied to these situations with a large amount of unlabeled data and a small amount of labeled data (Song et al. 2022; Ligthart et al. 2021; Tseng et al. 2020; Li and Liang 2019). Reinforcement learning realizes the interaction between model and environment, and can select the next action according to the current state and signal of environment (Padakandla 2021; Yau et al. 2017). The goal of reinforcement learning is to achieve dynamic adjustment of parameters, obtain knowledge from environment, and improve model behavior to achieve the maximum reinforcement signal. To facilitate reader understanding, the general workflow of machine learning is summarized in Fig. 2a. Feature engineering refers to selecting appropriate descriptors as model inputs to accurately predict model outputs (Zhang et al. 2022b, 2022c). The employed descriptors usually include pyrolysis conditions, adsorption conditions, and physicochemical properties of biochar (Fig. 2b). The commonly employed machine learning algorithms for optimizing synthetic conditions and modeling adsorption processes of biochar are presented in Fig. 2c. Further, the application of machine learning in the modeling and optimization of biochar pyrolysis and adsorption processes is shown in Fig. 3.
a General workflow of machine learning for optimizing synthesis and pollutant adsorption processes of biochar; b The variety of descriptors for optimizing biochar pyrolysis (Left) and adsorption (Right) processes; c Mainstream algorithms for optimizing synthesis and adsorption processes of biochar for pollutant removal
2.2 Evaluation metrics for machine learning
For machine learning, error is the difference between the predicted value of the model and the true value of the sample data. According to the difference in sample data, error is divided into training error and test error. Among them, training error is called empirical error and testing error is called generalization error (Tanaka 2015). Training error reflects the learning difficulty of a given problem, and test error reflects the prediction performance of the model for unknown data sets. Over-fitting and under-fitting are common fitting problems for model training (Dahan and Keller 2021; Graham et al. 2020; Yuan et al. 2020). Over-fitting takes the local features of training data set as the whole features, and the reason is that too many parameters of model make the training error small and the test error large (Yildirim and Ozkale 2021; Avrutskiy 2020). When under-fitting occurs, the learning ability of model is weak and the basic features of training data set cannot be learned (Handelman et al. 2019; Ahmed and Isa 2017; Van Calster and Vickers 2015). In general, the relationship between test error and model complexity is parabolic. Cross-validation is commonly employed to evaluate test error (Liu et al. 2022d; Yan et al. 2020; Pardakhti et al. 2017). Generally, the scale of adsorption data set is relatively small. In order to make accurate and reliable predictions, cross-validation is necessary, and the common cross-validation method is k-fold cross-validation (Xia et al. 2020; Yang et al. 2019). In addition, for the regression problem, the evaluation indexes of the model include: coefficient of determination (R2) (Uddin et al. 2022; Suvarna et al. 2022), mean absolute error (MAE) (Zhang et al. 2021a; Lamoureux et al. 2021; Chang and Medford 2021), mean square error (MSE), root mean square error (RMSE) (Wei et al. 2022; Kim et al. 2021) and mean absolute percentage error (MAPE) (Mir et al. 2022; Ke et al. 2021b; Bhagat et al. 2021). For classification problems, the evaluation indicators of the model are accuracy (Liu et al. 2022d), error rate, recall rate (Somarowthu et al. 2011), balanced F score (F1 score) (Avakyan et al. 2022; Findlay et al. 2018), receiver operating characteristic (ROC) curve (Xu et al. 2022; Razavi-Termeh et al. 2021) and the area under ROC curve (AUROC or AUC) (Wan et al. 2022; Ding et al. 2022; Cashman et al. 2017).
3 Synthesis optimization of biochar
The physicochemical properties of biochar depend on synthetic conditions, which play a key role on adsorption performance. However, due to many influencing factors involved, it is expensive and unacceptable to optimize synthetic conditions through trial and error method. Machine learning can quickly and efficiently accelerate experimental screening, so it is widely applied in the modeling and optimization of biochar synthesis. Because there are many products in the low oxygen or anaerobic pyrolysis of biomass (Egbosiuba 2022; Liu et al. 2021a; Thiruvengadam et al. 2021), covering biochar, bio-oil and biogas, accurate prediction of biochar production is the key. Secondly, the specific surface area, pore size distribution and the type and number of surface functional groups of biochar are determined by the composition of raw materials, pyrolysis temperature and pyrolysis mode. Effective optimization of synthetic parameters is the prerequisite for the preparation of biochar with excellent adsorption properties.
The synthesis process of biochar is a high-dimensional, complex and nonlinear system, thus making machine learning applicable. Employing machine learning can quickly determine the core factors of the synthesis process of biochar and provide a useful reference for the preparation of biochar adsorbents. Taking the raw material source, carbon content, oxygen concentration, process temperature and other variables related to synthesis process as the inputs of model training, the mapping relationship between the synthetic conditions and the physicochemical properties of biochar adsorbents can be quickly constructed. In addition, in the industrialization of biochar, life cycle and economy must be considered (Azzi et al. 2022; Zhu et al. 2022c). By employing machine learning, the life cycle and economic costs of different types of biochar can be effectively evaluated. To get an intuitive understanding, successful cases of machine learning applied to optimize the pyrolysis process of biochar are summarized in Additional file 1: Table S1, and a specific case of machine learning used to understand the pyrolysis process of biochar was presented (Fig. 4).
![figure 4](http://media.springernature.com/lw685/springer-static/image/art%3A10.1007%2Fs42773-023-00225-x/MediaObjects/42773_2023_225_Fig4_HTML.png)
(Reproduced with permission from ref Dong et al. 2022). Copyright Elsevier 2022
A specific case of machine learning used to understand the pyrolysis process of biochar
3.1 Prediction of biochar yield
When discussing the synthetic conditions of biochar, some key input variables cover pyrolysis temperature, pyrolysis time, oxygen concentration, carbon content, ratio of carbon to hydrogen, ratio of carbon to nitrogen, and ash content. Compared with the existing models, the advantage of machine learning algorithm model is that it does not exclude non numerical parameters (Shima et al. 2022), and can construct a variety of solutions for input variables. Commonly employed machine learning algorithms include multiple regression, extreme gradient boosting (XGBoost), support vector machine (SVM), RF, and ANN. ANN has been widely applied because of its powerful ability to establish nonlinear mapping in high dimensional space (Harrington et al. 2022). However, RF and SVM are more cost-effective when considering data size, model convergence speed. Modeling and predicting the pyrolysis process of biochar has attracted extensive attention, which promotes the large-scale synthesis of biochar.
Generally, the performance of ANN algorithm is better than that of simple algorithm, such as multiple linear regression (MLR) and multiple nonlinear regression (MnLR). For example, multiple-input single output-ANN (MISO-ANN) and MLR were employed simultaneously to predict the mass yield of biochar, and the models were evaluated by employing R2, MAE and mean deviation error (MBE) (Abdulsalam et al. 2020). The performance of MISO-ANN model was satisfactory, while MLR model exhibited unacceptable prediction ability (Additional file 1: Fig. S1). MnLR and ANN were applied to predict the yield of biochar (Li et al. 2021b). The R2 of the optimized MnLR model was only 0.5579, while the optimized ANN model possessed higher accuracy, with R2 reaching 0.785. In order to accurately predict biochar yield, the optimal feedforward ANN (FANN) model was developed (Tee et al. 2022). The FANN model was trained employing data mined from literature. The correlation coefficient (R) of the model was greater than 0.96, R2 was greater than 0.92, and MAE, MAPE and RMSE were low, suggesting that the FANN model possessed excellent prediction performance. Merdun et al. employed two different ANN algorithms, feedforward network (FFN) and cascaded feedforward network (CFN), to predict the yield of biochar. Generally speaking, FFN and CFN exhibited similar performance, and both possessed excellent prediction ability (Merdun and Sezgin 2018). A response surface approach (RSM) and an ANN were employed to model the biochar preparation process (Gupta et al. 2022). For ANN model, R2 was close to 1 and MSE value was low, indicating that ANN achieved accurate prediction of biochar production. Interestingly, ANN was superior to RSM model in predicting biochar yield.
Because ANN algorithm is complex and its interpretability is not high, some relatively simple algorithms with high interpretability are applied to predict biochar yield. Li et al. discussed the co-pyrolysis of biochar and bio-oil via machine learning (Alabdrabalnabi et al. 2022). The hyperparameters of XGBoost were determined and optimized by grid search and cross-validation. It was found that XGBoost could accurately predict biochar yield, with RMSE of 1.77 and R2 of 0.96. Furthermore, RF algorithm can also be employed to predict the yield of biochar (Zhou et al. 2022). Zhu et al. proved that RF can respond to biomass characteristics and pyrolysis conditions, and then accurately predict biochar production (Zhu et al. 2019b).
Some relatively simple machine learning algorithms sometimes even show better model performance than ANN algorithm. Cao et al. compared the traditional ANN model with the least squares support vector machine (LS-SVM) model in order to predict the biochar generated from cow manure pyrolysis, employing laboratory-scale fixed bed adsorption data as samples (Cao et al. 2016). ANN and LS-SVM both could be applied to predict biochar production, but the prediction performance and robustness of LS-SVM model were superior to those of ANN model. However, LS-SVM algorithm may fall into local optimization. In order to overcome these shortcomings, an adaptive neuro fuzzy inference system (ANFIS-SSO) employing social-spider optimization algorithm was designed to predict biochar production (Ewees et al. 2017). Compared with classical ANFIS, particle swarm optimization (PSO), artificial bee colony and LS-SVM, ANFIS-SSO model possessed better prediction performance. The hybrid model integrating ANFIS and Grey Wolf optimization algorithm can also avoid falling into local optimum (Ewees and Elaziz 2018).
Ensemble learning, especially when combined with optimization algorithms, may possess significantly better model accuracy than other algorithms including ANN. Haq et al. developed a series of machine learning models combining genetic algorithm (GA) and PSO to evaluate biochar yield (Haq et al. 2022). Specifically, GA and PSO algorithms were coupled with ensemble learning, decision tree, SVM and ANN respectively. The results exhibited that the integrated learning tree (ELT-PSO) model was superior to all other models, with R2 of 0.99 and RMSE of 2.33, achieving the successful prediction of biochar yield. A user-friendly software based on ELT-PSO model was developed, which is beneficial to avoid a lot of expensive experiments. The integration of Taguchi method into ANN can maximize the prediction of biochar yield in pyrolysis reaction (Albalasmeh et al. 2020). Taguchi method was applied to set multi-factor and horizontal experimental parameters as input variables, and the improved ANN exhibited satisfactory performance in predicting the maximum yield of biochar, indicating that the ANN combined with fast propagation algorithm is an appropriate method to predict the yield of biochar. It should be pointed out that metaheuristic algorithm was reported to be integrated into ANN models, significantly improving prediction accuracy (Khan et al. 2022). Further, combining the ANFIS-SSO and PSO algorithm, the prediction performance of model could be improved (Abd El Aziz et al. 2017). The optimized ANFIS-SSO took pyrolysis temperature, heating rate, holding time, water content and sample quality as input parameters, and achieved satisfactory prediction performance of biochar quality and yield.
3.2 Intelligent design of biochar surface structures
Machine learning can not only predict the yield of biochar, but also be used for intelligent design of the surface properties of biochar, such as element composition, specific surface area, surface functional groups, etc. A 2 × 5 × 1 ANN was developed to predict the pyrolysis behavior of groundnut shell at heating rates of 5, 10 and 20 °C min−1 (Hai et al. 2021). The values predicted by ANN were highly consistent with the experimental values (R2 > 0.999), accurately fitting the pyrolysis behavior of GNS. Tee et al. developed a FANN model to predict the BET surface area of biochar (Tee et al. 2022). It was found that the relative importance of variables affecting BET specific surface area was the ratio of C and H, ash content, residence time, heating rate, pyrolysis temperature, O, fixed carbon, volatile matter, moisture content and N. It could be stated that ultimate analysis had the most significant impact on BET specific surface area, followed by pyrolysis conditions and proximate analysis. The ultimate analysis significantly affected the structure of biochar. For example, the removal of hydrogen and oxygen at a pyrolysis temperature of > 500 °C promoted the aromatization and carbonization of the biochar structure (Kwiatkowski and Kalderis 2020). Pyrolysis conditions possessed important effects on BET specific surface area. For example, the rise of pyrolysis temperature led to the decomposition of organic substances in biomass raw materials, thus effectively removing volatile substances, producing more pores and increasing specific surface area. The heating rate also affected the BET specific surface area of biochar (This expression is a bit puzzling here.). The specific surface area first increased and then decreased with the increase of residence time. However, it should be noted that ash content may cause the reduction of BET specific surface area, because the tiny ash content may occupy the pores and voids on the surface of biochar. Liao et al. developed a multi-layer FANN model to predict the surface area of biochar (Liao et al. 2019). The trained ANN showed excellent model accuracy (R2 > 0.9) and was highly consistent with independent experimental data. Contribution analysis was carried out to understand the effects of different technological factors on the surface area of biochar. Liew et al. constructed a FANN with heating rate, residence time, pyrolysis temperature, ultimate and proximate analysis as descriptors. The FANN with backpropagation algorithm was trained to simulate the pyrolysis process of biomass and successfully predict the element ratio and surface area of various biochar types (Liew et al. 2022). In addition to ANN algorithm that could be used for predicting the physicochemical properties of biochar, some simple algorithms are also applicable. For example, Wani et al. successfully employed a k-nearest neighbor (kNN) algorithm to predict the surface area, hydrogen-carbon ratio, and oxygen-carbon ratio of different types of biochar from volatile matter content, pyrolysis conditions, and pH (Wani et al. 2020). It was found that BET surface area increased with increasing pH or temperature, but decreased with increasing volatile substance content. Volatile substance content was the most important parameter affecting hydrogen-carbon ratio, and adsorption was the most important parameter affecting oxygen-carbon ratio.
3.3 Optimization of process parameters
In the preparation of biochar, the selection of raw materials and process parameters is the core step. In particular, to obtain biochar with high adsorption performance, the large specific surface area and abundant surface functional groups are essential. The key factors affecting biomass pyrolysis to biochar include the elemental composition, structural composition, particle size and pyrolysis conditions of biomass (Lakshmi et al. 2021). The elemental composition includes the elemental proportion of C, N, O and H. The structural composition refers to the content of moisture and ash, and the relative content of lignin, hemicellulose and cellulose. The pyrolysis conditions are determined by pyrolysis rate, heating temperature and residence time. Machine learning algorithm could quickly determine the importance of raw materials and process parameters to the yield and properties of biochar, which is conducive to the purposeful production of high-yield biochar with excellent adsorption performance. Linear regression, RF, SVM, ANN and improved ANN are the mainstream algorithms employed to analyze the preparation process of biochar, and provide meaningful references for the preparation of biochar.
MLR model exhibited that volatile substances, ash content and temperature were significant factors affecting the yield of biochar (Narde and Remya 2022). A machine learning model was established by polynomial regression to predict the pyrolysis product yield of microwave-assisted sawdust (Potnuri et al. 2022). It was found that increasing pyrolysis temperature decreased the production of biogas and bio-oil, and significantly increased the production of biochar. However, the scale of the problem that can be handled by linear regression is small, and the information obtained is limited. Some advanced machine learning algorithms were employed to predict the yield and characteristics of biochar. Through XGBoost, it was found that pyrolysis temperature, N/C, H/C, reaction time and ash content were the core parameters that determined the yield of algae biochar (Pathy et al. 2020). Pearson correlation coefficient matrix covered the correlation between input variables and biochar production. The analysis of feature importance revealed that pyrolysis temperature was the most important factor determining the yield of biochar. Li et al. constructed a machine learning model to predict the physicochemical properties of biochar employing pyrolysis conditions as input parameters (Li et al. 2019b). The machine learning model was expected to quantitatively evaluate the effects of raw material type and pyrolysis temperature on the yield and properties of biochar. The results showed that although the yield, specific surface area, cation exchange capacity, pH value, volatile and ash content and elemental composition of biochar were limited by many factors, the characteristics of biochar depended largely on the type of raw materials and pyrolysis temperature.
RF algorithm is proved to be an ideal method for evaluating biomass pyrolysis products. Evaluating the yield of pyrolysis products by RF algorithm is beneficial to optimize pyrolysis process. Dong et al. employed RF algorithm to predict the yield of biochar, bio-oil and biogas under co-pyrolysis conditions (Dong et al. 2022). It was found that the most critical factors to predict the yield of pyrolysis products were moisture content, carbon content and final heating temperature, and the characteristics of biomass were more central than pyrolysis conditions. Carbon content had a negative effect on biochar yield, and a positive effect on bio-oil yield. The final temperature possessed a negative effect on biochar yield, and a positive effect on pyrolysis gas yield. RF algorithm also proved that pyrolysis conditions determined the yield of biochar, and the relative contribution was higher than the biomass characteristics represented by element composition and structure information (Zhu et al. 2019b).
ANN could be applied to evaluate the importance of operating parameters in the preparation of biochar, so as to synthesize biochar adsorbents with ideal adsorption properties (Abdulsalam et al. 2020). Model results exhibited that the most important factors affecting the physical and chemical properties of biochar were reaction temperature and retention time. When the temperature was set to 180 °C and the retention time was determined to be 11.5 h, the adsorption capacity of biochar for Pd2+ reached 89.52 mg g−1. Also, the best operating parameters for pyrolysis of banana peel into biochar were determined by training ANN (Bong et al. 2022). ANN was applied to construct a comprehensive data-driven model based on pyrolysis conditions and biomass raw material composition to predict the preparation process of biochar (Li et al. 2022b). It was found that pyrolysis temperature, nitrogen content and ash content were the core parameters affecting the yield and composition of biochar (Additional file 1: Fig. S2). In addition, pyrolysis conditions also significantly affected the BET surface area of biochar. By testing ANN with 48 different network structures and combining sensitivity analysis, it was found that gas flow rate was the most critical factor in the generation of biochar, followed by holding time and pyrolysis temperature (Altikat and Alma 2022a). The deep artificial neural network (DNN) model showed that the effect of holding time on the yield of biochar and bio-oil was higher than other parameters under co-pyrolysis conditions (Altikat and Alma 2022b). It should be pointed out that ANN was proved to be able to predict the pyrolysis kinetics of biochar (Mayol et al. 2018).
3.4 Evaluation of economic cost
Although the pyrolysis of biochar has developed into a relatively mature commercial technology (Li et al. 2022a; Shahbaz et al. 2021; Masek et al. 2013), the large-scale production of biochar involves many factors, covering the source of raw materials and pyrolysis conditions, so it is difficult to conduct a comprehensive economic and environmental assessment. In the economic and environmental evaluation of biochar synthesis process, there are two major limitations: (1) Under the premise of a single type of biomass, evaluate the economic and environmental effects of biochar under different pyrolysis conditions; (2) Under the same pyrolysis condition, carry out economic and environmental analysis of pyrolysis for different types of biomass into biochar. By employing machine learning, the economic, energy and environmental effects of various raw materials under different pyrolysis conditions could be evaluated. Employing the existing experimental data as the training set, the RF model was employed to predict the biochar yield and characteristics of different raw materials under various pyrolysis conditions (Cheng et al. 2020). Further, integrating RF algorithm, economic analysis and life cycle assessment, the net global warming potential (GWP), energy return on investment (EROI) and minimum product selling price (MPSP) in the process of biochar preparation were obtained (Additional file 1: Fig. S3). The results exhibited that the slow pyrolysis of wood waste and crop residues was an economic and commercial technology. The fluctuation range of MPSP was 774–1256 US dollars ton−1, which is determined by pyrolysis temperature and the type of raw materials. It is necessary to balance the economic effect and environmental impact of biomass. Pyrolysis sludge at low temperature corresponded to the best MPSP, while pyrolysis lignocellulose at high temperature covered the best overall energy and environmental performance. The potential income of biochar from existing pyrolysis plants in the world was evaluated through life cycle analysis (Han et al. 2021). The results exhibited that the net income of the existing 144 pyrolysis plants increases with the increase of plant capacity and biochar price. In Europe and the United States, the potential income from the construction of new pyrolysis plants is optimistic. Inconvenient transportation, poor land and low availability of crop reduce reduces the potential income brought by biochar.
4 Modeling of pollutant adsorption with biochar
Biochar is an effective material to remove heavy metals and organic pollutants from wastewater. The characteristics and operating conditions of biochar could affect the effectiveness of treatment process. The characteristics of biochar include specific surface area, surface functional groups, pH and element composition of biochar, etc. The operating conditions cover the initial concentration of metal, contact time, temperature and the presence of competitive ions. The adsorption process of pollutants on biochar is a highly complex nonlinear system related to the pyrolysis conditions, physicochemical properties and adsorption conditions of biochar, which naturally makes machine learning applicable. Machine learning has been widely reported to model pollutant adsorption processes on biochar, involving predicting adsorption efficiency, optimizing process parameters, and revealing adsorption mechanism. Furthermore, machine learning is more effective in modeling the adsorption process of pollutants on biochar, compared with the existing kinetic equation and isotherm equation (Reddy et al. 2021). In order to facilitate readers to better deploy machine learning to biochar for modeling and optimizing the adsorption process of pollutants, successful cases of machine learning are systematically summarized in Additional file 1: Table S2. A special case that clearly describes the design of biochar for pollutant adsorption via machine learning was presented (Additional file 1: Fig. S4).
4.1 Prediction of adsorption efficiency
Existing models, such as MLR, can only assess adsorption processes at specific equilibrium concentrations or under certain adsorption isotherm models. Compared with MLR, advanced machine learning algorithms possess significantly excellent prediction accuracy and robustness, among which ANN algorithm is widely applied (Chowdhury et al. 2019; Praveen et al. 2021; Yadav and Jagadevan 2021; Kang et al. 2022). For example, as a black box model with strong fitting, ANN can predict the equilibrium concentration of organic compounds on the surface of biochar (Zhang et al. 2020). Hanandeh et al. employed ANN combining several radial basis functions and gradient enhancement algorithms to model multiple input and multiple output processes, and achieved highly accurate prediction (R2 > 0.99) (El Hanandeh et al. 2021). ANN provides the best match with the experimental data, allowing accurate prediction of how the adsorption system responds to changes in operation. Yadav et al. performed an ANN modeling to predict the adsorption removal of copper ions (Yadav et al. 2021). Under optimized conditions, MAE, RMSE and R2 provided by ANN model were 2.63, 4.60 and 0.91, respectively, which well evaluated the adsorption process of copper ions on biochar surface.
However, the dilemma faced by ANN algorithm is that the size of adsorption data set is generally small, resulting in over-fitting. Moreover, the availability of most ANN models is low. In order to overcome the limitations faced by ANN, some non-ANN algorithms have been applied to predict the adsorption capacity and adsorption efficiency of biochar and the equilibrium concentration of pollutants (Li et al. 2019a). RF, generalized linear models and SVM were proved to be able to accurately predict the performance of vertical flow constructed wetlands using biochar to remove organic wastes, and the R2 and the RMSE of whole fitting data achieved 74.0% and 5.0 mg L−1, 80.0% and 0.3 mg L−1, 90.1% and 2.9 mg L−1, and 48.5% and 0.5 mg L−1 for BOD5_VF1, NH4- N_VF1, BOD5_VF2, and NH4-N_VF2, respectively (Nguyen et al. 2021). With pH, amount of adsorbent, contact time and initial lead concentration as inputs and adsorption capacity as outputs, it was found that SVM could accurately evaluate the adsorption process of lead ions on biochar (Talebkeikhah et al. 2020). Yang et al. developed a data-driven machine learning tool to successfully predict the maximum adsorption capacity (Qm) of 19 pharmaceutical compounds on biochar (Yang et al. 2022a). It was found that k-nearest neighbor (kNN) was the most ideal algorithm, with RMSE of 23.48, followed by RF and cubist, accompanied by RMSE of 26.91 and 29.56, respectively (Additional file 1: Fig. S5).
Some non-ANN algorithms even exhibit better model accuracy than ANN. Based on 353 adsorption experimental data in literature, the ability of RF and ANN to predict the adsorption of six heavy metals (lead, cadmium, nickel, arsenic, copper and zinc) on biochar was compared (Zhu et al. 2019a). Compared with ANN model (R2 = 0.948), RF model possessed better model prediction performance (R2 = 0.973). Further, the generalization ability of RF model was better than that of ANN model. It should be emphasized that RF possesses a higher tolerance for data loss than ANN (Sun et al. 2022). Kriging and multi parameter linear free energy relationship (LFER) models were employed to predict the adsorption capacity of organic pollutants on biochar (Zhao et al. 2022). The prediction accuracy of Kriging-LFER model (R2 up to 0.940 and 0.976, respectively) was 7% and 9.6% higher than that of ANN-LFER model (R2 up to 0.870 and 0.880, respectively).
The advantage of ANFIS lies in the combination of neural network and fuzzy logic (Adedeji et al. 2020; Karaboga and Kaya 2018). In theory, ANFIS may possess more advantages than ANN in predicting the adsorption of pollutants on biochar, and thus ANFIS is widely applied (Ohale et al. 2022; Bisaria et al. 2022). For example, the adsorption efficiency data of Cu(II) ions on biochar were constructed through laboratory batch studies, and the model efficiencies of ANFIS, ANN and MLR were compared (Wong et al. 2020). ANFIS model exhibited the best model performance, with the accuracy of 90.24% and 87.06% for training and testing data sets, respectively. ANN model exhibited a weaker predictive performance than ANFIS model. Due to the complex relationship between variables, the MLR model was not applicable. ANFIS, ANN, GA and RSM were employed to evaluate the simultaneous adsorption and removal of chemical oxygen demand (COD) and total organic carbon (TOC) on tea waste biochar (TWBC) (Alhothali et al. 2022). The applied model exhibited the following order of performance: ANFIS > ANN-GA > RSM-GA > RSM. ANFIS also showed better model performance than RSM and ANN for decolorization of activated yellow 81 (RY81) in aqueous solution via biochar (Ravindiran et al. 2022).
The integration of different machine learning algorithms can sometimes overcome the problem of poor prediction accuracy of a single algorithm (Talebkeikhah et al. 2020; Yu et al. 2021). The queue search algorithm (QSA) based on human activities was employed to optimize the parameters of ANN model, which has significantly improved the prediction ability of ANN (Additional file 1: Fig. S6) (Zheng and Nguyen 2022). SVM, RF, ANN, M5Tree and Gaussian process algorithms were employed as key algorithms, and each model was packaged to generate a new integrated model; thus 20 intelligent models were developed and evaluated (Ke et al. 2021a). Among them, SVM-ANN model possessed the most accurate and reliable prediction performance. The above shows that the integrating multiple models is conducive to improving the prediction accuracy of a single model.
Further, the type of descriptor proved to significantly affect model accuracy. Zhu et al. applied RF algorithm to model the adsorption process of pharmaceuticals and personal care products (PPCP) on biochar (Zhu et al. 2022a). Employed descriptors included PPCP and biochar properties and adsorption conditions. It was found that the RMSE was reduced by 18–24% in the model that employed biochar properties such as surface functionality and hierarchical porous structure as descriptors. Specifically, the use of functional group information, surface element composition and specific structural properties collected from XPS data as descriptors could significantly improve the accuracy of the model.
It should be pointed out that although the common descriptors applied to predict the adsorption performance of biochar are the physicochemical characteristics of biochar and the operating conditions of adsorption process, however, the experimental parameters of the pyrolysis process of biochar may also be employed to predict the adsorption performance of biochar, because the pyrolysis conditions of biochar are directly related to the characteristics of biochar. For example, Albalasmeh et al. successfully predicted the adsorption capacity of biochar for MB using the operating conditions of biochar pyrolysis (particle size, parent biomass type, pyrolysis temperature) as descriptors and the adsorption capacity of methylene blue (MB) as evaluation indicators (Albalasmeh et al. 2020).
4.2 Optimization of experimental conditions
Establishing the nonlinear relationship between operating conditions and the adsorption performance of biochar for pollutant removalgreatly facilitates the design of adsorption devices with excellent performance, such as adsorption column and adsorption bed. Thanks to the economy, efficiency and strong prediction ability of machine learning, it is possible to optimize the adsorption process of pollutants on biochar. Common algorithms applied to optimize operating parameters for improving the adsorption performance of biochar include RF, SVM, ANN and ANFIS.
Linear regression is a simple machine learning model, which can be applied to optimize the adsorption process of pollutants on biochar. Nguyen et al. employed linear regression to optimize the adsorption of 19 drugs on biochar (Nguyen et al. 2022). It was found that the equilibrium adsorption capacity of biochar was strongly affected by the characteristics of biochar (including pore volume, specific surface area and pore structure), but not by the characteristics of pollutants and experimental conditions except the initial concentration of drugs.
Due to the inherent limitations of linear regression, such as the low robustness of the model, some advanced machine learning algorithms with high explanatory power are employed to optimize the adsorption process of pollutants on biochar, such as RF, SVM and Kernel extreme learning machine (KELM). For example, RF was used to analyze 559 separate Langmuir adsorption isotherm data covering the adsorption capacity and affinity of biochar for 17 different metals, revealing the effectiveness of several parameters in the preparation process of biochar for the adsorption of heavy metals (Additional file 1: Fig. S7a and b) (Thomas et al. 2020). It was found that the key factor affecting the adsorption capacity and affinity of biochar for metal pollutants was the raw materials used (Additional file 1: Fig. S7c and d). Nutrient intensive raw materials (animal biological waste, sludge and feces) were considered to be the best way to prepare biochar. Beigzadeh et al. believe that RF model is so accurate because it can capture the nonlinear relationship between the input data and its related removal ability (Beigzadeh et al. 2020). In addition, Palansooriya et al. applied RF and SVM to optimize the immobilization efficiency of heavy metals in biochar modified soil (Additional file 1: Fig. S8) (Palansooriya et al. 2022). It was found that the most significant factors affecting the immobilization of heavy metals were the nitrogen content in biochar (0.3–25.9%) and the application rate of biochar. Furthermore, the relative importance order of selected influencing factors was: biochar property > experimental conditions > soil property > heavy metal property. Zhao et al. proposed the KELM and Kriging model for the adsorption of heavy metals on biochar (Zhao et al. 2021). It was found that adsorption efficiency was strongly related to pH and temperature.
ANN is widely applied to optimize the process parameters of pollutant adsorption on biochar (Salawu et al. 2022; Mondal et al. 2016, 2017; Mukherjee et al. 2022). For example, Dalhat et al. employed a FANN coupled with nonlinear regression generalized decay function (GEDF) to optimize the adsorption process of o-cresol and phenol on a fixed bed of biochar (Dalhat et al. 2021). The ANN model combined with contour method, Garson algorithm and performance decomposition was applied to determine the relative importance of operating variables. The operating variables follow the sequence: initial concentration > flow rate > time > bed depth or activated date palm biochar (DPBC) quality. Specifically, increasing the flow rate resulted in a linear or even exponential increase in the concentration of adsorbate effluents. The higher DPBC bed depth caused a slight linear decrease in the concentration of adsorbate effluents. When the initial concentration was 100 mg L−1, the bed depth was 13.3 cm, and the flow rate was 5 mL min−1, the bed capacity reached the maximum. Further, coupling ANN and other algorithms, such as PSO, can further improve the applicability of ANN for optimizing adsorption conditions (Wang et al. 2022a).
It should be pointed out that there are some disputes about the importance of the chemical and structural characteristics of biochar for the pollutant adsorption performance of biochar. The adsorption of uranium by biochar was optimized, and the results revealed that compared with the chemical characteristics of biochar, the structural characteristics of biochar, such as specific surface area, played a more critical role in uranium adsorption (Da et al. 2022). Furthermore, when the average pore diameter of the adsorbent was 2.5–32.5 times of the maximum diameter of PPCP molecule, the adsorption of PPCP was obviously promoted (Zhu et al. 2022a). On the contrary, Zhu et al. optimized the adsorption of six heavy metals on biochar based on the model output of metal types, environmental conditions (such as temperature and pH), biochar characteristics and the initial concentrations of heavy metals and biochar (Sun et al. 2022). The characteristics of biochar were considered to be the most important factors for adsorption efficiency, and the relative importance of the cation exchange capacity (CEC) and pHH2O of biochar even accounted for 66% of the characteristics of biochar while that of the surface area of biochar was only 2%. The reasons for these inconsistencies need to be further clarified to obtain high adsorption capacity and fast adsorption rate. In general, compared with pollutant concentration and other adsorption parameters, the adsorption capacity of biochar is strongly controlled by the physicochemical properties of biochar, including surface properties, pore structure, pore volume and specific surface area. Furthermore, the model built by detailed biochar properties (such as surface functionality and graded porous structure) was more accurate than that built by general information (such as volume element composition and total pore volume) (i.e., RMSE was reduced by 18–4%) (Zhu et al. 2022a). In the prediction of adsorption capacity, the order of relative importance of surface carbon functional groups was C–O bond > C–O bond > nonpolar carbon. This indicates that the surface properties of biochar may be crucial in the adsorption capacity.
4.3 Revelation of adsorption mechanism
The adsorptive removal of pollutants in wastewater on the surface of biochar follows many mechanisms, such as ion exchange, precipitation, cation-π interaction, complexation and electrostatic interaction, which is a highly complex system (Goswami et al. 2022; Kabir et al. 2022). As a new research method, machine learning is a powerful tool to reveal the adsorption mechanism of biochar for pollutant removal. Liu et al. systematically revealed the adsorption mechanism of antibiotics on biochar (Liu et al. 2022a). It was found that increasing the concentration of antibiotics was conducive to enhancing the adsorption of antibiotics on biochar. The higher the initial concentration of antibiotics, the stronger the adsorption capacity, which may be due to the traction effect caused by concentration gradient. In addition, the larger specific surface area provided more adsorption sites for antibiotics. When specific surface area was less than 300 m2 g−1, increasing specific surface area meant promoting adsorption. When specific surface area exceeded 1500 m2 g−1, it exhibited an inhibition trend, which may be due to the fact that the increase of specific surface area hindered other physicochemical properties of biochar, thereby reducing the overall adsorption capacity of biochar to antibiotics. Furthermore, the pH of the solution (pH solution) affected adsorption process by the surface charges of biochar and antibiotics. When the pH of the solution was lower than zero charge point, the surface of biochar was negatively charged. With the increase of pH, the dependence of adsorption on solution pH first increased, then became constant, and finally decreased, due to the electrostatic interaction between biochar and antibiotics. The influence of EtOH on As(III) adsorption mechanism was quantitatively elucidated by machine learning (Zafar et al. 2017). EtOH (concentration 40 μg L−1) could enhance the adsorption of As(III) on biochar through ligand–metal binding complexation mechanism. When the pH increased from 2 to 7, it was observed that the adsorption capacity of As(III) increased first and then decreased, due to the electrostatic repulsion between the main univalent non-ionic species of As(II) and the surface negative charge of biochar. The partial dependence graph of Cr(VI) removal rate on the physicochemical properties of biochar was applied to reveal the adsorption mechanism of Cr(VI) (Additional file 1: Fig. S9) (Zhu et al. 2022b). It was found that the adsorption process of Cr(VI) on iron modified biochar composite was divided into three stages (Additional file 1: Fig. S9b). In the first stage, the Cr(VI) removal capacity of iron modified biochar composite reached a local peak, which may be attributed to the carbon defect of electron donor on the condensed polyaryl carbon matrix. In the second stage, Cr(VI) could be effectively adsorbed through direct complexation, electron donor and indirect electron mediation. In the third stage, increasing the O/C ratio reduced the adsorption capacity of iron modified biochar composite, which may be attributed to the poor structure of biochar and the active adsorption sites on the surface of biochar that were difficult to access.
Generally, there is a linear dependence between reaction barrier and adsorption energy, that is, the Bronsted Evans Polanyi (BEP) relationship (Cao 2022). On the one hand, reaction barrier can be applied as an evaluation index of adsorption energy. On the other hand, lowering reaction barrier may be conducive to the weak adsorption of pollutants, which facilitates the subsequent disposal of pollutants. However, it should be pointed out that there is a lack of deployment of machine learning in predicting the adsorption energy of biochar for different pollutants and the related scaling relationship, which needs to be strengthened for a more comprehensive picture of pollutant adsorption on biochar.
The adsorption mechanism can be intuitively understood by giving the distribution image of pollutants in biochar particles. A 3D in situ visualization method was established to qualitatively and semi quantitatively characterize Pb (II) adsorption distribution in biochar particles (Zhang et al. 2022a). Specifically, the biochar and the biochar adsorbed Pb (II) were imaged by X-ray micro CT (Additional file 1: Fig. S10a). Then, K-means clustering algorithm was employed to segment sample images, and optimal segmentation thresholds were set as 6000 HU, 7,000 HU and 1,300 HU respectively. Finally, rendered images of Pb(II) adsorption distribution in biochar were presented (Additional file 1: Fig. S10b). This in-situ visualization method provides a new method for revealing the adsorption mechanism of pollutants on biochar.
5 Statistical analysis and general guidance for biochar
In order to facilitate the deployment of machine learning to biochar, the application of machine learning in biochar pyrolysis and adsorption for pollutant removal was comprehensively analyzed (Fig. 5). The adsorption data set of biochar for pollutant removal is generally constructed by a top-down method, that is, obtaining first-hand data through batch experiments or extracting published experimental data from literature through data mining technology. The bottom-up method based on (first principal calculation ? or first-principles calculation ?) or molecular dynamics simulation is hardly employed to build the adsorption data set of biochar for pollutant removal, which may be due to the difficulty in establishing a true and accurate calculation model of biochar. In the process of biochar pyrolysis, the top five machine learning algorithms employed frequently include ANN, RF, SVM, XGBoost, and MLR (Fig. 5a). For the adsorption of pollutants on biochar, the situation is somewhat different, and the top five machine learning algorithms applied include ANN, RF, SVM, ANFIS, and GA (Fig. 5b). R2 and MSE are the most frequent model evaluation indexes in biochar pyrolysis and pollutant adsorption (Fig. 5c and d). For the adsorption of pollutants by biochar, heavy metals, PPCPs and dyes are the most concerned types of pollutants, accounting for 73%, 13.5% and 6.8%, respectively (Fig. 5e). Further, according to the frequency of occurrence, the heavy metals employed include Pb(II), Cu(II), As(III), Cd(II), and Zn(II) (Fig. 5f). For biochar pyrolysis, the commonly employed input variables include pyrolysis parameters, source and physicochemical properties of biochar. The commonly applied output variable is the yield of biochar. For the adsorption of pollutants on biochar, the commonly employed input variables are the physicochemical characteristics of biochar, the valence state, molecular weight, functional group of pollutants, initial concentration, temperature, stirring time, stirring frequency, adsorbent dosage and pH. The pyrolysis parameters of biochar are sometimes employed as input variables for modeling adsorption process. The commonly applied output variables include adsorption capacity, adsorption efficiency and removal efficiency. At present, adsorption energy or Gibbs free energy has not been used as an output variable .
Comprehensive statistics on the applications of machine learning in optimizing biochar pyrolysis and modeling adsorption process of biochar for pollutant removal. a The frequency of algorithms employed in machine learning for biochar pyrolysis; b The frequency of algorithms employed in machine learning for pollutant removal via biochar; c The frequency of model evaluation indexes employed in machine learning for biochar pyrolysis; d The frequency of model evaluation indexes employed in machine learning for pollutant removal via biochar; e The ratio of pollutants in machine learning for adsorption via biochar; f The frequency of heavy metal elements in machine learning for adsorption via biochar
For the modeling of biochar synthesis and adsorption, when focusing on the optimization of process parameters, the highly interpretable algorithms such as kNN, SVM and RF can be employed for prediction and evaluation. The advantages of the kNN algorithm lie in its simplicity, convenience and excellent performance. The principle of SVM algorithm is to minimize structural risk, which is manifested in the minimization of empirical risk and confidence interval, making SVM friendly to small sample data sets. As an integrated algorithm based on decision tree, RF is particularly suitable for analyzing feature importance, even for high-dimensional data sets. However, ANN algorithm may not be suitable for optimizing experimental conditions, because of its inherent shortcomings, namely, it is unable to provide analytical formulas, and the weight value obtained is not a regression coefficient, so it cannot be employed to analyze causality. However, ANN possesses the ability to express any nonlinear relationship, so it is applicable when the yield or adsorption efficiency prediction is concerned. However, attention should be paid to the size of the dataset when selecting ANN. In general, ANN is prone to over-fitting on small-scale data sets, so it is not applicable. In addition, thanks to fuzzy logic, ANFIS may show stronger model accuracy than ANN.
Generally, the optimal machine learning model often shows significantly superior prediction accuracy compared with the original machine learning model. Hyperparameter optimization is an essential step to obtain the optimal machine learning model. Common hyperparameter optimization methods include grid search, random search and Bayesian optimization (Bischl et al. 2023). It should be pointed out that for small data sets, automatic parameter adjustment may be a convenient way, but it depends on the experience of the tester. Further, single machine learning model may not be applicable, especially for some sparse data sets. In order to solve the dilemma faced by single machine learning model, some machine learning technologies have been developed. An integrated learning method is a meta-algorithm that combines multiple machine learning technologies into one machine learning model to achieve bagging, boosting or stacking (Deng et al. 2020; Ye et al. 2021). The prediction performance of integrated learning model is better than that of single machine learning model. In addition, modern machine learning models are extremely complex, and it is appropriate to optimize models by iteration. Active learning tries to solve the labeling bottleneck of samples, and actively selects the most valuable unlabeled samples for labeling first, so as to achieve excellent model accuracy with as few labeled samples as possible (Faulon and Faure 2021; Sayin et al. 2021). Active learning could overcome the shortcomings of traditional machine learning to some extent. In short, the optimal machine learning model can be obtained through hyperparameter optimization, integrated learning and active learning.
In order to facilitate the rapid deployment of machine learning to biochar, the scope, benefits, and limitations of different machine learning models and descriptors are presented in Additional file 1: Tables S3 and S4. An application guide of machine learning for biochar synthesis optimization and adsorption modeling was proposed (Fig. 6). Before model training, it is necessary to build a data set of pyrolysis process and adsorption process. It is recommended to employ batch experiments or mine useful information in published literature to build data sets for model training. For the pyrolysis process of biochar, it is recommended to use pyrolysis temperature, pyrolysis time, source of biochar, carbon content in feed, oxygen concentration, carbon to hydrogen ratio, carbon to nitrogen ratio and ash content as descriptors. It is recommended to employ biochar yield as the output variable. It should be pointed out that when co-pyrolysis is considered, the competitive reaction of biochar, bio-oil and biogas may be involved. At this time, appropriate descriptors need to be carefully selected. When biochar is used for pollutant adsorption, it is recommended to employ the physicochemical properties of biochar, the electronic properties and structural properties of pollutants, operating parameters (adsorption time, temperature, stirring rate, pH, adsorbent dosage, pollutant concentration and coexisting competitive ions) as descriptors. Equilibrium concentration, adsorption capacity, adsorption efficiency or removal efficiency are recommended as output variables. Because there is a significant causal relationship between these recommended descriptors and biochar yield or adsorption performance, and clear model expressions can be obtained to improve the interpretability of model. Furthermore, if the data set is successfully constructed by bottom-up method, it is beneficial to understand the pyrolysis and adsorption processes of biochar at the electronic and atomic levels. In particular, the adsorption data set constructed by the bottom-up approach may facilitate the understanding of the nonlinear dependence of adsorption process. For the adsorption data set constructed by the bottom-up method, the electronic properties represented by p-band center, d-band center, orbital electron number, first ionization energy and electronegativity are recommended as descriptors. The recommended output variable is adsorption energy or Gibbs free energy. In order to select an appropriate training model, it is necessary to integrate various factors, including the nature of the problem concerned (regression, classification or clustering), the computing power of existing resources, the method to build a dataset, the accessibility of the number of data points, and the content of the output variables concerned. Considering that the scale of biochar datasets is generally small, the algorithms represented by RF and SVM may be appropriate. Blind employment of ANN algorithm may be inappropriate, especially when the data size is limited. ANN and ANFIS are particularly suitable when the data scale is large and the problem concerned is the yield or adsorption capacity of biochar. When selecting model evaluation indicators, R2 and RMSE are recommended as unified model evaluation indicators for regression problems. When machine learning is applied to biochar, it is mainly concerned with regression problems, that is, predicting the yield of biochar and its adsorption capacity for pollutants, and interpretability analysis for the pyrolysis and adsorption processes of biochar. Machine learning has rarely been reported for the classification of biochar. However, considering that sometimes the concern is whether the biochar prepared is a superior adsorbent in the larger searchable, several evaluation indexes of machine learning classification problems applicable to the whole process of biochar adsorption were given. For classification problems, it may be appropriate to select accuracy, error rate, recall rate, F1 and AUC.
6 Conclusions and future perspectives
In conclusion, this review comprehensively summarized the latest progress of machine learning for biochar adsorbents from the perspective of the whole process regulation for the first time, covering optimization of pyrolysis process, evaluation of economic costs and modeling of adsorption process. The workflow of machine learning and the frequently employed machine learning algorithms for biochar were systematically summarized. The general guidance of machine learning for optimization of biochar pyrolysis process and adsorption process for pollutant removal was proposed. In order to facilitate the synthesis optimization and adsorption modeling of biochar for pollutant removal via machine learning, the following issues need to be carefully considered:
-
1.
The scale and reliability of data sets are key to the deployment of machine learning into biochar. Because biochar synthesis is a complex system and the DFT calculation of adsorption enthalpy is expensive, the idea of constructing data set from bottom to up by theoretical calculation is not practical. Therefore, a top-down approach is generally required to obtain the corresponding data set. However, there is a lack of corresponding data database. In addition, due to the existence of experimental errors and differences in experimental parameters, there are some anomalies in the data set. Developing automatic experimental instruments that can produce a large amount of field data in a short time and establishing biochar database are beneficial to the integration of machine learning and biochar. It is debatable whether statistical rules derived from employing machine learning are reliable when data size is small. Because machine learning generally applies to large data sets, and the resulting models are reliable. When training data is extremely scarce and machine learning is carried out rashly, it may get the cognition which is different from the reality or even completely wrong.
-
2.
Natural language processing technology and image recognition technology provide powerful tools for building large-scale data sets. Through natural language processing technology and image recognition technology, the text information and picture information in the literature could be quickly obtained, such as the synthesis steps, adsorption performance and morphology of biochar. Moreover, by mining valuable information in massive literature, it is conducive to maximizing the effectiveness of published experimental data.
-
3.
At present, almost all the published data are positive data, while some failed and negative data have not been disclosed and shared, which leads to biased and unbalanced biochar data sets. When the data set is not unbiased, it does not reflect the real situation. Although biased data sets can be dealt with by some technical means, the key to fundamentally solving the problem of data set distribution lies in encouraging the sharing or publishing of failed experimental data.
-
4.
Appropriate algorithm and its setting of hyperparameters are extremely important for the convergence speed and prediction performance of model. In the biochar data obtained through experiments, the risk of over-fitting is increased due to the small data scale. Therefore, it is recommended to select algorithms with strong robustness and generalization ability when training biochar datasets. In addition, exposing the settings of hyperparameters and implementation code facilitates the reproduction of model. It is important to note that sharing the details of model could avoid unnecessary duplication of effort.
-
5.
The widely applied ANN algorithm model is less explanatory, and the training of the model has great randomness or flexibility, so the model training results may change with some artificial and subjective factors. In order to improve the randomness and interpretability of machine learning model, and break through black box, it is urgent and necessary to develop machine learning algorithms with high interpretability of gray box or white box to promote the acquisition of clear physical and chemical laws.
-
6.
In practice, machine learning may get inconsistent results with reality, especially when data is scarce. In order to solve the above problem, some strategies are proposed. First of all, selecting descriptors that have causal relationship with the predicted value could enhance the interpretability of the model and obtain the predicted data consistent with the experiment as much as possible. In addition, when employing machine learning modeling, it is also applicable to adjust the model parameters in time according to the feedback of experiments. Employing the existing data of the system similar to the object being studied to expand the scarce data set could improve the prediction accuracy of the model. Furthermore, in order to solve the dilemma of small data sets and inconsistency with experiments encountered by machine learning, the most fundamental thing is to obtain a large number of reliable first-hand data and build corresponding databases.
-
7.
The competitive reaction of bio-oil and biogas is often involved in the pyrolysis process of biochar. However, at present, the competitive reaction involved is seldom considered, which may not be conducive to a comprehensive understanding of the pyrolysis reaction of biochar. In future research, the competitive reaction among biochar, bio-oil and biogas needs to be deeply considered to obtain higher yield of target products.
-
8.
Machine learning is expected to further clarify adsorption mechanism, including electrostatic interaction, complexation and adsorption energy. There is a lack of machine learning to understand the nonlinear dependence scaling relationship of pollutants in the adsorption process of biochar surface. The nonlinear dependence scaling relationship is an important part of understanding adsorption pathway. In addition, it is necessary to strengthen the influence of pollutant types, including polarity, solubility, molecular weight and even functional groups, on adsorption process. In addition, applying machine learning and image segmentation technology to reconstruct the adsorption distribution of pollutants in biochar is a new tool to intuitively understand adsorption mechanism.
-
9.
Soil heavy metal pollution has become one of the most serious environmental problems facing China. Biochar is considered to be an effective adsorbent for remediation of metal pollution in soil. However, due to the complex and diverse nature of soil, biochar and heavy metals, it is difficult to accurately measure the remediation efficiency of heavy metals. Machine learning has been widely reported to predict the efficiency of heavy metal immobilization in biochar-modified soil, which is helpful to determine the best conditions for enhancing heavy metal immobilization in biochar-modified soil (Palansooriya et al. 2022; Sun et al. 2022; Zhang et al. 2022e). Further, a database with 930 data sets on biochar improved soil was created, including 74 biochar and 43 soils (Sun et al. 2022). However, as a carbon material with rich pore structure and large specific surface area, the evolution of biochar over time cannot be ignored. After field application, microbial activity, temperature and precipitation could seriously change the physicochemical properties of biochar, and even lead to the fragmentation and dissolution of biochar (Wang et al. 2020b). Machine learning is naturally suitable for simulating and predicting the evolution of biochar-modified soil over time, including the aging of biochar-modified soil, so as to promote the transformation of artificial aging methods from qualitative methods to quantitative methods and provide good evidence for field application.
-
10.
The research of machine learning for biochar is just in its infancy, but the future development trend is bright. On the one hand, the application of machine learning in the optimization of biochar synthesis need to strengthen in order to produce more efficient and economical products. On the other hand, it is necessary to strengthen the modeling of pollutant adsorption process in water phase, solid phase and gas phase based on biochar via machine learning.
Availability of data and materials
The datasets used or analyzed during the current study are available from the corresponding author on reasonable request.
Abbreviations
- ANFIS-SSO:
-
Adaptive neuro fuzzy inference system employing social-spider optimization
- ANN:
-
Artificial neural network
- CEC:
-
Cation exchange capacity
- CFN:
-
Cascaded feedforward network
- DNN:
-
Deep artificial neural network
- DPBC:
-
Activated date palm biochar
- ELT:
-
Integrated learning tree
- EROI:
-
Return on energy investment
- FANN:
-
Feedforward ANN
- FFN:
-
Feedforward network
- GA:
-
Genetic algorithm
- GWP:
-
Global warming potential
- KELM:
-
Nonlinear regression generalized decay function
- Kernel:
-
Extreme learning machine
- kNN:
-
K-nearest neighbor
- LFER:
-
Linear free energy relationship
- LS-SVM:
-
Least squares support vector machine
- MAE:
-
Mean absolute error
- MAPE:
-
Mean absolute percentage error
- MBE:
-
Mean deviation error
- MISO-ANN:
-
Multiple-input single output-ANN
- MLR:
-
Multiple linear regression
- MnLR:
-
Multiple nonlinear regression
- MPSP:
-
Minimum product selling price
- MSE:
-
Mean square error
- PCC:
-
Peelian correlation coefficient
- PSO:
-
Particle swarm optimization
- Qm :
-
Maximum adsorption capacity
- QSA:
-
Queue search algorithm
- R:
-
Correlation coefficient
- R2 :
-
Coefficient of determination
- RF:
-
Random forest
- RMSE:
-
Root mean square error
- RSM:
-
Response surface methodology
- TWBC:
-
Tea waste biochar
- XGB:
-
Extreme gradient boosting
References
Abd El Aziz M, Hemdan AM, Ewees AA et al (2017) Prediction of biochar yield using adaptive neuro-fuzzy inference system with particle swarm optimization. IEEE PES-IAS PowerAfrica Conference 2017 Jun 27–30. Accra, GHANA, pp. 115–120
Abdulsalam J, Lawal AI, Setsepu RL, Onifade M, Bada S (2020) Application of gene expression programming, artificial neural network and multilinear regression in predicting hydrochar physicochemical properties. Bioresour Bioprocess. https://doi.org/10.1186/s40643-020-00350-6
Adedeji PA, Akinlabi SA, Madushele N, Olatunji OO (2020) Neuro-fuzzy resource forecast in site suitability assessment for wind and solar energy: a mini review. J Cleaner Prod. https://doi.org/10.1016/j.jclepro.2020.122104
Ahmed MM, Isa NAM (2017) Knowledge base to fuzzy information granule: a review from the interpretability-accuracy perspective. Appl Soft Comput 54:121–140. https://doi.org/10.1016/j.asoc.2016.12.055
Alabdrabalnabi A, Gautam R, Mani Sarathy S (2022) Machine learning to predict biochar and bio-oil yields from co-pyrolysis of biomass and plastics. Fuel. https://doi.org/10.1016/j.fuel.2022.125303
Albalasmeh A, Gharaibeh MA, Mohawesh O et al (2020) Characterization and artificial neural networks modelling of methylene blue adsorption of biochar derived from agricultural residues: effect of biomass type, pyrolysis temperature, particle size. J Saudi Chem Soc 24:811–823. https://doi.org/10.1016/j.jscs.2020.07.005
Alhothali A, Khurshid H, Mustafa MRU et al (2022) Evaluation of contemporary computational techniques to optimize adsorption process for simultaneous removal of COD and TOC in wastewater. Adsorpt Sci Technol 2022:1–16. https://doi.org/10.1155/2022/7874826
Alsawy T, Rashad E, El-Qelish M, Mohammed RH (2022) A comprehensive review on the chemical regeneration of biochar adsorbent for sustainable wastewater treatment. Npj Clean Water. https://doi.org/10.1038/s41545-022-00172-3
Altikat A, Alma MH (2022a) Application of new hybrid models based on artificial neural networks for modeling pyrolysis yields of Atriplex nitens S. Int J Energy Res 46:4445–4461. https://doi.org/10.1002/er.7441
Altikat A, Alma MH (2022b) Prediction carbonization yields and the sensitivity analyses using deep learning neural networks and support vector machines. Int J Environ Sci Technol. https://doi.org/10.1007/s13762-022-04407-1
An Q, Jiang YQ, Nan HY, Yu Y, Jiang JN (2019) Unraveling sorption of nickel from aqueous solution by KMnO4 and KOH-modified peanut shell biochar: implicit mechanism. Chemosphere 214:846–854. https://doi.org/10.1016/j.chemosphere.2018.10.007
An XF, Wu ZS, Shi W et al (2021) Biochar for simultaneously enhancing the slow-release performance of fertilizers and minimizing the pollution of pesticides. J Hazard Mater. https://doi.org/10.1016/j.jhazmat.2020.124865
Ascher S, Watson I, You SM (2022) Machine learning methods for modelling the gasification and pyrolysis of biomass and waste. Renewable Sustain Energy Rev. https://doi.org/10.1016/j.rser.2021.111902
Avakyan L, Tolchina D, Barkovski V et al (2022) Ultimate sensitivity of radial distribution functions to architecture of PtCu bimetallic nanoparticles. Comput Mater Sci 208:111326. https://doi.org/10.1016/j.commatsci.2022.111326
Avrutskiy VI (2020) Preventing overfitting by training derivatives. Proceedings of the Future Technologies Conference (Ftc) 2019, Vol 1 1069:144–163. https://doi.org/10.1007/978-3-030-32520-6_12
Azzi ES, Karltun E, Sundberg C (2022) Life cycle assessment of urban uses of biochar and case study in Uppsala, Sweden. Biochar. https://doi.org/10.1007/s42773-022-00144-3
Beigzadeh B, Bahrami M, Amiri MJ, Mahmoudi MR (2020) A new approach in adsorption modeling using random forest regression, Bayesian multiple linear regression, and multiple linear regression: 2,4-D adsorption by a green adsorbent. Water Sci Technol 82:1586–1602. https://doi.org/10.2166/wst.2020.440
Bhagat SK, Tiyasha T, Awadh SM, Tung TM, Jawad AH, Yaseen ZM (2021) Prediction of sediment heavy metal at the Australian Bays using newly developed hybrid artificial intelligence models. Environ Pollut 268:115663. https://doi.org/10.1016/j.envpol.2020.115663
Bisaria K, Singh R, Gupta M, Mathur A, Dixit A (2022) Novel acoustic-activated alkali-functionalized Trapa bispinosa peel biochar for green immobilization of chlorpyrifos from wastewater: artificial intelligence modelling and experimental validation. Biomass Convers Biorefin. https://doi.org/10.1007/s13399-022-02898-z
Bischl B, Binder M, Lang M et al (2023) Hyperparameter optimization: foundations, algorithms, best practices, and open challenges. Wiley Int Rev Data Mining Knowl Discovery
Bong HK, Selvarajoo A, Arumugasamy SK (2022) Stability of biochar derived from banana peel through pyrolysis as alternative source of nutrient in soil: feedforward neural network modelling study. Environ Monit Assess. https://doi.org/10.1007/s10661-021-09691-x
Boraah N, Chakma S, Kaushal P (2022) Attributes of wood biochar as an efficient adsorbent for remediating heavy metals and emerging contaminants from water: a critical review and bibliometric analysis. J Environ Chem Eng. https://doi.org/10.1016/j.jece.2022.107825
Cao L (2022) Recent advances in the application of machine-learning algorithms to predict adsorption energies. Trends Chem 4:347–360. https://doi.org/10.1016/j.trechm.2022.01.012
Cao H, Xin Y, Yuan Q (2016) Prediction of biochar yield from cattle manure pyrolysis via least squares support vector machine intelligent approach. Bioresour Technol 202:158–164. https://doi.org/10.1016/j.biortech.2015.12.024
Cashman S, Korostynska O, Shaw A, Lisboa P, Conroy L (2017) Detecting the presence and concentration of nitrate in water using microwave spectroscopy. IEEE Sens J 17:4092–4099. https://doi.org/10.1109/Jsen.2017.2705281
Cha JS, Jang SH, Lam SS et al (2021) Performance of CO2 and Fe-modified lignin char on arsenic (V) removal from water. Chemosphere. https://doi.org/10.1016/j.chemosphere.2021.130521
Chang CY, Medford AJ (2021) Application of density functional tight binding and machine learning to evaluate the stability of biomass intermediates on the Rh(111) surface. J Phys Chem C 125:18210–18216. https://doi.org/10.1021/acs.jpcc.1c05715
Cheng F, Luo H, Colosi LM (2020) Slow pyrolysis as a platform for negative emissions technology: an integration of machine learning models, life cycle assessment, and economic analysis. Energy Convers Manage. https://doi.org/10.1016/j.enconman.2020.113258
Cheng CY, Pourhejazy P, Ying KC, Lin CF (2021) Unsupervised learning-based artificial bee colony for minimizing non-value-adding operations. Appl Soft Comput. https://doi.org/10.1016/j.asoc.2021.107280
Choudhary M, Kumar R, Neogi S (2020) Activated biochar derived from Opuntia ficus-indica for the efficient adsorption of malachite green dye, Cu2+ and Ni2+ from water. J Hazard Mater. https://doi.org/10.1016/j.jhazmat.2020.122441
Chowdhury S, Sikder J, Mandal T, Halder G (2019) Comprehensive analysis on sorptive uptake of enrofloxacin by activated carbon derived from industrial paper sludge. Sci Total Environ 665:438–452. https://doi.org/10.1016/j.scitotenv.2019.02.081
Da T-X, Ren H-K, He W-K, Gong S-Y, Chen T (2022) Prediction of uranium adsorption capacity on biochar by machine learning methods. J Environ Chem Eng. https://doi.org/10.1016/j.jece.2022.108449
Dahan E, Keller Y (2021) A unified approach to Kinship verification. IEEE Trans Pattern Anal Mach Int 43:2851–2857. https://doi.org/10.1109/Tpami.2020.3036993
Dalhat MA, Muazu ND, Essa MH (2021) Generalized decay and artificial neural network models for fixed-Bed phenolic compounds adsorption onto activated date palm biochar. J Environ Chem Eng. https://doi.org/10.1016/j.jece.2020.104711
Deng C, Ji X, Rainey C et al (2020) Integrating machine learning with human knowledge. Iscience 23:101656
Din SU, Awan JM, Imran M et al (2021) Novel nanocomposite of biochar-zerovalent copper for lead adsorption. Microsc Res Tech 84:2598–2606. https://doi.org/10.1002/jemt.23810
Ding Y, Jiang XQ, Kim YJ (2022) Relational graph convolutional networks for predicting blood-brain barrier penetration of drug molecules. Bioinformatics 38:2826–2831. https://doi.org/10.1093/bioinformatics/btac211
Dong Z, Bai X, Xu D, Li W (2022) Machine learning prediction of pyrolytic products of lignocellulosic biomass based on physicochemical characteristics and pyrolysis conditions. Bioresour Technol 367:128182. https://doi.org/10.1016/j.biortech.2022.128182
Egbosiuba TC (2022) Biochar and bio-oil fuel properties from nickel nanoparticles assisted pyrolysis of cassava peel. Heliyon. https://doi.org/10.1016/j.heliyon.2022.e10114
El Hanandeh A, Mahdi Z, Imtiaz MS (2021) Modelling of the adsorption of Pb, Cu and Ni ions from single and multi-component aqueous solutions by date seed derived biochar: comparison of six machine learning approaches. Environ Res 192:110338. https://doi.org/10.1016/j.envres.2020.110338
Ertugrul OF (2020) A novel randomized machine learning approach: reservoir computing extreme learning machine. Appl Soft Comput. https://doi.org/10.1016/j.asoc.2020.106433
Ewees AA, Abd El Aziz M, Elhoseny M (2017) Social-spider optimization algorithm for improving ANFIS to predict biochar yield. 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT). 2017 Jul 03–05. Delhi, INDIA
Ewees AA, Elaziz MA (2018) Improved adaptive neuro-fuzzy inference system using gray wolf optimization: a case study in predicting biochar yield. J Intell Syst 29:924–940. https://doi.org/10.1515/jisys-2017-0641
Faulon JL, Faure L (2021) In silico, in vitro, and in vivo machine learning in synthetic biology and metabolic engineering. Curr Opin Chem Biol 65:85–92
Findlay MR, Freitas DN, Mobed-Miremadi M, Wheeler KE (2018) Machine learning provides predictive analysis into silver nanoparticle protein corona formation from physicochemical properties. Environ Sci: Nano 5:64–71. https://doi.org/10.1039/c7en00466d
Goswami L, Kushwaha A, Kafle SR, Kim B-S (2022) Surface modification of biochar for dye removal from wastewater. Catalysts. https://doi.org/10.3390/catal12080817
Graham S, Epstein D, Rajpoot N (2020) Dense steerable filter CNNs for exploiting rotational symmetry in histology images. IEEE Trans Med Imaging 39:4124–4136. https://doi.org/10.1109/Tmi.2020.3013246
Guo MX, Li H, Baldwin B, Morrison J (2019) Thermochemical processing of animal manure for bioenergy and biochar. Int J Green Engry 13:963–968. https://doi.org/10.1080/15435075.2015.1087851
Gupta S, Patel P, Mondal P (2022) Biofuels production from pine needles via pyrolysis: Process parameters modeling and optimization through combined RSM and ANN based approach. Fuel. https://doi.org/10.1016/j.fuel.2021.122230
Hai A, Bharath G, Daud M et al (2021) Valorization of groundnut shell via pyrolysis: Product distribution, thermodynamic analysis, kinetic estimation, and artificial neural network modeling. Chemosphere 283:131162. https://doi.org/10.1016/j.chemosphere.2021.131162
Han M, Zhao Q, Li W et al (2021) Global soil organic carbon changes and economic revenues with biochar application. GCB Bioenergy 14:364–377. https://doi.org/10.1111/gcbb.12915
Handelman GS, Kok HK, Chandra RV et al (2019) Peering into the black box of artificial intelligence: evaluation metrics of machine learning methods (vol 212, pg 38, 2019). Am J Roentgenol 212:479–479. https://doi.org/10.2214/Ajr.18.20994
Haq ZU, Ullah H, Khan MNA, Raza Naqvi S, Ahad A, Amin NAS (2022) Comparative study of machine learning methods integrated with genetic algorithm and particle swarm optimization for bio-char yield prediction. Bioresour Technol 363:128008. https://doi.org/10.1016/j.biortech.2022.128008
Harrington GH, Kelly C, Attari V, Arroyave R, Kalidindi SR (2022) Application of a chained-ANN for learning the process-structure mapping in Mg2SixSn1-x spinodal decomposition. Integr Mater Manuf I 11:433–449. https://doi.org/10.1007/s40192-022-00274-3
Isaeva VI, Vedenyapina MD, Kurmysheva AY et al (2021) Modern carbon-based materials for adsorptive removal of organic and inorganic pollutants from water and wastewater. Molecules. https://doi.org/10.3390/molecules26216628
Jabar JM, Odusote YA (2021) Utilization of prepared activated biochar from water lily (Nymphaea lotus) stem for adsorption of malachite green dye from aqueous solution. Biomass Convers Biorefin. https://doi.org/10.1007/s13399-021-01399-9
Jeyasubramanian K, Thangagiri B, Sakthivel A et al (2021) A complete review on biochar: production, property, multifaceted applications, interaction mechanism and computational approach. Fuel. https://doi.org/10.1016/j.fuel.2021.120243
Jung C, Boateng LK, Flora JRV et al (2015) Competitive adsorption of selected non-steroidal anti-inflammatory drugs on activated biochars: experimental and molecular modeling study. Chem Eng J 264:1–9. https://doi.org/10.1016/j.cej.2014.11.076
Kabir MM, Akter MM, Khandaker S et al (2022) Highly effective agro-waste based functional green adsorbents for toxic chromium(VI) ion removal from wastewater. J Mol Liq. https://doi.org/10.1016/j.molliq.2021.118327
Kang JK, Seo EJ, Lee CG, Jeong S, Park SJ (2022) Application of response surface methodology and artificial neural network for the preparation of Fe-loaded biochar for enhanced Cr(VI) adsorption and its physicochemical properties and Cr(VI) adsorption characteristics. Environ Sci Pollut Res Int 29:60852–60866. https://doi.org/10.1007/s11356-022-20009-3
Karaboga D, Kaya E (2018) Adaptive network based fuzzy inference system (ANFIS) training approaches: a comprehensive survey. Artif Intell Rev 52:2263–2293. https://doi.org/10.1007/s10462-017-9610-2
Katiyar R, Patel AK, Nguyen TB, Singhania RR, Chen CW, Dong CD (2021) Adsorption of copper(II) in aqueous solution using biochars derived from Ascophyllum nodosum seaweed. Bioresour Technol. https://doi.org/10.1016/j.biortech.2021.124829
Ke B, Nguyen H, Bui XN et al (2021a) Predicting the sorption efficiency of heavy metal based on the biochar characteristics, metal sources, and environmental conditions using various novel hybrid machine learning models. Chemosphere 276:130204. https://doi.org/10.1016/j.chemosphere.2021.130204
Ke B, Nguyen H, Bui XN, Bui HB, Nguyen-Thoi T (2021b) Prediction of the sorption efficiency of heavy metal onto biochar using a robust combination of fuzzy C-means clustering and back-propagation neural network. J Environ Manage 293:112808. https://doi.org/10.1016/j.jenvman.2021.112808
Khan M, Ullah Z, Masek O, Raza Naqvi S, Nouman Aslam Khan M (2022) Artificial neural networks for the prediction of biochar yield: a comparative study of metaheuristic algorithms. Bioresour Technol 355:127215. https://doi.org/10.1016/j.biortech.2022.127215
Kim JE, Bhatia SK, Song HJ et al (2020) Adsorptive removal of tetracycline from aqueous solution by maple leaf-derived biochar. Bioresour Technol. https://doi.org/10.1016/j.biortech.2020.123092
Kim S, Quiroz-Arita C, Monroe EA et al (2021) Application of attached algae flow-ways for coupling biomass production with the utilization of dilute non-point source nutrients in the Upper Laguna Madre. TX. Water Res 191:116816. https://doi.org/10.1016/j.watres.2021.116816
Komnitsas K, Zaharaki D, Bartzas G, Alevizos G (2017) Adsorption of scandium and neodymium on biochar derived after low-temperature pyrolysis of sawdust. Minerals. https://doi.org/10.3390/min7100200
Krueger BC, Fowler GD, Templeton MR, Moya B (2020) Resource recovery and biochar characteristics from full-scale faecal sludge treatment and co-treatment with agricultural waste. Water Res. https://doi.org/10.1016/j.watres.2019.115253
Kwiatkowski M, Kalderis D (2020) A complementary analysis of the porous structure of biochars obtained from biomass. Carbon Lett 30:325–329. https://doi.org/10.1007/s42823-019-00101-4
Lakshmi D, Akhil D, Kartik A et al (2021) Artificial intelligence (AI) applications in adsorption of heavy metals using modified biochar. Sci Total Environ. https://doi.org/10.1016/j.scitotenv.2021.149623
Lamoureux PS, Choksi TS, Streibel V, Abild-Pedersen F (2021) Combining artificial intelligence and physics-based modeling to directly assess atomic site stabilities: from sub-nanometer clusters to extended surfaces. Phys Chem Chem Phys 23:22022–22034. https://doi.org/10.1039/d1cp02198b
Lee ME, Park JH, Chung JW (2019) Comparison of the lead and copper adsorption capacities of plant source materials and their biochars. J Environ Manage 236:118–124. https://doi.org/10.1016/j.jenvman.2019.01.100
Li YF, Liang DM (2019) Safe semi-supervised learning: a brief introduction. Front Comput Sci 13:669–676. https://doi.org/10.1007/s11704-019-8452-2
Li M, Wei D, Liu T et al (2019a) EDTA functionalized magnetic biochar for Pb(II) removal: adsorption performance, mechanism and SVM model prediction. Sep Purif Technol. https://doi.org/10.1016/j.seppur.2019.115696
Li S, Harris S, Anandhi A, Chen G (2019b) Predicting biochar properties and functions based on feedstock and pyrolysis temperature: a review and data syntheses. J Cleaner Prod 215:890–902. https://doi.org/10.1016/j.jclepro.2019.01.106
Li D, Liu SL, Gao FR, Sun X (2020a) Continual learning classification method with new labeled data based on the artificial immune system. Appl Soft Comput. https://doi.org/10.1016/j.asoc.2020.106423
Li N, Shepperd M, Guo YC (2020b) A systematic review of unsupervised learning techniques for software defect prediction. Inf Softw Technol. https://doi.org/10.1016/j.infsof.2020.106287
Li YG, Li QQ, Wu CX, Luo X, Yu X, Chen MJ (2020c) The inappropriate application of the regression Langmuir Q(m) for adsorption capacity comparison. Sci Total Environ. https://doi.org/10.1016/j.scitotenv.2019.134222
Li LY, Cao G, Zhu RS (2021a) Adsorption of Cr(VI) from aqueous solution by a litchi shell-based adsorbent. Environ Res. https://doi.org/10.1016/j.envres.2020.110356
Li TY, Xiang H, Yang Y, Wang J, Yildiz G (2021b) Prediction of char production from slow pyrolysis of lignocellulosic biomass using multiple nonlinear regression and artificial neural network. J Anal Appl Pyrolysis. https://doi.org/10.1016/j.jaap.2021.105286
Li XD, Peng B, Liu QY, Zhang HY (2022a) Microwave pyrolysis coupled with conventional pre-pyrolysis of the stalk for syngas and biochar. Bioresour Technol. https://doi.org/10.1016/j.biortech.2022.126745
Li Y, Gupta R, You S (2022b) Machine learning assisted prediction of biochar yield and composition via pyrolysis of biomass. Bioresour Technol 359:127511. https://doi.org/10.1016/j.biortech.2022.127511
Lian SM, Liu JW, Lu RK, Luo XL (2019) Captured multi-label relations via joint deep supervised autoencoder. Appl Soft Comput 74:709–728. https://doi.org/10.1016/j.asoc.2018.10.035
Liao M, Kelley SS, Yao Y (2019) Artificial neural network based modeling for the prediction of yield and surface area of activated carbon from biomass. Biofuels Bioprod Biorefin 13:1015–1027. https://doi.org/10.1002/bbb.1991
Liew YW, Arumugasamy SK, Selvarajoo A (2022) Potential of biochar as soil amendment: prediction of elemental ratios from pyrolysis of agriculture biomass using artificial neural network. Water Air Soil Pollut. https://doi.org/10.1007/s11270-022-05510-2
Ligthart A, Catal C, Tekinerdogan B (2021) Analyzing the effectiveness of semi-supervised learning approaches for opinion spam classification. Appl Soft Comput. https://doi.org/10.1016/j.asoc.2020.107023
Liu YY, Chen JW (2022) Effect of ageing on biochar properties and pollutant management. Chemosphere. https://doi.org/10.1016/j.chemosphere.2021.133427
Liu HD, Xu GR, Li GB (2021a) Autocatalytic sludge pyrolysis by biochar derived from pharmaceutical sludge for biogas upgrading. Energy. https://doi.org/10.1016/j.energy.2021.120802
Liu L, Li C, Liu XI, Gao Y, Liu G (2021b) Study on the regulation mechanism of cadmium adsorption system mediated by extraneous dissolved organic matter. Ecotox Environ Saf. https://doi.org/10.1016/j.ecoenv.2021.112930
Liu X, Shao Z, Wang Y et al (2022a) New use for Lentinus edodes bran biochar for tetracycline removal. Environ Res. https://doi.org/10.1016/j.envres.2022.114651
Liu XW, Hu YK, Chen JG, Li KQ (2022b) Shape and boundary-aware multi-branch model for semi-supervised medical image segmentation. Comput Biol Med 143:105252. https://doi.org/10.1016/j.compbiomed.2022.105252
Liu YB, Yang XL, Zhang JQ, Zhu ZY (2022c) Process simulation of preparing biochar by biomass pyrolysis via aspen plus and its economic evaluation. Waste Biomass Valorization 13:2609–2622. https://doi.org/10.1007/s12649-021-01671-z
Liu ZR, Shurin GV, Bian L, White DL, Shurin MR, Star A (2022d) A carbon nanotube sensor array for the label-free discrimination of live and dead cells with machine learning. Anal Chem 94:3565–3573. https://doi.org/10.1021/acs.analchem.1c04661
Luo ZR, Yao B, Yang X et al (2022) Novel insights into the adsorption of organic contaminants by biochar: a review. Chemosphere. https://doi.org/10.1016/j.chemosphere.2021.132113
Masek O, Budarin V, Gronnow M et al (2013) Microwave and slow pyrolysis biochar-comparison of physical and functional properties. J Anal Appl Pyrolysis 100:41–48. https://doi.org/10.1016/j.jaap.2012.11.015
Masood A, Ahmad K (2021) A review on emerging artificial intelligence (AI) techniques for air pollution forecasting: Fundamentals, application and performance. J Cleaner Prod 322:129072. https://doi.org/10.1016/j.jclepro.2021.129072
Masrura SU, Jones-Lepp TL, Kajitvichyanukul P, Ok YS, Tsang DCW, Khan E (2022) Unintentional release of antibiotics associated with nutrients recovery from source-separated human urine by biochar. Chemosphere. https://doi.org/10.1016/j.chemosphere.2022.134426
Mayol AP, Maningo JMZ, Chua-Unsu AGAY et al. (2018) Application of artificial neural networks in prediction of pyrolysis behavior for algal mat (LABLAB) biomass. IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management. https://doi.org/10.1109/hnicem.2018.8666376
Medeiros D, Nzediegwu C, Benally C et al (2022) Pristine and engineered biochar for the removal of contaminants co-existing in several types of industrial wastewaters: a critical review. Sci Total Environ. https://doi.org/10.1016/j.scitotenv.2021.151120
Merdun H, Sezgin IV (2018) Modelling of pyrolysis product yields by artificial neural networks. Int J Renew Energy Res 8:1178–1188
Mir AA, Celebi FV, Alsolai H et al (2022) Anomalies prediction in radon time series for earthquake likelihood using machine learning-based ensemble model. IEEE Access 10:37984–37999. https://doi.org/10.1109/Access.2022.3163291
Mizikovsky D, Sanchez MN, Nefzger CM, Partida GC, Palpant NJ (2022) Organization of gene programs revealed by unsupervised analysis of diverse gene-trait associations. Nucleic Acids Res. https://doi.org/10.1093/nar/gkac413
Mondal S, Aikat K, Halder G (2016) Biosorptive uptake of ibuprofen by chemically modified Parthenium hysterophorus derived biochar: equilibrium, kinetics, thermodynamics and modeling. Ecol Eng 92:158–172. https://doi.org/10.1016/j.ecoleng.2016.03.022
Mondal S, Aikat K, Siddharth K et al (2017) Optimizing ranitidine hydrochloride uptake of Parthenium hysterophorus derived N-biochar through response surface methodology and artificial neural network. Process Saf Environ Prot 107:388–401. https://doi.org/10.1016/j.psep.2017.03.011
Mukherjee S, Kamila B, Paul S et al (2022) Insight into biosorptive uptake of fluoride by chemically activated biochar: experimental modeling and parametric optimization. Biomass Convers Biorefin. https://doi.org/10.1007/s13399-021-02200-7
Nair RR, Mondal MM, Weichgrebe D (2020) Biochar from co-pyrolysis of urban organic wastes—investigation of carbon sink potential using ATR-FTIR and TGA. Biomass Convers Biorefin 12:4729–4743. https://doi.org/10.1007/s13399-020-01000-9
Nandy A, Duan CR, Taylor MG, Liu F, Steeves AH, Kulik HJ (2021) Computational discovery of transition-metal complexes: from high-throughput screening to machine learning. Chem Rev 121:9927–10000. https://doi.org/10.1021/acs.chemrev.1c00347
Narde SR, Remya N (2022) Biochar production from agricultural biomass through microwave-assisted pyrolysis: predictive modelling and experimental validation of biochar yield. Environ Dev Sustain 24:11089–11102. https://doi.org/10.1007/s10668-021-01898-9
Nguyen XC, Ly QV, Peng W et al (2021) Vertical flow constructed wetlands using expanded clay and biochar for wastewater remediation: a comparative study and prediction of effluents using machine learning. J Hazard Mater 413:125426. https://doi.org/10.1016/j.jhazmat.2021.125426
Nguyen XC, Ly QV, Nguyen TTH, Ngo HTT, Hu Y, Zhang Z (2022) Potential application of machine learning for exploring adsorption mechanisms of pharmaceuticals onto biochars. Chemosphere 287:132203. https://doi.org/10.1016/j.chemosphere.2021.132203
Ohale PE, Onu CE, Nwabanne JT, Aniagor CO, Okey-Onyesolu CF, Ohale NJ (2022) A comparative optimization and modeling of ammonia–nitrogen adsorption from abattoir wastewater using a novel iron-functionalized crab shell. Appl Water Sci. https://doi.org/10.1007/s13201-022-01713-4
Onyekwena CC, Xue Q, Li Q et al (2022) Support vector machine regression to predict gas diffusion coefficient of biochar-amended soil. Appl Soft Comput. https://doi.org/10.1016/j.asoc.2022.109345
Ortiz-Monsalve S, Gutterres M, Valente P et al (2020) Degradation of a leather-dye by the combination of depolymerised wood-chip biochar adsorption and solid-state fermentation with Trametes villosa SCS-10. Bioresour Bioprocess. https://doi.org/10.1186/s40643-020-00349-z
Padakandla S (2021) A Survey of reinforcement learning algorithms for dynamically varying environments. Acm Comput Surv 54:127. https://doi.org/10.1145/3459991
Palansooriya KN, Li J, Dissanayake PD et al (2022) Prediction of soil heavy metal immobilization by biochar using machine learning. Environ Sci Technol 56:4187–4198. https://doi.org/10.1021/acs.est.1c08302
Pan XQ, Gu ZP, Chen WM, Li QB (2021) Preparation of biochar and biochar composites and their application in a Fenton-like process for wastewater decontamination: a review. Sci Total Environ. https://doi.org/10.1016/j.scitotenv.2020.142104
Pardakhti M, Moharreri E, Wanik D, Suib SL, Srivastava R (2017) Machine learning using combined structural and chemical descriptors for prediction of methane adsorption performance of metal organic frameworks (MOFs). ACS Comb Sci 19:640–645. https://doi.org/10.1021/acscombsci.7b00056
Pathy A, Meher S, Balasubramanian P (2020) Predicting algal biochar yield using eXtreme Gradient Boosting (XGB) algorithm of machine learning methods. Algal Res. https://doi.org/10.1016/j.algal.2020.102006
Paula AJ, Ferreira OP, Souza AG, Nepomuceno F, Andrade CE, Faria AF (2022) Machine learning and natural language processing enable a data-oriented experimental design approach for producing biochar and hydrochar from biomass. Chem Mater 34:979–990. https://doi.org/10.1021/acs.chemmater.1c02961
Potnuri R, Suriapparao DV, Sankar Rao C, Sridevi V, Kumar A (2022) Effect of dry torrefaction pretreatment of the microwave-assisted catalytic pyrolysis of biomass using the machine learning approach. Renewable Energy 197:798–809. https://doi.org/10.1016/j.renene.2022.08.006
Praveen S, Jegan J, Pushpa TB, Gokulan R (2021) Artificial neural network modelling for biodecolorization of Basic Violet 03 from aqueous solution by biochar derived from agro-bio waste of groundnut hull: Kinetics and thermodynamics. Chemosphere 276:130191. https://doi.org/10.1016/j.chemosphere.2021.130191
Rathnayake N, Patel S, Halder P et al (2022) Co-pyrolysis of biosolids with alum sludge: effect of temperature and mixing ratio on product properties. J Anal Appl Pyrolysis. https://doi.org/10.1016/j.jaap.2022.105488
Ravindiran G, Saravanan P, Alagumalai A, Subbarayan S (2022) Soft computing-based models and decolorization of Reactive Yellow 81 using Ulva Prolifera biochar. Chemosphere 287:132368. https://doi.org/10.1016/j.chemosphere.2021.132368
Razavi-Termeh SV, Sadeghi-Niaraki A, Choi SM (2021) Spatial modeling of asthma-prone areas using remote sensing and ensemble machine learning algorithms. Remote Sens-Basel 13:3222. https://doi.org/10.3390/rs13163222
Reddy BS, Maurya AK, Sathishkumar V et al (2021) Prediction of batch sorption of barium and strontium from saline water. Environ Res. https://doi.org/10.1016/j.envres.2021.111107
Ro KS (2016) Kinetics and energetics of producing animal manure-based biochar. Bioenergy Res 9:447–453. https://doi.org/10.1007/s12155-016-9724-4
Rovira M, Engvall K, Duwig C (2022) Identifying key features in reactive flows: a tutorial on combining dimensionality reduction, unsupervised clustering, and feature correlation. Chem Eng J 438:135250. https://doi.org/10.1016/j.cej.2022.135250
Salawu OA, Han Z, Adeleye AS (2022) Shrimp waste-derived porous carbon adsorbent: performance, mechanism, and application of machine learning. J Hazard Mater 437:129266. https://doi.org/10.1016/j.jhazmat.2022.129266
Sayin B, Krivosheev E, Yang J et al (2021) A review and experimental analysis of active learning over crowdsourced data. Artif Intell Rev 54:5283–5305
Schleder GR, Padilha ACM, Acosta CM, Costa M, Fazzio A (2019) From DFT to machine learning: recent approaches to materials science–a review. J Phys Mater. https://doi.org/10.1088/2515-7639/ab084b
Selvam SM, Balasubramanian P (2022) Influence of biomass composition and microwave pyrolysis conditions on biochar yield and its properties: a machine learning approach. Bioenergy Res. https://doi.org/10.1007/s12155-022-10447-9
Shahbaz M, AlNouss A, Ghiat I et al (2021) A comprehensive review of biomass based thermochemical conversion technologies integrated with CO2 capture and utilisation within BECCS networks. Conserv Recycl. https://doi.org/10.1016/j.resconrec.2021.105734
Shima H, Sato Y, Sakata K, Asakura T, Kikuchi J (2022) Identifying a correlation among qualitative non-numeric parameters in natural fish microbe dataset using machine learning. Appl Sci-Basel. https://doi.org/10.3390/app12125927
Sohail A, Arif F (2020) Supervised and unsupervised algorithms for bioinformatics and data science. Prog Biophys Mol Biol 151:14–22. https://doi.org/10.1016/j.pbiomolbio.2019.11.012
Somarowthu S, Yang HY, Hidebrand DGC, Ondrechen MJ (2011) High-performance prediction of functional residues in proteins with machine learning and computed input features. Biopolymers 95:390–400. https://doi.org/10.1002/bip.21589
Song ZX, Yang XL, Xu ZL, King I (2022) Graph-based semi-supervised learning: a comprehensive review. IEEE Trans Neural Networks Learn Syst. https://doi.org/10.1109/Tnnls.2022.3155478
Sun Y, Zhang YY, Lu L et al (2022) The application of machine learning methods for prediction of metal immobilization remediation by biochar amendment in soil. Sci Total Environ 829:154668. https://doi.org/10.1016/j.scitotenv.2022.154668
Suvarna M, Araujo TP, Perez-Ramirez J (2022) A generalized machine learning framework to predict the space-time yield of methanol from thermocatalytic CO2 hydrogenation. Appl Catal B-Environ 315:121530. https://doi.org/10.1016/j.apcatb.2022.121530
Talebkeikhah F, Rasam S, Talebkeikhah M, Torkashvand M, Salimi A, Moraveji MK (2020) Investigation of effective processes parameters on lead(II) adsorption from wastewater by biochar in mild air oxidation pyrolysis process. Int J Environ Anal Chem. https://doi.org/10.1080/03067319.2020.1777291
Tan LS, Ma ZH, Yang KQ et al (2020) Effect of three artificial aging techniques on physicochemical properties and Pb adsorption capacities of different biochars. Sci Total Environ. https://doi.org/10.1016/j.scitotenv.2019.134223
Tan YH, Wan XR, Ni X et al (2022) Efficient removal of Cd(II) from aqueous solution by chitosan modified kiwi branch biochar. Chemosphere. https://doi.org/10.1016/j.chemosphere.2021.133251
Tanaka A (2015) Analyses on empirical error minimzation in multiple kernel regressors. 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2015 Apr 19–24. Brisbane, AUSTRALIA, pp. 2046–2050
Tang SF, Zhou H, Tan WT et al (2022) Adsorption characteristics and mechanisms of Fe-Mn oxide modified biochar for Pb(II) in wastewater. Int J Environ Res Public Health. https://doi.org/10.3390/ijerph19148420
Tee JX, Selvarajoo A, Arumugasamy SK (2022) Prediction of carbon sequestration of biochar produced from biomass pyrolysis by artificial neural network. J Environ Chem Eng. https://doi.org/10.1016/j.jece.2022.107640
Thiruvengadam S, Edmund Murphy M, Tan JS (2021) Mathematically modelling pyrolytic polygeneration processes using artificial intelligence. Fuel. https://doi.org/10.1016/j.fuel.2021.120488
Thomas E, Borchard N, Sarmiento C, Atkinson R, Ladd B (2020) Key factors determining biochar sorption capacity for metal contaminants: a literature synthesis. Biochar 2:151–163. https://doi.org/10.1007/s42773-020-00053-3
Tomczyk A, Sokolowska Z, Boguta P (2020) Biochar physicochemical properties: Pyrolysis temperature and feedstock kind effects. Rev Environ Sci Bio 19:191–215. https://doi.org/10.1007/s11157-020-09523-3
Tran HQ, Ha C (2022) Machine learning in indoor visible light positioning systems: a review. Neurocomputing 491:117–131. https://doi.org/10.1016/j.neucom.2021.10.123
Tseng KK, Zhang YQ, Zhu QL, Yung KL, Ip WH (2020) Semi-supervised image depth prediction with deep learning and binocular algorithms. Appl Soft Comput. https://doi.org/10.1016/j.asoc.2020.106272
Tyagi U (2022) Enhanced adsorption of metal ions onto Vetiveria zizanioides biochar via batch and fixed bed studies. Bioresour Technol. https://doi.org/10.1016/j.biortech.2021.126475
Uddin MG, Nash S, Rahman A, Olbert AI (2022) A comprehensive method for improvement of water quality index (WQI) models for coastal water quality assessment. Water Res 219:118532. https://doi.org/10.1016/j.watres.2022.118532
Van Calster B, Vickers AJ (2015) Calibration of risk prediction models: impact on decision-analytic performance. Med Decis Making 35:162–169. https://doi.org/10.1177/0272989x14547233
Wan S, Zhao XB, Niu ZH et al (2022) Influence of ambient air pollution on successful pregnancy with frozen embryo transfer: a machine learning prediction model. Ecotoxicol Environ Saf 236:113444. https://doi.org/10.1016/j.ecoenv.2022.113444
Wang B, Yu J, Liao H, Zhu WK, Ding PP, Zhou J (2020a) Adsorption of lead(II) from aqueous solution with high efficiency by hydrothermal biochar derived from honey. Int J Environ Res Public Health. https://doi.org/10.3390/ijerph17103441
Wang L, O’Connor D, Rinklebe J et al (2020b) Biochar aging: mechanisms, physicochemical changes, assessment, and implications for field applications. Environ Sci Technol 54:14797–14814. https://doi.org/10.1021/acs.est.0c04033
Wang XM, Gao JH, Long MS, Wang JM (2021) Self-tuning for data-efficient deep learning. Int Conf Mach Learn 139(139):7748–7759
Wang W, Wu X, Long S (2022a) Optimizing the methylene blue removal from aqueous solution using pomelo peel based biochar assisted by RSM and ANN-PSO. Pol J Environ Stud 31:329–346. https://doi.org/10.15244/pjoes/137947
Wang ZW, Yang Z, Liu LJ, Ye YP, Xie XY (2022b) Electron transfer mechanism of chitosan-modified natural manganese ore-cornstalk biochar composites with activated peroxymonosulfate: the role of functional groups on the surface of biochar-based composites. Sep Purif Technol. https://doi.org/10.1016/j.seppur.2022.122107
Wani I, Sharma A, Kushvaha V, Madhushri P, Peng L (2020) Effect of pH, volatile content, and pyrolysis conditions on surface area and O/C and H/C ratios of biochar: towards understanding performance of biochar using simplified approach. J Hazard Toxic Radioact Waste. https://doi.org/10.1061/(asce)hz.2153-5515.0000545
Wei N, Zhang QJ, Zhang YJ et al (2022) Super-learner model realizes the transient prediction of CO2 and NOx of diesel trucks: model development, evaluation and interpretation. Environ Int 158:106977. https://doi.org/10.1016/j.envint.2021.106977
Wong YJ, Arumugasamy SK, Chung CH, Selvarajoo A, Sethu V (2020) Comparative study of artificial neural network (ANN), adaptive neuro-fuzzy inference system (ANFIS) and multiple linear regression (MLR) for modeling of Cu(II) adsorption from aqueous solution using biochar derived from rambutan (Nephelium lappaceum) peel. Environ Monit Assess 192:439. https://doi.org/10.1007/s10661-020-08268-4
Wu Y, Zhang PY, Zeng GM et al (2016) Enhancing sewage sludge dewaterability by a skeleton builder: Biochar produced from sludge cake conditioned with rice husk flour and FeCl3. ACS Sustain Chem Eng 4:5711–5717. https://doi.org/10.1021/acssuschemeng.6b01654
Wu LP, Wei CB, Zhang SR, Wang YD, Kuzyakov Y, Ding XD (2019) MgO-modified biochar increases phosphate retention and rice yields in saline-alkaline soil. J Cleaner Prod 235:901–909. https://doi.org/10.1016/j.jclepro.2019.07.043
Wu LR, Yuan LF, Zhao GJ, Lin HT, Li SZ (2022a) Deep clustering and visualization for end-to-end high-dimensional data analysis. IEEE Trans Neural Networks Learn Syst. https://doi.org/10.1109/Tnnls.2022.3151498
Wu ML, Liu B, Li J, Su XT, Liu WZ, Li XQ (2022b) Influence of pyrolysis temperature on sludge biochar: the ecological risk assessment of heavy metals and the adsorption of Cd(II). Environ Sci Pollut Res. https://doi.org/10.1007/s11356-022-22827-x
Xia R, Wang G, Zhang Y et al (2020) River algal blooms are well predicted by antecedent environmental conditions. Water Res 185:116221. https://doi.org/10.1016/j.watres.2020.116221
Xie YQ, Wang L, Li HL et al (2022) A critical review on production, modification and utilization of biochar. J Anal Appl Pyrol. https://doi.org/10.1016/j.jaap.2021.105405
Xu XT, Wang C, Gui BX et al (2022) Application of machine learning to predict the inhibitory activity of organic chemicals on thyroid stimulating hormone receptor. Environ Res 212:113175. https://doi.org/10.1016/j.envres.2022.113175
Yadav K, Jagadevan S (2021) Influence of torrefaction and pyrolysis on engineered biochar and its applicability in defluoridation: insight into adsorption mechanism, batch adsorber design and artificial neural network modelling. J Anal Appl Pyrolysis. https://doi.org/10.1016/j.jaap.2021.105015
Yadav K, Raphi M, Jagadevan S (2021) Adsorption of copper(II) on chemically modified biochar: a single-stage batch adsorber design and predictive modeling through artificial neural network. Biomass Convers Biorefin. https://doi.org/10.1007/s13399-021-01494-x
Yan XL, Zhang J, Russo DP, Zhu H, Yan B (2020) Prediction of nano-bio interactions through convolutional neural network analysis of nanostructure images. ACS Sustain Chem Eng 8:19096–19104. https://doi.org/10.1021/acssuschemeng.0c07453
Yang WY, Liang H, Peng F, Liu ZL, Liu J, Qiao ZW (2019) Computational screening of metal-organic framework membranes for the separation of 15 gas mixtures. Nanomaterials 9:467. https://doi.org/10.3390/nano9030467
Yang W, Shang JY, Li BG, Flury M (2020) Surface and colloid properties of biochar and implications for transport in porous media. Crit Rev Environ Sci Technol 50:2484–2522. https://doi.org/10.1080/10643389.2019.1699381
Yang XC, Nguyen XC, Tran QB et al (2022a) Machine learning-assisted evaluation of potential biochars for pharmaceutical removal from water. Environ Res. https://doi.org/10.1016/j.envres.2022.113953
Yang YD, Piao YX, Wang RF, Su YM, Qiu JR, Liu N (2022b) Mechanism of biochar functional groups in the catalytic reduction of tetrachloroethylene by sulfides. Environ Pollut. https://doi.org/10.1016/j.envpol.2022.118921
Yau KLA, Qadir J, Khoo HL, Ling MH, Komisarczuk P (2017) A survey on reinforcement learning models and algorithms for traffic signal control. Acm Comput Surv 50:34. https://doi.org/10.1145/3068287
Ye Y, Shi J, Zhu D et al (2021) Management of medical and health big data based on integrated learning-based health care system: a review and comparative analysis. Comput Meth Prog Bio 209:106293
Yildirim H, Ozkale MR (2021) LL-ELM: a regularized extreme learning machine based on L-1-norm and Liu estimator. Neural Comput Appl 33:10469–10484. https://doi.org/10.1007/s00521-021-05806-0
Yu F, Sun L, Zhou YM et al (2016) Biosorbents based on agricultural wastes for ionic liquid removal: an approach to agricultural wastes management. Chemosphere 165:94–99. https://doi.org/10.1016/j.chemosphere.2016.08.133
Yu A, Liu Y, Li X, Yang Y, Zhou Z, Liu H (2021) Modeling and optimizing of NH4+ removal from stormwater by coal-based granular activated carbon using RSM and ANN coupled with GA. Water. https://doi.org/10.3390/w13050608
Yuan LH, Li C, Cao JT, Zhao QB (2020) Rank minimization on tensor ring: an efficient approach for tensor decomposition and completion. Mach Learn 109:603–622. https://doi.org/10.1007/s10994-019-05846-7
Zafar M, Van Vinh N, Behera SK, Park HS (2017) Ethanol mediated As(III) adsorption onto Zn-loaded pinecone biochar: experimental investigation, modeling, and optimization using hybrid artificial neural network-genetic algorithm approach. J Environ Sci (china) 54:114–125. https://doi.org/10.1016/j.jes.2016.06.008
Zeng ZW, Tan XF, Liu YG et al (2018) Comprehensive adsorption studies of doxycycline and ciprofloxacin antibiotics by biochars prepared at different temperatures. Front Chem. https://doi.org/10.3389/fchem.2018.00080
Zhang D, Jiao LC, Bai X, Wang S, Hou B (2018) A robust semi-supervised SVM via ensemble learning. Appl Soft Comput 65:632–643. https://doi.org/10.1016/j.asoc.2018.01.038
Zhang K, Zhong SF, Zhang HC (2020) Predicting aqueous adsorption of organic compounds onto biochars, carbon nanotubes, granular activated carbons, and resins with machine learning. Environ Sci Technol 54:7008–7018. https://doi.org/10.1021/acs.est.0c02526
Zhang HK, Wang ZL, Cai JF, Wu SC, Li JJ (2021a) Machine-learning-enabled tricks of the trade for rapid host material discovery in Li-S battery. ACS Appl Mater Inter 13:53388–53397. https://doi.org/10.1021/acsami.1c10749
Zhang X, Zheng HH, Wu J et al (2021b) Physicochemical and adsorption properties of biochar from biomass-based pyrolytic polygeneration: effects of biomass species and temperature. Biochar 3:657–670. https://doi.org/10.1007/s42773-021-00102-5
Zhang H, Li Y, Xie R et al (2022a) A particle scale micro-CT approach for 3D in-situ visualizing the Pb(II) adsorption in different crop residue-derived chars. Bioresour Technol 344:126269. https://doi.org/10.1016/j.biortech.2021.126269
Zhang W, Huang W, Tan J, Guo Q, Wu B (2022b) Heterogeneous catalysis mediated by light, electricity and enzyme via machine learning: paradigms, applications and prospects. Chemosphere. https://doi.org/10.1016/j.chemosphere.2022.136447
Zhang W, Huang W, Tan J, Huang D, Ma J, Wu B (2022c) Modeling, optimization and understanding of adsorption process for pollutant removal via machine learning: recent progress and future perspectives. Chemosphere 311:137044. https://doi.org/10.1016/j.chemosphere.2022.137044
Zhang WA, Gu X, Tang LB, Yin YP, Liu DS, Zhang YM (2022d) Application of machine learning, deep learning and optimization algorithms in geoengineering and geoscience: comprehensive review and future challenge. Gondwana Res 109:1–17. https://doi.org/10.1016/j.gr.2022.03.015
Zhang Y, Ren M, Tang Y et al (2022e) Immobilization on anionic metal(loid)s in soil by biochar: a meta-analysis assisted by machine learning. J Hazard Mater. https://doi.org/10.1016/j.jhazmat.2022.129442
Zhao Y, Li Y, Fan D, Song J, Yang F (2021) Application of kernel extreme learning machine and Kriging model in prediction of heavy metals removal by biochar. Bioresour Technol 329:124876. https://doi.org/10.1016/j.biortech.2021.124876
Zhao Y, Fan D, Li Y, Yang F (2022) Application of machine learning in predicting the adsorption capacity of organic compounds onto biochar and resin. Environ Res 208:112694. https://doi.org/10.1016/j.envres.2022.112694
Zheng X, Nguyen H (2022) A novel artificial intelligent model for predicting water treatment efficiency of various biochar systems based on artificial neural network and queuing search algorithm. Chemosphere 287:132251. https://doi.org/10.1016/j.chemosphere.2021.132251
Zhou X, Zhao J, Chen M, Zhao G, Wu S (2022) Influence of catalyst and solvent on the hydrothermal liquefaction of woody biomass. Bioresour Technol 346:126354. https://doi.org/10.1016/j.biortech.2021.126354
Zhu X, Wang X, Ok YS (2019a) The application of machine learning methods for prediction of metal sorption onto biochars. J Hazard Mater 378:120727. https://doi.org/10.1016/j.jhazmat.2019.06.004
Zhu XZ, Li YN, Wang XN (2019b) Machine learning prediction of biochar yield and carbon contents in biochar based on biomass characteristics and pyrolysis conditions. Bioresour Technol. https://doi.org/10.1016/j.biortech.2019.121527
Zhu X, He M, Sun Y et al (2022a) Insights into the adsorption of pharmaceuticals and personal care products (PPCPs) on biochar and activated carbon with the aid of machine learning. J Hazard Mater 423:127060. https://doi.org/10.1016/j.jhazmat.2021.127060
Zhu X, Xu Z, You S et al (2022b) Machine learning exploration of the direct and indirect roles of Fe impregnation on Cr(VI) removal by engineered biochar. Chem Eng J. https://doi.org/10.1016/j.cej.2021.131967
Zhu XF, Labianca C, He MJ et al (2022c) Life-cycle assessment of pyrolysis processes for sustainable production of biochar from agro-residues. Bioresour Technol. https://doi.org/10.1016/j.biortech.2022.127601
Zuo HF, Qin XX, Liu ZG, Fu YL (2021) Preparation and characterization of modified corn stalk biochar. BioResources 16:7427–7442. https://doi.org/10.15376/biores.16.4.7428-7443
Acknowledgements
The present work was funded by the National Natural Science Foundation of China, the Central Public-Interest Scientific Institution Basal Research Fund of China and Gusu Innovation and Entrepreneurship Leading Talent Plan.
Funding
This work was supported by the Ministry of Scienceand Technology of the People’s Republic of China (No.2019YFC0408302), the National Natural Science Foundation of China (21906078, 52070137), the Central Public-Interest Scientific Institution Basal Research Fund of the People’s Republic of China (PM-zx703-202204-104) and Gusu Innovation and Entrepreneurship Leading Talent Plan (ZXL2022500).
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by WZ, BW, JT and WH. The first draft of the manuscript was written by WZ and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors report no conflict of interest. The authors alone are responsible for the content and writing of this paper.
Additional information
Handling editor: Baoshan Xing.
Supplementary Information
Additional file 1.
Text S1. Mainstream machine learning algorithms; Fig. S1. Model performance in a) testing datasets and b) validation datasets for predicting mass yield; Fig. S2. a) PCC between any two variables of interest. Relative importance of input variables: b) biochar yield, c) proximate composition of biochar and d) ultimate composition of biochar; Fig. S3. a) The breakdown of energy (top) and climate change (bottom) profiles for pyrolysis of biomass, b) Comparison of EROI vs GWP for biomass pyrolysis at different temperature; Fig. S4. A special case that clearly describes the design of biochar for pollutant adsorption via machine learning; Fig. S5. RMSE a) and R2 b) values of kNN, RF and Cubist; Fig. S6. Correlation and boxplot of the ANN and QSA-ANN models. a) QSA-ANN on the training dataset; b) ANN on the training dataset; c) QSA-ANN on the testing dataset; d) ANN on the testing dataset; Fig. S7. The relative importance of independent variables for a) adsorption capacity and b) affinity prediction, c, d) Univariate plots of the five most important variables for adsorption capacity prediction for all heavy metal; Fig. S8. Model performance of a) supporting vector regression, b) neural network, c) RF, and d) updated RF for predicting the immobilization efficiency of heavy metal in biochar-amended soils; Fig. S9. Partial dependence plots of Cr(VI) removal capacity on the physicochemical properties of biochar: a) Fe content, b) The ratio of O to C, c) C content and d) Specific surface area; Fig. S10. a) The qualitative data of 3D in-situ visualization of WSBC + Pb(II) particles, b) The normalization semi quantitively 3D in-situ visualization data of WSBC + Pb(II) particles; Table S1. Summary of machine learning for optimizing pyrolysis process of biochar; Table S2. Summary of machine learning for modeling pollutant adsorption process of biochar; Table S3. The scope, benefits, and limitations of different machine learning algorithms; Table S4. The scope, benefits, and limitations of different descriptors.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhang, W., Chen, R., Li, J. et al. Synthesis optimization and adsorption modeling of biochar for pollutant removal via machine learning. Biochar 5, 25 (2023). https://doi.org/10.1007/s42773-023-00225-x
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42773-023-00225-x