Predictions of elemental composition of coal and biomass from their proximate analyses using ANFIS, ANN and MLR

The elemental composition of coal and biomass provides significant parameters used in the design of almost all energy conversion systems and projects. The laboratory tests to determine the elemental composition of coal and biomass is time-consuming and costly. However, limited research has suggested that there is a correlation between parameters obtained from elemental and proximate analyses of these materials. In this study, some predictive models of the elemental composition of coal and biomass using soft computing and regression analyses have been developed. Thirty-one samples including parameters of elemental and proximate analyses were used during the analyses to develop multiple prediction models. Dependent variables for multiple prediction models were selected as carbon, hydrogen, and oxygen. Using volatile matter, fixed carbon, moisture and ash contents as independent variables, three different prediction models were developed for each dependent parameter using ANFIS, ANN, and MLR. In addition, a routine for selecting the best predictive model was suggested in the study. The reliability of the established models was tested by using various prediction performance indices and the models were found to be satisfactory. Therefore, the developed models can be used to determine the elemental composition of coal and biomass for practical purposes.


Introduction
The world's energy demand has steadily increased owing to rising population and living standards (Chen et al. 2015). Due to this reason, fossil fuel reserves are ending slowly (Mohr et al. 2015;Shafiee and Topal 2009). Coal is the major world energy single source, and it acts as the guarantor of energy security, supplying 38% of the whole world electricity (IEA 2018). Furthermore, it will still account for 26% of the world's electricity supply in 2040, as predicted by the International Energy Agency (2018).
The studies on renewable energy and alternative fuels around the world have been reported in the literature. Wood biomass has been acknowledged as a potential source of renewable energy because of its accessibility in most areas of the world ( Van der stelt et al. 2011). Properties such as the high quantity of moisture, low bulk density, low calorific value and high energy requirements for grinding are the limitations of biomass that restrict its wider use for power generation (Haseli 2018). One of the main properties for the utilization of coal and biomass materials is the elemental composition (Chen et al. 2015). A special instrument is required to determine the elemental composition of coal, while data for proximate analysis can be readily acquired using common equipment (Shen et al. 2010).
The elemental composition of biomass is a significant asset that defines the amount of energy and evaluates the clean and efficient use of biomass materials (Chen et al. 2015). The elemental composition is a needed factor in evaluating the process of chemical conversion and predicting the flow of flue gas and the quality of air in coal combustion (Nhuchhen 2016). The proximate analysis is a fuel property that provides for the chemical composition of coal and confirms the appropriate usage of coal. Focusing on proximate and elemental analyses, a number of fuel parameters can be examined (Mathews et al. 2014).
The characteristics of coal and biomass are necessary for the production of their potential and also the effective operation of the energy conversion process. In the past, several relationships have been established using ultimate and proximate analyses. Chelgani et al. (2008) established a technique to predict the grindability of coal by multiple regression and artificial neural network (ANN) models from the data obtained from proximate and ultimate analyses. Furthermore, on the basis of the ultimate analysis of solid, fluid and gaseous fuels, the relationship to predict higher heating value was established by Channiwala and ParikhP (2002). Similarly, the heating value of biomass and municipal solid waste (MSW) was determined using the data obtained from proximate analysis (Parikh et al. 2007; Komilis et al. 2012). Previous studies have shown that no studies have established models to predict the elemental composition from the proximate analysis of coal and biomass materials except for the relationship developed by Vakkilainen (2000) particularly for black liquor only. Thus, the existing gap has necessitated the current study.
The design of energy conversion systems requires the elemental composition of coal, biomass and other related materials. Hitherto limited researches on correlations have been published to evaluate the elemental composition using proximate analysis of these materials. There has been a significant increase in recent research on biomass, coal and related materials, which requires an elemental analysis of these materials for the assessment of the complete process of any thermochemical conversion techniques. Therefore, this research aims to evaluate the elemental composition of both coal and biomass materials obtained from South Africa (SA) and Nigeria Coalfields from their proximate analysis using soft computing and regression analyses. The study makes use of ANN, adaptive neuro-fuzzy inference system (ANFIS) and multilinear regression (MLR) based on laboratory test results to eliminate the need for timeconsuming and costly elemental experimental analysis. Laboratory tests were conducted to examine the proximate analysis and elemental analysis of the coal and biomass materials. The results of the proximate analysis will be used as the input parameters in the proposed models and the elemental composition will be the targeted output. The predicted results of the ANFIS, ANN and MLR will be compared with the existing models and the model with the best fit/performance from the coefficient of determination, average absolute error, average biased error and mean the absolute error will be proposed for predicting the elemental composition of both coal and biomass materials.

Experimental investigation
To develop the models, proximate and elemental data relating to different coal samples and biomass (forest and agricultural wastes) were used to cover a wide range of values for fixed carbon (FC), moisture (M), volatile matter (VM), ash (A), carbon (C), hydrogen (H), nitrogen (N), oxygen (O) and sulphur (S) contents. A total of 31 samples (8 coal samples from Nigeria (NIG), 8 coal samples from South Africa (SA),12 wood biomass from SA and 3 refusederived fuels from SA) used for this study were collected using a grab sampling method. Since there are no specific sampling protocols identified for biomass materials, the samples were collected with due care to obtain the most representative samples. For the coal samples, each sample was kept in a plastic bag (made from aluminium-coated polyester) and marked/labelled with a chosen number. The sample lumps were reduced to appropriate dimensions (10 mm) using a crusher (Rocklabs MK III). The samples were milled to a fraction of 250 lm for proximate and ultimate analyses. The proximate analyses for these samples were carried out according to the ASTM D5142, with approximately 1 g used to estimate the A, VM and M contents. The FC is expressed as the subtraction of the sum of moisture, volatile matter and ash contents from100%. The elemental analysis was conducted based on the ASTM 5373-14:2015for CHN with the use of a LECO CHN 628 with an add on 628 S module. Approximately 0.25 g of the samples were used for the temperature analyses of up to A. I. Lawal et al. 1450°C with an analyzing time of between 60 and 300 s. A database of proximate and elemental analyses obtained from the experimental tests for the samples are presented in Table 1. To enable the general application of the proposed models, the data set was trained and validated using ANFIS, ANN, and MLR and compared with one another.
3 Models development

Artificial neural network
The ANN is a soft computing method that imitates the human brain in the processing of information, like reasoning, studying, memorizing and inducing a complex network. This is made possible through the interconnected structures comprising several simple processing neurons having the ability to perform large parallel computations for data processing and information representation (Dehghani and Ataee-pour 2011). To create an ANN model, there are various ways in which the neurons can be connected. The feed-forward (FF) ANN was suggested by Shahin et al. (2002) to solve extremely non-linear and complex problems such that time-dependent parameters are not involved in the input parameters. The multi-layer perceptron (MLP) neural network is a well-recognized FF-ANNs (Simpson 1990;Haykin 1999;Monjezi et al. 2013). MLP has several nodes/neurons in 3 layers i.e. input, Predictions of elemental composition of coal and biomass from their proximate analyses using… hidden and an output connected together by weights. Du et al. (2002) and Kalinli et al. (2011) have successfully established the efficiency of MLP ANNs in high-dimensional functional approximation. However, ANN requires the training of the network before the results can be interpreted. Various learning algorithms are used for training MLP-FF but the backpropagation (BP) algorithm is commonly applied (Rumelhart et al. 1986;Fausett 1994;Dreyfus 2005).
The mode of implementation of the BP-ANN is that the input data is imported into the input layer which initiates the propagation into the hidden neurons via the weights of connection. The input from each neuron in the input layer, I i , is multiplied by the weight, w ik . At each neuron, the summation of the input signals multiplied with their respective weights is determined and added to a threshold value referred to as the bias value, b nk . To obtain the output of the node, the combined input, J i , is subjected to a nonlinear transfer function (tan sigmoid or log sigmoid). The targeted output of the entire network can then be calculated by applying the same principle as in the case of the input node but in this case the transfer function, could be nonlinear such as a sigmoidal function or linear (Eq. (1)). In BP, the signals are propagated from the input layer through the hidden layer to the output layer, known as forwardpass, then the system obtained values are compared to the targeted actual value and system error can be computed between the two values. The resulting errors are then returned to the system to update the weights known as backward-pass. In this process, the errors of both training and testing datasets are reduced. This procedure is repeated in the feed-forward-backpropagation ANN until the resulting errors have converged to the threshold level specified by the system's error function, such as the rootmean-squared error (RMSE). To build the ANN network, sufficient datasets are required though there is no extant rule to determine the number of datasets sufficient for the building of a suitable ANN model. Equation (1) shows the general form of the principle of operation of the ANN model.
where, b 0 is the bias in the output layer; w k is the weight of connection between the k th of the hidden layer and the single output neuron; b hk is the bias in the k th neuron of the hidden layer; n is the number of neurons in the hidden layer; w ik is the weight of connection between the i th input parameter and the hidden layer; C i is the input variable i; D is the output variable; f purlin and f sig are the linear and nonlinear transfer functions respectively.

ANN models for the predictions of elemental compositions
The ANN model proposed in this study was created using MLP-FF that is trained with the BP training algorithm. Three different ANN models were performed predicting each of the H, O, and C. This is necessary because the size of the matrixes of the targeted outputs is not equal for the elemental compositions and also to enable fair comparison with the ANFIS and MLR models. Four inputs variables representing the A, VM, FC, and M contents were used in each of the models. A total of 28 experimental datasets conducted in this study as shown in Table 1 was used for developing ANN model for C, while 20 parameters each were used for the respective ANN predictions of H and O. The ANN model was performed in MATLABÓ environment using its embedded neural network toolbox. The input and output variables have been scaled between -1 and 1 using Eq.
(2) to achieve the dimensional consistency of the parameters and also to eliminate the over-fitting of the trained network.
where Y i is the scaled parameters, X i is the actual data to be scaled, X max and X min are the maximum and minimum values of the actual data, respectively. The network architecture with one hidden layer was adopted in this study and the trial and error approach was used to arrive at the optimal network architecture. A three layers (one input, one hidden and one output layers) network with four neurons in the input layer, three neurons in the hidden layer, and one neuron in the output layer was chosen for the building of the proposed ANN models. A non-linear (TANSIG) transfer function was used for both the input layer and the output layer. The obtained optimal ANN architecture for the three proposed models is shown in Fig. 1.
The respective performances of the obtained models for each of the elemental compositions are shown in Fig. 2. The figures show that in each of the cases, the mean squared error decreases up to the points where the best performances were obtained and their values tend to reach asymptotic values after the best performance. The pattern of the curves for the training, validations, and testing are similar, indicating that the models are successful.
The regression plots of the proposed ANN models are also illustrated in Fig. 3. The figure shows that the R values used to train, validate, and test the three models are above A. I. Lawal et al. 97%. Hence, the proposed ANNs can successfully predict the elemental compositions of solid fuels.
To enable the easy application of the proposed ANN models for the predictions of elemental compositions of the solid fuels (i.e. coal and biomass), the proposed ANN models were transformed into the mathematical models through the weights and biases based on the ANN general equations presented in Eq. (1). The mathematical formulas obtained for C, H, and O are as presented in Eqs.
(3) to (5). The predictions directly output from the ANN models and those of Eqs. (3), (4) and (5) were compared to validate the mathematically transformed ANN as illustrated in Fig. 4. It is found that the coefficient of determinants for the three models is 100% indicating that the proposed equations are replicates of their respective ANN models.

Adaptive Neuro-Fuzzy Inference System (ANFIS)
ANFIS is a soft computing method that incorporates the concept of fuzzy logic into neural networks (Jang 1993). It is generally used in various aspects of engineering science and the earth sciences (Habibagahi 2002;Iphar 2012;Sahu et al. 2011;Sahu and Mahapatra 2013;Onifade et al. 2019). ANFIS has the ability to approximate any real continuous function on a compact set to any degree of accuracy (Jang et al. 1997). It uses linguistic information based on fuzzy logic and the learning capability of the ANN. ANFIS is a fuzzy mapping algorithm that replicates and evaluates the input and output data via a hybrid learning to estimate the optimal distribution of membership function on the basis of Tagaki-Sugeno-Kang (TSK) fuzzy inference system (Jang and Gulley 1995;Loukas 2001). ANFIS is essentially based on the fuzzy ''If-Then'' rules from the Takagi and Sugeno fuzzy model (Jang et al. 1997) as shown in Fig. 5. A typical Sugeno fuzzy system that has two input parameters, one output parameter as the result, and two rules are typically displayed in Fig. 5. The corresponding ANFIS structure of this system is also shown in Fig. 5 (Rafiei-Sardooi et al. 2018;Seifi and Riahi 2018). Its rules are: Rule 1: If x is A 1 and y is B 1 Then f = p 1 x ? q 1 y ? r 1 Rule 2: If x is A 2 and y is B 2 Then f = p 2 x ? q 2 y ? r 2

Descriptions of node functions
The ANFIS architecture has two node types which are square and circle nodes. In the square node, there is an unknown parameter while in circle node, there is no unknown parameter (i.e. only the multiplication of fuzzy membership functions and the normalization of the firing strengths take place in the two respective circle nodes). The node functions in the same layer have the same function family as explained below: Layer 1: In this layer, every node output is fuzzified by membership grades of a fuzzy set equivalent to each input (Eq. (6)). The membership function of this fuzzy set may be triangular, trapezoidal, generalized bell and Gaussian membership functions.
where x is input to node i, O 1i is the membership grade of a fuzzy set A i and identifies the degree to which a certain input x satisfies the quantifier A i and l Ai is the membership function which could be any form of the afore-mentioned membership functions. For instance, the l Ai for a typical bell-shaped function is given in Eq. (7).
where a i , b i , and c i are known as premise parameters in this layer. The parameters control the shape of the function. Layer 2: Each node in this layer is a circle node labelled M, the output of which is the product of all Layer 1 outputs (Eq. (8)): The output of each node in this layer is the firing strength of a rule.
Layer 3: This is a normalized layer. Every node labelled as encircled N (Fig. 5). Every node in this layer normalizes the weight function generated from the preceding layer of the product. The ith node measures the ratio of the firing strength of the ith rules to the sum of all rule's firing strengths as presented in Eq. (9): Layer 4: This layer is the defuzzification layer. Each node i in this layer is an adaptive node with a node function as described in Eq. (10).
Predictions of elemental composition of coal and biomass from their proximate analyses using… where w i is the normalized firing strength from layer 3. The parameter set of this node is p i , q i , and r i . Parameters in this layer are termed the consequent parameters. For the input parameters with three (3) membership functions, for instance, the f i in Eq. (10) will be (f i = p i x ? q i y ? r i z ? s i ).
Layer 5: The single node in this layer is a fixed node characterised as R. It calculates the total output as the sum of all incoming signals as shown in Eq. (11).

ANFIS model development for the prediction of elemental composition
In this study, a five-layer ANFIS model was established to predict the elemental composition of coal and biomass. The Grid partitioning approach was used to create the FIS model and a hybrid technique was used to evaluate the premise and resulting parameters. A four inputs-one output model was employed to determine the C, H and O.
A Gaussian type membership function (guessmf) was selected for inputs and constant type membership function was used for output when obtaining the fuzzy inference system for the C and H. The triangular membership function (trimf) was chosen for the input in the case of O and the constant type membership function was also used for the output in the case of O. Each of the input membership functions was categorised into three linguistics variables. The low (L); high (H) and very high (VH) linguistic variables were used for the FC and VM while very low (VL); low (L) and high (H) were used for the A and M, respectively. A typical membership function for input 2 (VM) and input 3 (A) with the respective Gaussian and triangular membership functions is shown in Fig. 6. The data set was selected randomly but included the highest and lowest values. The data set was normalized within the range of 0 and 1. The ANFIS model was implemented in MATLAB environment. Eighty-one rules in total were created in each of the models (Fig. 7). The predictive capability of the models was tested using eight additional data points having the same distribution as the training data set for the carbon while four additional data set each was used for both hydrogen (H) and oxygen (O) testing, respectively. The overall performance of the models was compared with predictions of the other model using the whole data sets used in developing the models. The premise and the consequence parameters are output as presented in Tables 2 and 3, respectively.

Multilinear regression (MLR)
The multilinear regression technique is widely used in different engineering fields to solve a wide range of problems. For instance, the MLR was used by Shen et al. (2010) and Parikh et al. (2007) for predicting the elemental compositions of fuels. In the area of geotechnical engineering, MLR has been used to predict the strength of rock, rock fragmentation and shear strength parameters by Gokceoglu and Zorlu (2004), Bahrami et al. (2011, respectively. MLR is normally utilized to establish a relationship between the dependent and independent variables (Onifade 2018;Onifade and Genc 2018). Hence, MLR measures the influence of the dependent variable on the independent variable. For the regression involving one dependent and one independent variable, the general regression equation is shown in Eq. (12).
where b is constant, D and E are dependent and independent variables, respectively. The equation can be extended to accommodate more than one independent variables as presented in Eq. (13).
where E 1 , E 2 , E 3 , … E n is different independent variables to predict D.
The MLR was used in this study for the prediction of the elemental composition of fuels (coal and biomass). The dependent variables are C, H, and O whereas the independent variables are FC, VM, A, and M. The MLR was implemented with the OriginPro Ó software. To do this, the dependent and independent variables were imported into the OriginPro Ó software and under the analysis drawdown menu, the multilinear regression was selected to perform the required MLR analysis. The obtained MLR equations for the three dependent variables are as presented in Eqs. (14) to (16).
For the predicted C (Fig. 8), the R 2 values of the model suggested by Shen et al. (2010), Parikh et al. (2007) and Nhuchhen (2016) are 0.2513, 0.4474 and 0.8327, respectively while the R 2 value of the predictions of the MLR, Predictions of elemental composition of coal and biomass from their proximate analyses using…  10). In all the compared cases, the proposed models have the highest coefficient of correlations than the existing models. Hence, they can give reasonable predictions of the C, H, and O. To further showcase the performances of the proposed models, the error analysis was conducted as presented in the next subsection.

Error analysis
The performance of the proposed models and the models obtained in the literature was evaluated using three forms of estimation errors, which are the mean absolute error   (27)), and average biased error (ABE) (Eq. (28)) as presented in Table 5.
where E and P represent the experimentally measured and predicted elemental compositions of the solid fuels, while n is the number of sample data points used for the model developments. The AAE estimates the degree of closeness between the predicted and measured elemental compositions, the ABE computes the degree of the bias of the models' errors. The smaller the absolute value of the AAE Fig. 11 Strength of the input paramet is, the smaller the bias of the correlation while a positive ABE value depicts that the average of the predicted value of the elemental composition is more than the experimentally measured one. Similarly, the MAE gives an apparent amount of error in the same unit that the physical quantity has. In addition, R 2 indicates the degree of goodness of the proposed correlations respectively (Shen et al. 2010;Nhuchhen 2016). The models having the least MAE and high R 2 values were selected as the best models for predicting the elemental compositions of solid fuels. The error analysis performed in this study is as presented in Table 5. For the MAE error, the ANFIS model has the least value for the carbon, hydrogen, and oxygen while Shen et al. (2010) model has the highest. In the case of AAE, ANFIS model also has the smallest values for all the predicted elemental compositions which imply that ANFIS prediction is closer to the experimental predictions while Shen et al. (2010) model has the highest AAE in both C and O and AAE of Nhuchhen (2016) model is the highest in H. The ABE for both the ANFIS and MLR in the case of C are positive while those of the remaining models are negative indicating that ANFIS and MLR predictions are slightly overestimated as the margin between ABE is very close to zero. On the other hand, the ABE estimated for H using the ANFIS, MLR, and Nhuchhen (2016) models are positive while that of others is negative hence ANFIS, MLR and Nhuchhen (2016) models slightly overestimate the value of H but Nhuchhen (2016) model deviates more from the experimental predictions. In addition, ABE for the O shows that four of the models overestimate the oxygen content of the fuels though the deviation of the MLR model from the experimental model is very small while two of the models slightly underestimate the oxygen content. It can be inferred from the error analysis performed that the ANFIS models have the least error follow by ANN and then MLR. Similarly, Shen et al. (2010) model has the highest error values follow by Parikh et al. (2007) model and then Nhuchhen (2016) model. Hence, the performance of the proposed models is promising.

Sensitivity analysis
Sensitivity analysis is a technique used to evaluate the input parameters that most affect the output parameters. Jong and Lee (2004) reported that the cosine amplitude approach can be employed to evaluate sensitivity analysis. The cosine amplitude method is as illustrated in Eq. (29).
where I i and O tj represent the input and output parameters, and n is the number of sample data points.
The cosine amplitude method has also been used by various researchers in the field of geotechnical engineering. For instance, Monjezi et al. (2012) studied the influence of various parameters on the uniaxial compressive strength of the rock. Momeni et al. (2014) determined the parameters influencing the bearing capacity of stockpile using the cosine amplitude approach. However, none of the existing studies for predicting the elemental composition of solid fuel conducted the sensitivity analysis using the CAM. Hence, in this study, a sensitivity analysis was conducted using the outputs from the three methods used and the order of the importance of the independent variables is as presented in Fig. 11. The results presented in Fig. 11 based on the cosine amplitude approach in Eq. (29) show the strength of the relation between the input and output parameters for the three models developed. For the case of C (Fig. 11a), FC content has the highest influence on it as expected follow by VM. A and M contents have a similar impact on C. On the other hand, VM has the highest influence on both H and O (Figs. 11b and c) follow by M, FC and then A contents for the three models proposed in this study.

Conclusions
It is difficult to evaluate the elemental composition of coal and biomass through laboratory tests as the test is timeconsuming and costly. As a result of the difficulty, the elemental composition of coal and biomass cannot sometimes be obtained by laboratory tests, especially when considering small to medium-size engineering projects. In the first part of the study, in order to overcome this limitation, prediction models were developed by using proximate and elemental analyses data obtained from the laboratory tests, and ANFIS, ANN and MLR as the prediction tools. The prediction models developed have strong prediction capacities and can be used to estimate the elemental composition of coal and biomass.
In the second part of the study, the developed prediction models were compared with some existing empirical equations for estimating the elemental composition of similar materials. For all the parameters of elemental composition (i.e. carbon, hydrogen, and oxygen) investigated, the proposed models have a higher coefficient of correlations than the existing models. In addition, error analyses were performed to further compare the performances of the proposed models with those of the existing models in the literature using MAE, AAE, and ABE as prediction error indicators. For the three elements of coal and biomass investigated, the models developed in this study have the smallest values of MAE, AAE, and ABE except for carbon and hydrogen where there are overlapping of performances in ABE estimates. This shows that using prediction models established in this study leads to smaller errors compared to the existing models.
While comparing the ANFIS, ANN and MLR models proposed in this study, it is obvious that the ANFIS models have higher predictive capacities than the ANN models and, lastly, are followed by the MLR models. This may be mainly due to non-linearity between independent and dependent variables. However, the performances of the proposed models are satisfactory and they can be used for practical purposes.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons. org/licenses/by/4.0/.