Abstract
Palm oil mill effluent (POME) contributes to 23.7% of the methane emissions in Malaysia. Development of a methane emission prediction tool by using machine learning (ML) enables the estimated volume of methane released to be determined. In this study, Gaussian Process Regression (GPR) along with its respective kernels was explored for the development of the prediction tool. Synthetic minority oversampling technique (SMOTE) was also implemented to study the effect of the training sample size used on the model validation. The GPR model was trained using synthetic data created from SMOTE, while the measure data from the plant was used to test the reliability of the trained model. The application of SMOTE was capable in producing high model validation performance (R2 = 0.98, RMSE = 0.133, MSE = 0.018 and MAE = 0.08) using the common squared exponential kernel GPR model. However, the Matern 5/2 and rational quadratic kernel GPR model had the best model validation performance (R2 = 0.98, RMSE = 0.131, MSE = 0.017 and MAE = 0.083). In terms of model testing performance, rational quadratic kernel had the best performance with R2 = 0.99, RMSE = 0.061, MSE = 0.0037 and MAE = 0.044. The results of this study indicate the prediction tool developed using SMOTE-based rational quadratic kernel GPR model can predict methane emissions with high accuracy. The methane emissions prediction tool developed is an alternative cost friendly and reliable option to existing methods.
Similar content being viewed by others
Data availability
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
Abbreviations
- POME:
-
Palm oil mill effluent
- ML:
-
Machine learning
- MCS:
-
Methane capture system
- GRP:
-
Gaussian Process Regression
- SMOTE:
-
Synthetic minority oversampling technique
References
Abdurahman NH, Rosli YM and Azhari NH (2013) ‘The performance evaluation of anaerobic methods for palm oil mill effluent (POME) treatment: a review’. Int Perspect Water Qual Manag Pollut Control. https://doi.org/10.5772/54331 [Preprint]
Agerton M, Gilbert B (2021) The key to reducing methane emissions? Actual measurement, Forbes. Available at: https://www.forbes.com/sites/thebakersinstitute/2021/08/16/the-key-to-reducing-methane-emissions-actual-measurement/?sh=3aa8b7923b88. Accessed 18 April 2022
Akbaş H, Bilgen B, Turhan AM (2015) An integrated prediction and optimization model of biogas production system at a wastewater treatment facility. Bioresource Technology 196:566–576. https://doi.org/10.1016/J.BIORTECH.2015.08.017
Akhbari A, Kutty PK, Chuen OC, Ibrahim S (2020a) A study of palm oil mill processing and environmental assessment of palm oil mill effluent treatment. Environ Eng Res 25(2):212–221. https://doi.org/10.4491/EER.2018.452
Akhbari A, Kutty PK, Chuen OC and Ibrahim S (2020b) ‘A study of palm oil mill processing and environmental assessment of palm oil treatment. 25(1):1–17
Ashfaque JM, Iqbal A (2019) Introduction to support vector machines and kernel methods, publication at https://www.researchgate.net/publication/332370436
Baquerizo-Crespo RJ, Astals S, Pérez-Ones O, Pereda-Reyes I (2021) Mathematical modeling challenges associated with waste anaerobic biodegradability. In: Maddela NR, Chakraborty S, Garcia Cruzatty L (eds) Advances in the domain of environmental biotechnology. Springer Nature Singapore Pte Ltd., Singapore, pp 357–392. https://doi.org/10.1007/978-981-15-8999-7_14
Carotenuto C, Guarino G, Morrone B, Minale M (2016) Temperature and pH effect on methane production from buffalo manure anaerobic digestion. Int J Heat Technol 34(Special Issue 2):S425–S429. https://doi.org/10.18280/IJHT.34SP0233.
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357. https://doi.org/10.1613/JAIR.953
Chen Z, Wang B (2018) How priors of initial hyperparameters affect Gaussian process regression models. Neurocomputing 275:1702–1710. https://doi.org/10.1016/j.neucom.2017.10.028
Chin MJ, Poh PE, Tey BT, Chan ES, Chin KL (2013) Biogas from palm oil mill effluent (POME): opportunities and challenges from Malaysia’s perspective. Renew Sustain Energy Rev 26:717–726. https://doi.org/10.1016/J.RSER.2013.06.008
Douzas G, Bacao F, Last F (2018) Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE. Inf Sci 465:1–20. https://doi.org/10.1016/J.INS.2018.06.056
De Clercq D, Jalota D, Shang R, Ni K, Zhang Z, Khan A, Wen Z, Caicedo L, Yuan K (2019) ‘Machine learning powered software for accurate prediction of biogas production: A case study on industrial-scale Chinese production data. J Clean Prod 218:390–399. https://doi.org/10.1016/J.JCLEPRO.2019.01.031
De Clercq D, Wen Z, Fei F, Caicedo L, Yuan K, Shang R (2020) Interpretable machine learning for predicting biomethane production in industrial-scale anaerobic co-digestion. Sci Total Environ 712:134574. https://doi.org/10.1016/J.SCITOTENV.2019.134574
Glen S (2015) Residual plot: definition and examples. Available at: https://www.statisticshowto.com/residual-plot/. Accessed 24 June 2022
Harsono SS, Grundmann P, Soebronto S (2014) Anaerobic treatment of palm oil mill effluents: potential contribution to net energy yield and reduction of greenhouse gas emissions from biodiesel production. J Clean Prod 64:619–627. https://doi.org/10.1016/j.jclepro.2013.07.056
Hazirah Che Ithnin N, Hashim H (2019) Predictive modelling for biogas generation from palm oil mill effluent (POME). Chemical Engineering Transactions 72. https://doi.org/10.3303/CET1972053
IEA (2022) Global Methane Pledge – Policies - IEA. Available at: https://www.iea.org/policies/14258-global-methane-pledge?jurisdiction=International&msclkid=1767b550be1511ec89d00cf6813f1e8e (Accessed: 17 April 2022)
Jeong K, Abbas A, Shin J, Son M, Kim YM, Cho KH (2021) Prediction of biogas production in anaerobic co-digestion of organic wastes using deep learning models. Water Res 205:17697. https://doi.org/10.1016/j.watres.2021.117697
Jiang J, He S, Kang X, Sun Y, Yuan Z, Xing T, Guo Y, Li L (2020) Effect of organic loading rate and temperature on the anaerobic digestion of municipal solid waste: process performance and energy recovery. Front Energy Res 8:89. https://doi.org/10.3389/FENRG.2020.00089/BIBTEX
Kamyab H, Chelliapan S, Din MFM, Rezania S, Khademi T, Kumar A (2018) Palm oil mill effluent as an environmental pollutant. Palm Oil 13:13–28. https://doi.org/10.5772/intechopen.75811
Larsen BS (2022) Synthetic minority over-sampling technique (SMOTE)—file exchange—MATLAB Central
Liu Y, Huang D, Liu B, Feng Q, Cai B (2021) Adaptive ranking based ensemble learning of Gaussian process regression models for quality-related variable prediction in process industries. Appl Soft Comput 101:107060. https://doi.org/10.1016/j.asoc.2020.107060
Loh SK, Nasrin, Azri M, Adela M, Muzzammil, Jay D, Eleanor D (2017) ‘Biogas capture—a means of reducing greenhouse gas emissions from palm oil mill effluent. Oil Palm Bullet 75:27–36
Mahmoodi-Eshkaftaki M, Ebrahimi R (2021) Integrated deep learning neural network and desirability analysis in biogas plants: A powerful tool to optimize biogas purification. Energy 231:121073. https://doi.org/10.1016/J.ENERGY.2021.121073
Maldonado S, López J, Vairetti C (2019) An alternative SMOTE oversampling strategy for high-dimensional datasets. Appl Soft Comput 76:380–389. https://doi.org/10.1016/J.ASOC.2018.12.024
McCarthy N (2020) Which countries produce the most palm oil? [Infographic]. Available at: https://www.forbes.com/sites/niallmccarthy/2020/10/02/which-countries-produce-the-most-palm-oil-infographic/?sh=16a5bed71e42 (Accessed: 13 April 2022)
Ministry of Environment and Water (2020) Malaysia third biennial update report to the UNFCCC. Ministry of Environment and Water 1–285. Available at: https://unfccc.int/sites/default/files/resource/MALAYSIA_BUR3-UNFCCC_Submission.pdf. Accessed 17 Apr 2022
Mohd MI (2015) Renewable energy policy status and challenges of POME-biogas industry in Malaysia. Pertanika Journal of Scholarly Research Reviews 1:33-39. Available at: https://www.researchgate.net/publication/287817723. Accessed 17 Apr 2022
Murphy DJ, Goggin K, Paterson RRM (2021) Oil palm in the 2020s and beyond: challenges and solutions. CABI Agric Biosci 2(1):1–22. https://doi.org/10.1186/s43170-021-00058-3
Nugaliyadde A, Wong KW, Sohel F and Xie H (2017) ‘Reinforced memory network for question answering’, In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 10635 LNCS, pp. 482–490. https://doi.org/10.1007/978-3-319-70096-0_50/COVER/
Pan Y, Zeng X, Xu H, Sun Y, Wang D, Wu J (2021) ‘Evaluation of Gaussian process regression kernel functions for improving groundwater prediction. J Hydrol 603(PB):126960. https://doi.org/10.1016/j.jhydrol.2021.126960
Putro LHS, Budianta D, Rohendi D, Rejo A (2019) Biomethane emissions: measurement in wastewater pond at palm oil mill by using TGS2611 methane gas sensor. Journal of Ecological Engineering 20(6):25–35. https://doi.org/10.12911/22998993/108696
Putro, HSL, Budianta D, Rohendi D and Rejo A (2020) ‘Modeling methane emission of wastewater anaerobic pond at palm oil mill using radial basis function neural network green house gas mitigation for street lighting sector in Palembang city view project modeling methane emission of wastewater anaerobic pond ’, Int J Inform Technol. https://doi.org/10.18517/ijaseit.10.1.9577 [Preprint]
Rasmussen CE, Williams CKI (2006) Introduction. Gaussian processes for machine learning. The MIT Press, Cambridge, pp 1–6
Rasmussen CE (2004) ‘Gaussian Processes in Machine Learning’, In Bousquet O, von Luxburg U and Rätsch G (eds.) Advanced lectures on machine learning: ML Summer Schools 2003, Canberra, Australia, February 2 - 14, 2003, Tübingen, Germany, August 4 - 16, 2003, Revised Lectures. Berlin, Heidelberg: Springer Berlin Heidelberg, pp. 63–71. https://doi.org/10.1007/978-3-540-28650-9_4
Refaeilzadeh P, Tang L and Liu H (2016) ‘Cross-validation’, Encyclopedia of database systems, pp. 1–7. https://doi.org/10.1007/978-1-4899-7993-3_565-2
Sarwani MK, Fawzi M, Osman SA, Nasrin AB (2019) Bio-methane from palm oil mill effluent (POME): transportation fuel potential in Malaysia, J Adv Res Fluid Mech Thermal Sci 63(1):. Available at: https://www.akademiabaru.com/submit/index.php/arfmts/article/view/2713/1775 (Accessed: 10 June 2022)
Shahidul MI, Malcolm ML, Eugene JJ (2018) Methane production potential of pome: a review on waste-to-energy [WTE] MODEL. Sci Int (Lahore) 30(5):717–728
Shairah N, Shahrifun A, Nazilah N, Aris A, Omar Q, Ahmad N (2015) Characterization of palm oil mill secondary effluent (Pomse). Malays J Civ Eng 27(1):144–151. https://doi.org/10.11113/mjce.v27n1.364
Singh G, Kumar Panda R, Professor A, Panda RK (2011) Daily sediment yield modeling with artificial neural network using 10-fold cross validation method: a small agricultural watershed, Kapgari, India. Int J Earth Sci Eng 4(January):20410247 Available at: https://www.researchgate.net/publication/265988179. Accessed 28 Apr 2022
Tan YD, Lim JS (2019) Feasibility of palm oil mill effluent elimination towards sustainable Malaysian palm oil industry. Renew Sustain Energy Rev 111(January):507–522. https://doi.org/10.1016/j.rser.2019.05.043
Tyagi VK, Lo SL (2011) Application of physico-chemical pretreatment methods to enhance the sludge disintegration and subsequent anaerobic digestion: an up to date review. Rev Environ Sci Biotechnol 10(3):215–242. https://doi.org/10.1007/S11157-011-9244-9
United States Environmental Protection Agency (2021) Importance of methane, global methane initiative. https://www.epa.gov/gmi; https://www.epa.gov/gmi/importance-methane
Wan X, Li X, Wang X, Yi X, Zhao Y, He X, Wu R, Huang M (2022) ‘Water quality prediction model using Gaussian process regression based on deep learning for carbon neutrality in papermaking wastewater treatment system. Environ Res 211(January):112942. https://doi.org/10.1016/j.envres.2022.112942
Wei S, Chen Z, Arumugasamy SK and Chew IML (2022) Data augmentation and machine learning techniques for control strategy development in bio-polymerization process. Environ Sci Ecotechnol 100172. https://doi.org/10.1016/j.ese.2022.100172
Wilson A, Adams R (2013, May). Gaussian process kernels for pattern discovery and extrapolation. In: International conference on machine learning. PMLR, pp 1067–1075. http://proceedings.mlr.press/v28/wilson13.html
Wu TY, Mohammad AW, Jahim JM, Anuar N (2009) A holistic approach to managing palm oil mill effluent (POME): biotechnological advances in the sustainable reuse of POME. Biotechnol Adv 27(1):40–52. https://doi.org/10.1016/j.biotechadv.2008.08.005
Yang Y, Zheng S, Ai Z, Jafari MMM (2021) On the prediction of biogas production from vegetables, fruits, and food wastes by ANFIS- and LSSVM-based models. Biomed Res Int 2021. https://doi.org/10.1155/2021/9202127
Yazdi SK & Scholz M (2010) Assessing storm water detention systems treating road runoff with an artificial neural network predicting fecal indicator organisms. Water Air Soil Pollut 206:. https://doi.org/10.1007/s11270-009-0084-y
Zaied BK, Rashid M, Nasrullah M, Bari BS, Zularisam AW, Singh L, Kumar D, Krishnan S (2023) Prediction and optimization of biogas production from POME co-digestion in solar bioreactor using artificial neural network coupled with particle swarm optimization (ANN-PSO). Biomass Conversion and Biorefinery 13(1):73–88. https://doi.org/10.1007/s13399-020-01057-6
Zhang Y, Yan L, Chi L, Long X, Mei Z, Zhang Z (2008) Startup and operation of anaerobic EGSB reactor treating palm oil mill effluent. J Environ Sci 20(6):658–663. https://doi.org/10.1016/S1001-0742(08)62109-9
Zhu X, Vondrick C, Fowlkes CC, Ramanan D (2016) Do we need more training data? Int J Comput Vision 119(1):76–92. https://doi.org/10.1007/S11263-015-0812-2/FIGURES/17
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hia, H.Y., Selvanathan, K., Ragu, K. et al. Development of a Methane Emission Prediction Tool (POMEP178) for Palm Oil Mill Effluent Using Gaussian Process Regression. Process Integr Optim Sustain 7, 921–930 (2023). https://doi.org/10.1007/s41660-023-00331-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41660-023-00331-0