Abstract
Due to climate change, forest regions in California are increasingly experiencing severe wildfires, with other issues affecting the rest of the world. Machine learning (ML) and artificial intelligence (AI) models have emerged to predict wildfire hazards and aid mitigation efforts. However, the wildfire prediction modelling domain faces inconsistencies due to database manipulations for multi-class classification. To help to address this issue, our paper focuses on creating wildfire prediction models through One-class classification algorithms: Support Vector Machine, Isolation Forest, AutoEncoder, Variational AutoEncoder, Deep Support Vector Data Description, and Adversarially Learned Anomaly Detection. To minimise bias in the selection of the training and testing data, Five-Fold Cross-Validation was used to validate all One-class ML models. These One-class ML models outperformed Two-class ML models using the same ground truth data, with mean accuracy levels between 90 and 99 percent. Shapley values were used to derive the most important features affecting the wildfire prediction model, which is a novel contribution to the field of wildfire prediction. Among the most important factors were the seasonal maximum and mean dew point temperatures. In providing access to our algorithms, using Python Flask and a web-based tool, the top-performing models were operationalized for deployment as a REST API, with the potential to strengthen wildfires mitigation strategies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
https://www.fire.ca.gov/our-impact/statisticsStatistics on CA wildfires and CAL FIRE activity.
- 2.
A different case study with 2.2 million acres burned in Western Australia was conducted as the second case study. However, due to page limitations, we are unable to discuss this data set and its associated results in this paper.
- 3.
This thesis provides more detail on the steps involved in data pre-processing [10].
- 4.
- 5.
More information on the cost calculation can be found on [10, pp. 167–168].
References
Abdollahi, A., Pradhan, B.: Explainable artificial intelligence (XAI) for interpreting the contributing factors feed into the wildfire susceptibility prediction model. Sci. Total Environ. 879, 163004 (2023). https://doi.org/10.1016/j.scitotenv.2023.163004
Alkhatib, R., Sahwan, W., Alkhatieb, A., Schütt, B.: A brief review of machine learning algorithms in forest fires science. Appl. Sci. 13(14) (2023). https://doi.org/10.3390/app13148275, https://www.mdpi.com/2076-3417/13/14/8275
de Bem, P., de Carvalho Júnior, O., Matricardi, E., Guimarães, R., Gomes, R.: Predicting wildfire vulnerability using logistic regression and artificial neural networks: a case study in Brazil. Int. J. Wildland Fire 28(1), 35–45 (2018). https://doi.org/10.1071/WF18018
Bergstra, J., Yamins, D., Cox, D.D.: Making a science of model search: hyperparameter optimization in hundreds of dimensions for vision architectures. In: Proceedings of the 30th International Conference on International Conference on Machine Learning, vol. 28, pp. I-115–I-123. ICML 2013, JMLR.org (2013)
Center, N.I.F.: National Wildfire Coordinating Group (NWCG). Interagency Standards for Fire and Fire Aviation Operations. Createspace Independent Publishing Platform, Great Basin Cache Supply Office: Boise, ID, USA (2019)
Cortes, C., Vapnik, V.: Support vector machine. Mach. learn. 20(3), 273–297 (1995)
Donovan, G.H., Prestemon, J.P., Gebert, K.: The effect of newspaper coverage and political pressure on wildfire suppression costs. Soc. Nat. Resour. 24(8), 785–798 (2011)
Ghorbanzadeh, O., et al.: Spatial prediction of wildfire susceptibility using field survey GPS data and machine learning approaches. Fire 2(3), 43 (2019)
Goldarag, Y., Mohammadzadeh, A., Ardakani, A.: Fire risk assessment using neural network and logistic regression. J. Indian Soc. Remote Sens. 44, 1–10 (2016). https://doi.org/10.1007/s12524-016-0557-6
Ismail, F.N.: Novel machine learning approaches for wildfire prediction to overcome the drawbacks of equation-based forecasting, Ph. D. dissertation, University of Otago (2022)
Jaafari, A., Pourghasemi, H.R.: 28 - Factors influencing regional-scale wildfire probability in Iran: an application of random forest and support vector machine. In: Pourghasemi, H.R., Gokceoglu, C. (eds.) Spatial Modeling in GIS and R for Earth and Environmental Sciences, pp. 607–619. Elsevier (2019)
Jain, P., Coogan, S.C., Subramanian, S.G., Crowley, M., Taylor, S., Flannigan, M.D.: A review of machine learning applications in wildfire science and management. Environ. Rev. 28(4), 478–505 (2020)
Jiménez-Ruano, A., Mimbrero, M.R., de la Riva Fernández, J.: Understanding wildfires in mainland Spain. a comprehensive analysis of fire regime features in a climate-human context. Appl. Geogr. 89, 100–111 (2017)
Jolly, W.M., Freeborn, P.H., Page, W.G., Butler, B.W.: Severe fire danger index: a forecastable metric to inform firefighter and community wildfire risk management. Fire 2(3), 47 (2019). https://doi.org/10.3390/fire2030047
Khan, S.S., Madden, M.G.: One-class classification: taxonomy of study and review of techniques. Knowl. Eng. Rev. 29(3), 345–374 (2014)
Kim, S., Choi, Y., Lee, M.: Deep learning with support vector data description. Neurocomputing 165, 111–117 (2015)
Liu, F.T., Ting, K.M., Zhou, Z.: Isolation forest. In: 2008 Eighth IEEE International Conference on Data Mining, pp. 413–422. IEEE (2008)
Lundberg, S.M., Lee, S.I.: A unified approach to interpreting model predictions. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 4768–4777. Curran Associates Inc. (2017)
Ma, J., Cheng, J., Jiang, F., Gan, V., Wang, M., Zhai, C.: Real-time detection of wildfire risk caused by powerline vegetation faults using advanced machine learning techniques. Adv. Eng. Inform. 44, 101070 (2020). https://doi.org/10.1016/j.aei.2020.101070
Michael, Y., Helman, D., Glickman, O., Gabay, D., Brenner, S., Lensky, I.M.: Forecasting fire risk with machine learning and dynamic information derived from satellite vegetation index time-series. Sci. Total Environ. 764, 142844 (2021). https://doi.org/10.1016/j.scitotenv.2020.142844
Miller, C., Hilton, J., Sullivan, A., Prakash, M.: SPARK – a bushfire spread prediction tool. In: ISESS 2015. IAICT, vol. 448, pp. 262–271. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-15994-2_26
Nhu, V.H., et al.: Shallow landslide susceptibility mapping: a comparison between logistic model tree, logistic regression, naïve bayes tree, artificial neural network, and support vector machine algorithms. Int. J. Environ. Res. Public Health 17(8), 2749 (2020)
Ntinopoulos, N., Sakellariou, S., Christopoulou, O., Sfougaris, A.: Fusion of remotely-sensed fire-related indices for wildfire prediction through the contribution of artificial intelligence. Sustainability 15(15), 1–24 (2023). https://doi.org/10.3390/su151511527
Nunes, A., Lourenço, L., Meira Castro, A.C.: Exploring spatial patterns and drivers of forest fires in Portugal (1980–2014). Sci. Total Environ. 573, 1190–1202 (2016). https://doi.org/10.1016/j.scitotenv.2016.03.121
Papadopoulos, A., Paschalidou, A., Kassomenos, P., McGregor, G.: On the association between synoptic circulation and wildfires in the Eastern Mediterranean. Theoret. Appl. Climatol. 115(3), 483–501 (2014)
Patterson, J., Gibson, A.: Deep Learning: A Practitioner’s Approach. O’Reilly Media, Inc. (2017)
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Reisen, F., Duran, S.M., Flannigan, M., Elliott, C., Rideout, K.: Wildfire smoke and public health risk. Int. J. Wildland Fire 24(8), 1029–1044 (2015)
Ruff, L., et al.: Deep one-class classification. In: Dy, J., Krause, A. (eds.) Proceedings of the 35th International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 80, pp. 4393–4402. PMLR (2018)
Sayad, Y.O., Mousannif, H., Al Moatassime, H.: Predictive modeling of wildfires: a new dataset and machine learning approach. Fire Saf. J. 104, 130–146 (2019). https://doi.org/10.1016/j.firesaf.2019.01.006
Tax, D.M., Duin, R.P.: Support vector domain description. Pattern Recogn. Lett. 20(11–13), 1191–1199 (1999)
Tien Bui, D., Bui, Q.T., Nguyen, Q.P., Pradhan, B., Nampak, H., Trinh, P.T.: A hybrid artificial intelligence approach using GIS-based neural-fuzzy inference system and particle swarm optimization for forest fire susceptibility modeling at a tropical area. Agric. For. Meteorol. 233, 32–44 (2017)
Tonini, M., D’Andrea, M., Biondi, G., Degli Esposti, S., Trucchia, A., Fiorucci, P.: A machine learning-based approach for wildfire susceptibility mapping. the case study of the liguria region in Italy. Geosciences 10(3), 105 (2020)
Zenati, H., Romain, M., Foo, C.S., Lecouat, B., Chandrasekhar, V.: Adversarially learned anomaly detection. In: 2018 IEEE International Conference on Data Mining (ICDM), pp. 727–736. IEEE (2018)
Zhao, Y., Nasrullah, Z., Li, Z.: PyOD: a python toolbox for scalable outlier detection. J. Mach. Learn. Res. 20(96), 1–7 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ismail, F.N., Sengupta, A., Woodford, B.J., Licorish, S.A. (2024). A Comparison of One-Class Versus Two-Class Machine Learning Models for Wildfire Prediction in California. In: Benavides-Prado, D., Erfani, S., Fournier-Viger, P., Boo, Y.L., Koh, Y.S. (eds) Data Science and Machine Learning. AusDM 2023. Communications in Computer and Information Science, vol 1943. Springer, Singapore. https://doi.org/10.1007/978-981-99-8696-5_17
Download citation
DOI: https://doi.org/10.1007/978-981-99-8696-5_17
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8695-8
Online ISBN: 978-981-99-8696-5
eBook Packages: Computer ScienceComputer Science (R0)