Abstract
Reference crop evapotranspiration (ETO) is a basic component of the hydrological cycle and its estimation is critical for agricultural water resource management and scheduling. In this study, three tree-based machine learning algorithms (random forest [RF], gradient boosting decision tree [GBDT], and extreme gradient boosting [XGBoost]) were adopted to determine the essential factors for ETO prediction. The tree-based models were optimized using the Bayesian optimization (BO) algorithm and were compared with three standalone models in terms of daily ETO and monthly mean ETO estimations in North China, with different input combinations of essential variables. The results indicated that solar radiation (Rs) and air temperature (Ts), including the maximum, minimum, and average temperatures, in daily ETO were the key variables affecting model prediction accuracy. Rs was the most influential factor in the monthly average ETO model followed by Ts. Both relative humidity (RH) and wind speed at 2 m (U2) had little impact on ETO prediction at different scales, although their importance differed. Compared with the GBDT and RF models, the XGBoost model exhibited the best performance for daily ETO and monthly mean ETO estimations. The hybrid tree-based models with the BO algorithm outperformed standalone tree-based models. Overall, compared with the other inputs, the model with three inputs (Rs, Ts, and RH/U2) had the highest accuracy. The BO-XGBoost model exhibited superior performance in terms of the global performance index (GPI) for daily ETO and monthly mean ETO predictions and is recommended as a more accurate model for predicting daily ETO and monthly mean ETO in North China or areas with a similar climate.
Similar content being viewed by others
Data availability
The datasets generated during and analysed during the current study are available from the corresponding author on reasonable request.
Code availability
Not applicable.
References
Abdullah SS, Malek MA, Abdullah NS et al (2015) Extreme learning machines: a new approach for prediction of reference evapotranspiration. J Hydrol 527:184–195. https://doi.org/10.1016/j.jhydrol.2015.04.073
Ahmad MJ, Choi KS (2018) Influence of climate variables on FAO Penman-Monteith reference evapotranspiration in the Upper Chenab Canal command area of Pakistan. Paddy Water Environ, 16(3):425–438. https://doi.org/10.1007/s10333-018-0636-0
Alibabaei K, Pedro DG, Lima TM (2021) Modeling soil water content and reference evapotranspiration from climate data using deep learning method. Appl Sci 11(11):5029. https://doi.org/10.3390/app11115029
Alipour A, Yarahmadi J, Mahdavi M (2014) Comparative study of M5 model tree and artificial neural network in estimating reference evapotranspiration using MODIS products. J Climatol 2014:1–11. https://doi.org/10.1155/2014/839205
Alizamir M, Kisi O, Adnan RM et al (2020) Modelling reference evapotranspiration by combining neuro-fuzzy and evolutionary strategies. Acta Geophys 68(3):14. https://doi.org/10.1007/s11600-020-00446-9
Allen RG, Pereira LS, Raes D et al (1998) Crop evapotranspiration-guidelines for computing crop water requirements-FAO Irrigation and drainage paper 56 Fao. Rome 300:D05109
Babak M, Saeid M (2020) Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric Water Manag 237:106145. https://doi.org/10.1016/j.agwat.2020.106145
Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Chen X, Cui N, Li J (2019) Simulation of reference crop evapotranspiration in hilly area of central Sichuan based on MARS. Trans Chin Soc Agric Eng 35(16):152–160
Chen T, Guestrin C (2016) Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining 785–794. https://doi.org/10.1145/2939672.2939785
Chia MY, Feng HY, Koo CH (2021) Swarm-based optimization as stochastic training strategy for estimation of reference evapotranspiration using extreme learning machine. Agric Water Manag 243:106447. https://doi.org/10.1016/j.agwat.2020.106447
Fan J, Yue W, Wu L et al (2018) Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily reference evapotranspiration using limited meteorological data in different climates of China. Agric for Meteorol 263:225–241. https://doi.org/10.1016/j.agrformet.2018.08.019
Feng Y, Cui N, Zhao L et al (2016) Comparison of ELM, GANN, WNN and empirical models for estimating reference evapotranspiration in humid region of Southwest China. J Hydrol 536:376–383. https://doi.org/10.1016/j.jhydrol.2016.02.053
Feng Y, Cui N, Zhang Q et al (2017a) Comparison of artificial intelligence and empirical models for estimation of daily diffuse solar radiation in North China Plain. Int J Hydrogen Energy 42(21):14418–14428. https://doi.org/10.1016/j.ijhydene.2017.04.084
Feng Y, Peng Y, Cui N et al (2017b) Modeling reference evapotranspiration using extreme learning machine and generalized regression neural network only with temperature data. Comput Electron Agric 136:71–78. https://doi.org/10.1016/j.compag.2017.01.027
Hai T, Diop L, Bodian A et al (2018) Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: Regional case study in Burkina Faso. Agric Water Manag 208:140–151. https://doi.org/10.1016/j.agwat.2018.06.018
Hassan MA, Khalil A, Kaseb S et al (2017) Exploring the potential of tree-based ensemble methods in solar radiation modeling. Appl Energy 203:897–916. https://doi.org/10.1016/j.apenergy.2017.06.104
Liu Y, Yu M, Ma X et al (2017) Estimating models for reference evapotranspiration with core meteorological parameters via path analysis. Hydrol Res 48:340–354. https://doi.org/10.2166/nh.2016.240
Mehdizadeh S, Saadatnejadgharahassanlo H, Behmanesh J (2017) Calibration of Hargreaves-Samani and Priestley-Taylor equations in estimating reference evapotranspiration in the Northwest of Iran. Arch Agron Soil Sci 63(7):942–955. https://doi.org/10.1080/03650340.2016.1249474
Mohammadi B, Mehdizadeh S (2020) Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric Water Manag 237:106145. https://doi.org/10.1016/j.agwat.2020.106145
Pereira LS, Allen RG, Smith M et al (2015) Crop evapotranspiration estimation with FAO56: Past and future. Agric Water Manag 147:4–20. https://doi.org/10.1016/j.agwat.2014.07.031
Shiri J (2018) Improving the performance of the mass transfer-based reference evapotranspiration estimation approaches through a coupled wavelet-random forest methodology. J Hydrol 561:737–750. https://doi.org/10.1016/j.jhydrol.2018.04.042
Wang L, He X, Jakob FS et al (2020) Models and measurements of seven years of evapotranspiration on a high elevation site on the Central Tibetan Plateau. J Mt Sci 17(12):3039–3053. https://doi.org/10.1007/s11629-020-6051-1
Wu L, Fan J (2019) Comparison of neuron-based, kernel-based, tree-based and curve-based machine learning models for predicting daily reference evapotranspiration. Plos One 14(5):0217520. https://doi.org/10.1371/journal.pone.0217520
Wu L, Peng Y, Fan J et al (2019) Machine learning models for the estimation of monthly mean daily reference evapotranspiration based on cross-station and synthetic data. Hydrol Res 50(6):1730–1750. https://doi.org/10.2166/nh.2019.060
Yin Z, Wen X, Feng Q et al (2017) Integrating genetic algorithm and support vector machine for modeling daily reference evapotranspiration in a semi-arid mountain area. Hydrol Res 48(5):1177–1191. https://doi.org/10.2166/nh.2016.205
Yu H, Wen X, Li B et al (2020) Uncertainty analysis of artificial intelligence modeling daily reference evapotranspiration in the northwest end of China. Comput Electron Agric 176:105653. https://doi.org/10.1016/j.compag.2020.105653
Zhang Y, Zhao Z, Zheng J (2020) CatBoost: A new approach for estimating daily reference crop evapotranspiration in arid and semi-arid regions of Northern China. J Hydrol 588:125087. https://doi.org/10.1016/j.jhydrol.2020.125087
Zhao J, Xia H, Yue Q et al (2020) Spatiotemporal variation in reference evapotranspiration and its contributing climatic factors in China under future scenarios. Int J Climatol 40(8):3813–3831. https://doi.org/10.1002/joc.6429
Zhao L, Xing L, Wang Y et al (2023) Prediction Model for Reference Crop Evapotranspiration Based on the Back-propagation Algorithm with Limited Factors. Water Resour Manag 37(3):1207–1222. https://doi.org/10.1007/s11269-022-03423-7
Zhao L, Zhao X, Zhou H et al (2021) Prediction model for daily reference crop evapotranspiration based on hybrid algorithm and principal components analysis in Southwest China. Comput Electron Agric 190:106424. https://doi.org/10.1016/j.compag.2021.106424
Zhou Z, Zhao L, Lin A et al (2020) Exploring the potential of deep factorization machine and various gradient boosting models in modeling daily reference evapotranspiration in China. Arab J Geosci 13(24):1287. https://doi.org/10.1007/s12517-020-06293-8
Acknowledgements
We would like to thank the National Climatic Centre of the China Meteorological Administration for providing the climate database used in this study.
Funding
This work was also supported by National Natural Science Foundation of China (52309050, 51922072, 51779161, 51009101), Key R&D and Promotion Projects in Henan Province (Science and Technology Development) (No. 232102110264 & 202102110112), the Fundamental Research Funds for the Central Universities (2019CDLZ-10, 2020CDDZ-19), Experimental Technology Development Fund Project of Henan University of Science and Technology(No. SY2021008), PhD Research Startup Foundation of Henan University of Science and Technology (No. 13480025), Henan Provincial Tobacco Company Luoyang City Company Technology Innovation Pro (No. 2023410300200043), and Key Scientific Research Projects of Colleges and Universities in Henan Province (No. 24B416001).
Author information
Authors and Affiliations
Contributions
Conceptualization: Long Zhao, Ningbo Cui; Methodology: Long Zhao, Yuhang Wang, Yi Shi, Xinbo Zhao; Formal analysis and investigation: Long Zhao, Yuhang Wang, Yi Shi, Xinbo Zhao, Shuo Zhang; Writing—original draft preparation: Long Zhao, Yuhang Wang; Writing—review and editing: Ningbo Cui; Funding acquisition: Long Zhao, Ningbo Cui; Supervision: Ningbo Cui. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Ethics approval
Not applicable.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhao, L., Wang, Y., Shi, Y. et al. Selecting essential factors for predicting reference crop evapotranspiration through tree-based machine learning and Bayesian optimization. Theor Appl Climatol 155, 2953–2972 (2024). https://doi.org/10.1007/s00704-023-04760-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-023-04760-2