Abstract
Accurate forecasting of daily evapotranspiration (ET) is essential for enhancing real-time irrigation scheduling and informed decision-making in water resources allocation. This study investigates the intricate relationships between meteorological variables and evapotranspiration (ET) to enhance the accuracy of ET estimation models. Robust correlations were identified, emphasizing the significance of net radiation (Rn) in predicting ET. The study explores three distinct scenarios, incorporating different combinations of weather variables as input. The first scenario incorporates all weather variables, including date and time, as inputs for model development. The second scenario utilizes only Rn as input to predict ET values. In the third and final scenario, all weather variables, along with date and time, are employed as inputs for comprehensive model development. The multivariate linear regression (MLR) model demonstrated exceptional performance when exclusively using Rn, achieving an impressive R2 value of 0.99 in both calibration and validation phases. However, limitations were observed when Rn was excluded, highlighting the necessity of a comprehensive set of input data. Penalized regression models, including ridge regression, LASSO, and ELNET, exhibited improved performance with the inclusion of Rn, supporting the importance of this variable in refining ET estimates. Machine learning models displayed remarkable performance, with most achieving R2 values exceeding 0.95 in scenarios involving extensive input data. The Support Vector Regression (SVR) model faced challenges, indicating potential overfitting in certain scenarios. In scenarios with limited input data, machine learning models exhibited varying performance, with the Random Forest (RF) model emerging as the most robust model with R2 value of 0.99 and 0.84 during the calibration and validation, respectively.
Similar content being viewed by others
Data availability
The datasets analysed during the current study are available from the corresponding author on reasonable request.
Code availability
The code are available from the corresponding author on reasonable request.
References
Abbas F, Afzaal H, Farooque AA, Tang S (2020) Crop Yield Prediction through Proximal Sensing and Machine Learning Algorithms. Agronomy 10:1046. https://doi.org/10.3390/agronomy10071046
Abyaneh HZ, Nia AM, Varkeshi MB et al (2011) Performance Evaluation of ANN and ANFIS Models for Estimating Garlic Crop Evapotranspiration. J Irrig Drain Eng 137:280–286. https://doi.org/10.1061/(ASCE)IR.1943-4774.0000298
Achite M, Elshaboury N, Jehanzaib M et al (2023) Performance of Machine Learning Techniques for Meteorological Drought Forecasting in the Wadi Mina Basin, Algeria. Water 15:765. https://doi.org/10.3390/w15040765
Aghajanloo M-B, Sabziparvar A-A, Hosseinzadeh Talaee P (2013) Artificial neural network–genetic algorithm for estimation of crop evapotranspiration in a semi-arid region of Iran. Neural Comput Appl 23:1387–1393. https://doi.org/10.1007/s00521-012-1087-y
Allen RG, Pereira LS, Raes D, Smith M (1998) Crop evapotranspiration-Guidelines for computing crop water requirements-FAO Irrigation and drainage paper 56. FAO - Food and Agriculture Organization of the United Nations Rome 300(9):D05109
Anapalli SS, Ahuja LR, Gowda PH et al (2016) Simulation of crop evapotranspiration and crop coefficients with data in weighing lysimeters. Agric Water Manag 177:274–283. https://doi.org/10.1016/j.agwat.2016.08.009
Azzam A, Zhang W, Akhtar F et al (2022) Estimation of green and blue water evapotranspiration using machine learning algorithms with limited meteorological data: A case study in Amu Darya River Basin, Central Asia. Comput Electron Agric 202:107403. https://doi.org/10.1016/j.compag.2022.107403
Bachour R, Walker WR, Ticlavilca AM et al (2014) Estimation of Spatially Distributed Evapotranspiration Using Remote Sensing and a Relevance Vector Machine. J Irrig Drain Eng 140:4014029. https://doi.org/10.1061/(ASCE)IR.1943-4774.0000754
Biau G, Devroye L, Lugosi G (2008) Consistency of random forests and other averaging classifiers. J Mach Learn Res 9(9):2015–2033
Biau G, Scornet E (2016) A random forest guided tour. TEST 25:197–227. https://doi.org/10.1007/s11749-016-0481-7
Breiman L (2001) Random Forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Buttar NA, Yongguang H, Shabbir A et al (2018) Estimation of evapotranspiration using Bowen ratio method. IFAC-PapersOnLine 51:807–810. https://doi.org/10.1016/j.ifacol.2018.08.096
Chen Z, Sun S, Wang Y et al (2020) Temporal convolution-network-based models for modeling maize evapotranspiration under mulched drip irrigation. Comput Electron Agric 169:105206. https://doi.org/10.1016/j.compag.2019.105206
Chia MY, Huang YF, Koo CH et al (2022) Long-term forecasting of monthly mean reference evapotranspiration using deep neural network: A comparison of training strategies and approaches. Appl Soft Comput 126:109221. https://doi.org/10.1016/j.asoc.2022.109221
Das B, Nair B, Reddy VK, Venkatesh P (2018) Evaluation of multiple linear, neural network and penalised regression models for prediction of rice yield based on weather parameters for west coast of India. Int J Biometeorol 62:1809–1822. https://doi.org/10.1007/s00484-018-1583-6
de Teixeira AHC, Bastiaanssen WGM, Ahmad MD, Bos MG (2009) Reviewing SEBAL input parameters for assessing evapotranspiration and water productivity for the Low-Middle São Francisco River basin, Brazil. Agric For Meteorol 149:462–476. https://doi.org/10.1016/j.agrformet.2008.09.016
Ekanayake P, Wickramasinghe L, Jayasinghe JMJW, Rathnayake U (2021) Regression-Based Prediction of Power Generation at Samanalawewa Hydropower Plant in Sri Lanka Using Machine Learning. Math Probl Eng 2021:1–12. https://doi.org/10.1155/2021/4913824
Elbeltagi A, Kushwaha NL, Rajput J et al (2022) Modelling daily reference evapotranspiration based on stacking hybridization of ANN with meta-heuristic algorithms under diverse agro-climatic conditions. Stoch Environ Res Risk Assess. https://doi.org/10.1007/s00477-022-02196-0
Elbeltagi A, Al-Mukhtar M, Kushwaha NL et al (2023a) Forecasting monthly pan evaporation using hybrid additive regression and data-driven models in a semi-arid environment. Appl Water Sci 13:42. https://doi.org/10.1007/s13201-022-01846-6
Elbeltagi A, Seifi A, Ehteram M et al (2023b) GLUE analysis of meteorological-based crop coefficient predictions to derive the explicit equation. Neural Comput Appl. https://doi.org/10.1007/s00521-023-08466-4
Fan J, Yue W, Wu L et al (2018) Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily reference evapotranspiration using limited meteorological data in different climates of China. Agric for Meteorol 263:225–241. https://doi.org/10.1016/j.agrformet.2018.08.019
Feng Y, Gong D, Mei X, Cui N (2017) Estimation of maize evapotranspiration using extreme learning machine and generalized regression neural network on the China Loess Plateau. Hydrol Res 48:1156–1168. https://doi.org/10.2166/nh.2016.099
Ghiassi M, Saidane H, Zimbra DK (2005) A dynamic artificial neural network model for forecasting time series events. Int J Forecast 21:341–362. https://doi.org/10.1016/j.ijforecast.2004.10.008
Heddam S, Vishwakarma DK, Abed SA et al (2024) Hybrid river stage forecasting based on machine learning with empirical mode decomposition. Appl Water Sci 14:46. https://doi.org/10.1007/s13201-024-02103-8
Hilt DE, Seegrist DW (1977) Ridge, a computer program for calculating ridge regression estimates. USDA Forest Service Research Note NE-236, United States, Department of Agriculture, Forest Service, Northeastern Forest Experiment Station
Jiang X, Kang S, Tong L, Li F (2016) Modification of evapotranspiration model based on effective resistance to estimate evapotranspiration of maize for seed production in an arid region of northwest China. J Hydrol 538:194–207. https://doi.org/10.1016/j.jhydrol.2016.04.002
Karunanayake C, Gunathilake MB, Rathnayake U (2020) Inflow Forecast of Iranamadu Reservoir, Sri Lanka, under Projected Climate Scenarios Using Artificial Neural Networks. Appl Comput Intell Soft Comput 2020:1–11. https://doi.org/10.1155/2020/8821627
Kato S, Yamaguchi Y (2007) Estimation of storage heat flux in an urban area using ASTER data. Remote Sens Environ 110:1–17. https://doi.org/10.1016/j.rse.2007.02.011
Kaur R, Sharma S (2019) An ANN Based Approach for Software Fault Prediction Using Object Oriented Metrics. In: Luhach AK, Singh D, Hsiung P-A, et al. (eds) Advanced Informatics for Computing Research, ICAICR 2018. Communications in Computer and Information Science, vol 955. Springer Singapore, pp 341–354
Khaniya B, Karunanayake C, Gunathilake MB, Rathnayake U (2020) Projection of Future Hydropower Generation in Samanalawewa Power Plant, Sri Lanka. Math Probl Eng 2020:1–11. https://doi.org/10.1155/2020/8862067
Kramer O (2013) Dimensionality Reduction with Unsupervised Nearest Neighbors. Springer Berlin Heidelberg, Berlin
Kumar R, Lone MA, Bhat OA (2021) Determination of water requirement and crop coefficients for green gram in temperate region using lysimeter water balance. Int J Hydrol Sci Technol 12:1. https://doi.org/10.1504/ijhst.2021.10038778
Kumar A, Singh VK, Saran B et al (2022a) Development of Novel Hybrid Models for Prediction of Drought- and Stress-Tolerance Indices in Teosinte Introgressed Maize Lines Using Artificial Intelligence Techniques. Sustainability 14:2287. https://doi.org/10.3390/su14042287
Kumar R, Manzoor S, Vishwakarma DK et al (2022b) Assessment of Climate Change Impact on Snowmelt Runoff in Himalayan Region. Sustainability 14:1–23. https://doi.org/10.3390/su14031150
Kushwaha NL, Rajput J, Elbeltagi A et al (2021) Data Intelligence Model and Meta-Heuristic Algorithms-Based Pan Evaporation Modelling in Two Different Agro-Climatic Zones: A Case Study from Northern India. Atmosphere (basel) 12:1654. https://doi.org/10.3390/atmos12121654
Li Y-F, Xie M, Goh T-N (2010) Adaptive ridge regression system for software cost estimating on multi-collinear datasets. J Syst Softw 83:2332–2343. https://doi.org/10.1016/j.jss.2010.07.032
Liu Y, Wang Y, Zhang J (2012) New Machine Learning Algorithm: Random Forest. In: Liu B, Ma M, Chang J (eds) Information Computing and Applications. ICICA 2012. Lecture Notes in Computer Science, vol 7473. Springer Berlin Heidelberg, Berlin, pp 246–252
Malek E, Bingham GE (1993) Comparison of the Bowen ratio-energy balance and the water balance methods for the measurement of evapotranspiration. J Hydrol 146:209–220. https://doi.org/10.1016/0022-1694(93)90276-F
Markuna S, Kumar P, Ali R et al (2023) Application of Innovative Machine Learning Techniques for Long-Term Rainfall Prediction. Pure Appl Geophys 180:335–363. https://doi.org/10.1007/s00024-022-03189-4
Mirzania E, Vishwakarma DK, Bui Q-AT et al (2023) A novel hybrid AIG-SVR model for estimating daily reference evapotranspiration. Arab J Geosci 16:301. https://doi.org/10.1007/s12517-023-11387-0
Nash JE, Sutcliffe JV (1970) River flow forecasting through conceptual models part I — A discussion of principles. J Hydrol 10:282–290. https://doi.org/10.1016/0022-1694(70)90255-6
Pavlou M, Ambler G, Seaman S et al (2016) Review and evaluation of penalised regression methods for risk prediction in low-dimensional data with few events. Stat Med 35:1159–1177. https://doi.org/10.1002/sim.6782
Peacock CE, Hess TM (2004) Estimating evapotranspiration from a reed bed using the Bowen ratio energy balance method. Hydrol Process 18:247–260. https://doi.org/10.1002/hyp.1373
Rana G, Katerji N (2000) Measurement and estimation of actual evapotranspiration in the field under Mediterranean climate: a review. Eur J Agron 13:125–153. https://doi.org/10.1016/S1161-0301(00)00070-8
Raza A, Al-Ansari N, Hu Y et al (2022) Misconceptions of Reference and Potential Evapotranspiration: A PRISMA-Guided Comprehensive Review. Hydrology 9:153. https://doi.org/10.3390/hydrology9090153
Sagar A, Hasan M, Singh DK et al (2022) Development of Smart Weighing Lysimeter for Measuring Evapotranspiration and Developing Crop Coefficient for Greenhouse Chrysanthemum. Sensors 22:6239. https://doi.org/10.3390/s22166239
Saggi MK, Jain S (2020) Application of fuzzy-genetic and regularization random forest (FG-RRF): Estimation of crop evapotranspiration (ET ) for maize and wheat crops. Agric Water Manag 229:105907. https://doi.org/10.1016/j.agwat.2019.105907
Saroughi M, Mirzania E, Vishwakarma DK et al (2023) A Novel Hybrid Algorithms for Groundwater Level Prediction. Iran J Sci Technol Trans Civ Eng. https://doi.org/10.1007/s40996-023-01068-z
Sattari MT, Apaydin H, Band SS et al (2021) Comparative analysis of kernel-based versus ANN and deep learning methods in monthly reference evapotranspiration estimation. Hydrol Earth Syst Sci 25:603–618. https://doi.org/10.5194/hess-25-603-2021
Shiri J, Nazemi AH, Sadraddini AA et al (2014) Comparison of heuristic and empirical approaches for estimating reference evapotranspiration from limited inputs in Iran. Comput Electron Agric 108:230–241. https://doi.org/10.1016/j.compag.2014.08.007
Shukla R, Kumar P, Vishwakarma DK et al (2021) Modeling of stage-discharge using back propagation ANN-, ANFIS-, and WANN-based computing techniques. Theor Appl Climatol. https://doi.org/10.1007/s00704-021-03863-y
Singh RS, Patel C, Yadav MK, Singh KK (2014) Yield forecasting of rice and wheat crops for eastern Uttar Pradesh. J Agrometeorol 16:199–202. https://doi.org/10.54386/jam.v16i2.1521
Singh AK, Kumar P, Ali R et al (2022a) An Integrated Statistical-Machine Learning Approach for Runoff Prediction. Sustainability 14:8209. https://doi.org/10.3390/su14138209
Singh VK, Panda KC, Sagar A et al (2022b) Novel Genetic Algorithm (GA) based hybrid machine learning-pedotransfer Function (ML-PTF) for prediction of spatial pattern of saturated hydraulic conductivity. Eng Appl Comput Fluid Mech 16:1082–1099. https://doi.org/10.1080/19942060.2022.2071994
Tabari H, Martinez C, Ezani A, Hosseinzadeh Talaee P (2013) Applicability of support vector machines and adaptive neurofuzzy inference system for modeling potato crop evapotranspiration. Irrig Sci 31:575–588. https://doi.org/10.1007/s00271-012-0332-6
Thissen U, van Brakel R, de Weijer A et al (2003) Using support vector machines for time series prediction. Chemom Intell Lab Syst 69:35–49. https://doi.org/10.1016/S0169-7439(03)00111-4
Tulla PS, Kumar P, Vishwakarma DK et al (2024) Daily suspended sediment yield estimation using soft-computing algorithms for hilly watersheds in a data-scarce situation: a case study of Bino watershed, Uttarakhand. Theor Appl Climatol. https://doi.org/10.1007/s00704-024-04862-5
Üneş F, Kaya YZ, Mamak M (2020) Daily reference evapotranspiration prediction based on climatic conditions applying different data mining techniques and empirical equations. Theor Appl Climatol 141:763–773. https://doi.org/10.1007/s00704-020-03225-0
Vapnik V (1998) Statistical learning theory. John Wiley & Sons, Inc., Oxford
Verma SB, Rosenberg NJ, Blad BL (1978) Turbulent Exchange Coefficients for Sensible Heat and Water Vapor under Advective Conditions. J Appl Meteorol 17:330–338
Vishwakarma DK, Kumar R, Pandey K et al (2018) Modeling of Rainfall and Ground Water Fluctuation of Gonda District Uttar Pradesh, India. Int J Curr Microbiol Appl Sci 7:2613–2618. https://doi.org/10.20546/ijcmas.2018.705.302
Vishwakarma DK, Pandey K, Kaur A et al (2022) Methods to estimate evapotranspiration in humid and subtropical climate conditions. Agric Water Manag 261:107378. https://doi.org/10.1016/j.agwat.2021.107378
Vishwakarma DK, Kuriqi A, Abed SA et al (2023) Forecasting of stage-discharge in a non-perennial river using machine learning with gamma test. Heliyon 9:e16290. https://doi.org/10.1016/j.heliyon.2023.e16290
Vishwakarma DK, Kumar P, Yadav KK et al (2024) Evaluation of CatBoost Method for Predicting Weekly Pan Evaporation in Subtropical and Sub-Humid Regions. Pure Appl Geophys. https://doi.org/10.1007/s00024-023-03426-4
Wang S-C (2003) Artificial Neural Network. In: Wang S-C (ed) Interdisciplinary Computing in Java Programming. Springer US, Boston, pp 81–100
Willmott CJ (1981) On the validation of models. Phys Geogr 2:184–194. https://doi.org/10.1080/02723646.1981.10642213
Zhou Z, Zhao L, Lin A et al (2020) Exploring the potential of deep factorization machine and various gradient boosting models in modeling daily reference evapotranspiration in China. Arab J Geosci 13:1287. https://doi.org/10.1007/s12517-020-06293-8
Zou H, Hastie T (2005) Regularization and Variable Selection Via the Elastic Net. J R Stat Soc Ser B Stat Methodol 67:301–320. https://doi.org/10.1111/j.1467-9868.2005.00503.x
Acknowledgements
The authors extend their appreciation to the Deanship of Scientific Research, King Saud University for funding through the Vice Deanship of Scientific Research Chairs; Research Chair of Prince Sultan Bin Abdulaziz International Prize for Water. Authors also thank the Department of Agrometeorology, G.B. Pant University of Agriculture and Technology for providing the required facilities to conduct the study.
Funding
This research was funded by the Deanship of Scientific Research, King Saud University through the Vice Deanship of Scientific Research Chairs; Research Chair of Prince Sultan Bin Abdulaziz International Prize for Water.
Author information
Authors and Affiliations
Contributions
Anurag Satpathi and Abhishek Danodia conducted the formal analysis, investigation and wrote the manuscript. Ajeet Singh Nain edited and supervised the work. Makrand Dhyani, Dinesh Kumar Vishwakarma, Ahmed Z. Dewidar, and Mohamed A. Mattar prepared the necessary figures and revised the manuscript. All authors have reviewed the results and approved the final version of the manuscript.
Corresponding authors
Ethics declarations
Ethics approval
Not applicable.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Satpathi, A., Danodia, A., Nain, A.S. et al. Estimation of crop evapotranspiration using statistical and machine learning techniques with limited meteorological data: a case study in Udham Singh Nagar, India. Theor Appl Climatol (2024). https://doi.org/10.1007/s00704-024-04953-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00704-024-04953-3