Abstract
Rainfall plays an essential part in numerous aspects of the natural world, including the environment, ecosystems, human societies, and the global climate system. The lack of rainfall data, typically accompanied by data gaps in arid regions like Saudi Arabia, presents substantial obstacles for hydrological and environmental studies. The objective of this study is to identify a suitable imputation technique for finding missing rainfall data on daily and monthly time scales. In this study, eight weather stations were selected, located in the vicinity of Al-Madinah Al-Munawarah City for the period of 5 years (2008–2012). Two stations (226, 371) were considered target/empty stations to compare the computed values from each method to real values, i.e., cross-validation. In this study, various techniques, including arithmetic average (AA), inverse distance weighing (IDW), normal ratio (NR), satellite products, TRMM, IMERG-GPM, CHIRPS, MERRA-2, and artificial intelligence-based and feed-forward backpropagation neural network (FFBP-NN), were evaluated. Statistical measures were used to check the reliability of each imputation technique on daily and monthly rainfall datasets. The results revealed that FFBP-NN exhibited the highest correlation values, surpassing 0.95 for both the stations on monthly and above 0.80 on daily time scale. IMERG-GPM performed well across satellite datasets, with a daily correlation over 0.50 and a monthly correlation above 0.80. Similarly, NR outperformed AA and IDW techniques in terms of correlation, providing values above 0.5 for daily and 0.89 for monthly intervals over both stations. Generally, all methods performed well on both time scales, except MERRA-2 dataset having a lower correlation coefficient. Based on the analysis, it is recommended to utilize the FFBP-NN approach for longer time series data availability while IMERG-GPM for high spatial variation in region. This research contributes to the ongoing efforts to mitigate data gaps in arid regions and supports more accurate water resource management and environmental planning.
Similar content being viewed by others
Data availability
All data generated or analyzed during this study are included in this published article.
Code availability
All codes for data cleaning and analysis associated with the current submission are available at the local authorities in Saudi Arabia.
References
Abdullah M, Al-Ansari N (2022) Missing rainfall data estimation—an approach to investigate different methods: case study of Baghdad. Arabian J Geosci 15:1740
Abhishek K, Kumar A, Ranjan R, Kumar S (2012) A rainfall prediction model using artificial neural network. IEEE Control Syst Grad Res Colloquium 1:82–87. https://doi.org/10.1109/ICSGRC.2012.6287140
Addi M, Gyasi-Agyei Y, Obuobie E, Amekudzi LK (2022) Evaluation of imputation techniques for infilling missing daily rainfall records on river basins in Ghana. Hydrol Sci J 67:613–627
Akiner M E (2021) Long-term rainfall information forecast by utilizing constrained amount of observation through artificial neural network approach. Adv Meteorol 2021. https://doi.org/10.1155/2021/5524611
Ali H, Shui L, Ehsan G (2010) Estimation of yield sediment using artificial neural network at basin scale. Aust j Basic Appl Sci 4:1668–1675
Armanuos AM, Al-Ansari N, Yaseen ZM (2020) Cross assessment of twenty-one different methods for missing precipitation data estimation. Atmosphere 11:389
Barrios A, Trincado G, Garreaud R (2018) Alternative approaches for estimating missing climate data: application to monthly precipitation records in South-Central Chile. For Ecosyst 5:1–10
Bellido-Jiménez JA, Gualda JE, García-Marín AP (2021) Assessing Machine Learning Models for Gap Filling Daily Rainfall Series in a Semiarid Region of Spain. Atmosphere 12:1158
Borga M, Vizzaccaro A (1997) On the interpolation of hydrologic variables: formal equivalence of multiquadratic surface fitting and kriging. J Hydrol 195:160–171
Chung J, Lee Y, Kim J, Jung C, Kim S (2022) Soil moisture content estimation based on sentinel-1 SAR imagery using an artificial neural network and hydrological components. Remote Sens 14:465
da Silva-Fuzzo DF, Rocha JV (2016) Validação dos dados de precipitação estimados pelo TRMM para o Estado do Paraná e sua contribuição ao monitoramento agrometeorológico. Formação 3:23. https://doi.org/10.33081/formacao.v3i23.4148
de Moraes CAL, Blanco CJC (2021) Assessment of satellite products for filling rainfall data gaps in the Amazon region. Nat Resour Model 34:e12298
dos Santos EP, Dias RLS, Maciel IP, Kolling Neto A, da Silva DD (2021) Estimation of missing hydrological data in monthly rainfall series using meteorological satellite data. Environ Earth Sci 80:1–9
Duarte LV, Formiga KTM, Costa VAF (2022a) Analysis of the IMERG-GPM precipitation product analysis in brazilian midwestern basins considering different time and spatial scales. Water 14:2472
Duarte LV, Formiga KTM, Costa VAF (2022b) Comparison of methods for filling daily and monthly rainfall missing data: statistical models or imputation of satellite retrievals? Water 14:3144
Egigu ML (2020) Techniques of filling missing values of daily and monthly rain fall data: a review. J Environ Earth Sci 3:1036
Endalew L, Mulu A (2023) Estimation of the amount of sediment entering into Shumburit reservoir from the Shumburit watershed, East Gojjam zone, Amhara Region. Ethiopia Environ Chall 11:100696
Farzandi M, Sanaeinejad H, Rezaei-Pazhan H, Sarmad M (2022) Improving estimation of missing data in historical monthly precipitation by evolutionary methods in the semi-arid area. Environ Dev Sustain 24:8313–8332
Fung KF, Chew KS, Huang YF, Ahmed AN, Teo FY, Ng JL, Elshafie A (2022) Evaluation of spatial interpolation methods and spatiotemporal modeling of rainfall distribution in Peninsular Malaysia. Ain Shams Eng J 13:101571
Gebregiorgis AS, Kirstetter PE, Hong YE, Gourley JJ, Huffman GJ, Petersen WA, Xue X, Schwaller MR (2018) To what extent is the day 1 GPM IMERG satellite precipitation estimate improved as compared to TRMM TMPA-RT? J Geophys Res Atmos 123:1694–1707
Gill P E, Murray W and Wright M H 2019. Practical optimization: SIAM-Society for Industrial and Applied Mathematics https://doi.org/10.1137/1.9781611975604
Githungo W, Otengi S, Wakhungu J, Masibayi E (2016) Infilling monthly rain gauge data gaps with satellite estimates for Asal of Kenya. Hydrology 3:40
Hao R, Bai Z (2023) Comparative Study for daily streamflow simulation with different machine learning methods. Water 15:1179
Hsu J, Huang W-R, Liu P-Y, Li X (2021) Validation of CHIRPS precipitation estimates over Taiwan at multiple timescales. Remote Sens 13:254
Hussain S, Bahrawi J, Awais M, Elhag M (2022a) Understanding the role of the radiometric indices in temporal evapotranspiration estimation in arid environments. Desalin Water Treat 256:221–234
Hussain S, Elfeki AM, Chaabani A, Yibrie EA, Elhag M (2022b) Spatio-temporal evaluation of remote sensing rainfall data of TRMM satellite over the Kingdom of Saudi Arabia. Theor Appl Climatol 150:363–377
Ismail WNW, Zin WZW, Ibrahim W (2017) Estimation of rainfall and stream flow missing data for Terengganu, Malaysia by using interpolation technique methods. Mal J Fund Appl Sci 13:214–218
Kilsdonk RA, Bomers A, Wijnberg KM (2022) Predicting urban flooding due to extreme precipitation using a long short-term memory neural network. Hydrology 9:105
Kumar A, Deo MM, Jeet P, Kumari A, Prakash O (2022) Daily rainfall prediction for Bihar using artificial neural networks: prediction of rainfall using ANN. J AgriSearch 9:320–325
Kummerow C, Simpson J, Thiele O, Barnes W, Chang A, Stocker E, Adler R, Hou A, Kakar R, Wentz F (2000) The status of the Tropical Rainfall Measuring Mission (TRMM) after two years in orbit. J Appl Meteorol 39:1965–1982
Londhe S, Dixit P, Shah S, Narkhede S (2015) Infilling of missing daily rainfall records using artificial neural network. ISH J Hydraul Eng 21:255–264
López-Bermeo C, Montoya RD, Caro-Lopera FJ, Díaz-García JA (2022) Validation of the accuracy of the CHIRPS precipitation dataset at representing climate variability in a tropical mountainous region of South America. Phys Chem Earth Parts a/b/c 127:103184
Maghsood FF, Hashemi H, Hosseini SH, Berndtsson R (2019) Ground validation of GPM IMERG precipitation products over Iran. Remote Sens 12:48
Moeletsi ME-ARC, Shabalala ZP-ARC, De Nysschen G-ARC, Moeletsi ME, Walker S (2016) Evaluation of an inverse distance weighting method for patching daily and dekadal rainfall over the Free State Province, South Africa. Water SA 42:466–474
Noh GH, Ahn KH (2022) New gridded rainfall dataset over the Korean peninsula: gap infilling, reconstruction, and validation. Int J Climatol 42:435–452
Papailiou I, Spyropoulos F, Trichakis I, Karatzas GP (2022) Artificial neural networks and multiple linear regression for filling in missing daily rainfall data. Water 14:2892
Pinthong S, Ditthakit P, Salaeh N, Hasan M A, Son C T, Linh N T T, Islam S and Yadav K K (2022) Imputation of missing monthly rainfall data using machine learning and spatial interpolation approaches in Thale Sap Songkhla River Basin, Thailand. Environ Sci Pollut Res: 1–17. https://doi.org/10.1007/s11356-022-23022-8
Ramadhan R, Yusnaini H, Marzuki M, Muharsyah R, Suryanto W, Sholihun S, Vonnisa M, Harmadi H, Ningsih AP, Battaglia A (2022) Evaluation of GPM IMERG performance using gauge data over Indonesian maritime continent at different time scales. Remote Sens 14:1172
Sanusi W, Wan Zin WZ, Mulbar U, Danial M, Side S (2017) Comparison of the methods to estimate missing values in monthly precipitation data. Int J Adv Sci Eng Inf Technol 7:2168–2174
Sattari M-T, Rezazadeh-Joudi A, Kusiak A (2017) Assessment of different methods for estimation of missing data in precipitation studies. Hydrol Res 48:1032–1044
Sharifi E, Steinacker R, Saghafian B (2016) Assessment of GPM-IMERG and other precipitation products against gauge data under different topographic and climatic conditions in Iran: Preliminary results. Remote Sens 8:135
Su J, Lü H, Ryu D, Zhu Y (2019) The assessment and comparison of TMPA and IMERG products over the major basins of Mainland China. Earth Space Sci 6:2461–2479
Tan ML, Yang X (2020) Effect of rainfall station density, distribution and missing values on SWAT outputs in tropical region. J Hydrol 584:124660
Tareke KA, Awoke AG (2023) Hydrological drought forecasting and monitoring system development using artificial neural network (ANN) in Ethiopia. Heliyon 9(2):e13287. https://doi.org/10.1016/j.heliyon.2023.e13287
Taye M, Mengistu D, Sahlu D (2023) Performance evaluation of multiple satellite rainfall data sets in central highlands of Abbay Basin. Ethiopia Eur J Remote Sens 56:2233686
Teegavarapu RS, Aly A, Pathak CS, Ahlquist J, Fuelberg H, Hood J (2018) Infilling missing precipitation records using variants of spatial interpolation and data-driven methods: use of optimal weighting parameters and nearest neighbour-based corrections. Int J Climatol 38:776–793
Turkeltaub T, Bel G (2023) The effects of rain and evapotranspiration statistics on groundwater recharge estimations for semi-arid environments. Hydrol Earth Syst Sci 27:289–302
Vieux BE, Vieux BE (2001) Distributed hydrologic modeling using GIS. Springer, Dordrecht
Vogl TP, Mangis J, Rigler A, Zink W, Alkon D (1988) Accelerating the convergence of the back-propagation method. Biol Cybern 59:257–263
Wang J, Petersen WA, Wolff DB (2021) Validation of satellite-based precipitation products from TRMM to GPM. Remote Sens 13:1745
Wuthiwongyothin S, Kalkan C, Panyavaraporn J (2021) Evaluating inverse distance weighting and correlation coefficient weighting infilling methods on daily rainfall time series. Cre Sci 13:71–79
Xia Y, Fabian P, Stohl A, Winterhalter M (1999) Forest climatology: estimation of missing values for Bavaria, Germany. Agric for Meteorol 96:131–144
Yang X, Xie X, Liu DL, Ji F, Wang L (2015) Spatial interpolation of daily rainfall data for local climate impact assessment over greater Sydney region. Adv Meteorol 2015:1–12
Youssef AM, Sefry SA, Pradhan B, Alfadail EA (2016) Analysis on causes of flash flood in Jeddah city (Kingdom of Saudi Arabia) of 2009 and 2011 using multi-sensor remote sensing data and GIS. Geomatics Nat Hazards Risk 7:1018–1042
Zhang Y, Vaze J, Chiew FH, Teng J, Li M (2014) Predicting hydrological signatures in ungauged catchments using spatial interpolation, index model, and rainfall–runoff modelling. J Hydrol 517:936–948
Zhang Y, Wu C, Yeh PJ-F, Li J, Hu BX, Feng P, Jun C (2022) Evaluation and comparison of precipitation estimates and hydrologic utility of CHIRPS, TRMM 3B42 V7 and PERSIANN-CDR products in various climate regimes. Atmos Res 265:105881
Acknowledgements
The authors, therefore, acknowledge with thanks the DSR technical and financial support.
Funding
This project was funded by the Deanship of Scientific Research (DSR), King Abdulaziz University, Jeddah, under Grant No. (IFPRC–017–155–2020).
Author information
Authors and Affiliations
Contributions
Conceptualization, B.N., S.H., and A.E.; methodology, S. H. and A. E.; validation, S.H., B.N., A.E., and M.M.; writing original draft preparation, M.A., and S.H.; review and editing, S.H., B.N., A.E., M.M., and M.A.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Niyazi, B., Hussain, S., Elfeki, A.M. et al. Comparative evaluation of techniques for missing rainfall data estimation in arid regions: case study of Al-Madinah Al-Munawarah, Saudi Arabia. Theor Appl Climatol 155, 2195–2214 (2024). https://doi.org/10.1007/s00704-023-04752-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-023-04752-2