Abstract
Quetta Valley is an integral part of the Pishin Lora Basin (PLB), a prominently water-deficient basin within the Balochistan province of Pakistan. The Pishin Lora River (PLR) flowing through the basin supports agriculture and domestic water use. However, the available scientific research on hydrological modeling in the study area is either limited or outdated. The influence of climate change on the basin's hydrology has intensified the demand for advanced approaches in flow prediction studies. Hydrological modeling is complex, particularly in a basin like Quetta with complicated terrain, including high elevations and varied slopes with diverse rainfall distribution. A machine learning (ML) data-driven approach is a viable alternative to conceptual hydrological models, mitigating the dependency on extensive data that may not always be readily available. This study employs Artificial Neural Network Multilayer Perceptron (ANN-MLP), Random Forest (RF), and Multiple Linear Regression (MLR) models for runoff prediction. Remotely sensed meteorological datasets spanning over thirty-two years (1990–2022) were obtained and compiled at three distinct timescales: daily, weekly, and monthly. The dataset comprises rainfall, humidity, and temperature records from the Modern-Era Retrospective Analysis for Research and Applications-2 (MERRA-2) satellite. These variables serve as model inputs for predicting the runoff of the watershed. Simulated runoff data using the SCS-CN method were used in ML models in the absence of actual runoff records. Statistical tests were performed for the performance evaluation of the model, including coefficient of determination (R2), mean absolute error (MAE), root mean squared error (RMSE), relative absolute error (RAE), and Willmott index (WI). Two input combinations (M1 and M2) were used to predict the daily, weekly, and monthly runoff timescales. The results of the all models were promising for both input combinations, with a slightly superior predictive performance observed in M2. In M2, the R2 values for ANN, RF, and MLR at the daily timescale were 0.998, 0.996, and 0.994, respectively. At the weekly timescales, these values were 0.994, 0.998, and 0.995, while at the monthly timescales, they were 0.995, 0.994, and 0.997. This study establishes improved water management strategies for data-scarce regions, exploiting the power of cutting-edge machine learning tools to drive innovation in hydrological research within the region.
Similar content being viewed by others
Data availability
No datasets were generated or analysed during the current study
Abbreviations
- ANN:
-
Artificial neural network
- MLP:
-
Multilayer perceptron
- RF:
-
Random forest
- MLR:
-
Multiple linear regression
- PLB:
-
Pishin lora basin
- PLR:
-
Pishin lora river
- M1:
-
Model 1
- M2:
-
Model 2
- MAE:
-
Mean absolute error
- RMSE:
-
Root mean squared error
- R2 :
-
coefficient of determination
- WI:
-
Wilmott Index
- RAE:
-
Relative absolute error
References
Abd-Elaty I, Shoshah H, Zeleňáková M, Kushwaha NL, El-Dean OW (2022) Forecasting of flash floods peak flow for environmental hazards and water harvesting in desert area of El-Qaa Plain, Sinai. Int J Environ Res Public Health 19(10):6049
Adane GB, Hirpa BA, Gebru BM, Song C, Lee WK (2021) Integrating satellite rainfall estimates with hydrological water balance model: rainfall-runoff modeling in Awash River Basin. Ethiopia Water 13(6):800
Aftab SM, Siddiqui RH, Farooqui MA (2018) Strategies to manage aquifer recharge in Balochistan, Pakistan: an overview. IOP Conf Ser Mater Sci Eng 414(1):012023. OP Publishing
Ahmad I, Verma V, Verma MK (2015) Application of curve number method for estimation of runoff potential in GIS environment. In 2nd international conference on geological and civil engineering (Vol. 80, No. 4, pp. 16–20). IACSIT Press, Singapore
Akbarian M, Saghafian B, Golian S (2023) Monthly streamflow forecasting by machine learning methods using dynamic weather prediction model outputs over Iran. J Hydrol 620:129480
Bazai MH, Panezai S (2020) Assessment of urban sprawl and land use change dynamics through GIS and remote sensing in Quetta, Balochistan, Pakistan. J Geography Soc Sci (JGSS) 2(1):31–55
Bonta JV (1997) Determination of watershed curve number using derived distributions. J Irrig Drain Eng 123(1):28–36
Bouzeria H, Ghenim AN, Khanchoul K (2017) Using artificial neural network (ANN) for prediction of sediment loads, application to the Mellah catchment, northeast Algeria. J Water Land Dev 33(1):47
Breiman L (2001) Random Forests Machine Learning 45:5–32. https://doi.org/10.1023/A:1010933404324
Chakravarti A, Joshi N, Panjiar H (2015) Rainfall Runoff Analysis Using the Artificial Neural Network. Indian J Sci Technol 8(14):1–7
Dawood F, Akhtar MM, Ehsan M (2021) Evaluating urbanization impact on stressed aquifer of Quetta Valley, Pakistan. Desalination Water Treat 222:103–113
Dawson CW, Wilby R (1998) An artificial neural network approach to rainfall-runoff modeling. Hydrol Sci J 43(1):47–66
Elbeltagi A, Di Nunno F, Kushwaha NL, de Marinis G, Granata F (2022) River flow rate prediction in the Des Moines watershed (Iowa, USA): A machine learning approach. Stoch Env Res Risk Assess 36(11):3835–3855
Esha RI, Imteaz MA (2019) Assessing the predictability of MLR models for long-term streamflow using lagged climate indices as predictors: A case study of NSW (Australia). Hydrol Res 50(1):262–281
Gajbhiye S, Mishra SK (2012) Application of NRSC-SCS curve number model in runoff estimation using RS & GIS. In: IEEE-International conference on advances in engineering, science and management (ICAESM-2012). IEEE, pp 346–352
Gholami, V, Sahour, H (2022) Simulation of rainfall-runoff process using an artificial neural network (ANN) and field plots data. Theoretical and Applied Climatology, 1–12. https://doi.org/10.1007/s00704-021-03817-4
Ghorbani MA, Khatibi R, Goel A, FazeliFard MH, Azani A (2016) Modeling river discharge time series using support vector machine and artificial neural networks. Environ Earth Sci 75:1–13
Hamal, K, Sharma, S, Khadka, N, Baniya, B, Ali, M, Shrestha, MS, ... Dawadi, B (2020) Evaluation of MERRA-2 precipitation products using gauge observation in Nepal. Hydrology, 7(3), 40. https://doi.org/10.3390/hydrology7030040
Herawati H, Suripin S, Suharyanto S, Hetwisari T (2018) Analysis of river flow regime changes related to water availability on the Kapuas River, Indonesia. Irrigation and Drainage 67:66–71
Hussain D, Khan AA (2020) Machine learning techniques for monthly river flow forecasting of Hunza River, Pakistan. Earth Sci Inf 13:939–949
Hussain D, Hussain T, Khan AA, Naqvi SAA, Jamil A (2020) A deep learning approach for hydrological time-series prediction: A case study of Gilgit river basin. Earth Sci Inf 13:915–927
Jeong DI, Kim YO (2005) Rainfall-runoff models using artificial neural networks for ensemble streamflow prediction. Hydrological Processes: An International Journal 19(19):3819–3835
Jibril MM, Bello A, Aminu II, Ibrahim AS, Bashir A, Malami SI, Magaji MM (2022) An overview of streamflow prediction using random forest algorithm. GSC Adv Res Rev 13(1):050–057
Khan M, Noor S (2019) Irrigation runoff volume prediction using machine learning algorithms. Euro Int J Sci Technol 8:1–22
Khan SD, Mahmood K, Sultan MI, Khan AS, Xiong Y, Sagintayev Z (2010) Trace element geochemistry of groundwater from Quetta Valley, western Pakistan. Environ Earth Sci 60:573–582
Kuswanto H, Naufal A (2019) Evaluation of performance of drought prediction in Indonesia based on TRMM and MERRA-2 using machine learning methods. MethodsX 6:1238–1251
Li X, Sha J, Wang ZL (2019) Comparison of daily streamflow forecasts using extreme learning machines and the random forest method. Hydrol Sci J 64(15):1857–1866
Machado F, Mine M, Kaviski E, Fill H (2011) Monthly rainfall–runoff modelling using artificial neural networks. Hydrol Sci J-J Des Sciences Hydrologiques 56(3):349–361
Nile BK, Hassan WH, Alshama GA (2019) Analysis of the effect of climate change on rainfall intensity and expected flooding by using ANN and SWMM programs. ARPN J Eng Appl Sci 14(5):974–984
Ouma YO, Cheruyot R, Wachera AN (2021) Rainfall and runoff time-series trend analysis using LSTM recurrent neural network and wavelet neural network with satellite-based meteorological data: case study of Nzoia hydrologic basin. Complex & Intelligent Systems, pp 1–24
Papacharalampous GA, Tyralis H (2018) Evaluation of random forests and Prophet for daily streamflow forecasting. Adv Geosci 45:201–208
Patel A, Kethavath A, Kushwaha NL, Naorem A, Jagadale M, Sheetal KR, Renjith PS (2023) Review of artificial intelligence and internet of things technologies in land and water management research during 1991–2021: A bibliometric analysis. Eng Appl Artif Intell 123:106335
Poonia V, Tiwari HL (2020) Rainfall-runoff modeling for the Hoshangabad Basin of Narmada River using artificial neural network. Arab J Geosci 13:1–10
Poul AK, Shourian M, Ebrahimi H (2019) A comparative study of MLR, KNN, ANN and ANFIS models with wavelet transform in monthly stream flow prediction. Water Resour Manage 33:2907–2923
Ratner B (2009) The correlation coefficient: Its values range between+ 1/− 1, or do they? J Target Meas Anal Mark 17(2):139–142
Sagintayev, Z, Sultan, M, Khan, SD, Khan, SA, Mahmood, K, Yan, E, ... Marsala, P (2012) A remote sensing contribution to hydrologic modelling in arid and inaccessible watersheds, Pishin Lora basin, Pakistan. Hydrological Processes, 26(1), 85–99. https://doi.org/10.1002/hyp.8114
Sharma S, Sharma S, Athaiya A (2017) Activation functions in neural networks. Towards Data Sci 6(12):310–316
Sharma P, Singh S, Sharma SD (2022) Artificial neural network approach for hydrologic river flow time series forecasting. Agricultural Res 11(3):465–476
Simon Haykin (McMaster University, Hamilton, Ontario, C) (2005) Neural Networks - A Comprehensive Foundation - Simon Haykin.pdf (p. 823)
Soomro AG, Babar MM, Memon AH, Zaidi AZ, Ashraf A, Lund J (2019) Sensitivity of direct runoff to curve number using the SCS-CN method. Civ Eng J 5:2738–2746
Soulis KX (2021) Soil conservation service curve number (SCS-CN) Method: Current applications, remaining challenges, and future perspectives. Water 13(2):192
Sun N, Zhang S, Peng T, Zhang N, Zhou J, Zhang H (2022) Multi-variables-driven model based on random forest and Gaussian process regression for monthly streamflow forecasting. Water 14(11):1828
Talukdar S, Pal S, Shahfahad, Naikoo MW, Parvez A, Rahman A (2023) Trend analysis and forecasting of streamflow using random forest in the Punarbhaba River basin. Environ Monit Assess 195(1):153
Taylor KE (2005) Taylor diagram primer. Work. Pap, pp 1–4
Umar M, Waseem A, Kassi AM, Farooq M, Sabir MA (2014) Surface and subsurface water quality assessment in semi-arid region: a case study from Quetta and Sorange Intermontane Valleys. Pakistan Global Nest J 16(5):938–954
Us Saqib N, Aziz T (2023) Estimation Of Wheat Crop Productivity Of District Nawab Shah Using Metric EEFLUX imagery. J Agric Res 61(1):77–82
Us Saqib N, Saeed F, Aziz T (2022) Effect Of Man-Made Structures On Natural Wetlands In Pakistan: A Case Study Of Chotiari Reservoir. J Clean WAS (JCleanWAS) 6(1):01–04
Viji R, Rajesh Prasanna P, Ilangovan R (2015) Gis based SCS-CN method for estimating runoff in Kundahpalam watershed, Nilgries District. Tamilnadu Earth Sciences Research J 19(1):59–64
Vishwakarma, DK, Kuriqi, A, Abed, SA, Kishore, G, Al-Ansari, N, Pandey, K, ... Jewel, A (2023) Forecasting of stage-discharge in a non-perennial river using machine learning with gamma test. Heliyon, 9(5). https://doi.org/10.1016/j.heliyon.2023.e16290
Williams, B, Halloin, C, Löbel, W, Finklea, F, Lipke, E, Zweigerdt, R, Cremaschi, S (2020) Data-driven model development for cardiomyocyte production experimental failure prediction. In Computer aided chemical engineering (Vol. 48, pp. 1639–1644). Elsevier. https://doi.org/10.1016/B978-0-12-823377-1.50274-3
Yao L, Wei WEI, Yu Y, Xiao J, Chen L (2018) Rainfall-runoff risk characteristics of urban function zones in Beijing using the SCS-CN model. J Geog Sci 28:656–668
Yaseen ZM, El-Shafie A, Jaafar O, Afan HA, Sayl KN (2015) Artificial intelligence based models for streamflow forecasting: 2000–2015. J Hydrol 530:829–844
Yeoh, KL, Puay, HT, Abdullah, R, Abd Manan, TS (2023) Appraisal of data-driven techniques for predicting short-term streamflow in tropical catchment. Water Sci Technol https://doi.org/10.2166/wst.2023.193
Zhan X, Huang ML (2004) ArcCN-Runoff: an ArcGIS tool for generating curve number and runoff maps. Environ Model Softw 19(10):875–879
Acknowledgements
The authors are thankful to the Higher Education Commission, Islamabad for providing full financial support under Local Challenges Fund Project No. 20-LCF-63 titled; Metering the aquifer using a smart monitoring and data-driven approach to assist in devising an adaptive groundwater management strategy in Balochistan; and Mehran University of Engineering & Technology, Jamshoro, Pakistan for funding during the collection of literature data and the writing of this paper.
Funding
The authors declare that they have not received any fund to conduct this research.
Author information
Authors and Affiliations
Contributions
The authors confirm contribution to the paper as follows: study conception and design Ghunwa Shah, devised the research, the main conceptual ideas and proof outline and analysis and interpretation of results. Ghunwa Shah, Shahzad Hussain, Rizwan and Tariq Aziz worked out almost all of the technical details, and wrote the manuscript. Ghunwa Shah, Arjumand Zaidi, Abdul Latif Qureshi, Shahzad Hussain authors contributed to the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Communicated by H. Babaie.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shah, G., Zaidi, A., Qureshi, A.L. et al. Rainfall-runoff modeling using machine learning in the ungauged urban watershed of Quetta Valley, Balochistan (Pakistan). Earth Sci Inform (2024). https://doi.org/10.1007/s12145-024-01302-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12145-024-01302-w