Skip to main content
Log in

Rainfall-runoff modeling using machine learning in the ungauged urban watershed of Quetta Valley, Balochistan (Pakistan)

  • RESEARCH
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

Quetta Valley is an integral part of the Pishin Lora Basin (PLB), a prominently water-deficient basin within the Balochistan province of Pakistan. The Pishin Lora River (PLR) flowing through the basin supports agriculture and domestic water use. However, the available scientific research on hydrological modeling in the study area is either limited or outdated. The influence of climate change on the basin's hydrology has intensified the demand for advanced approaches in flow prediction studies. Hydrological modeling is complex, particularly in a basin like Quetta with complicated terrain, including high elevations and varied slopes with diverse rainfall distribution. A machine learning (ML) data-driven approach is a viable alternative to conceptual hydrological models, mitigating the dependency on extensive data that may not always be readily available. This study employs Artificial Neural Network Multilayer Perceptron (ANN-MLP), Random Forest (RF), and Multiple Linear Regression (MLR) models for runoff prediction. Remotely sensed meteorological datasets spanning over thirty-two years (1990–2022) were obtained and compiled at three distinct timescales: daily, weekly, and monthly. The dataset comprises rainfall, humidity, and temperature records from the Modern-Era Retrospective Analysis for Research and Applications-2 (MERRA-2) satellite. These variables serve as model inputs for predicting the runoff of the watershed. Simulated runoff data using the SCS-CN method were used in ML models in the absence of actual runoff records. Statistical tests were performed for the performance evaluation of the model, including coefficient of determination (R2), mean absolute error (MAE), root mean squared error (RMSE), relative absolute error (RAE), and Willmott index (WI). Two input combinations (M1 and M2) were used to predict the daily, weekly, and monthly runoff timescales. The results of the all models were promising for both input combinations, with a slightly superior predictive performance observed in M2. In M2, the R2 values for ANN, RF, and MLR at the daily timescale were 0.998, 0.996, and 0.994, respectively. At the weekly timescales, these values were 0.994, 0.998, and 0.995, while at the monthly timescales, they were 0.995, 0.994, and 0.997. This study establishes improved water management strategies for data-scarce regions, exploiting the power of cutting-edge machine learning tools to drive innovation in hydrological research within the region.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Data availability

No datasets were generated or analysed during the current study

Abbreviations

ANN:

Artificial neural network

MLP:

Multilayer perceptron

RF:

Random forest

MLR:

Multiple linear regression

PLB:

Pishin lora basin

PLR:

Pishin lora river

M1:

Model 1

M2:

Model 2

MAE:

Mean absolute error

RMSE:

Root mean squared error

R2 :

coefficient of determination

WI:

Wilmott Index

RAE:

Relative absolute error

References

  • Abd-Elaty I, Shoshah H, Zeleňáková M, Kushwaha NL, El-Dean OW (2022) Forecasting of flash floods peak flow for environmental hazards and water harvesting in desert area of El-Qaa Plain, Sinai. Int J Environ Res Public Health 19(10):6049

    Article  Google Scholar 

  • Adane GB, Hirpa BA, Gebru BM, Song C, Lee WK (2021) Integrating satellite rainfall estimates with hydrological water balance model: rainfall-runoff modeling in Awash River Basin. Ethiopia Water 13(6):800

    Google Scholar 

  • Aftab SM, Siddiqui RH, Farooqui MA (2018) Strategies to manage aquifer recharge in Balochistan, Pakistan: an overview. IOP Conf Ser Mater Sci Eng 414(1):012023.  OP Publishing

  • Ahmad I, Verma V, Verma MK (2015) Application of curve number method for estimation of runoff potential in GIS environment. In 2nd international conference on geological and civil engineering (Vol. 80, No. 4, pp. 16–20). IACSIT Press, Singapore

    Google Scholar 

  • Akbarian M, Saghafian B, Golian S (2023) Monthly streamflow forecasting by machine learning methods using dynamic weather prediction model outputs over Iran. J Hydrol 620:129480

    Article  Google Scholar 

  • Bazai MH, Panezai S (2020) Assessment of urban sprawl and land use change dynamics through GIS and remote sensing in Quetta, Balochistan, Pakistan. J Geography Soc Sci (JGSS) 2(1):31–55

    Google Scholar 

  • Bonta JV (1997) Determination of watershed curve number using derived distributions. J Irrig Drain Eng 123(1):28–36

    Article  Google Scholar 

  • Bouzeria H, Ghenim AN, Khanchoul K (2017) Using artificial neural network (ANN) for prediction of sediment loads, application to the Mellah catchment, northeast Algeria. J Water Land Dev 33(1):47

    Article  Google Scholar 

  • Breiman L (2001) Random Forests Machine Learning 45:5–32. https://doi.org/10.1023/A:1010933404324

    Article  Google Scholar 

  • Chakravarti A, Joshi N, Panjiar H (2015) Rainfall Runoff Analysis Using the Artificial Neural Network. Indian J Sci Technol 8(14):1–7

    Article  Google Scholar 

  • Dawood F, Akhtar MM, Ehsan M (2021) Evaluating urbanization impact on stressed aquifer of Quetta Valley, Pakistan. Desalination Water Treat 222:103–113

    Article  Google Scholar 

  • Dawson CW, Wilby R (1998) An artificial neural network approach to rainfall-runoff modeling. Hydrol Sci J 43(1):47–66

    Article  Google Scholar 

  • Elbeltagi A, Di Nunno F, Kushwaha NL, de Marinis G, Granata F (2022) River flow rate prediction in the Des Moines watershed (Iowa, USA): A machine learning approach. Stoch Env Res Risk Assess 36(11):3835–3855

    Article  Google Scholar 

  • Esha RI, Imteaz MA (2019) Assessing the predictability of MLR models for long-term streamflow using lagged climate indices as predictors: A case study of NSW (Australia). Hydrol Res 50(1):262–281

    Article  Google Scholar 

  • Gajbhiye S, Mishra SK (2012) Application of NRSC-SCS curve number model in runoff estimation using RS & GIS. In: IEEE-International conference on advances in engineering, science and management (ICAESM-2012). IEEE, pp 346–352

  • Gholami, V, Sahour, H (2022) Simulation of rainfall-runoff process using an artificial neural network (ANN) and field plots data. Theoretical and Applied Climatology, 1–12. https://doi.org/10.1007/s00704-021-03817-4

  • Ghorbani MA, Khatibi R, Goel A, FazeliFard MH, Azani A (2016) Modeling river discharge time series using support vector machine and artificial neural networks. Environ Earth Sci 75:1–13

    Google Scholar 

  • Hamal, K, Sharma, S, Khadka, N, Baniya, B, Ali, M, Shrestha, MS, ... Dawadi, B (2020) Evaluation of MERRA-2 precipitation products using gauge observation in Nepal. Hydrology, 7(3), 40. https://doi.org/10.3390/hydrology7030040

  • Herawati H, Suripin S, Suharyanto S, Hetwisari T (2018) Analysis of river flow regime changes related to water availability on the Kapuas River, Indonesia. Irrigation and Drainage 67:66–71

    Article  Google Scholar 

  • Hussain D, Khan AA (2020) Machine learning techniques for monthly river flow forecasting of Hunza River, Pakistan. Earth Sci Inf 13:939–949

    Article  Google Scholar 

  • Hussain D, Hussain T, Khan AA, Naqvi SAA, Jamil A (2020) A deep learning approach for hydrological time-series prediction: A case study of Gilgit river basin. Earth Sci Inf 13:915–927

    Article  Google Scholar 

  • Jeong DI, Kim YO (2005) Rainfall-runoff models using artificial neural networks for ensemble streamflow prediction. Hydrological Processes: An International Journal 19(19):3819–3835

    Article  Google Scholar 

  • Jibril MM, Bello A, Aminu II, Ibrahim AS, Bashir A, Malami SI, Magaji MM (2022) An overview of streamflow prediction using random forest algorithm. GSC Adv Res Rev 13(1):050–057

    Article  Google Scholar 

  • Khan M, Noor S (2019) Irrigation runoff volume prediction using machine learning algorithms. Euro Int J Sci Technol 8:1–22

    Google Scholar 

  • Khan SD, Mahmood K, Sultan MI, Khan AS, Xiong Y, Sagintayev Z (2010) Trace element geochemistry of groundwater from Quetta Valley, western Pakistan. Environ Earth Sci 60:573–582

    Article  CAS  Google Scholar 

  • Kuswanto H, Naufal A (2019) Evaluation of performance of drought prediction in Indonesia based on TRMM and MERRA-2 using machine learning methods. MethodsX 6:1238–1251

    Article  Google Scholar 

  • Li X, Sha J, Wang ZL (2019) Comparison of daily streamflow forecasts using extreme learning machines and the random forest method. Hydrol Sci J 64(15):1857–1866

    Article  Google Scholar 

  • Machado F, Mine M, Kaviski E, Fill H (2011) Monthly rainfall–runoff modelling using artificial neural networks. Hydrol Sci J-J Des Sciences Hydrologiques 56(3):349–361

    Article  Google Scholar 

  • Nile BK, Hassan WH, Alshama GA (2019) Analysis of the effect of climate change on rainfall intensity and expected flooding by using ANN and SWMM programs. ARPN J Eng Appl Sci 14(5):974–984

    Google Scholar 

  • Ouma YO, Cheruyot R, Wachera AN (2021) Rainfall and runoff time-series trend analysis using LSTM recurrent neural network and wavelet neural network with satellite-based meteorological data: case study of Nzoia hydrologic basin. Complex & Intelligent Systems, pp 1–24

  • Papacharalampous GA, Tyralis H (2018) Evaluation of random forests and Prophet for daily streamflow forecasting. Adv Geosci 45:201–208

    Article  Google Scholar 

  • Patel A, Kethavath A, Kushwaha NL, Naorem A, Jagadale M, Sheetal KR, Renjith PS (2023) Review of artificial intelligence and internet of things technologies in land and water management research during 1991–2021: A bibliometric analysis. Eng Appl Artif Intell 123:106335

    Article  Google Scholar 

  • Poonia V, Tiwari HL (2020) Rainfall-runoff modeling for the Hoshangabad Basin of Narmada River using artificial neural network. Arab J Geosci 13:1–10

    Article  Google Scholar 

  • Poul AK, Shourian M, Ebrahimi H (2019) A comparative study of MLR, KNN, ANN and ANFIS models with wavelet transform in monthly stream flow prediction. Water Resour Manage 33:2907–2923

    Article  Google Scholar 

  • Ratner B (2009) The correlation coefficient: Its values range between+ 1/− 1, or do they? J Target Meas Anal Mark 17(2):139–142

    Article  Google Scholar 

  • Sagintayev, Z, Sultan, M, Khan, SD, Khan, SA, Mahmood, K, Yan, E, ... Marsala, P (2012) A remote sensing contribution to hydrologic modelling in arid and inaccessible watersheds, Pishin Lora basin, Pakistan. Hydrological Processes, 26(1), 85–99. https://doi.org/10.1002/hyp.8114

  • Sharma S, Sharma S, Athaiya A (2017) Activation functions in neural networks. Towards Data Sci 6(12):310–316

    Google Scholar 

  • Sharma P, Singh S, Sharma SD (2022) Artificial neural network approach for hydrologic river flow time series forecasting. Agricultural Res 11(3):465–476

    Article  Google Scholar 

  • Simon Haykin (McMaster University, Hamilton, Ontario, C) (2005) Neural Networks - A Comprehensive Foundation - Simon Haykin.pdf (p. 823)

  • Soomro AG, Babar MM, Memon AH, Zaidi AZ, Ashraf A, Lund J (2019) Sensitivity of direct runoff to curve number using the SCS-CN method. Civ Eng J 5:2738–2746

    Article  Google Scholar 

  • Soulis KX (2021) Soil conservation service curve number (SCS-CN) Method: Current applications, remaining challenges, and future perspectives. Water 13(2):192

    Article  Google Scholar 

  • Sun N, Zhang S, Peng T, Zhang N, Zhou J, Zhang H (2022) Multi-variables-driven model based on random forest and Gaussian process regression for monthly streamflow forecasting. Water 14(11):1828

    Article  Google Scholar 

  • Talukdar S, Pal S, Shahfahad, Naikoo MW, Parvez A, Rahman A (2023) Trend analysis and forecasting of streamflow using random forest in the Punarbhaba River basin. Environ Monit Assess 195(1):153

    Article  Google Scholar 

  • Taylor KE (2005) Taylor diagram primer. Work. Pap, pp 1–4

  • Umar M, Waseem A, Kassi AM, Farooq M, Sabir MA (2014) Surface and subsurface water quality assessment in semi-arid region: a case study from Quetta and Sorange Intermontane Valleys. Pakistan Global Nest J 16(5):938–954

    Article  Google Scholar 

  • Us Saqib N, Aziz T (2023) Estimation Of Wheat Crop Productivity Of District Nawab Shah Using Metric EEFLUX imagery. J Agric Res 61(1):77–82

    Google Scholar 

  • Us Saqib N, Saeed F, Aziz T (2022) Effect Of Man-Made Structures On Natural Wetlands In Pakistan: A Case Study Of Chotiari Reservoir. J Clean WAS (JCleanWAS) 6(1):01–04

    Google Scholar 

  • Viji R, Rajesh Prasanna P, Ilangovan R (2015) Gis based SCS-CN method for estimating runoff in Kundahpalam watershed, Nilgries District. Tamilnadu Earth Sciences Research J 19(1):59–64

    Article  Google Scholar 

  • Vishwakarma, DK, Kuriqi, A, Abed, SA, Kishore, G, Al-Ansari, N, Pandey, K, ... Jewel, A (2023) Forecasting of stage-discharge in a non-perennial river using machine learning with gamma test. Heliyon, 9(5). https://doi.org/10.1016/j.heliyon.2023.e16290

  • Williams, B, Halloin, C, Löbel, W, Finklea, F, Lipke, E, Zweigerdt, R, Cremaschi, S (2020) Data-driven model development for cardiomyocyte production experimental failure prediction. In Computer aided chemical engineering (Vol. 48, pp. 1639–1644). Elsevier. https://doi.org/10.1016/B978-0-12-823377-1.50274-3

  • Yao L, Wei WEI, Yu Y, Xiao J, Chen L (2018) Rainfall-runoff risk characteristics of urban function zones in Beijing using the SCS-CN model. J Geog Sci 28:656–668

    Article  Google Scholar 

  • Yaseen ZM, El-Shafie A, Jaafar O, Afan HA, Sayl KN (2015) Artificial intelligence based models for streamflow forecasting: 2000–2015. J Hydrol 530:829–844

    Article  Google Scholar 

  • Yeoh, KL, Puay, HT, Abdullah, R, Abd Manan, TS (2023) Appraisal of data-driven techniques for predicting short-term streamflow in tropical catchment. Water Sci Technol https://doi.org/10.2166/wst.2023.193

  • Zhan X, Huang ML (2004) ArcCN-Runoff: an ArcGIS tool for generating curve number and runoff maps. Environ Model Softw 19(10):875–879

    Article  Google Scholar 

Download references

Acknowledgements

The authors are thankful to the Higher Education Commission, Islamabad for providing full financial support under Local Challenges Fund Project No. 20-LCF-63 titled; Metering the aquifer using a smart monitoring and data-driven approach to assist in devising an adaptive groundwater management strategy in Balochistan; and Mehran University of Engineering & Technology, Jamshoro, Pakistan for funding during the collection of literature data and the writing of this paper.

Funding

The authors declare that they have not received any fund to conduct this research.

Author information

Authors and Affiliations

Authors

Contributions

The authors confirm contribution to the paper as follows: study conception and design Ghunwa Shah, devised the research, the main conceptual ideas and proof outline and analysis and interpretation of results. Ghunwa Shah, Shahzad Hussain, Rizwan and Tariq Aziz worked out almost all of the technical details, and wrote the manuscript. Ghunwa Shah, Arjumand Zaidi, Abdul Latif Qureshi, Shahzad Hussain authors contributed to the final version of the manuscript.

Corresponding author

Correspondence to Ghunwa Shah.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Communicated by H. Babaie.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shah, G., Zaidi, A., Qureshi, A.L. et al. Rainfall-runoff modeling using machine learning in the ungauged urban watershed of Quetta Valley, Balochistan (Pakistan). Earth Sci Inform (2024). https://doi.org/10.1007/s12145-024-01302-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s12145-024-01302-w

Keywords

Navigation