Skip to main content
Log in

Transfer learning framework for streamflow prediction in large-scale transboundary catchments: Sensitivity analysis and applicability in data-scarce basins

  • Research Articles
  • Published:
Journal of Geographical Sciences Aims and scope Submit manuscript

Abstract

The imbalance in global streamflow gauge distribution and regional data scarcity, especially in large transboundary basins, challenge regional water resource management. Effectively utilizing these limited data to construct reliable models is of crucial practical importance. This study employs a transfer learning (TL) framework to simulate daily streamflow in the Dulong-Irrawaddy River Basin (DIRB), a less-studied transboundary basin shared by Myanmar, China, and India. Our results show that TL significantly improves streamflow predictions: the optimal TL model achieves an average Nash-Sutcliffe efficiency of 0.872, showing a marked improvement in the Hkamti sub-basin. Despite data scarcity, TL achieves a mean NSE of 0.817, surpassing the 0.655 of the process-based model MIKE SHE. Additionally, our study reveals the importance of source model selection in TL, as different parts of the flow are affected by the diversity and similarity of data in the source model. Deep learning models, particularly TL, exhibit complex sensitivities to meteorological inputs, more accurately capturing non-linear relationships among multiple variables than the process-based model. Integrated gradients (IG) analysis further illustrates TL’s ability to capture spatial heterogeneity in upstream and downstream sub-basins and its adeptness in characterizing different flow regimes. This study underscores the potential of TL in enhancing the understanding of hydrological processes in large-scale catchments and highlights its value for water resource management in transboundary basins under data scarcity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Alvarez-Garreton C, Mendoza P A, Boisier J P et al., 2018. The CAMELS-CL dataset: Catchment attributes and meteorology for large sample studies–Chile dataset. Hydrology and Earth System Sciences, 22: 5817–5846.

    Article  Google Scholar 

  • Ault T R, 2020. On the essentials of drought in a changing climate. Science, 368(6488): 256–260.

    Article  CAS  Google Scholar 

  • Bolibar J, Rabatel A, Gouttevin I et al., 2022. Nonlinear sensitivity of glacier mass balance to future climate change unveiled by deep learning. Nature Communications, 13(1): 409.

    Article  CAS  Google Scholar 

  • Borga M, Stoffel M, Marchi L et al., 2014. Hydrogeomorphic response to extreme rainfall in headwater systems: Flash floods and debris flows. Journal of Hydrology, 518: 194–205.

    Article  Google Scholar 

  • Carvalho D V, Pereira E M, Cardoso J S et al., 2019. Machine learning interpretability: A survey on methods and metrics. Electronics, 8: 832.

    Article  Google Scholar 

  • Cook C, Bakker K, 2012. Water security: Debating an emerging paradigm. Global Environmental Change, 22: 94–102.

    Article  Google Scholar 

  • Coxon G, Addor N, Bloomfield JP et al., 2020. Catchment attributes and hydro-meteorological timeseries for 671 catchments across Great Britain (CAMELS-GB). Hydrology and Earth System Sciences, 12: 2459–2483.

    Google Scholar 

  • Dong X, Chowdhury S, Qian L et al., 2019. Deep learning for named entity recognition on Chinese electronic medical records: Combining deep transfer learning with multitask bi-directional LSTM RNN. PLoS ONE, 14: e0216046.

    Article  Google Scholar 

  • Fang K, Kifer D, Lawson K et al., 2020. Evaluating the potential and challenges of an uncertainty quantification method for long short-term memory models for soil moisture predictions. Water Resources Research, 56(12): e2020WR028095.

    Article  Google Scholar 

  • Fang K, Shen C P, Kifer D et al., 2017. Prolongation of SMAP to spatiotemporally seamless coverage of continental U.S. using a deep learning neural network. Geophysical Research Letters, 44: 11030–11039.

    Article  Google Scholar 

  • Feng D P, Fang K, Shen C P, 2020. Enhancing streamflow forecast and extracting insights using long-short term memory networks with data integration at continental scales. Water Resources Research, 56(9): e2019WR026793.

    Article  Google Scholar 

  • Feng D P, Lawson K, Shen C P, 2021. Mitigating prediction error of deep learning streamflow models in large data-sparse regions with ensemble modeling and soft data. Geophysical Research Letters, 48: e2021GL092999.

    Article  Google Scholar 

  • Feng D P, Liu J T, Lawson K et al., 2022. Differentiable, learnable, regionalized process-based models with multiphysical outputs can approach state-of-the-art hydrologic prediction accuracy. Water Resources Research, 58(10): e2022WR032404.

    Article  Google Scholar 

  • Feng Y, He D M, 2009. Transboundary water vulnerability and its drivers in China. Journal of Geographical Sciences, 19(2): 189–199.

    Article  Google Scholar 

  • Funk C, Peterson P, Landsfeld M et al., 2015. The climate hazards infrared precipitation with stations: A new environmental record for monitoring extremes. Scientific Data, 2: 150066.

    Article  Google Scholar 

  • Hashemi R, Brigode P, Garambois P A et al., 2022. How can we benefit from regime information to make more effective use of long short-term memory (LSTM) runoff models? Hydrology and Earth System Sciences, 26: 5793–5816.

    Article  Google Scholar 

  • He D M, Wu R D, Feng Y et al., 2014. REVIEW: China’s transboundary waters: New paradigms for water and ecological security through applied ecology. Journal of Applied Ecology, 51: 1159–1168.

    Article  Google Scholar 

  • Hochreiter S, Schmidhuber J, 1997. Long short-term memory. Neural Computation, 9: 1735–1780.

    Article  CAS  Google Scholar 

  • Höge M, Scheidegger A, Baity-Jesi M et al., 2022. Improving hydrologic models for predictions and process understanding using neural ODEs. Hydrology and Earth System Sciences, 26: 5085–5102.

    Article  Google Scholar 

  • Hu X L, Zhou Z, Xiong H B et al., 2023. Inter-comparison of global precipitation data products at the river basin scale. Hydrology Research, nh2023062.

  • Huang Q, Long D, Du M D et al., 2020. Daily continuous river discharge estimation for ungauged basins using a hydrologic model calibrated by satellite altimetry: Implications for the SWOT mission. Water Resources Research, 56(7): e2020WR027309.

    Article  Google Scholar 

  • Jahanshahi A, Patil S D, Goharian E, 2022. Identifying most relevant controls on catchment hydrological similarity using model transferability: A comprehensive study in Iran. Journal of Hydrology, 612: 128193.

    Article  Google Scholar 

  • Ji X, Chen Y F, Jiang W et al., 2022. Glacier area changes in the Nujiang-Salween River Basin over the past 45 years. Journal of Geographical Sciences, 32(6): 1177–1204.

    Article  Google Scholar 

  • Ji X, Li Y G, Luo X et al., 2020. Evaluation of bias correction methods for APHRODITE data to improve hydrologic simulation in a large Himalayan basin. Atmospheric Research, 242: 104964.

    Article  Google Scholar 

  • Karpatne A, Atluri G, Faghmous J H et al., 2017. Theory-guided data science: A new paradigm for scientific discovery from data. IEEE Transactions on Knowledge and Data Engineering, 29: 2318–2331.

    Article  Google Scholar 

  • Kirchner J W, 2016. Aggregation in environmental systems (Part 1): Seasonal tracer cycles quantify young water fractions, but not mean transit times, in spatially heterogeneous catchments. Hydrology and Earth System Sciences, 20: 279–297.

    Article  CAS  Google Scholar 

  • Klotz D, Kratzert F, Gauch M et al., 2022. Uncertainty estimation with deep learning for rainfall-runoff modeling. Hydrology and Earth System Sciences, 26: 1673–1693.

    Article  Google Scholar 

  • Kratzert F, Klotz D, Brenner C et al., 2018. Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks. Hydrology and Earth System Sciences, 22: 6005–6022.

    Article  Google Scholar 

  • Kratzert F, Klotz D, Hochreiter S et al., 2020. A note on leveraging synergy in multiple meteorological datasets with deep learning for rainfall-runoff modeling. Hydrology and Earth System Sciences, 25: 2685–2703.

    Article  Google Scholar 

  • Kratzert F, Klotz D, Shalev G et al., 2019. Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets. Hydrology and Earth System Sciences, 23: 5089–5110.

    Article  Google Scholar 

  • Kratzert F, Nearing G, Addor N et al., 2023. Caravan: A global community dataset for large-sample hydrology. Scientific Data, 10: 61.

    Article  Google Scholar 

  • Leng J, Gao M L, Gong H L et al., 2023. Spatio-temporal prediction of regional land subsidence via ConvLSTM. Journal of Geographical Sciences, 33(10): 2131–2156.

    Article  Google Scholar 

  • Li B, Li R D, Sun T et al., 2023. Improving LSTM hydrological modeling with spatiotemporal deep learning and multi-task learning: A case study of three mountainous areas on the Tibetan Plateau. Journal of Hydrology, 620: 129401.

    Article  Google Scholar 

  • Li Y G, Zhang Y Y, He D M et al., 2019. Spatial downscaling of the tropical rainfall measuring mission precipitation using geographically weighted regression kriging over the Lancang River Basin, China. Chinese Geographical Science, 29: 446–462.

    Article  Google Scholar 

  • Linardatos P, Papastefanopoulos V, Kotsiantis S, 2020. Explainable AI: A review of machine learning interpretability methods. Entropy, 23: 18.

    Article  Google Scholar 

  • Lu D, Konapala G, Painter S L et al., 2021. Streamflow simulation in data-scarce basins using Bayesian and physics-informed machine learning models. Journal of Hydrometeorology, 22: 1421–1438.

    Google Scholar 

  • Luo X, Fan X M, Li Y G et al., 2020. Bias correction of a gauge-based gridded product to improve extreme precipitation analysis in the Yarlung Tsangpo–Brahmaputra River basin. Natural Hazards and Earth System Sciences, 20: 2243–2254.

    Article  Google Scholar 

  • Ma K, Feng D P, Lawson K et al., 2021. Transferring hydrologic data across continents: Leveraging data-rich regions to improve hydrologic prediction in data-sparse regions. Water Resources Research, 57(5): e2020WR028600.

    Article  Google Scholar 

  • National Water Resources Committee (NWRC), 2018. The Ayeyarwady State of the Basin Assessment. SOBA 1.2: Surface Water Resources. National Water Resources Committee.

  • Newman A, Sampson K, Clark M et al., 2014. A Large-sample Watershed-scale Hydrometeorological Dataset for the Contiguous USA. Boulder.

  • Pan S J, Yang Q, 2010. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22: 1345–1359.

    Article  Google Scholar 

  • Rahmani F, Lawson K, Ouyang W Y et al., 2020. Exploring the exceptional performance of a deep learning stream temperature model and the value of streamflow data. Environmental Research Letters, 16: 024025

    Google Scholar 

  • Read J S, Jia X W, Willard J et al., 2019. Process-guided deep learning predictions of lake water temperature. Water Resources Research, 55(11): 9173–9190.

    Article  Google Scholar 

  • Reichert P, Ammann L, Fenicia F, 2021. Potential and challenges of investigating intrinsic uncertainty of hydrological models with stochastic, time-dependent parameters. Water Resources Research, 57(3): e2020WR028400

    Article  Google Scholar 

  • Rodriguez-Galiano V, Sanchez-Castillo M, Chica-Olmo M et al., 2015. Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines. Ore Geology Reviews, 71: 804–818.

    Article  Google Scholar 

  • Schewe J, Gosling S N, Reyer C et al., 2019. State-of-the-art global models underestimate impacts from climate extremes. Nature Communications, 10(1): 1005.

    Article  Google Scholar 

  • Shen C P, 2018. A transdisciplinary review of deep learning research and its relevance for water resources scientists. Water Resources Research, 54(11): 8558–8593.

    Article  Google Scholar 

  • Shen C P, Appling A P, Gentine P et al., 2023. Differentiable modelling to unify machine learning and physical models for geosciences. Nature Reviews Earth & Environment, 4: 552–567.

    Article  Google Scholar 

  • Shen C P, Chen X Y, Laloy E, 2021. Editorial: Broadening the use of machine learning in hydrology. Frontiers in Water, 3: 681023.

    Article  Google Scholar 

  • Shrestha S, Imbulana N, Piman T et al., 2020. Multimodelling approach to the assessment of climate change impacts on hydrology and river morphology in the Chindwin River Basin, Myanmar. CATENA, 188: 104464.

    Article  Google Scholar 

  • Sun A Y, Scanlon B R, Zhang Z Z et al., 2019. Combining physically based modeling and deep learning for fusing grace satellite data: Can we learn from mismatch? Water Resources Research, 55(2): 1179–1195.

    Article  Google Scholar 

  • Sundararajan M, Taly A, Yan Q Q, 2017. Axiomatic attribution for deep networks. In: Proceedings of the 34th International Conference on Machine Learning. Sydney, Australia.

  • The United Nations World Water Development Report 2020: Water and Climate Change, 2020. UNESCO.

  • The United Nations World Water Development Report 2021: Valuing Water, 2021. UNESCO.

  • Thrun S, Pratt L, 1998. Learning to Learn. New York: Springer Science & Business Media.

    Book  Google Scholar 

  • Tsai W P, Feng D P, Pan M et al., 2021. From calibration to parameter learning: Harnessing the scaling effects of big data in geoscientific modeling. Nature Communications, 12(1): 5988.

    Article  CAS  Google Scholar 

  • Wang Y W, Wang L, Li X P et al., 2020. An integration of gauge, satellite, and reanalysis precipitation datasets for the largest river basin of the Tibetan Plateau. Earth System Science Data, 12: 1789–1803.

    Article  Google Scholar 

  • Yilmaz K K, Gupta H V, Wagener T, 2008. A process-based diagnostic approach to model evaluation: Application to the NWS distributed hydrologic model: process-based diagnostic evaluation of hydrologic model. Water Resources Research, 44(9): W09417

    Article  Google Scholar 

  • Zhang K, Luhar M, Brunner M I et al., 2023. Streamflow prediction in poorly gauged watersheds in the United States through data-driven sparse sensing. Water Resources Research, 59(4): e2022WR034092.

    Article  Google Scholar 

  • Zhao G, Pang B, Xu Z G et al., 2021. Improving urban flood susceptibility mapping using transfer learning. Journal of Hydrology, 602: 126777.

    Article  Google Scholar 

  • Zhu Y X, Sang Y F, Wang B et al., 2023. Heterogeneity in spatiotemporal variability of high mountain Asia’s runoff and its underlying mechanisms. Water Resources Research, 59(7): e2022WR032721.

    Article  Google Scholar 

Download references

Acknowledgments

We thank Peter Reichert for the support in the sensitivity analysis (Method used in 3.2.1) that helped improve the manuscript. The hydrologic deep learning code used in this work can be accessed at https://doi.org/10.5281/zenodo.3993880. Data for CAMELS can be downloaded at https://ral.ucar.edu/solutions/products/camels. Data for CAMELS-GB can be downloaded at https://doi.org/10.5285/8344e4f3-d2ea-44f5-8afa-86d2987543a9. Data for CAMELS-CL can be downloaded at http://www.cr2.cl/camels-cl/. Discharge data of DIRB can be downloaded at https://www.bafg.de/GRDC/EN/Home/homepage_node.html.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daming He.

Additional information

Foundation: National Key Research and Development Program of China, No.2022YFF1302405; National Natural Science Foundation of China, No.42201040; The National Key Research and Development Program of China, No.2016YFA0601601; The China Postdoctoral Science Foundation, No.2023M733006

Author: Ma Kai (1992–), PhD, specialized in transboundary hydrology and hydrologic modelling.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ma, K., Shen, C., Xu, Z. et al. Transfer learning framework for streamflow prediction in large-scale transboundary catchments: Sensitivity analysis and applicability in data-scarce basins. J. Geogr. Sci. 34, 963–984 (2024). https://doi.org/10.1007/s11442-024-2235-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11442-024-2235-x

Keywords

Navigation