Abstract
The imbalance in global streamflow gauge distribution and regional data scarcity, especially in large transboundary basins, challenge regional water resource management. Effectively utilizing these limited data to construct reliable models is of crucial practical importance. This study employs a transfer learning (TL) framework to simulate daily streamflow in the Dulong-Irrawaddy River Basin (DIRB), a less-studied transboundary basin shared by Myanmar, China, and India. Our results show that TL significantly improves streamflow predictions: the optimal TL model achieves an average Nash-Sutcliffe efficiency of 0.872, showing a marked improvement in the Hkamti sub-basin. Despite data scarcity, TL achieves a mean NSE of 0.817, surpassing the 0.655 of the process-based model MIKE SHE. Additionally, our study reveals the importance of source model selection in TL, as different parts of the flow are affected by the diversity and similarity of data in the source model. Deep learning models, particularly TL, exhibit complex sensitivities to meteorological inputs, more accurately capturing non-linear relationships among multiple variables than the process-based model. Integrated gradients (IG) analysis further illustrates TL’s ability to capture spatial heterogeneity in upstream and downstream sub-basins and its adeptness in characterizing different flow regimes. This study underscores the potential of TL in enhancing the understanding of hydrological processes in large-scale catchments and highlights its value for water resource management in transboundary basins under data scarcity.
Similar content being viewed by others
References
Alvarez-Garreton C, Mendoza P A, Boisier J P et al., 2018. The CAMELS-CL dataset: Catchment attributes and meteorology for large sample studies–Chile dataset. Hydrology and Earth System Sciences, 22: 5817–5846.
Ault T R, 2020. On the essentials of drought in a changing climate. Science, 368(6488): 256–260.
Bolibar J, Rabatel A, Gouttevin I et al., 2022. Nonlinear sensitivity of glacier mass balance to future climate change unveiled by deep learning. Nature Communications, 13(1): 409.
Borga M, Stoffel M, Marchi L et al., 2014. Hydrogeomorphic response to extreme rainfall in headwater systems: Flash floods and debris flows. Journal of Hydrology, 518: 194–205.
Carvalho D V, Pereira E M, Cardoso J S et al., 2019. Machine learning interpretability: A survey on methods and metrics. Electronics, 8: 832.
Cook C, Bakker K, 2012. Water security: Debating an emerging paradigm. Global Environmental Change, 22: 94–102.
Coxon G, Addor N, Bloomfield JP et al., 2020. Catchment attributes and hydro-meteorological timeseries for 671 catchments across Great Britain (CAMELS-GB). Hydrology and Earth System Sciences, 12: 2459–2483.
Dong X, Chowdhury S, Qian L et al., 2019. Deep learning for named entity recognition on Chinese electronic medical records: Combining deep transfer learning with multitask bi-directional LSTM RNN. PLoS ONE, 14: e0216046.
Fang K, Kifer D, Lawson K et al., 2020. Evaluating the potential and challenges of an uncertainty quantification method for long short-term memory models for soil moisture predictions. Water Resources Research, 56(12): e2020WR028095.
Fang K, Shen C P, Kifer D et al., 2017. Prolongation of SMAP to spatiotemporally seamless coverage of continental U.S. using a deep learning neural network. Geophysical Research Letters, 44: 11030–11039.
Feng D P, Fang K, Shen C P, 2020. Enhancing streamflow forecast and extracting insights using long-short term memory networks with data integration at continental scales. Water Resources Research, 56(9): e2019WR026793.
Feng D P, Lawson K, Shen C P, 2021. Mitigating prediction error of deep learning streamflow models in large data-sparse regions with ensemble modeling and soft data. Geophysical Research Letters, 48: e2021GL092999.
Feng D P, Liu J T, Lawson K et al., 2022. Differentiable, learnable, regionalized process-based models with multiphysical outputs can approach state-of-the-art hydrologic prediction accuracy. Water Resources Research, 58(10): e2022WR032404.
Feng Y, He D M, 2009. Transboundary water vulnerability and its drivers in China. Journal of Geographical Sciences, 19(2): 189–199.
Funk C, Peterson P, Landsfeld M et al., 2015. The climate hazards infrared precipitation with stations: A new environmental record for monitoring extremes. Scientific Data, 2: 150066.
Hashemi R, Brigode P, Garambois P A et al., 2022. How can we benefit from regime information to make more effective use of long short-term memory (LSTM) runoff models? Hydrology and Earth System Sciences, 26: 5793–5816.
He D M, Wu R D, Feng Y et al., 2014. REVIEW: China’s transboundary waters: New paradigms for water and ecological security through applied ecology. Journal of Applied Ecology, 51: 1159–1168.
Hochreiter S, Schmidhuber J, 1997. Long short-term memory. Neural Computation, 9: 1735–1780.
Höge M, Scheidegger A, Baity-Jesi M et al., 2022. Improving hydrologic models for predictions and process understanding using neural ODEs. Hydrology and Earth System Sciences, 26: 5085–5102.
Hu X L, Zhou Z, Xiong H B et al., 2023. Inter-comparison of global precipitation data products at the river basin scale. Hydrology Research, nh2023062.
Huang Q, Long D, Du M D et al., 2020. Daily continuous river discharge estimation for ungauged basins using a hydrologic model calibrated by satellite altimetry: Implications for the SWOT mission. Water Resources Research, 56(7): e2020WR027309.
Jahanshahi A, Patil S D, Goharian E, 2022. Identifying most relevant controls on catchment hydrological similarity using model transferability: A comprehensive study in Iran. Journal of Hydrology, 612: 128193.
Ji X, Chen Y F, Jiang W et al., 2022. Glacier area changes in the Nujiang-Salween River Basin over the past 45 years. Journal of Geographical Sciences, 32(6): 1177–1204.
Ji X, Li Y G, Luo X et al., 2020. Evaluation of bias correction methods for APHRODITE data to improve hydrologic simulation in a large Himalayan basin. Atmospheric Research, 242: 104964.
Karpatne A, Atluri G, Faghmous J H et al., 2017. Theory-guided data science: A new paradigm for scientific discovery from data. IEEE Transactions on Knowledge and Data Engineering, 29: 2318–2331.
Kirchner J W, 2016. Aggregation in environmental systems (Part 1): Seasonal tracer cycles quantify young water fractions, but not mean transit times, in spatially heterogeneous catchments. Hydrology and Earth System Sciences, 20: 279–297.
Klotz D, Kratzert F, Gauch M et al., 2022. Uncertainty estimation with deep learning for rainfall-runoff modeling. Hydrology and Earth System Sciences, 26: 1673–1693.
Kratzert F, Klotz D, Brenner C et al., 2018. Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks. Hydrology and Earth System Sciences, 22: 6005–6022.
Kratzert F, Klotz D, Hochreiter S et al., 2020. A note on leveraging synergy in multiple meteorological datasets with deep learning for rainfall-runoff modeling. Hydrology and Earth System Sciences, 25: 2685–2703.
Kratzert F, Klotz D, Shalev G et al., 2019. Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets. Hydrology and Earth System Sciences, 23: 5089–5110.
Kratzert F, Nearing G, Addor N et al., 2023. Caravan: A global community dataset for large-sample hydrology. Scientific Data, 10: 61.
Leng J, Gao M L, Gong H L et al., 2023. Spatio-temporal prediction of regional land subsidence via ConvLSTM. Journal of Geographical Sciences, 33(10): 2131–2156.
Li B, Li R D, Sun T et al., 2023. Improving LSTM hydrological modeling with spatiotemporal deep learning and multi-task learning: A case study of three mountainous areas on the Tibetan Plateau. Journal of Hydrology, 620: 129401.
Li Y G, Zhang Y Y, He D M et al., 2019. Spatial downscaling of the tropical rainfall measuring mission precipitation using geographically weighted regression kriging over the Lancang River Basin, China. Chinese Geographical Science, 29: 446–462.
Linardatos P, Papastefanopoulos V, Kotsiantis S, 2020. Explainable AI: A review of machine learning interpretability methods. Entropy, 23: 18.
Lu D, Konapala G, Painter S L et al., 2021. Streamflow simulation in data-scarce basins using Bayesian and physics-informed machine learning models. Journal of Hydrometeorology, 22: 1421–1438.
Luo X, Fan X M, Li Y G et al., 2020. Bias correction of a gauge-based gridded product to improve extreme precipitation analysis in the Yarlung Tsangpo–Brahmaputra River basin. Natural Hazards and Earth System Sciences, 20: 2243–2254.
Ma K, Feng D P, Lawson K et al., 2021. Transferring hydrologic data across continents: Leveraging data-rich regions to improve hydrologic prediction in data-sparse regions. Water Resources Research, 57(5): e2020WR028600.
National Water Resources Committee (NWRC), 2018. The Ayeyarwady State of the Basin Assessment. SOBA 1.2: Surface Water Resources. National Water Resources Committee.
Newman A, Sampson K, Clark M et al., 2014. A Large-sample Watershed-scale Hydrometeorological Dataset for the Contiguous USA. Boulder.
Pan S J, Yang Q, 2010. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22: 1345–1359.
Rahmani F, Lawson K, Ouyang W Y et al., 2020. Exploring the exceptional performance of a deep learning stream temperature model and the value of streamflow data. Environmental Research Letters, 16: 024025
Read J S, Jia X W, Willard J et al., 2019. Process-guided deep learning predictions of lake water temperature. Water Resources Research, 55(11): 9173–9190.
Reichert P, Ammann L, Fenicia F, 2021. Potential and challenges of investigating intrinsic uncertainty of hydrological models with stochastic, time-dependent parameters. Water Resources Research, 57(3): e2020WR028400
Rodriguez-Galiano V, Sanchez-Castillo M, Chica-Olmo M et al., 2015. Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines. Ore Geology Reviews, 71: 804–818.
Schewe J, Gosling S N, Reyer C et al., 2019. State-of-the-art global models underestimate impacts from climate extremes. Nature Communications, 10(1): 1005.
Shen C P, 2018. A transdisciplinary review of deep learning research and its relevance for water resources scientists. Water Resources Research, 54(11): 8558–8593.
Shen C P, Appling A P, Gentine P et al., 2023. Differentiable modelling to unify machine learning and physical models for geosciences. Nature Reviews Earth & Environment, 4: 552–567.
Shen C P, Chen X Y, Laloy E, 2021. Editorial: Broadening the use of machine learning in hydrology. Frontiers in Water, 3: 681023.
Shrestha S, Imbulana N, Piman T et al., 2020. Multimodelling approach to the assessment of climate change impacts on hydrology and river morphology in the Chindwin River Basin, Myanmar. CATENA, 188: 104464.
Sun A Y, Scanlon B R, Zhang Z Z et al., 2019. Combining physically based modeling and deep learning for fusing grace satellite data: Can we learn from mismatch? Water Resources Research, 55(2): 1179–1195.
Sundararajan M, Taly A, Yan Q Q, 2017. Axiomatic attribution for deep networks. In: Proceedings of the 34th International Conference on Machine Learning. Sydney, Australia.
The United Nations World Water Development Report 2020: Water and Climate Change, 2020. UNESCO.
The United Nations World Water Development Report 2021: Valuing Water, 2021. UNESCO.
Thrun S, Pratt L, 1998. Learning to Learn. New York: Springer Science & Business Media.
Tsai W P, Feng D P, Pan M et al., 2021. From calibration to parameter learning: Harnessing the scaling effects of big data in geoscientific modeling. Nature Communications, 12(1): 5988.
Wang Y W, Wang L, Li X P et al., 2020. An integration of gauge, satellite, and reanalysis precipitation datasets for the largest river basin of the Tibetan Plateau. Earth System Science Data, 12: 1789–1803.
Yilmaz K K, Gupta H V, Wagener T, 2008. A process-based diagnostic approach to model evaluation: Application to the NWS distributed hydrologic model: process-based diagnostic evaluation of hydrologic model. Water Resources Research, 44(9): W09417
Zhang K, Luhar M, Brunner M I et al., 2023. Streamflow prediction in poorly gauged watersheds in the United States through data-driven sparse sensing. Water Resources Research, 59(4): e2022WR034092.
Zhao G, Pang B, Xu Z G et al., 2021. Improving urban flood susceptibility mapping using transfer learning. Journal of Hydrology, 602: 126777.
Zhu Y X, Sang Y F, Wang B et al., 2023. Heterogeneity in spatiotemporal variability of high mountain Asia’s runoff and its underlying mechanisms. Water Resources Research, 59(7): e2022WR032721.
Acknowledgments
We thank Peter Reichert for the support in the sensitivity analysis (Method used in 3.2.1) that helped improve the manuscript. The hydrologic deep learning code used in this work can be accessed at https://doi.org/10.5281/zenodo.3993880. Data for CAMELS can be downloaded at https://ral.ucar.edu/solutions/products/camels. Data for CAMELS-GB can be downloaded at https://doi.org/10.5285/8344e4f3-d2ea-44f5-8afa-86d2987543a9. Data for CAMELS-CL can be downloaded at http://www.cr2.cl/camels-cl/. Discharge data of DIRB can be downloaded at https://www.bafg.de/GRDC/EN/Home/homepage_node.html.
Author information
Authors and Affiliations
Corresponding author
Additional information
Foundation: National Key Research and Development Program of China, No.2022YFF1302405; National Natural Science Foundation of China, No.42201040; The National Key Research and Development Program of China, No.2016YFA0601601; The China Postdoctoral Science Foundation, No.2023M733006
Author: Ma Kai (1992–), PhD, specialized in transboundary hydrology and hydrologic modelling.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Ma, K., Shen, C., Xu, Z. et al. Transfer learning framework for streamflow prediction in large-scale transboundary catchments: Sensitivity analysis and applicability in data-scarce basins. J. Geogr. Sci. 34, 963–984 (2024). https://doi.org/10.1007/s11442-024-2235-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11442-024-2235-x