Skip to main content
Log in

Probabilistic Prediction for Monthly Streamflow through Coupling Stepwise Cluster Analysis and Quantile Regression Methods

  • Published:
Water Resources Management Aims and scope Submit manuscript

Abstract

In this study, a stepwise cluster forecasting (SCF) framework is proposed for probabilistic prediction for monthly streamflow through integrating stepwise cluster analysis and quantile regression methods. The developed SCF method can capture discrete and nonlinear relationships between explanatory and response variables. A cluster tree was generated through the SCF method for reflecting complex relationships between independent (i.e. explanatory) and dependent (i.e. response) variables in the hydrologic system. Quantile regression approach was employed to construct probabilistic relationships between inputs and outputs in each leaf of the cluster tree. The developed SCF method was applied for monthly streamflow prediction in Xiangxi River based on the gauged records at Xingshan gauging station and related meteorological data. The performance of the SCF method was evaluated through indices of percent bias (PBIAS), RMSE-observations standard deviation ratio (RSR), and Nash-Sutcliffe efficiency coefficient (NSE). Two new indices, the relative distance to the bounds (RDB) and the degree of uncertainty (DOU) were proposed to reflect the uncertainty of the predictions from SCF model. The results showed that the uncertainty of the predictions was acceptable and would not change significantly in calibration and validation periods. Quantile regression was integrated into prediction process of the SCF approach to provide probabilistic forecasts. The 90 % confidence intervals could well bracket the observations in both calibration and validation periods. Comparison among different forecasting techniques showed the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Adamowski J, Karapataki C (2008) Comparison of multivariate regression and artificial neural networks for peak urban water demand forecasting: the evaluation of different ANN learning algorithms. J Hydrol Eng ASCE 15(10):729–743

    Article  Google Scholar 

  • Adamowski J, Chan HF, Prasher SO, Ozga-Zielinski B, Sliusarieva A (2012) Comparison of multiple linear and nonlinear regression, autoregressive integrated moving average, artificial neural network, and wavelet artificial neural network methods for urban water demand forecasting in Montreal, Canada. Water Resour Res 48:W01528

    Article  Google Scholar 

  • Agarwal A, Maheswaran R, Kurths J, Khosa R (2016) Wavelet Spectrum and self-organizing maps-based approach for hydrologic regionalization -a case study in the western United States. Water Resour Manag 30(12):4399–4413

    Article  Google Scholar 

  • Cigizoglu HK (2005) Generalized regression neural network in monthly flow forecasting. Civ Eng Environ Syst 22(2):71–84

    Article  Google Scholar 

  • Cooley WW, Lohnes PR (1971). Multivariate data analysis. Wiley, Inc.

  • Dai C, Cai YP, Lu WT, Liu H, Guo HC (2016) Conjunctive water use optimization for watershed-Lake water distribution system under uncertainty: a case study. Water Resour Manag 30(12):4429–4449

    Article  Google Scholar 

  • Erechtchoukova MG, Khaiter PA, Saffarpour S (2016) Short-term predictions of hydrological events on an urbanized watershed using supervised classification. Water Resour Manag 30(12):4329–4343

    Article  Google Scholar 

  • Fan YR, Huang W, Huang GH, Li Z, Li YP, Wang XQ, Jin L (2015a) A stepwise-cluster forecasting approach for monthly streamflows based on climate teleconnections. Stoch Env Res Risk A 29(6):1557–1569

    Article  Google Scholar 

  • Fan YR, Huang WW, Huang GH, Li YP, Huang K (2015b) A coupled ensemble filtering and probabilistic collocation approach for uncertainty quantification of hydrological models. J Hydrol 530:255–272

    Article  Google Scholar 

  • Fan YR, Huang GH, Huang K, Baetz BW (2015c) Planning water resources allocation under multiple uncertainties through a generalized fuzzy two-stage stochastic programming method. IEEE Trans Fuzzy Syst 23(5):1488–1504. doi:10.1109/TFUZZ.2014.2362550

    Article  Google Scholar 

  • Fan YR, Huang W, Huang GH, Huang K, Zhou X (2015d) A PCM-based stochastic hydrological model for uncertainty quantification in watershed systems. Stoch Env Res Risk A 29(3):915–927

    Article  Google Scholar 

  • Fan YR, Huang GH, Li YP, Kong XM (2016a) Bivariate hydrologic risk analysis based on coupled entropy-copula method for the Xiangxi River in Three Gorges Reservoir Area. China Theor Appl Climatol 125:381–397

    Article  Google Scholar 

  • Fan YR, Huang W, Huang GH, Li YP, Huang K (2016b) Hydrologic risk analysis in the Yangtze River basin through coupling Gaussian mixtures into copulas. Adv Water Resour 88:170–185

    Article  Google Scholar 

  • Haghighatjou P, Akhoond-Ali AM, Behnia A, Chinipardaz R (2008) Parametric and nonparametric frequency analysis of monthly precipitation in Iran. J Appl Sci 8:3242–3248

    Article  Google Scholar 

  • Han JC, Huang GH, Zhang H, Li Z, Li YP (2014) Bayesian uncertainty analysis in hydrological modeling associated with watershed subdivision level: a case study of SLURP model applied to the Xiangxi River watershed. China Stochastic Env Res Risk A. doi:10.1007/s00477-013-0792-0

    Google Scholar 

  • He L, Huang GH, Lu HW, Zeng GM (2008) Optimization of surfactant-enhanced aquifer remediation for a laboratory BTEX system under parameter uncertainty. Environ Sci Technol 42(6):2009–2014

    Article  Google Scholar 

  • Huang G (1992) A stepwise cluster analysis method for predicting air quality in an urban environment. Atmos Environ Part B. Urban Atmos 26(3):349–357

    Article  Google Scholar 

  • Huang GH, Huang YF, Wang GQ, Xiao HN (2006) Development of a forecasting system for supporting remediation design and process control based on NAPL-biodegradation simulation and stepwise-cluster analysis. Water Resour Res 42:W06413

    Google Scholar 

  • Kennedy WJ, Gentle JE (1981) Statistics: Textbooks and Monographs. Marcel Dekker, New York

    Google Scholar 

  • Li Z, Huang GH, Fan YR, Xu JL (2015) Hydrologic risk analysis for Nonstationary streamflow records under uncertainty. J Environ Inform 26(1):41–51

    Google Scholar 

  • Lu H, Zhang Y, Wang X, He L (2014) A semiparametric statistical approach for forecasting SO2 and NOx concentrations. Environ Sci Pollut Res. doi:10.1007/s11356-014-2748-4

    Google Scholar 

  • Moriasi DN, Amold JG, Van Liew MW, Binger RL, Harmel RD, Veith T (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50(3):885–900

    Article  Google Scholar 

  • Morrison DF (1967) Multivariate Statistical Methods. McGraw-Hill, New York

    Google Scholar 

  • Nikoo MR, Beiglou PHB, Mahjouri N (2016) Optimizing multiple-pollutant waste load allocation in rivers: an interval parameter game theoretic model. Water Resour Manag 30(12):4201–4220

    Article  Google Scholar 

  • Nourani V, Khanghah TR, Baghanam AH (2015) Application of entropy concept for input selection of wavelet-ANN based rainfall-runoff modeling. J Environ Inform 26(1):52–70

    Google Scholar 

  • Okkan U, Serbes ZA (2012) Rainfall-runoff modeling using least squares support vector machines. Environmetrics 23:549–564

    Article  Google Scholar 

  • Overall JE, Klett CJ (1972) Applied Multivariate Analysis. McGraw-Hill, New York

    Google Scholar 

  • Qin XS, Huang GH, Zeng GM, Chakma A (2008) Simulation-based optimization of dual-phase vacuum extraction to remove nonaqueous phase liquids in subsurface. Water Resour Res 44:W04422

    Article  Google Scholar 

  • Rahmani MA, Zarghami M (2015) The use of statistical weather generator, hybrid data driven and system dynamics models for water resources management under climate change. J Environ Inform 25(1):23–35

    Article  Google Scholar 

  • Rao CR (1965) Linear Statistical Inference and Its Applications. Wiley, Hoboken

    Google Scholar 

  • Sachindra DA, Huang F, Barton A, Perera BJC (2013) Least square support vector and multi-linear regression for statistically downscaling general circulation model outputs to catchment streamflows. Int J Climatol 33:1087–1106

    Article  Google Scholar 

  • Sadri S, Burn DH (2012) Nonparametric methods for drought severity estimation atungauged sites. Water Resour Res 48:W12505

    Article  Google Scholar 

  • Sivakumar B, Singh VP (2012) Hydrologic system complexity and nonlinear dynamic concepts for a catchment classification framework. Hydrol Earth Syst Sci 16:4119–4131

    Article  Google Scholar 

  • Song JY, Chung E (2016) Robustness, Uncertainty and Sensitivity Analyses of the TOPSIS Method for Quantitative Climate Change Vulnerability: a Case Study of Flood Damage. Water Resour Manag. doi:10.1007/s11269-016-1451-2

    Google Scholar 

  • Sujay Raghavendra N, Deka PC (2014) Support vector machine application in the field of hydrology: a review. Appl Soft Comput 19:371–386

    Google Scholar 

  • Tatsuoka MM (1971) Multivariate Analysis. John Wiley, Hoboken

    Google Scholar 

  • Turan EM, Yurdusev AM (2009) River flow estimation from upstream flow records by artificial intelligence methods. J Hydrol 369:71–77

    Article  Google Scholar 

  • Wang WC, Chau KW, Cheng CT, Qiu L (2009) A comparison of performance of several artificial intelligence methods for forecasting monthly discharge time series. J Hydrol 374:294–306

    Article  Google Scholar 

  • Wang S, Huang GH, He L (2012) Development of a clusterwise-linear-regression-based forecasting system for characterizing DNAPL dissolution behaviors in porous media. Sci Total Environ 433:141–150

    Article  Google Scholar 

  • Wang X, Huang G, Lin Q, Nie X, Cheng G, Fan Y, Li Z, Yao Y, Suo M (2013) A stepwise cluster analysis approach for downscaled climate projection- A Canadian case study. Environ Model Softw 49:141–151

    Article  Google Scholar 

  • Wilks SS (1962) Mathematical Statistics. Wiley, New York

    Google Scholar 

  • Wu CL (2010). Hydrological predictions using data-driven models coupled with data preprocessing techniques. Ph.D thesis, The Hong Kong Polytechnic University

  • Xiao WH, Wang JH, Huang YF, Sun SC, Zhou YY (2015) An approach for estimating the nitrobenzene (NB) emission effect in frozen rivers: a case study of nitrobenzene pollution in the Songhua River, China. J Environ Inform 26(2):140–147

    Google Scholar 

  • Yang Y, Zhang CT, Zhang RX, Christakos G (2015) Improving environmental prediction by assimilating auxiliary information. J Environ Inform 26(2):91–105

    Google Scholar 

Download references

Acknowledgments

This research was supported by the Natural Science Foundation of China (No. 51190095) and the Program for Innovative Research Team in University (IRT1127) and the Natural Science and Engineering Research Council of Canada.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to G. H. Huang.

Electronic supplementary material

ESM 1

(DOCX 19 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fan, Y.R., Huang, G.H., Li, Y.P. et al. Probabilistic Prediction for Monthly Streamflow through Coupling Stepwise Cluster Analysis and Quantile Regression Methods. Water Resour Manage 30, 5313–5331 (2016). https://doi.org/10.1007/s11269-016-1489-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11269-016-1489-1

Keywords

Navigation