Skip to main content

Data-Driven Modelling: Concepts, Approaches and Experiences

  • Chapter
Practical Hydroinformatics

Part of the book series: Water Science and Technology Library ((WSTL,volume 68))

Abstract

Data-driven modelling is the area of hydroinformatics undergoing fast development. This chapter reviews the main concepts and approaches of data-driven modelling, which is based on computational intelligence and machine-learning methods. A brief overview of the main methods – neural networks, fuzzy rule-based systems and genetic algorithms, and their combination via committee approaches – is provided along with hydrological examples and references to the rest of the book.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Abarbanel HDI (1996) Analysis of Observed Chaotic Data. Springer-Verlag: New York.

    Google Scholar 

  • Abebe AJ, Solomatine DP, Venneker R (2000) Application of adaptive fuzzy rule-based models for reconstruction of missing precipitation events. Hydrological Sciences Journal 45(3): 425–436.

    Google Scholar 

  • Abebe AJ, Guinot V, Solomatine DP (2000) Fuzzy alpha-cut vs. Monte Carlo techniques in assessing uncertainty in model parameters. Proc. 4th Int. Conference on Hydroinformatics, Cedar Rapids.

    Google Scholar 

  • Abebe AJ, Price RK (2004) Information theory and neural networks for managing uncertainty in flood routing. ASCE Journal of Computing in Civil Engineering 18(4): 373–380.

    Article  Google Scholar 

  • Abrahart RJ, See L (2000) Comparing neural network and autoregressive moving average techniques for the provision of continuous river flow forecast in two contrasting catchments. Hydrological Processes 14: 2157–2172.

    Article  Google Scholar 

  • Abrahart RJ, See L (2002) Multi-model data fusion for river flow forecasting: an evaluation of six alternative methods based on two contrasting catchments. Hydrology and Earth System Sciences 6(4): 655–670.

    Google Scholar 

  • Babovic V, Keijzer M, Stefansson M (2000) Optimal embedding using evolutionary algorithms. Proc. 4th Int. Conference on Hydroinformatics, Cedar Rapids.

    Google Scholar 

  • Bárdossy A, Duckstein L (1995) Fuzzy Rule-Based Modeling with Applications to Geophysical, Biological and Engineering Systems. CRC press Inc: Boca Raton, Florida, USA.

    Google Scholar 

  • Beale R, Jackson T (1990) Neural Computing: An Introduction, Adam Hilger: Bristol.

    Google Scholar 

  • Bhattacharya B, Solomatine DP (2005) Neural networks and M5 model trees in modelling water level – discharge relationship. Neurocomputing 63: 381–396.

    Article  Google Scholar 

  • Bishop CM (1995) Neural Networks for Pattern Recognition. Clarendon Press: Oxford.

    Google Scholar 

  • Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth International: Belmont.

    Google Scholar 

  • Breiman L (1996) Stacked regressor. Machine Learning 24(1): 49–64.

    Google Scholar 

  • Cordón O, Herrara F (1995) A general study on genetic fuzzy systems. In: Winter G, Périaux J, Gálan M, Cuesta P (eds) Genetic Algorithms in Engineering and Computer Science. John Wiley & Sons, Chichester, pp. 33–57.

    Google Scholar 

  • Dawson CW, Wilby R (1998) An artificial neural network approach to rainfall-runoff modelling. Hydrological Sciences Journal 43(1): 47–66.

    Google Scholar 

  • Dibike Y, Solomatine DP, Abbott MB (1999) On the encapsulation of numerical-hydraulic models in artificial neural network. Journal of Hydraulic Research 37(2): 147–161.

    Google Scholar 

  • Dibike YB, Velickov S, Solomatine DP, Abbott MB (2001) Model induction with support vector machines: introduction and applications. ASCE Journal of Computing in Civil Engineering 15(3): 208–216.

    Article  Google Scholar 

  • Frapporti G, Vriend SP, Van Gaans PFM (1993) Hydrogeochemistry of the shallow Dutch groundwater: classification of the national groundwater quality monitoring network. Water Resources Research, 29(9): 2993–3004.

    Article  Google Scholar 

  • Freund Y, Schapire R (1997) A decision-theoretic generalisation of on-line learning and an application of boosting. Journal of Computer and System Science 55(1): 119–139.

    Article  Google Scholar 

  • Galeati G (1990) A comparison of parametric and non-parametric methods for runoff forecasting. Hydrology Sciences Journal 35(1): 79–94.

    Article  Google Scholar 

  • Giustolisi O, Savic DA (2006) A symbolic data-driven technique based on evolutionary polynomial regression. Journal of Hydroinformatics 8(3): 202–207.

    Google Scholar 

  • Goldberg DE (1989) Genetic Algorithms in Search Optimisation and Machine Learning. Addison-Wesley: USA.

    Google Scholar 

  • Govindaraju RS, Ramachandra Rao A (eds) (2001) Artificial Neural Networks in Hydrology. Kluwer: Dordrecht.

    Google Scholar 

  • Hall MJ, Minns AW (1999) The classification of hydrologically homogeneous regions. Hydrological Sciences Journal 44: 693–704.

    Google Scholar 

  • Hannah DM, Smith BPG, Gurnell AM, McGregor GR (2000) An approach to hydrograph classification. Hydrological Processes 14: 317–338.

    Article  Google Scholar 

  • Harris NM, Gurnell AM, Hannah DM, Petts GE (2000) Classification of river regimes: a context for hydrogeology. Hydrological Processes 14: 2831–2848.

    Article  Google Scholar 

  • Haykin S (1999) Neural Networks: A Comprehensive Foundation. McMillan: New York.

    Google Scholar 

  • Hsu KL, Gupta HV, Sorooshian S (1995) Artificial neural network modelling of the rainfall-runoff process. Water Resources Research 31(10): 2517–2530.

    Article  Google Scholar 

  • Jang J-S, Sun C-T, Mizutani E (1997) Neuro-Fuzzy and Soft Computing. Prentice Hall.

    Google Scholar 

  • Jordan MI, Jacobs RA (1995) Modular and hierarchical learning systems. In: Arbib M (ed) The Handbook of Brain Theory and Neural Networks. MIT Press, Cambridge.

    Google Scholar 

  • Karr CL (1991) Genetic algorithms for fuzzy logic controllers. AI Expert 6: 26–33.

    Google Scholar 

  • Karlsson M, Yakowitz S (1987) Nearest neighbour methods for non-parametric rainfall runoff forecasting. Water Resources Research 23(7): 1300–1308.

    Article  Google Scholar 

  • Kasabov K (1996) Foundations of Neural Networks, Fuzzy Systems and Knowledge Engineering. MIT Press: Cambridge.

    Google Scholar 

  • Khu S-T, Liong S-Y, Babovic V, Madsen H, Muttil N (2001) Genetic programming and its application in real-time runoff forecasting, Journal of the American Water Resources Association 37(2): 439–451.

    Article  Google Scholar 

  • Kosko B (1997) Fuzzy engineering. Prentice-Hall: Upper Saddle River.

    Google Scholar 

  • Koza JR (1992) Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press: Cambridge, MA

    Google Scholar 

  • Lekkas DF, Imrie CE, Lees MJ (2001) Improved non-linear transfer function and neural network methods of flow routing for real-time forecasting. Journal of Hydroinformatics 3(3): 153–164.

    Google Scholar 

  • Liong SY, Sivapragasam C (2002) Flood stage forecasting with SVM. Journal of American Water Resources Association 38(1): 173–186.

    Article  Google Scholar 

  • Lobbrecht AH, Solomatine DP (1999) Control of water levels in polder areas using neural networks and fuzzy adaptive systems. In: Savic D, Walters G (eds) Water Industry Systems: Modelling and Optimization Applications. Research Studies Press Ltd., Baldock, pp. 509–518.

    Google Scholar 

  • Minns AW, Hall MJ (1996) Artificial neural network as rainfall-runoff model. Hydrological Sciences Journal 41(3): 399–417.

    Google Scholar 

  • Mitchell TM (1997) Machine Learning. McGraw-Hill: New York.

    Google Scholar 

  • Pesti G, Shrestha BP, Duckstein L, Bogárdi I (1996) A fuzzy rule-based approach to drought assessment. Water Resources Research 32(6): 1741–1747.

    Article  Google Scholar 

  • Phoon KK, Islam MN, Liaw CY, Liong SY (2002) A practical inverse approach for forecasting nonlinear hydrological time series. ASCE Journal of Hydrologic Engineering, 7(2): 116–128.

    Article  Google Scholar 

  • Quinlan JR (1992) Learning with continuous classes. In: Adams A, Sterling L (eds) Proc. AI’92, 5th Australian Joint Conference on Artificial Intelligence. World Scientific, Singapore, pp. 343–348.

    Google Scholar 

  • Reeves CR, Rowe JE (2003) Genetic Algorithms – Principles and Perspectives. A Guide to GA Theory, Kluwer Academic Publishers Group.

    Google Scholar 

  • Rumelhart D, Hinton G, Williams R (1986) Learning internal representations by error propagation. In: Rumelhart D, McClelland J, (eds) Parallel Distributed Processing: Explorations in the microstructure of cognition. Volume 1: Foundations. MIT Press, Cambridge, MA, pp. 318–363.

    Google Scholar 

  • See LM, Openshaw S (2000) A hybrid multi-model approach to river level forecasting. Hydrological Sciences Journal 45: 523–536.

    Google Scholar 

  • Shamseldin AY, O’Connor KM (1996) A nearest neighbour linear perturbation model for river flow forecasting. Journal of Hydrology 179: 353–375.

    Article  Google Scholar 

  • Shamseldin AY, O’Connor KM (2001) A non-linear neural network technique for updating of river flow forecasts. Hydrology and Earth System Sciences 5 (4): 557–597.

    Google Scholar 

  • Solomatine DP, Torres LA (1996) Neural network approximation of a hydrodynamic model in optimizing reservoir operation. Proc. 2nd Int. Conference on Hydroinformatics, Balkema: Rotterdam, 201–206.

    Google Scholar 

  • Solomatine DP (1999) Two strategies of adaptive cluster covering with descent and their comparison to other algorithms. Journal of Global Optimization 14(1): 55–78.

    Article  Google Scholar 

  • Solomatine DP, Rojas C, Velickov S, Wust H (2000) Chaos theory in predicting surge water levels in the North Sea. Proc. 4th Int. Conference on Hydroinformatics, Cedar-Rapids.

    Google Scholar 

  • Solomatine DP, Dulal KN (2003) Model tree as an alternative to neural network in rainfall-runoff modelling. Hydrological Sciences J. 48(3): 399–411.

    Article  Google Scholar 

  • Solomatine DP, Xue Y (2004) M5 model trees and neural networks: application to flood forecasting in the upper reach of the Huai River in China. ASCE J. Hydrologic Engineering 9(6): 491–501.

    Article  Google Scholar 

  • Solomatine DP, Maskey M, Shrestha DL (2007) Instance-based learning compared to other data-driven methods in hydrologic forecasting. Hydrological Processes, 21 (DOI: 10.1002/hyp. 6592).

    Google Scholar 

  • Sudheer KP, Jain SK (2003) Radial basis function neural network for modeling rating curves. ASCE Journal of Hydrologic Engineering 8(3): 161–164.

    Article  Google Scholar 

  • Sugeno M, Kang GT (1988) Structure identification of fuzzy model, Fuzzy Sets and Systems 28(1): 15–33.

    Google Scholar 

  • Takagi T, Sugeno M (1985) Fuzzy identification of systems and its applications to modeling and control. IEEE Transactions on Systems, Man, and Cybernetic SMC-15: 116–132.

    Google Scholar 

  • Toth E, Brath A, Montanari A (2000) Comparison of short-term rainfall prediction models for real-time flood forecasting. Journal of Hydrology 239: 132–147.

    Article  Google Scholar 

  • Vapnik VN (1998) Statistical Learning Theory. Wiley & Sons: New York.

    Google Scholar 

  • Velickov S, Solomatine DP, Yu X, Price RK (2000) Application of data mining techniques for remote sensing image analysis. Proc. 4th Int. Conference on Hydroinformatics, USA.

    Google Scholar 

  • Velickov S, Solomatine DP, Price RK (2003) Prediction of nonlinear dynamical systems based on time series analysis: issues of entropy, complexity and predictability. Proc. of the XXX IAHR Congress, Thessaloniki, Greece.

    Google Scholar 

  • Vernieuwe H, Georgieva O, De Baets B, Pauwels VRN, Verhoest NEC, De Troch FP (2005) Comparison of data-driven Takagi–Sugeno models of rainfall–discharge dynamics. Journal of Hydrology 302(1–4): 173–186.

    Article  Google Scholar 

  • Wang LX (1994) Adaptive Fuzzy Systems and Control: Design and Stability Analysis. PTR Prentice Hall Inc.: Englewood Cliffs, NJ.

    Google Scholar 

  • Werbos PJ (1994) The Roots of Backpropagation. NewYork: John Wiley & Sons (includes Werbos’s 1974 Ph.D. thesis, Beyond Regression).

    Google Scholar 

  • Witten IH, Frank E (2000) Data Mining. Morgan Kaufmann: San Francisco.

    Google Scholar 

  • Xiong LH, Shamseldin AY, O’Connor KM (2001) A non-linear combination of the forecasts of rainfall–runoff models by the first-order Takagi-Sugeno fuzzy system. Journal of Hydrology 245(1–4): 196–217.

    Article  Google Scholar 

  • Yao X, Liu Y (1997) A new evolutionary system for evolving artificial neural networks. IEEE Transactions on Neural Networks 8(3): 694–713.

    Article  Google Scholar 

  • Zadeh LA (1965) Fuzzy sets. Information and Control 8: 338–353.

    Article  Google Scholar 

  • Zhang X, Song X (2006) Spatial pattern identification of soil moisture based on self-organizing neural networks. Proc. 7th Intern. Conf on Hydroinformatics, Nice, September.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Solomatine, D., See, L., Abrahart, R. (2009). Data-Driven Modelling: Concepts, Approaches and Experiences. In: Abrahart, R.J., See, L.M., Solomatine, D.P. (eds) Practical Hydroinformatics. Water Science and Technology Library, vol 68. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79881-1_2

Download citation

Publish with us

Policies and ethics