Skip to main content
Log in

How Bayesian data assimilation can be used to estimate the mathematical structure of a model

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

In previous work, we presented a method for estimation and correction of non-linear mathematical model structures, within a Bayesian framework, by merging uncertain knowledge about process physics with uncertain and incomplete observations of dynamical input-state-output behavior. The resulting uncertainty in the model input-state-output mapping is expressed as a weighted combination of an uncertain conceptual model prior and a data-derived probability density function, with weights depending on the conditional data density. Our algorithm is based on the use of iterative data assimilation to update a conceptual model prior using observed system data, and thereby construct a posterior estimate of the model structure (the mathematical form of the equation itself, not just its parameters) that is consistent with both physically based prior knowledge and with the information in the data. An important aspect of the approach is that it facilitates a clear differentiation between the influences of different types of uncertainties (initial condition, input, and mapping structure) on the model prediction. Further, if some prior assumptions regarding the structural (mathematical) forms of the model equations exist, the procedure can help reveal errors in those forms and how they should be corrected. This paper examines the properties of the approach by investigating two case studies in considerable detail. The results show how, and to what degree, the structure of a dynamical hydrological model can be estimated without little or no prior knowledge (or under conditions of incorrect prior information) regarding the functional forms of the storage–streamflow and storage–evapotranspiration relationships. The importance and implications of careful specification of the model prior are illustrated and discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Arulampalam MS, Maskell S, Gordon N, Clapp T (2002) A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans Signal Process 50(2):174–188

    Article  Google Scholar 

  • Beven KJ, Binley AM (1992) The future of distributed models: model calibration and uncertainty in prediction. Hydrol Process 6:279–298

    Article  Google Scholar 

  • Bulygina N (2007) Model structure identification and correction through data assimilation. PhD dissertation, Department of Hydrology and Water Resources, The University of Arizona, Tucson

  • Bulygina N, Gupta H (2009) Estimating the uncertain mathematical structure of a water balance model via Bayesian data assimilation, Water Resour Res 45 (special issue on ‘Uncertainty Assessment in Surface and Subsurface Hydrology’W00B13). doi:10.1029/2007WR006749

  • Clark MP, Slater AG, Rupp DE, Woods RA, Vrugt JA, Gupta GV, Wagener T, Hay LE (2008) FUSE: a modular framework to diagnose differences between hydrological models. Water Resour Res 44:W00B02. doi:10.1029/2007WR006735

  • de Vos NJ, Rientjes THM, Gupta HV (2009) Diagnostic evaluation of conceptual rainfall–runoff models using temporal clustering. Hydrol Process (in press)

  • Doucet A, Godsill S, Andriew C (2000) On sequential Monte Carlo methods for Bayesian filtering. Stat Comput 10:197–208

    Article  Google Scholar 

  • Dugundji J, Granas A (2003) Fixed point theory. Springer-Verlag. ISBN 0-387-00173-5

  • Ferguson TS (1983) Bayesian density estimation by mixture of normal distributions. In: Rizvi H, Rustagi J (eds) Recent advances in statistics. Wiley, New York

  • Freer J, Beven KJ, Ambroise B (1996) Bayesian estimation of uncertainty in runoff prediction and the value of data: an example of the GLUE approach. Water Resour Res 32(7):2161–2173

    Article  Google Scholar 

  • Gupta HV, Wagener T, Liu Y (2008) Reconciling theory with observations: towards a diagnostic approach to model evaluation. Hydrol Process 22(18):3802–3813. doi:10.1002/hyp.6989

    Article  Google Scholar 

  • Hurzeler M, Kunsch HR (1998) Monte Carlo approximation for general state-space models. J Comput Graph Stat 7(2):175–193

    Article  Google Scholar 

  • Kullback S, Leibler RA (1951) On information and sufficiency. Ann Math Stat 22:79–86

    Article  Google Scholar 

  • Liu Y, Gupta H (2007) Uncertainty in hydrologic modeling: towards an integrated hierarchical data assimilation framework. Water Resour Res 43:W07401. doi:10.1029/2006WR005756

    Article  Google Scholar 

  • Martinez GF, Gupta HV (2009) Continental scale diagnostic evaluation of the ‘abcd’ monthly water balance model for the conterminous US. Water Resour Res (in press)

  • Moradkhani H, Hsu KL, Gupta H, Sorooshian S (2005) Uncertainty assessment of hydrological model states and parameters: sequential data assimilation using the particle filter. Water Resour Res 41(5):W0501

    Article  Google Scholar 

  • Muller P, Erkanli A, West M (1996) Bayesian curve fitting using multivariate normal mixtures. Biometrika 83(1):67–79

    Article  Google Scholar 

  • Neuman S (2003) Maximum likelihood Bayesian averaging of alternative conceptual-mathematical models. Stoch Environ Res Risk Assess 17(5):291–305. doi:10.1007/s00477-003-0151-7

    Article  Google Scholar 

  • Tanner M, Wong W (1987) The calculation of posterior distributions by data augmentation. J Am Stat Assoc 82(392):528–540

    Article  Google Scholar 

  • Thiemann M, Trosset MW, Gupta HV, Sorooshian S (2001) Recursive Bayesian inference for hydrologic models. Water Resour Res 37(10):2521–2535

    Article  Google Scholar 

  • Wagener T, McIntyre N, Lees MJ, Wheater HS, Gupta HV (2003) Towards reduced uncertainty in conceptual rainfall-runoff modelling: dynamic identifiability analysis. Hydrol Process 17(2):455–476

    Article  Google Scholar 

  • Woolhiser DA, Smith RE, Goodrich DC (1990) KINEROS: a kinematic runoff and erosion model. USDA Agric Res Serv ARS-77:130

  • Ye M, Neuman SP, Meyer PD (2004) Maximum likelihood Bayesian averaging of spatial variability models in unsaturated fractured tuff. Water Resour Res 40:W05113. doi:10.1029/2003WR002557

    Article  Google Scholar 

  • Young PC (2001) Data-based mechanistic modelling and validation of rainfall-flow processes. In: Anderson MG, Bates PG (eds) Model validation. Wiley, Chichester

    Google Scholar 

Download references

Acknowledgements

We gratefully acknowledge support for this work provided by the Department of Hydrology and Water Resources of the University of Arizona, the National Weather Service Office of Hydrology under grant NA04NWS462001 and by SAHRA under NSF-STC grant EAR-9876800. The first author was also partially supported by a Doctoral Fellowship from the Salt River Project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nataliya Bulygina.

Appendix

Appendix

Bias and Nash–Sutcliffe coefficient (NS) performance measures for the output predictions Q, when either the ensembles, expected value estimate, maximum likelihood estimate, or 25, 50, 75 and 95% probability mass regions are used as predictors: \( Bias = {\frac{{\sum\nolimits_{t} {Q_{t}^{pred} - Q_{t}^{true} } }}{{\sum\nolimits_{t} {Q_{t}^{true} } }}}*100\% ,\,NS = 1 - {\frac{{\sum\nolimits_{t} {(Q_{t}^{pred} - Q_{t}^{true} )^{2} } }}{{\sum\nolimits_{t} {(Q_{t}^{true} - Q_{0}^{true} )^{2} } }}}. \) Where the ‘pred’ and ‘true’ super-indices denote predicted and true values, respectively, and Q true 0 denotes the mean of the sequence of true values. For the probability mass regions, the measures are computed using pdf-weighted averages.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bulygina, N., Gupta, H. How Bayesian data assimilation can be used to estimate the mathematical structure of a model. Stoch Environ Res Risk Assess 24, 925–937 (2010). https://doi.org/10.1007/s00477-010-0387-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-010-0387-y

Keywords

Navigation