Skip to main content

Advertisement

Log in

An adaptive response surface method for continuous Bayesian model calibration

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

Non-linear numerical models of the injection phase of a carbon sequestration (CS) project are computationally demanding. Thus, the computational cost of the calibration of these models using sampling-based solutions can be formidable. The Bayesian adaptive response surface method (BARSM)—an adaptive response surface method (RSM)—is developed to mitigate the cost of sampling-based, continuous calibration of CS models. It is demonstrated that the adaptive scheme has a negligible effect on accuracy, while providing a significant increase in efficiency. In the BARSM, a meta-model replaces the computationally costly full model during the majority of the calibration cycles. In the remaining cycles, the full model is used and samples of these cycles are utilized for adaptively updating the meta-model. The idea behind the BARSM is to take advantage of the fact that sampling-based calibration algorithms typically tend to sample more frequently from areas with a larger posterior density than from areas with a smaller posterior density. This behavior of the sampling-based calibration algorithms is used to adaptively update the meta-model and to make it more accurate where it is most likely to be evaluated. The BARSM is integrated with Unscented Importance Sampling (UIS) (Sarkarfarshi and Gracie, Stoch Env Res Risk Assess 29: 975–993, 2015), which is an efficient Bayesian calibration algorithm. A synthesized case of supercritical CO2 injection in a heterogeneous saline aquifer is used to assess the performance of the BARSM and to compare it with a classical non-adaptive RSM approach and Bayesian calibration method UIS without using RSM. The BARSM is shown to reduce the computational cost compared to non-adaptive Bayesian calibration by 87 %, with negligible effect on accuracy. It is demonstrated that the error of the meta-model fitted using the BARSM, when samples are drawn from the posterior parameter distribution, is negligible and smaller than the monitoring error.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Notes

  1. To allow the meta-model to be used to forecast the system behaviour, the meta-model is trained using the output of the full model simulations up to \( t_{{N_{t} }} \), where \( t_{{N_{t} }} \gg t_{n} \).

  2. Since the distribution of the errors was close to a Normal distribution, the 2.5 and 97.5 % quantiles were estimated by subtracting and adding 1.96 standard deviations to the mean.

References

  • Abraham A (2005) Adaptation of fuzzy inference system using neural learning. In: Abraham A, Nedjah N, Mourelle LM (eds) Fuzzy systems engineering. Springer, Heidelberg, p 53–83

    Google Scholar 

  • Bachu S (2000) Sequestration of CO2 in geological media, criteria and approach for site selection in response to climate change. Energy Conserv Manag 41:953–970

    Article  CAS  Google Scholar 

  • Bachu S (2008) CO2 storage in geological media: role, means, status and barriers to deployment. Prog Energy Combust Sci 34:254–273

    Article  CAS  Google Scholar 

  • Balakrishnan S, Roy A, Ierapetritou MG, Flach GP, Georgopoulos PG (2003) Uncertainty reduction and characterization for complex environmental fate and transport models: an empirical bayesian framework incorporating the stochastic response surface method. Water Resour Res, 39(12):1350–1362

  • Barton RR (1994) Metamodeling: a state of the art review. Proceedings of the 1994 winter simulation conference, pp 237–244

  • Buhmann MD (2003) Radial basis functions: theory and implementations. Cambridge University Press, New York

    Book  Google Scholar 

  • Celia M, Nordbotten J (2009) Practical modelling approaches for geological storage of carbon dioxide. Ground Water 47(5):627–638

    Article  CAS  Google Scholar 

  • Ebigbo A, Class H, Helmig R (2007) CO2 leakage through an abandoned well: problem-oriented benchmarks. Comput Geosci 11:103–115

    Article  Google Scholar 

  • Espinet AJ, Shoemaker CA (2013) Comparison of optimization algorithms for parameter estimation of multiphase flow models with application to geological carbon sequestration. Adv Water Resour 54:133–148

    Article  Google Scholar 

  • Firoozabadi A, Cheng P (2010) Prospects for subsurface CO2 sequestration. Am Inst Chem Eng J 56(6):1398–1405

    Article  CAS  Google Scholar 

  • Friedman JH (1991) Multivariate adaptive regression splines. Ann Stat 19(1):1–67

    Article  Google Scholar 

  • IEA (2009) CO2 emissions from fuel combustion highlights. International Energy Agency, Paris

    Google Scholar 

  • Isukapalli SS, Roy A, Georgopoulos P (2000) Efficient sensitivity/uncertainty analysis using the combined stochastic response surface method and automated differentiation: application to environmental and biological systems. Risk Anal 20(5):591–602

    Article  CAS  Google Scholar 

  • Jenkinson D, Adams D, Wild A (1991) Model estimates of CO2 emissions from soil in response to global warming. Nature 351:304–306

    Article  CAS  Google Scholar 

  • Jin R, Chen W, Simpson TW (2001) Comparative studies of metamodeling techniques under multiple modeling criteria. Struct Multidiscip Optim 23(1):1–13

    Article  Google Scholar 

  • Jin R, Du X, Chen W (2003) The use of metamodeling techniques for optimization. Struct Multidicip Optim 25(2):99–116

    Article  Google Scholar 

  • Johnson J, White D (2012) History Matching and Performence Validation. In: Hitchon B Best practices for validating CO2 geological storage: observations and guidance from the IEAGHG Weyburn-Midale CO2 Monitoring and Storage Project (p. 215). 254. Geoscience Publishing

  • Julier S, Uhlmann JK (1996) A general method for approximating nonlinear transformations of probability distributions

  • Kleijnen J (1987) Statisticall tools for simulation practitioners. Marcel Dekker, New York

    Google Scholar 

  • Malekzadeh FA, Dusseault MB (2013) A solution for the transition zone isosats in two-phase primary drainage in the presence of gravity. Comput Geosci 17(5):757–771

    Article  Google Scholar 

  • Marshall A (1956) The use of multi-stage sampling schemes in Monte Carlo computations. Symposium on Monte Carlo Methods. Wiley, New York, pp 123–140

  • Nordbotton J, Celia M, Bachu S (2005) Injection and storage of CO2 in deep saline aquifers: analytical solution for CO2 plume evolution injection. Transp Porous Media 58(3):339–360

    Article  Google Scholar 

  • Oladyshkin S, Class H, Helmig R, Nowak W (2011) An integrative approach to robust design and probabilistic risk assessment for CO2 storage in geological formations. Comput Geosci 15(3):565–577

    Article  Google Scholar 

  • Oladyshkin S, de Barros F, Nowak W (2012) Global sensitivity analysis: a flexible and efficient framework with an example from stochastic hydrogeology. Adv Water Resour 37:10–22

    Article  Google Scholar 

  • Orr FM (2004) Storage of carbon dioxide in geologic formations. J Petrol Technol 56(9):90–97

    Article  CAS  Google Scholar 

  • Razavi S, Tolson BA, Burn DH (2012a) Numerical assessment of metamodelling strategies in computationally intensive optimization. Environ Model Softw 34:67–86

    Article  Google Scholar 

  • Razavi S, Tolson BA, Burn DH (2012b) Review of surrogate modeling in water resources. Water Resour Res. doi:10.1029/2011WR011527

  • Rougier J (2009) Formal bayes methods for model calibration with uncertainty. In: Beven K, Hall J (eds) Applied uncertainty analysis for flood risk management. Imperial College Press, London

    Google Scholar 

  • Sarkarfarshi M (2015) Carbon sequestration: uncertainty and parameter estimation. University of Waterloo, Waterloo

    Google Scholar 

  • Sarkarfarshi M, Gracie R (2013) A Bayesian approach to mitigate parameter uncertainty in carbon dioxide sequestration models. 47th US rock mechanics/geomechanics symposium. ARMA, American Rock Mechanics Association, San Francisco

  • Sarkarfarshi M, Gracie R (2015) Unscented importance sampling for parameter calibration of carbon sequestration systems. Stoch Env Res Risk Assess 29(3):975–993

    Article  Google Scholar 

  • Schlumberger (2013) ECLIPSE Technical Description Version 2013

  • Seto C, McRae G (2011) Reducing risk in basin scale sequestration: a Bayesian model selection framework for improving detection. Energy Procedia 4:4199–4206

    Article  Google Scholar 

  • Smith M (1993) Neural networks for statistical modeling. Van Nostrand Reinhold, New York

    Google Scholar 

  • Smith PJ, Shafi M, Gao H (1997) Quick simulation: a review of importance sampling techniques in communications systems. IEEE J Sel Areas Commun 15(4):597–613

    Article  Google Scholar 

  • Soreide I, Whitson C (1992) Peng Robinson predictions for hydrocarbons, CO2, N2 and H2S with pure water and NaCl brine. Fluid Phase Equilib 77:217–240

    Article  Google Scholar 

  • SPE International. (n.d.). Permeability determination. From PetroWiki: http://petrowiki.org/Permeability_determination#Determining_permeability

  • Stein ML (1999) Interpolation of spatial data: some theory for Kriging. Springer, New York

    Book  Google Scholar 

  • Tavakoli R, Yoon H, Delshad M, ElSheikh AH, Wheeler MF, WB Arnold (2013) Comparison of ensemble filtering algorithms and null-space Monte Carlo for parameter estimation and uncertainty quantification using CO2 sequestration data. Water Resour Res 49:1–20

    Article  Google Scholar 

  • Vrugt JA, ter Braak CJ, Gupta HV, Robinson BA (2009) Equifinality of formal (DREAM) and informal (GLUE) Bayesian approaches in hydrologic modeling? Stoch Env Res Risk Assess 23(7):1011–1026

    Article  Google Scholar 

  • Wang GG (2003) Adaptive response surface method using inherited Latin hypercube design points. J Mech Des 125(2):210–220

    Article  Google Scholar 

  • Wang GG, Dong Z, Aitchison P (2002) Adaptive response surface method- a global optimization scheme for computation-intensive design problems. Eng Optim 33(6):707–733

    Article  Google Scholar 

Download references

Acknowledgments

The authors wish to acknowledge funding from the Natural Science and Engineering Research Council of Canada (NSERC) through Carbon Management Canada (CMC) and from NSERC’s Discovery Grant program.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Robert Gracie.

Appendix 1: Unscented importance sampling (UIS)

Appendix 1: Unscented importance sampling (UIS)

Importance Sampling (IS) (Marshall 1956) is a method for generating samples from a probability distribution, denoted by \( \pi \left( \varvec{m} \right) \) here, which is not easy to sample from and is available at least up to a normalizing constant. In IS, samples, denoted by \( \hat{\varvec{m}}_{i} \) (i = 1 to N s ) here, are drawn from a proposal distribution, denoted by \( q(\varvec{m}) \), which is close to \( \pi \left( \varvec{m} \right) \) and is easy to sample from. Next, the samples are weighted to remove the bias,

$$ W_{i} = \frac{{w_{i} }}{{\mathop \sum \nolimits_{i = 1}^{N} w_{i} }} i = 1 \ldots N_{s} $$
(15)

where W i is the normalized weight for the ith sample and w i is un-normalized weight for the ith sample obtained by \( w_{i} = \pi (\hat{\varvec{m}}_{i} )/q(\hat{\varvec{m}}_{i} ) \). Statistics of \( \pi \left( \varvec{m} \right) \) can be approximated by statistics of the weighted samples.

IS can be utilized for sampling from the posterior distribution of uncertain parameters of a system in presence of random error with mean of zero, \( \varvec{\varepsilon} \), between the system behaviour model, \( \varvec{g}\left( \varvec{m} \right) \), and system behaviour measurements (monitoring data), \( \varvec{d}_{obs}^{{}} \). If \( \pi_{\varepsilon } \left(\varvec{\varepsilon}\right) \) denotes the probability distribution of \( \varvec{\varepsilon} \), \( \pi^{0} \left( \varvec{m} \right) \) denotes the prior distribution of \( \varvec{m} \) and c denotes the normalizing constant in the Bayes’ formula, un-normalized weights of the samples of the posterior distribution of m are obtained by

$$ w_{i} = c^{ - 1} \frac{{\pi_{\varepsilon } \left( {\varvec{d}_{obs} - \varvec{g}\left( {\hat{\varvec{m}}_{i} } \right)} \right)\pi^{0} \left( {\hat{\varvec{m}}_{i} } \right)}}{{q\left( {\hat{\varvec{m}}_{i} } \right)}}; \quad i = 1 \ldots N_{s} $$
(16)

When normalizing the weights, the normalizing constant is cancelled out from numerator and denominator in Eq. (16). Thus, normalizing constant in the Bayes formula (which is the problematic part in obtaining posterior distribution) does not need to be known in Bayesian IS.

A proposal distribution which is close to the true posterior is the key factor for efficient IS (Smith et al. 1997). Unscented Importance Sampling (UIS) (Sarkarfarshi and Gracie 2015) is an efficient method for obtaining a proposal distributions close to the true posterior distribution in continuous Bayesian IS. The benefits of UIS are computational efficiency and not making any assumptions about the model type nor probability distributions (prior, likelihood and posterior).

Now we briefly describe UIS formulation. Assume continuous Bayesian update of parameters of a system denoted by \( \varvec{m} \) where the system model at time t n is denoted by \( \varvec{g}\left( {t_{n} ,\varvec{m}} \right) \) and monitoring data at time t n is donted by \( \varvec{d}_{obs}^{n} \). Assume the discrepancy between the model and monitoring data, denoted by \( \varvec{\varepsilon} \), is a random variable with probability distribution of \( \pi_{\varepsilon } (\varvec{\varepsilon}) \). The posterior distribution of model parameters at time t n is denoted by \( \pi^{n} (\varvec{m}) \) and \( \pi (\varvec{m}) \) denotes the prior distribution before calibration.

At time t n , monitoring data \( \varvec{d}_{obs}^{n} \) is obtained and we want to sample from \( \pi^{n} (\varvec{m}) \) accounting for all monitoring data up to time t n . UIS at each time or cycle consists of two steps, as described in the following.

  1. 1.

    UKF step: in the UKF step, the true posterior is approximated by a Gaussian distribution using measurement update stage of Unscented Kalman Filter (UKF) (Julier and Uhlmann 1996). A full description of UKF can be found in the former reference. The UKF stage of UIS can be formulated as follows.

    1. 1.1

      Use \( \pi_{b}^{n} \left( \varvec{m} \right) = N\left( {\varvec{\mu}_{m}^{n - 1} ,{\mathbf{C}}_{m}^{n - 1} } \right) \) as the prior distribution, where:

      • 1.1.1 \(\varvec{\mu}_{m}^{n - 1} = \mathop \sum \limits_{i = 1}^{{N_{s} }} W_{i}^{n - 1} \hat{\varvec{m}}_{i}^{n - 1} \) and \( {\mathbf{C}}_{m}^{n - 1} = \mathop \sum \limits_{i = 1}^{{N_{s} }} W_{i}^{n - 1} \left( {\hat{\varvec{m}}_{i}^{n - 1} -\varvec{\mu}_{b}^{n} } \right)\left( {\hat{\varvec{m}}_{i}^{n - 1} -\varvec{\mu}_{b}^{n} } \right)^{T} \) from the previous cycle, when n > 1

      • 1.1.2 \( \varvec{\mu}_{m}^{n - 1} \) and \( {\mathbf{C}}_{m}^{n - 1} \) are mean and covariance of \( \pi^{0} \left( \varvec{m} \right) \) when n = 1

    2. 1.2

      Select the Sigma points (\( \varvec{\chi}_{i} \)) and corresponding weights (ϑ i ) in the UKF algorithm:

      $$ \varvec{\chi}_{0} =\varvec{\mu}_{m}^{n - 1} $$
      $$ - 1 < \vartheta_{0} < 1 $$
      $$ \varvec{\chi}_{i} =\varvec{\mu}_{m}^{n - 1} \pm \left( {\sqrt {\frac{{N_{m} }}{{1 - \vartheta_{0} }}{\mathbf{C}}_{m}^{n - 1} } } \right)_{i} +\, \left( { i = 1, \ldots ,N_{m} } \right) - ( i = N_{m} + 1, \ldots ,2N_{m} ) $$

      where \( \left( {\sqrt {\frac{{N_{m} }}{{1 - \vartheta_{0} }}{\mathbf{C}}_{m}^{n - 1} } } \right)_{i} \) is the ith row or column of the square root of \( \sqrt {\frac{{N_{m} }}{{1 - \vartheta_{0} }}{\mathbf{C}}_{m}^{n - 1} } \) obtained by Cholesky decomposition.

      $$ \vartheta_{i} = \frac{{1 - \vartheta_{0} }}{{2N_{s} }}; \quad i = 1,2, \ldots ,2N_{s} $$

      where N m is the number of uncertain parameters, i.e., dimension of the parameter vector \( \varvec{m} \).

    3. 1.3

      Obtain propagated Sigma points through system model by \( {\varvec{\Upsilon}}_{i} = \varvec{g}\left( {t_{n} ,\varvec{\chi}_{i} } \right) \).

    4. 1.4

      Obtain Sigma point and propagated sigma point statistics as follows:

      $$ \varvec{\mu}_{\chi } = \mathop \sum \limits_{i = 0}^{{2N_{m} }} \vartheta_{i}\varvec{\chi}_{i} ; \varvec{\mu}_{\varUpsilon } = \mathop \sum \limits_{i = 0}^{{2N_{m} }} \vartheta_{i} {\varvec{\Upsilon}}_{i} $$
      $$ {\mathbf{C}}_{\varUpsilon } = \mathop \sum \limits_{i = 0}^{{2N_{m} }} \vartheta_{i} \left[ {\varUpsilon_{i} -\varvec{\mu}_{\varUpsilon } } \right]^{T} \left[ {\varUpsilon_{i} -\varvec{\mu}_{\varUpsilon } } \right] ; {\mathbf{C}}_{\chi ,\varUpsilon } = \mathop \sum \limits_{i = 0}^{{2N_{m} }} \vartheta_{i} \left[ {\varvec{\chi}_{i} -\varvec{\mu}_{\chi } } \right]^{T} \left[ {\varUpsilon_{i} -\varvec{\mu}_{\varUpsilon } } \right] $$

      where \( \varvec{\mu}_{\chi } \) is the mean of the Sigma points, \( \varvec{\mu}_{\varUpsilon } \) is the mean of the propagated Sigma points, \( {\mathbf{C}}_{\varvec{\Upsilon}} \) is the covariance of the propagated Sigma points and \( {\mathbf{C}}_{\chi,\,\varvec{\Upsilon}} \) is the covariance between \( \varvec{\chi} \) and \( {\varvec{\Upsilon}} \).

    5. 1.5

      Obtain Gaussian posterior of UKF step with a mean of \( \varvec{\mu}_{m}^{n} =\varvec{\chi}_{0} + \varvec{K}\left( {\varvec{d}_{obs}^{n} -\varvec{\mu}_{\varUpsilon } } \right) \) and a covariance of \( {\mathbf{C}}_{m}^{n} = {\mathbf{C}}_{m}^{n - 1} - \varvec{K}\left( {{\mathbf{C}}_{\varUpsilon } + {\mathbf{C}}_{\varepsilon } } \right)\varvec{K}^{T} \) where \( \varvec{K} \) is the Kalman gain and is obtained by \( \varvec{K} = {\mathbf{C}}_{\chi ,\varUpsilon } \left( {{\mathbf{C}}_{\varUpsilon } + {\mathbf{C}}_{\varepsilon } } \right)^{ - 1} \) and C ɛ is covariance of \( \pi_{\varepsilon } (\varvec{\varepsilon}) \).

  2. 2.

    IS step: in the IS step, the Gaussian approximation of the posterior from the UKF step is used as the proposal distribution of IS.

    1. 2.1

      Use posterior of UKF step as the proposal distribution denoted by \( q^{n} \left( \varvec{m} \right) \)

    2. 2.2

      Sample from \( q^{n} \left( \varvec{m} \right) \) until desired number of samples is obtained. \( \hat{\varvec{m}}_{i}^{n} \) denotes sample i at cycle n.

    3. 2.3

      Obtain un-normalized weights by \( w_{i}^{n} = \pi^{0} (\hat{\varvec{m}}_{i}^{n} )/q^{n} \left( {\hat{\varvec{m}}_{i}^{n} } \right)\mathop \prod \limits_{l = 1}^{n} \pi_{{\varepsilon_{obs} }} \left( {\varvec{d}_{obs}^{l} - g\left( {t_{n} ,\hat{\varvec{m}}_{i}^{n} } \right)} \right) \)

    4. 2.4

      Normalize weights to obtain the importance weights at cycle n, denoted by W n i .

This algorithm is repeated when new monitoring data is obtained. The two stage scheme described above ensures that the proposal distribution is close to the true posterior at all calibration cycles and thus, increases the accuracy and computational efficiency of the IS. Moreover, no assumption is made on the type of the system model (e.g. linearity) or type of the prior, posterior and likelihood (e.g. being Gaussian). A more comprehensive description of UIS along with several case studies can be found in (Sarkarfarshi and Gracie 2015).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sarkarfarshi, M., Gracie, R. An adaptive response surface method for continuous Bayesian model calibration. Stoch Environ Res Risk Assess 31, 725–741 (2017). https://doi.org/10.1007/s00477-016-1231-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-016-1231-9

Keywords

Navigation