Skip to main content

Advertisement

Log in

A New Data-Space Inversion Procedure for Efficient Uncertainty Quantification in Subsurface Flow Problems

  • Published:
Mathematical Geosciences Aims and scope Submit manuscript

Abstract

Uncertainty quantification for subsurface flow problems is typically accomplished through model-based inversion procedures in which multiple posterior (history-matched) geological models are generated and used for flow predictions. These procedures can be demanding computationally, however, and it is not always straightforward to maintain geological realism in the resulting history-matched models. In some applications, it is the flow predictions themselves (and the uncertainty associated with these predictions), rather than the posterior geological models, that are of primary interest. This is the motivation for the data-space inversion (DSI) procedure developed in this paper. In the DSI approach, an ensemble of prior model realizations, honoring prior geostatistical information and hard data at wells, are generated and then (flow) simulated. The resulting production data are assembled into data vectors that represent prior ‘realizations’ in the data space. Pattern-based mapping operations and principal component analysis are applied to transform non-Gaussian data variables into lower-dimensional variables that are closer to multivariate Gaussian. The data-space inversion is posed within a Bayesian framework, and a data-space randomized maximum likelihood method is introduced to sample the conditional distribution of data variables given observed data. Extensive numerical results are presented for two example cases involving oil–water flow in a bimodal channelized system and oil–water–gas flow in a Gaussian permeability system. For both cases, DSI results for uncertainty quantification (e.g., P10, P50, P90 posterior predictions) are compared with those obtained from a strict rejection sampling (RS) procedure. Close agreement between the DSI and RS results is consistently achieved, even when the (synthetic) true data to be matched fall near the edge of the prior distribution. Computational savings using DSI are very substantial in that RS requires \(O(10^5\)\(10^6)\) flow simulations, in contrast to 500 for DSI, for the cases considered.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19

Similar content being viewed by others

References

  • Aanonsen SI, Nævdal G, Oliver DS, Reynolds AC, Vallès B (2009) The ensemble Kalman filter in reservoir engineering—a review. SPE J 14(3):393–412

    Article  Google Scholar 

  • Barker JW, Cuypers M, Holden L (2001) Quantifying uncertainty in production forecasts: another look at the PUNQ-S3 problem. SPE J 6(4):433–441

    Article  Google Scholar 

  • Cardoso MA, Durlofsky LJ, Sarma P (2009) Development and application of reduced-order modeling procedures for subsurface flow simulation. Int J Numer Methods Eng 77(9):1322–1350

    Article  Google Scholar 

  • Castro SA (2007) A probabilistic approach to jointly integrate 3D/4D seismic, production data and geological information for building reservoir models. PhD thesis, Stanford University

  • Chen Y, Oliver DS, Zhang D (2009) Data assimilation for nonlinear problems by ensemble Kalman filter with reparameterization. J Pet Sci Eng 66(1–2):1–14

    Article  Google Scholar 

  • Emerick AA, Reynolds AC (2013) Ensemble smoother with multiple data assimilation. Comput Geosci 55:3–15

    Article  Google Scholar 

  • Evensen G (2003) The ensemble Kalman filter: theoretical formulation and practical implementation. Ocean Dyn 53(4):343–367

    Article  Google Scholar 

  • Evensen G, van Leeuwen PJ (2000) An ensemble Kalman smoother for nonlinear dynamics. Mon Weather Rev 128(6):1852–1867

    Article  Google Scholar 

  • Floris F, Bush M, Cuypers M, Roggero F, Syversveen AR (2001) Methods for quantifying the uncertainty of production forecasts: a comparative study. Pet Geosci 7(SUPP):87–96

    Article  Google Scholar 

  • Gao G, Reynolds AC (2006) An improved implementation of the LBFGS algorithm for automatic history matching. SPE J 11(1):5–17

    Article  Google Scholar 

  • Gao G, Zafari M, Reynolds AC (2006) Quantifying uncertainty for the PUNQ-S3 problem in a Bayesian setting with RML and EnKF. SPE J 11(4):506–515

    Article  Google Scholar 

  • Gu Y, Oliver DS (2006) The ensemble Kalman filter for continuous updating of reservoir simulation models. J Energy Resour Technol 128(1):79–87

    Article  Google Scholar 

  • He J, Xie J, Sarma P, Wen XH, Chen WH, Kamath J (2015) Model-based a priori evaluation of surveillance programs effectiveness using proxies. Paper SPE 173229 presented at the SPE reservoir simulation symposium, Houston, Texas, USA, 23–25 February

  • Kitanidis PK (1986) Parameter uncertainty in estimation of spatial functions: Bayesian analysis. Water Resour Res 22(4):499–507

    Article  Google Scholar 

  • Krishnamurti TN, Kishtawal C, Zhang Z, LaRow T, Bachiochi D, Williford E, Gadgil S, Surendran S (2000) Multimodel ensemble forecasts for weather and seasonal climate. J Clim 13(23):4196–4216

    Article  Google Scholar 

  • Le DH, Reynolds AC (2014) Estimation of mutual information and conditional entropy for surveillance optimization. SPE J 19(4):648–661

    Article  Google Scholar 

  • Liang Y, Lee H, Lim S, Lin W, Lee K, Wu C (2002) Proper orthogonal decomposition and its applications—part I: theory. J Sound Vib 252(3):527–544

    Article  Google Scholar 

  • Mallet V, Stoltz G, Mauricette B (2009) Ozone ensemble forecast with machine learning algorithms. J Geophys Res 114(D5):148–227

    Article  Google Scholar 

  • Mohaghegh SD (2005) Recent developments in application of artificial intelligence in petroleum engineering. J Pet Technol 57(04):86–91

    Article  Google Scholar 

  • Mosegaard K, Tarantola A (1995) Monte Carlo sampling of solutions to inverse problems. J Geophys Res Solid Earth 100(B7):12,431–12,447

    Article  Google Scholar 

  • Oliver DS (1996) Multiple realizations of the permeability field from well test data. SPE J 1(2):145–154

    Article  Google Scholar 

  • Oliver DS, Chen Y (2011) Recent progress on reservoir history matching: a review. Comput Geosci 15(1):185–221

    Article  Google Scholar 

  • Oliver DS, He N, Reynolds AC (1996) Conditioning permeability fields to pressure data. In: 5th European conference on the mathematics of oil recovery, Leoben, Austria, 3–6 September

  • Oliver DS, Reynolds AC, Liu N (2008) Inverse theory for petroleum reservoir characterization and history matching. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Pagowski M, Grell G, McKeen S, Dévényi D, Wilczak J, Bouchet V, Gong W, McHenry J, Peckham S, McQueen J (2005) A simple method to improve ensemble-based ozone forecasts. Geophys Res Lett. doi:10.1029/2004GL022305

    Google Scholar 

  • Park H, Scheidt C, Fenwick D, Boucher A, Caers J (2013) History matching and uncertainty quantification of facies models with multiple geological interpretations. Comput Geosci 17(4):609–621

    Article  Google Scholar 

  • Remy N, Boucher A, Wu J (2009) Applied geostatistics with SGeMS: a user’s guide. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Reynolds AC, He N, Chu L, Oliver DS (1996) Reparameterization techniques for generating reservoir descriptions conditioned to variograms and well-test pressure data. SPE J 1(4):413–426

    Article  Google Scholar 

  • Reynolds AC, He N, Oliver DS (1999) Reducing uncertainty in geostatistical description with well-testing pressure data. In: Schatzinger RA, Jordan JF (eds) Reservoir characterization—recent advances. American Association of Petroleum Geologists, Tulsa, pp 149–162

    Google Scholar 

  • Sarma P, Durlofsky LJ, Aziz K, Chen W (2006) Efficient real-time reservoir management using adjoint-based optimal control and model updating. Comput Geosci 10(1):3–36

    Article  Google Scholar 

  • Satija A, Caers J (2015) Direct forecasting of subsurface flow response from non-linear dynamic data by linear least-squares in canonical functional principal component space. Adv Water Resour 77:69–81

    Article  Google Scholar 

  • Scheidt C, Renard P, Caers J (2015) Prediction-focused subsurface modeling: investigating the need for accuracy in flow-based inverse modeling. Math Geosci 47(2):173–191

    Article  Google Scholar 

  • Schlumberger, (2013) Eclipse reference manual. Version 2013.2

  • Shlens J (2005) A tutorial on principal component analysis. http://www.cs.cmu.edu/~elaw/papers/pca.pdf

  • Strebelle S (2002) Conditional simulation of complex geological structures using multiple-point statistics. Math Geosci 34(1):1–21

    Google Scholar 

  • Sun W (2014) Data driven history matching for reservoir production forecasting. Master’s thesis, Stanford University

  • Sun W, Durlofsky LJ, Hui MH (2016) Production uncertainty quantification for a naturally fractured reservoir using a new data-space inversion procedure. In: 15th European conference on the mathematics of oil recovery, Amsterdam, Netherlands, 29 August–1 September

  • Tarantola A (2005) Inverse problem theory and methods for model parameter estimation. SIAM, Philadelphia

    Book  Google Scholar 

  • Vo HX, Durlofsky LJ (2014) A new differentiable parameterization based on principal component analysis for the low-dimensional representation of complex geological models. Math Geosci 46(7):775–813

    Article  Google Scholar 

  • Vo HX, Durlofsky LJ (2015) Data assimilation and uncertainty assessment for complex geological models using a new PCA-based parameterization. Comput Geosci 19(4):747–767

    Article  Google Scholar 

  • Zhou Y (2012) Parallel general-purpose reservoir simulation with coupled reservoir models and multisegment wells. PhD thesis, Stanford University

Download references

Acknowledgements

We thank Chevron ETC and the Stanford Smart Fields Consortium for financial support. We are grateful to Hai Xuan Vo, David Cameron, Celine Scheidt, Vladislav Bukshtynov and Oleg Volkov for useful discussions and assistance with simulation software.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wenyue Sun.

Appendix: Detailed Mapping Operations

Appendix: Detailed Mapping Operations

In this Appendix, the general pattern-based mapping operations applied in this study are described. For generality and simplicity of notation, we describe the mapping operations for an ensemble of time-series functions \(y_i(t)~(i=1,2,\dots , N_{{\text {r}}})\). Note that the data values in \(({\mathbf {d}}_{{\text {full}}})_i\), discussed in Sect. 3.1, are simply the values of these time-series functions at different time steps.

Fig. 20
figure 20

Illustration of mapping operations. a Original functions, b functions after mapping

It is assumed that the functions can all be separated into the same number of stages, with the same general behavior within each stage. For example, the functions in Fig. 20a can be separated into four stages: constant, decline, constant and decline. The times separating the different stages are referred to as transition times. The goal is then to map these functions to those shown in Fig. 20b, in which the corresponding transition ‘times’ for all functions are the same.

The overall starting and ending times, \(t_{{\text {s}}}\) and \(t_{{\text {e}}}\), are assumed to be the same for all functions. A total of \(M+1\) stages are then identified (\(M=3\) for the cases shown in Fig. 20a), and the corresponding transition times are designated \(t_i^j~(i=1,2,\dots ,N_{{\text {r}}}; j=1,2,\dots , M)\). Defining \(t_i^0=t_s\) and \(t_i^{M+1}=t_e\), the mapped functions are then constructed as

$$\begin{aligned} \hat{y}_i(\tau )=y_i\left( t_i^j+(t_i^{j+1}-t_i^j)\frac{\tau -\tau ^j}{\tau ^{j+1}-\tau ^j}\right) , \quad \tau ^j \le \tau \le \tau ^{j+1}, \ \ 0 \le j \le M, \end{aligned}$$
(30)

where \(\hat{y}_i(\tau )\) denotes the mapped function for member i in the ensemble, and \(\tau ^j\) is the transition ‘time’ for all mapped functions (note \(\tau ^j\) is the same for all functions). The values of \(\tau ^j~(j=0,1,\dots ,M+1)\) must be predefined. In this paper, a particular transition ‘time’ is defined as the mean of the corresponding transition times for all of the original functions, i.e.,

$$\begin{aligned} \tau ^j = \sum _{i=1}^{N_{{\text {r}}}}\frac{t_i^j}{N_{{\text {r}}}} , \quad 0 \le j \le M + 1. \end{aligned}$$
(31)

Figure 20b shows the mapped functions corresponding to the original functions in Fig. 20a, with transition ‘times’ defined by Eq. (31).

The forward mapping operation for \(y_i(t)\) is expressed as

$$\begin{aligned} {\mathscr {F}}: y_i(t) \rightarrow \hat{y}_i(\tau ),~t_i^1,~t_i^2,\dots ,t_i^M. \end{aligned}$$
(32)

The transition times \(t_i^1\) to \(t_i^{M}\) must also be included since they are needed for the backward mapping. Note that the values in the mapped data vectors \(\widehat{{\mathbf {d}}}_i\), introduced in Sect. 3.1, are simply the values of \(\hat{y}_i(\tau )\) at different ‘time’ steps, plus the identified transition times.

Once we construct a predicted mapped function, denoted by \(\hat{y}_{{\text {p}}}(\tau )\), and its associated (predicted) transition times \(t_{{\text {p}}}^1, t_{{\text {p}}}^2, \dots , t_{{\text {p}}}^M\), the backward mapping is given by

$$\begin{aligned} {\mathscr {F}}^{-1}: \hat{y}_{{\text {p}}}(\tau ),~t_{{\text {p}}}^1,~t_{{\text {p}}}^2,\dots ,t_{{\text {p}}}^M \rightarrow y_{{\text {p}}}(t), \end{aligned}$$
(33)

where

$$\begin{aligned} y_{{\text {p}}}(t)=\hat{y}_{{\text {p}}}\left( \tau ^j+(\tau ^{j+1}-\tau ^j)\frac{t-t_{{\text {p}}}^{j}}{t_{{\text {p}}}^{j+1}-t_{{\text {p}}}^{j}}\right) , \quad t_{{\text {p}}}^{j} \le t \le t_{{\text {p}}}^{j+1}, \ \ 0 \le j \le M. \end{aligned}$$
(34)

Here \(\tau ^j\) are the pre-computed values from Eq. (31), and \(t_{{\text {p}}}^0\) and \(t_{{\text {p}}}^{M+1}\) are the known start and end simulation times. This completes the description of the forward and backward mapping operations used to ‘align’ the transitions between the various stages.

An alternate mapping approach, based on the use of histogram transformations, was introduced in Sun et al. (2016). That procedure is applicable when the production data do not display clear patterns, as is the case when wells are abruptly shut and opened at different times. Because the histogram transformations are applied independently for each data variable, the nonlinear correlations between data variables (as shown in Fig. 3e) might not be as effectively mitigated with this treatment. In future work, it will be of interest to compare results with the two approaches for systems that display clearly identifiable stages.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sun, W., Durlofsky, L.J. A New Data-Space Inversion Procedure for Efficient Uncertainty Quantification in Subsurface Flow Problems. Math Geosci 49, 679–715 (2017). https://doi.org/10.1007/s11004-016-9672-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11004-016-9672-8

Keywords

Navigation