Skip to main content

Advertisement

Log in

A New Differentiable Parameterization Based on Principal Component Analysis for the Low-Dimensional Representation of Complex Geological Models

  • Published:
Mathematical Geosciences Aims and scope Submit manuscript

Abstract

A new approach based on principal component analysis (PCA) for the representation of complex geological models in terms of a small number of parameters is presented. The basis matrix required by the method is constructed from a set of prior geological realizations generated using a geostatistical algorithm. Unlike standard PCA-based methods, in which the high-dimensional model is constructed from a (small) set of parameters by simply performing a multiplication using the basis matrix, in this method the mapping is formulated as an optimization problem. This enables the inclusion of bound constraints and regularization, which are shown to be useful for capturing highly connected geological features and binary/bimodal (rather than Gaussian) property distributions. The approach, referred to as optimization-based PCA (O-PCA), is applied here mainly for binary-facies systems, in which case the requisite optimization problem is separable and convex. The analytical solution of the optimization problem, as well as the derivative of the model with respect to the parameters, is obtained analytically. It is shown that the O-PCA mapping can also be viewed as a post-processing of the standard PCA model. The O-PCA procedure is applied both to generate new (random) realizations and for gradient-based history matching. For the latter, two- and three-dimensional systems, involving channelized and deltaic-fan geological models, are considered. The O-PCA method is shown to perform very well for these history matching problems, and to provide models that capture the key sand–sand and sand–shale connectivities evident in the true model. Finally, the approach is extended to generate bimodal systems in which the properties of both facies are characterized by Gaussian distributions. MATLAB code with the O-PCA implementation, and examples demonstrating its use are provided online as Supplementary Materials.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20

Similar content being viewed by others

References

  • Aanonsen SI, Naevdal G, Oliver DS, Reynolds AC, Vallès B (2009) The ensemble Kalman filter in reservoir engineering—a review. SPE J 14(3):393–412

    Article  Google Scholar 

  • Brouwer DR, Jansen JD (2004) Dynamic optimization of waterflooding with smart wells using optimal control theory. SPE J 9(4):391–402

    Article  Google Scholar 

  • Caers J (2007) Comparing the gradual deformation with the probability perturbation method for solving inverse problems. Math Geol 39(1):27–52

    Article  Google Scholar 

  • Caers J, Srinivasan S, Journel A (2000) Geostatistical quantification of geological information for a fluvial-type North Sea reservoir. SPE J 3(5):457–467

    Google Scholar 

  • Castro SA (2007) A probabilistic approach to jointly integrate 3D/4D seismic, production data and geological information for building reservoir models. PhD thesis, Department of Energy Resources Engineering, Stanford University

  • Dorn O, Villegas R (2008) History matching of petroleum reservoirs using a level set technique. Inverse Probl 24(3):035015. doi:10.1088/0266-5611/24/3/035015

  • Gao G, Reynolds AC (2006) An improved implementation of the LBFGS algorithm for automatic history matching. SPE J 11(1):5–17

    Article  Google Scholar 

  • Gavalas GR, Shah PC, Seinfeld JH (1976) Reservoir history matching by Bayesian estimation. SPE J 16(6):337–350

    Article  Google Scholar 

  • Gill PE, Murray W, Saunders MA (2005) SNOPT: an SQP algorithm for large-scale constrained optimization. SIAM Rev 47(1):99–131

    Article  Google Scholar 

  • Golub GH, Van Loan CF (1996) Matrix computation, 3rd edn. The John Hopkins University Press, Baltimore

  • Hansen TM (2011) mGstat: a geostatistical matlab toolbox. http://www.mgstat.sourceforge.net

  • He J, Sarma P, Durlofsky LJ (2013) Reduced-order flow modeling and geological parameterization for ensemble-based data assimilation. Comput Geosci 55(1):54–69

    Article  Google Scholar 

  • Hu LY, Blanc G, Noetinger B (2001) Gradual deformation and iterative calibration of sequential stochastic simulations. Math Geol 33(4):475–489

    Article  Google Scholar 

  • Jafarpour B, McLaughlin DB (2007) Efficient permeability parameterization with the discrete cosine transform. Paper SPE 106453 presented at the SPE reservoir simulation symposium, Houston, Texas, USA

  • Jafarpour B, Goyal V, McLaughlin DB, Freeman WT (2010) Compressed history matching: exploiting transform-domain sparsity for regularization of nonlinear dynamic data integration problems. Math Geosci 42(1):1–27

    Article  Google Scholar 

  • Khaninezhad MM, Jafarpour B, Li L (2010) History matching with learned sparse dictionaries. Paper SPE 133654 presented at the SPE annual technical conference and exhibition, Florence, Italy

  • Kitanidis P (1995) Quasi-linear geostatistical theory for inversing. Water Resour Res 31(10):2411–2419

    Article  Google Scholar 

  • Li R, Reynolds AC, Oliver DS (2003) History matching of three-phase flow production data. SPE J 8(4): 328–340

  • Liu N, Oliver DS (2004) Automatic history matching of geologic facies. SPE J 9(4):188–195

    Article  Google Scholar 

  • Lu P, Horne R (2000) A multiresolution approach to reservoir parameter estimation using wavelet analysis. Paper SPE 62985 presented at the SPE annual technical conference and exhibition, Dallas, Texas, USA

  • Luenberger DG, Ye Y (2008) Linear and nonlinear programming, 3rd edn. Springer, Berlin

  • Ma X, Zabaras N (2011) Kernel principal component analysis for stochastic input model generation. J Comput Phys 230(19):7311–7331

  • Moskowitz MA, Paliogiannis F (2011) Functions of several real variables. World Scientific, Singapore

  • Oliver DS (1996) Multiple realizations of permeability field from well test data. SPE J 1(2):145–154

    Article  Google Scholar 

  • Oliver DS, Chen Y (2011) Recent progress in history matching: a review. Comput Geosci 15(1):185–221

    Article  Google Scholar 

  • Oliver DS, He N, Reynolds AC (1996) Conditioning permeability fields to pressure data. Paper presented at the 5th European conference on the mathematics of oil recovery, Leoben, Austria

  • Oliver DS, Reynolds AC, Liu N (2008) Inverse theory for petroleum reservoir characterization and history matching. Cambridge University Press, London

  • Oliver DS, Chen Y, Naevdal G (2011) Updating Markov chain models using the ensemble Kalman filter. Comput Geosci 15(2):325–344

    Article  Google Scholar 

  • Park H, Scheidt C, Fenwick D, Boucher A, Caers J (2013) History matching and uncertainty quantification of facies models with multiple geological interpretations. Comput Geosci 17(4):609–621

    Article  Google Scholar 

  • Remy N, Boucher A, Wu J (2009) Applied geostatistics with SGeMS: a user’s guide. Cambridge University Press, London

  • Reynolds AC, He N, Chu L, Oliver DS (1996) Reparameterization techniques for generating reservoir description conditioned to variograms and well-test pressure data. SPE J 1(4):413–426

    Article  Google Scholar 

  • Reynolds AC, He N, Oliver DS (1999) Reducing uncertainty in geostatistical description with well testing pressure data. In: Reservoir characterization—recent advances. American Association of Petroleum Geologists, pp 149–162

  • Rodrigues JRP (2006) Calculating derivatives for automatic history matching. Comput Geosci 10(1): 119–136

  • Sahni I, Horne R (2005) Multiresolution wavelet analysis for improved reservoir description. SPE Reserv Eval Eng 8(1):53–69

    Article  Google Scholar 

  • Sarma P, Durlofsky LJ, Aziz K, Chen WH (2006) Efficient real-time reservoir management using adjoint-based optimal control and model updating. Comput Geosci 10(1):3–36

    Article  Google Scholar 

  • Sarma P, Durlofsky LJ, Aziz K, Chen WH (2007) A new approach to automatic history matching using kernel PCA. Paper SPE 106176 presented at the SPE reservoir simulation symposium, Houston, Texas, USA

  • Sarma P, Durlofsky LJ, Aziz K (2008) Kernel principal component analysis for efficient, differentiable parameterization of multipoint geostatistics. Math Geosci 40(1):3–32

    Article  Google Scholar 

  • Scheidt C, Caers J (2009) Representing spatial uncertainty using distances and kernels. Math Geosci 41(4):397–419

    Article  Google Scholar 

  • Scholkopf B, Smola A, Muller KR (1998) Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput 10(5):1299–1319

    Article  Google Scholar 

  • Sezgin M, Sankur B (2004) Survey over image thresholding techniques and quantitative performance evaluation. J Electron Imaging 13(1):146–165

    Article  Google Scholar 

  • Strebelle S (2002) Conditional simulation of complex geological structures using multiple-point statistics. Math Geosci 34(1):1–21

    Google Scholar 

  • van Doren JFM, Markovinović R, Jansen JD (2006) Reduced-order optimal control of water flooding using proper orthogonal decomposition. Comput Geosci 10(1):137–158

    Article  Google Scholar 

  • Wu Z, Reynolds AC, Oliver DS (1999) Conditioning geostatistical models to two-phase production data. SPE J 4(2):142–155

    Article  Google Scholar 

  • Zhao H, Li G, Reynolds AC, Yao J (2013) Large-scale history matching with quadratic interpolation models. Comput Geosci 17(1):117–138

    Article  Google Scholar 

  • Zhou Y (2012) Parallel general-purpose reservoir simulation with coupled reservoir models and multi-segment wells. PhD thesis, Department of Energy Resources Engineering, Stanford University

Download references

Acknowledgments

We thank the industrial affiliates of the Stanford University Reservoir Simulation Research (SUPRI-B) and Smart Fields Consortia for partial funding of this work. We are grateful to Vladislav Bukshtynov and Oleg Volkov for implementing the adjoint formulation in AD-GPRS and for their assistance with its use. We also acknowledge Andre Journel, Albert Reynolds and Pallav Sarma for useful discussions and suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hai X. Vo.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material (ZIP 4583 KB)

Appendix: O-PCA Analytical Solution and Derivative for Binary Models

Appendix: O-PCA Analytical Solution and Derivative for Binary Models

In this Appendix, we solve the minimization problem (10), defined in Sect. 2.3. We proceed by introducing the Lagrangian \(L(x_i)\), given by Luenberger and Ye (2008)

$$\begin{aligned} L(x_i) \!=\! (1\!-\!\gamma )x^2_i \!-\! 2\left( a_i\!-\!\frac{\gamma }{2}\right) x_i\!-\! \mu _i x_i \!-\! \eta _i(1\!-\!x_i), \ \ \mu _i \ge 0, \ \eta _i \ge 0, \ {x_i} \in {[ 0,1]},\nonumber \\ \end{aligned}$$
(18)

where \(\mu _i\) and \(\eta _i \ \forall \ i \in \{1,\ldots ,N_\mathrm{c}\}\) are the Lagrange multipliers. The Karush–Kuhn–Tucker (KKT) optimality conditions for (18) are

$$\begin{aligned}&\displaystyle \frac{\partial L(x_i)}{\partial x_i}= 2(1-\gamma )x_i - 2\left( a_i-\frac{\gamma }{2}\right) -\mu _i +\eta _i =0, \end{aligned}$$
(19)
$$\begin{aligned}&\displaystyle \mu _i x_i = 0, \end{aligned}$$
(20)
$$\begin{aligned}&\displaystyle \eta _i (1-x_i) = 0, \end{aligned}$$
(21)
$$\begin{aligned}&\displaystyle \mu _i \ge 0,\ \eta _i \ge 0,\ 0 \le x_i \le 1. \end{aligned}$$
(22)

The stationary point for the quadratic convex objective function without bound constraints, \(x^\mathrm{s}_i = (a_i-\gamma /2)/(1-\gamma )\), is first computed. Given the fact that the objective function is quadratic and convex with \(0 \le \gamma <1\), the minimal point for the bound-constrained problem, designated \(x^*_i\), is either \(x^\mathrm{s}_i, 0\), or \(1\). The following three cases must be considered:

  1. (i)

    Case 1: If \(a_i \le \gamma /2\), then \(x^\mathrm{s}_i \le 0\), and the minimum point is \(x^*_i = 0\). From (21) we have \(\eta _i = 0\). Substituting \(x^*_i = 0\) and \(\eta _i = 0\) in (19), we obtain \(\mu _i = \gamma - 2a_i\). We can readily verify that, for \(x^\mathrm{s}_i \le 0\) and \(0 \le \gamma <1\), we have \(\mu _i = \gamma - 2a_i \ge 0\), so (22) will be satisfied.

  2. (ii)

    Case 2: If \(a_i\ge 1-\gamma /2\), then \(x^\mathrm{s}_i \ge 1\), and the minimum point is \(x^*_i = 1\). From (20) we have \(\mu _i = 0\). Substituting \(x^*_i = 1\) and \(\mu _i = 0\) in (19), we obtain \(\eta _i = 2a_i + \gamma - 2\). It is easy to verify that if \(x^\mathrm{s}_i \ge 1\) and \(0 \le \gamma <1\), we also have \(\eta _i = 2a_i + \gamma - 2 \ge 0\). Thus (22) will be satisfied.

  3. (iii)

    Case 3: If \(\gamma /2<a_i<1-\gamma /2\), we have \(0<x^\mathrm{s}_i<1\), and the minimum point of the constrained problem is \(x^*_i = x^\mathrm{s}_i\). We also have \(\mu _i =\eta _i =0\) from (20) and (21), which satisfies (22).

Using the Lagrange multipliers computed above, we can apply the multivariate implicit function theorem (Moskowitz and Paliogiannis 2011) to determine the derivative \({\partial x_i}\)/\({\partial \xi _j}\), which will be required for gradient-based data assimilation. We first define, from (19), (20) and (21),

$$\begin{aligned} F_1(x_i, \mu _i, \eta _i, \varvec{\xi })&= 2(1-\gamma )x_i - 2\left( a_i(\varvec{\xi })-\frac{\gamma }{2}\right) -\mu _i +\eta _i , \nonumber \\ F_2(x_i, \mu _i, \eta _i, \varvec{\xi })&= \mu _i x_i , \nonumber \\ F_3(x_i, \mu _i, \eta _i, \varvec{\xi })&= \eta _i (1-x_i). \end{aligned}$$
(23)

Denoting partial derivatives as \(F_{1,x_i}\!\! =\!\!{\partial F_1}/{\partial x_i}, F_{1,\mu _i}\!=\!\! {\partial F_1}/{\partial \mu _i}, F_{1,\eta _i} \!\!=\!\! {\partial F_1}/{\partial \eta _i}, F_{1, \xi _j}={\partial F_1}/{\partial \xi _j}\), etc., through application of the implicit function theorem for a system of equations along with Cramer’s rule, we have

$$\begin{aligned} \frac{\partial x_i}{\partial \xi _j} = -\frac{\mathrm{det}\begin{pmatrix} F_{1, \xi _j} &{} F_{1, \mu _i} &{} F_{1, \eta _i}\\ F_{2, \xi _j} &{} F_{2, \mu _i} &{} F_{2, \eta _i}\\ F_{3, \xi _j} &{} F_{3, \mu _i} &{} F_{3, \eta _i}\\ \end{pmatrix}}{\mathrm{det}\begin{pmatrix} F_{1, x_i} &{} F_{1, \mu _i} &{} F_{1, \eta _i}\\ F_{2, x_i} &{} F_{2, \mu _i} &{} F_{2, \eta _i}\\ F_{3, x_i} &{} F_{3, \mu _i} &{} F_{3, \eta _i}\\ \end{pmatrix}} \end{aligned}$$
(24)

By noting that \(a_i=\bar{m}_i + \sum _{j=1}^{l} \Phi _{ij} \xi _j\), all of the partial derivatives in (23) can be readily determined. After evaluating the determinants of the two matrices in (24), we have

$$\begin{aligned} \frac{\partial x_i}{\partial \xi _j} = \frac{2 ({\partial a_i}/{\partial \xi _j}) x_i (1-x_i)}{\mu _i+2 x_i +\eta _i x_i -2 \gamma x_i - \mu _i x_i +2 \gamma x^2_i - 2 x^2_i}, \end{aligned}$$
(25)

where \({\partial a_i}/{\partial \xi _j}=\Phi _{ij}\).

The analytical solutions for \(x_i\) and \({\partial x_i}/{\partial \xi _j}\), although derived from the O-PCA formulation introduced here, can be interpreted in terms of a post-processing (histogram transform or soft-thresholding) of the underlying PCA model (designated \(a_i\)). Namely, for this binary case, with the regularization term given by \(\gamma x_i(1-x_i)\), O-PCA maps PCA solutions with \(a_i \le \gamma /2\) to \(x_i=0\), and solutions with \(a_i\ge 1-\gamma /2\) to \(x_i=1\). For PCA solutions with \(\gamma /2 < a_i < 1-\gamma /2\), O-PCA applies the linear mapping \(x_i = (a_i-\gamma /2)/(1-\gamma )\). Consistent with this, it can be seen from (25) that, if \(x_i=1\) or \(x_i=0, {\partial x_i}/{\partial \xi _j}=0\). Otherwise, \({\partial x_i}/{\partial \xi _j}=\Phi _{ij}/({1-\gamma })\).

This analytical description of the post-processing applied by O-PCA is possible in this case because of the simple form of the regularization term. As shown in Sect. 5, O-PCA can also be applied for channelized models described by bimodal log-permeability distributions. For such cases, the regularization term is significantly more complicated, and the analytical solution of the minimization problem cannot be achieved. As a result, the specific mapping introduced by O-PCA cannot be described analytically (though the derivative \({\partial x_i}/{\partial \xi _j}\) can be obtained semi-analytically).

In the development above, \(\mathbf{x}\) is the optimization variable. As noted in Sect. 2.3, after the solution and its derivative are found, they are designated \(\mathbf{m}\) and \({\mathrm{d}\mathbf m}/{\mathrm{d}\varvec{\xi }}\). To verify our derivation, the analytical solution for \(\mathbf{m}\) was compared to solutions obtained by applying a numerical optimization to the minimization problem defined by (8). Agreement to within numerical tolerance was consistently achieved. In addition, the analytical expression (25) for \({\mathrm{d}\mathbf m}/{\mathrm{d}\varvec{\xi }}\) was compared with numerical derivatives computed using finite differences. The Frobenius norm of the difference between the two derivative matrices was less than \(10^{-6}\). These results serve as verification of the analytical solutions derived here. All results presented in this paper for binary models were generated using these analytical solutions.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vo, H.X., Durlofsky, L.J. A New Differentiable Parameterization Based on Principal Component Analysis for the Low-Dimensional Representation of Complex Geological Models. Math Geosci 46, 775–813 (2014). https://doi.org/10.1007/s11004-014-9541-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11004-014-9541-2

Keywords

Navigation