A New Differentiable Parameterization Based on Principal Component Analysis for the Low-Dimensional Representation of Complex Geological Models

Vo, Hai X.; Durlofsky, Louis J.

doi:10.1007/s11004-014-9541-2

A New Differentiable Parameterization Based on Principal Component Analysis for the Low-Dimensional Representation of Complex Geological Models

Published: 23 July 2014

Volume 46, pages 775–813, (2014)
Cite this article

Mathematical Geosciences Aims and scope Submit manuscript

Hai X. Vo¹ &
Louis J. Durlofsky¹

1616 Accesses
93 Citations
Explore all metrics

Abstract

A new approach based on principal component analysis (PCA) for the representation of complex geological models in terms of a small number of parameters is presented. The basis matrix required by the method is constructed from a set of prior geological realizations generated using a geostatistical algorithm. Unlike standard PCA-based methods, in which the high-dimensional model is constructed from a (small) set of parameters by simply performing a multiplication using the basis matrix, in this method the mapping is formulated as an optimization problem. This enables the inclusion of bound constraints and regularization, which are shown to be useful for capturing highly connected geological features and binary/bimodal (rather than Gaussian) property distributions. The approach, referred to as optimization-based PCA (O-PCA), is applied here mainly for binary-facies systems, in which case the requisite optimization problem is separable and convex. The analytical solution of the optimization problem, as well as the derivative of the model with respect to the parameters, is obtained analytically. It is shown that the O-PCA mapping can also be viewed as a post-processing of the standard PCA model. The O-PCA procedure is applied both to generate new (random) realizations and for gradient-based history matching. For the latter, two- and three-dimensional systems, involving channelized and deltaic-fan geological models, are considered. The O-PCA method is shown to perform very well for these history matching problems, and to provide models that capture the key sand–sand and sand–shale connectivities evident in the true model. Finally, the approach is extended to generate bimodal systems in which the properties of both facies are characterized by Gaussian distributions. MATLAB code with the O-PCA implementation, and examples demonstrating its use are provided online as Supplementary Materials.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Eighty Years of the Finite Element Method: Birth, Evolution, and Future

Article Open access 13 June 2022

Wing Kam Liu, Shaofan Li & Harold S. Park

Topology optimization of multi-scale structures: a review

Article Open access 08 March 2021

Jun Wu, Ole Sigmund & Jeroen P. Groen

Triangulation of the Earth’s surface and its application to the geodetic velocity field modelling

Article 23 February 2024

Sandi Berk

References

Aanonsen SI, Naevdal G, Oliver DS, Reynolds AC, Vallès B (2009) The ensemble Kalman filter in reservoir engineering—a review. SPE J 14(3):393–412
Article Google Scholar
Brouwer DR, Jansen JD (2004) Dynamic optimization of waterflooding with smart wells using optimal control theory. SPE J 9(4):391–402
Article Google Scholar
Caers J (2007) Comparing the gradual deformation with the probability perturbation method for solving inverse problems. Math Geol 39(1):27–52
Article Google Scholar
Caers J, Srinivasan S, Journel A (2000) Geostatistical quantification of geological information for a fluvial-type North Sea reservoir. SPE J 3(5):457–467
Google Scholar
Castro SA (2007) A probabilistic approach to jointly integrate 3D/4D seismic, production data and geological information for building reservoir models. PhD thesis, Department of Energy Resources Engineering, Stanford University
Dorn O, Villegas R (2008) History matching of petroleum reservoirs using a level set technique. Inverse Probl 24(3):035015. doi:10.1088/0266-5611/24/3/035015
Gao G, Reynolds AC (2006) An improved implementation of the LBFGS algorithm for automatic history matching. SPE J 11(1):5–17
Article Google Scholar
Gavalas GR, Shah PC, Seinfeld JH (1976) Reservoir history matching by Bayesian estimation. SPE J 16(6):337–350
Article Google Scholar
Gill PE, Murray W, Saunders MA (2005) SNOPT: an SQP algorithm for large-scale constrained optimization. SIAM Rev 47(1):99–131
Article Google Scholar
Golub GH, Van Loan CF (1996) Matrix computation, 3rd edn. The John Hopkins University Press, Baltimore
Hansen TM (2011) mGstat: a geostatistical matlab toolbox. http://www.mgstat.sourceforge.net
He J, Sarma P, Durlofsky LJ (2013) Reduced-order flow modeling and geological parameterization for ensemble-based data assimilation. Comput Geosci 55(1):54–69
Article Google Scholar
Hu LY, Blanc G, Noetinger B (2001) Gradual deformation and iterative calibration of sequential stochastic simulations. Math Geol 33(4):475–489
Article Google Scholar
Jafarpour B, McLaughlin DB (2007) Efficient permeability parameterization with the discrete cosine transform. Paper SPE 106453 presented at the SPE reservoir simulation symposium, Houston, Texas, USA
Jafarpour B, Goyal V, McLaughlin DB, Freeman WT (2010) Compressed history matching: exploiting transform-domain sparsity for regularization of nonlinear dynamic data integration problems. Math Geosci 42(1):1–27
Article Google Scholar
Khaninezhad MM, Jafarpour B, Li L (2010) History matching with learned sparse dictionaries. Paper SPE 133654 presented at the SPE annual technical conference and exhibition, Florence, Italy
Kitanidis P (1995) Quasi-linear geostatistical theory for inversing. Water Resour Res 31(10):2411–2419
Article Google Scholar
Li R, Reynolds AC, Oliver DS (2003) History matching of three-phase flow production data. SPE J 8(4): 328–340
Liu N, Oliver DS (2004) Automatic history matching of geologic facies. SPE J 9(4):188–195
Article Google Scholar
Lu P, Horne R (2000) A multiresolution approach to reservoir parameter estimation using wavelet analysis. Paper SPE 62985 presented at the SPE annual technical conference and exhibition, Dallas, Texas, USA
Luenberger DG, Ye Y (2008) Linear and nonlinear programming, 3rd edn. Springer, Berlin
Ma X, Zabaras N (2011) Kernel principal component analysis for stochastic input model generation. J Comput Phys 230(19):7311–7331
Moskowitz MA, Paliogiannis F (2011) Functions of several real variables. World Scientific, Singapore
Oliver DS (1996) Multiple realizations of permeability field from well test data. SPE J 1(2):145–154
Article Google Scholar
Oliver DS, Chen Y (2011) Recent progress in history matching: a review. Comput Geosci 15(1):185–221
Article Google Scholar
Oliver DS, He N, Reynolds AC (1996) Conditioning permeability fields to pressure data. Paper presented at the 5th European conference on the mathematics of oil recovery, Leoben, Austria
Oliver DS, Reynolds AC, Liu N (2008) Inverse theory for petroleum reservoir characterization and history matching. Cambridge University Press, London
Oliver DS, Chen Y, Naevdal G (2011) Updating Markov chain models using the ensemble Kalman filter. Comput Geosci 15(2):325–344
Article Google Scholar
Park H, Scheidt C, Fenwick D, Boucher A, Caers J (2013) History matching and uncertainty quantification of facies models with multiple geological interpretations. Comput Geosci 17(4):609–621
Article Google Scholar
Remy N, Boucher A, Wu J (2009) Applied geostatistics with SGeMS: a user’s guide. Cambridge University Press, London
Reynolds AC, He N, Chu L, Oliver DS (1996) Reparameterization techniques for generating reservoir description conditioned to variograms and well-test pressure data. SPE J 1(4):413–426
Article Google Scholar
Reynolds AC, He N, Oliver DS (1999) Reducing uncertainty in geostatistical description with well testing pressure data. In: Reservoir characterization—recent advances. American Association of Petroleum Geologists, pp 149–162
Rodrigues JRP (2006) Calculating derivatives for automatic history matching. Comput Geosci 10(1): 119–136
Sahni I, Horne R (2005) Multiresolution wavelet analysis for improved reservoir description. SPE Reserv Eval Eng 8(1):53–69
Article Google Scholar
Sarma P, Durlofsky LJ, Aziz K, Chen WH (2006) Efficient real-time reservoir management using adjoint-based optimal control and model updating. Comput Geosci 10(1):3–36
Article Google Scholar
Sarma P, Durlofsky LJ, Aziz K, Chen WH (2007) A new approach to automatic history matching using kernel PCA. Paper SPE 106176 presented at the SPE reservoir simulation symposium, Houston, Texas, USA
Sarma P, Durlofsky LJ, Aziz K (2008) Kernel principal component analysis for efficient, differentiable parameterization of multipoint geostatistics. Math Geosci 40(1):3–32
Article Google Scholar
Scheidt C, Caers J (2009) Representing spatial uncertainty using distances and kernels. Math Geosci 41(4):397–419
Article Google Scholar
Scholkopf B, Smola A, Muller KR (1998) Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput 10(5):1299–1319
Article Google Scholar
Sezgin M, Sankur B (2004) Survey over image thresholding techniques and quantitative performance evaluation. J Electron Imaging 13(1):146–165
Article Google Scholar
Strebelle S (2002) Conditional simulation of complex geological structures using multiple-point statistics. Math Geosci 34(1):1–21
Google Scholar
van Doren JFM, Markovinović R, Jansen JD (2006) Reduced-order optimal control of water flooding using proper orthogonal decomposition. Comput Geosci 10(1):137–158
Article Google Scholar
Wu Z, Reynolds AC, Oliver DS (1999) Conditioning geostatistical models to two-phase production data. SPE J 4(2):142–155
Article Google Scholar
Zhao H, Li G, Reynolds AC, Yao J (2013) Large-scale history matching with quadratic interpolation models. Comput Geosci 17(1):117–138
Article Google Scholar
Zhou Y (2012) Parallel general-purpose reservoir simulation with coupled reservoir models and multi-segment wells. PhD thesis, Department of Energy Resources Engineering, Stanford University

Download references

Acknowledgments

We thank the industrial affiliates of the Stanford University Reservoir Simulation Research (SUPRI-B) and Smart Fields Consortia for partial funding of this work. We are grateful to Vladislav Bukshtynov and Oleg Volkov for implementing the adjoint formulation in AD-GPRS and for their assistance with its use. We also acknowledge Andre Journel, Albert Reynolds and Pallav Sarma for useful discussions and suggestions.

Author information

Authors and Affiliations

Department of Energy Resources Engineering, Stanford University, Stanford, CA, 94305-2220, USA
Hai X. Vo & Louis J. Durlofsky

Authors

Hai X. Vo
View author publications
You can also search for this author in PubMed Google Scholar
Louis J. Durlofsky
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hai X. Vo.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material (ZIP 4583 KB)

Appendix: O-PCA Analytical Solution and Derivative for Binary Models

In this Appendix, we solve the minimization problem (10), defined in Sect. 2.3. We proceed by introducing the Lagrangian $L(x_i)$, given by Luenberger and Ye (2008)

$$\begin{aligned} L(x_i) \!=\! (1\!-\!\gamma )x^2_i \!-\! 2\left( a_i\!-\!\frac{\gamma }{2}\right) x_i\!-\! \mu _i x_i \!-\! \eta _i(1\!-\!x_i), \ \ \mu _i \ge 0, \ \eta _i \ge 0, \ {x_i} \in {[ 0,1]},\nonumber \\ \end{aligned}$$

(18)

where $\mu _i$ and $\eta _i \ \forall \ i \in \{1,\ldots ,N_\mathrm{c}\}$ are the Lagrange multipliers. The Karush–Kuhn–Tucker (KKT) optimality conditions for (18) are

$$\begin{aligned}&\displaystyle \frac{\partial L(x_i)}{\partial x_i}= 2(1-\gamma )x_i - 2\left( a_i-\frac{\gamma }{2}\right) -\mu _i +\eta _i =0, \end{aligned}$$

(19)

$$\begin{aligned}&\displaystyle \mu _i x_i = 0, \end{aligned}$$

(20)

$$\begin{aligned}&\displaystyle \eta _i (1-x_i) = 0, \end{aligned}$$

(21)

$$\begin{aligned}&\displaystyle \mu _i \ge 0,\ \eta _i \ge 0,\ 0 \le x_i \le 1. \end{aligned}$$

(22)

The stationary point for the quadratic convex objective function without bound constraints, $x^\mathrm{s}_i = (a_i-\gamma /2)/(1-\gamma )$, is first computed. Given the fact that the objective function is quadratic and convex with $0 \le \gamma <1$, the minimal point for the bound-constrained problem, designated $x^*_i$, is either $x^\mathrm{s}_i, 0$, or $1$. The following three cases must be considered:

(i)
Case 1: If $a_i \le \gamma /2$, then $x^\mathrm{s}_i \le 0$, and the minimum point is $x^*_i = 0$. From (21) we have $\eta _i = 0$. Substituting $x^*_i = 0$ and $\eta _i = 0$ in (19), we obtain $\mu _i = \gamma - 2a_i$. We can readily verify that, for $x^\mathrm{s}_i \le 0$ and $0 \le \gamma <1$, we have $\mu _i = \gamma - 2a_i \ge 0$, so (22) will be satisfied.
(ii)
Case 2: If $a_i\ge 1-\gamma /2$, then $x^\mathrm{s}_i \ge 1$, and the minimum point is $x^*_i = 1$. From (20) we have $\mu _i = 0$. Substituting $x^*_i = 1$ and $\mu _i = 0$ in (19), we obtain $\eta _i = 2a_i + \gamma - 2$. It is easy to verify that if $x^\mathrm{s}_i \ge 1$ and $0 \le \gamma <1$, we also have $\eta _i = 2a_i + \gamma - 2 \ge 0$. Thus (22) will be satisfied.
(iii)
Case 3: If $\gamma /2<a_i<1-\gamma /2$, we have $0<x^\mathrm{s}_i<1$, and the minimum point of the constrained problem is $x^*_i = x^\mathrm{s}_i$. We also have $\mu _i =\eta _i =0$ from (20) and (21), which satisfies (22).

Using the Lagrange multipliers computed above, we can apply the multivariate implicit function theorem (Moskowitz and Paliogiannis 2011) to determine the derivative ${\partial x_i}$/${\partial \xi _j}$, which will be required for gradient-based data assimilation. We first define, from (19), (20) and (21),

$$\begin{aligned} F_1(x_i, \mu _i, \eta _i, \varvec{\xi })&= 2(1-\gamma )x_i - 2\left( a_i(\varvec{\xi })-\frac{\gamma }{2}\right) -\mu _i +\eta _i , \nonumber \\ F_2(x_i, \mu _i, \eta _i, \varvec{\xi })&= \mu _i x_i , \nonumber \\ F_3(x_i, \mu _i, \eta _i, \varvec{\xi })&= \eta _i (1-x_i). \end{aligned}$$

(23)

Denoting partial derivatives as $F_{1,x_i}\!\! =\!\!{\partial F_1}/{\partial x_i}, F_{1,\mu _i}\!=\!\! {\partial F_1}/{\partial \mu _i}, F_{1,\eta _i} \!\!=\!\! {\partial F_1}/{\partial \eta _i}, F_{1, \xi _j}={\partial F_1}/{\partial \xi _j}$, etc., through application of the implicit function theorem for a system of equations along with Cramer’s rule, we have

$$\begin{aligned} \frac{\partial x_i}{\partial \xi _j} = -\frac{\mathrm{det}\begin{pmatrix} F_{1, \xi _j} &{} F_{1, \mu _i} &{} F_{1, \eta _i}\\ F_{2, \xi _j} &{} F_{2, \mu _i} &{} F_{2, \eta _i}\\ F_{3, \xi _j} &{} F_{3, \mu _i} &{} F_{3, \eta _i}\\ \end{pmatrix}}{\mathrm{det}\begin{pmatrix} F_{1, x_i} &{} F_{1, \mu _i} &{} F_{1, \eta _i}\\ F_{2, x_i} &{} F_{2, \mu _i} &{} F_{2, \eta _i}\\ F_{3, x_i} &{} F_{3, \mu _i} &{} F_{3, \eta _i}\\ \end{pmatrix}} \end{aligned}$$

(24)

By noting that $a_i=\bar{m}_i + \sum _{j=1}^{l} \Phi _{ij} \xi _j$, all of the partial derivatives in (23) can be readily determined. After evaluating the determinants of the two matrices in (24), we have

$$\begin{aligned} \frac{\partial x_i}{\partial \xi _j} = \frac{2 ({\partial a_i}/{\partial \xi _j}) x_i (1-x_i)}{\mu _i+2 x_i +\eta _i x_i -2 \gamma x_i - \mu _i x_i +2 \gamma x^2_i - 2 x^2_i}, \end{aligned}$$

(25)

where ${\partial a_i}/{\partial \xi _j}=\Phi _{ij}$.

The analytical solutions for $x_i$ and ${\partial x_i}/{\partial \xi _j}$, although derived from the O-PCA formulation introduced here, can be interpreted in terms of a post-processing (histogram transform or soft-thresholding) of the underlying PCA model (designated $a_i$). Namely, for this binary case, with the regularization term given by $\gamma x_i(1-x_i)$, O-PCA maps PCA solutions with $a_i \le \gamma /2$ to $x_i=0$, and solutions with $a_i\ge 1-\gamma /2$ to $x_i=1$. For PCA solutions with $\gamma /2 < a_i < 1-\gamma /2$, O-PCA applies the linear mapping $x_i = (a_i-\gamma /2)/(1-\gamma )$. Consistent with this, it can be seen from (25) that, if $x_i=1$ or $x_i=0, {\partial x_i}/{\partial \xi _j}=0$. Otherwise, ${\partial x_i}/{\partial \xi _j}=\Phi _{ij}/({1-\gamma })$.

This analytical description of the post-processing applied by O-PCA is possible in this case because of the simple form of the regularization term. As shown in Sect. 5, O-PCA can also be applied for channelized models described by bimodal log-permeability distributions. For such cases, the regularization term is significantly more complicated, and the analytical solution of the minimization problem cannot be achieved. As a result, the specific mapping introduced by O-PCA cannot be described analytically (though the derivative ${\partial x_i}/{\partial \xi _j}$ can be obtained semi-analytically).

In the development above, $\mathbf{x}$ is the optimization variable. As noted in Sect. 2.3, after the solution and its derivative are found, they are designated $\mathbf{m}$ and ${\mathrm{d}\mathbf m}/{\mathrm{d}\varvec{\xi }}$. To verify our derivation, the analytical solution for $\mathbf{m}$ was compared to solutions obtained by applying a numerical optimization to the minimization problem defined by (8). Agreement to within numerical tolerance was consistently achieved. In addition, the analytical expression (25) for ${\mathrm{d}\mathbf m}/{\mathrm{d}\varvec{\xi }}$ was compared with numerical derivatives computed using finite differences. The Frobenius norm of the difference between the two derivative matrices was less than $10^{-6}$. These results serve as verification of the analytical solutions derived here. All results presented in this paper for binary models were generated using these analytical solutions.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vo, H.X., Durlofsky, L.J. A New Differentiable Parameterization Based on Principal Component Analysis for the Low-Dimensional Representation of Complex Geological Models. Math Geosci 46, 775–813 (2014). https://doi.org/10.1007/s11004-014-9541-2

Download citation

Received: 04 June 2013
Accepted: 14 April 2014
Published: 23 July 2014
Issue Date: October 2014
DOI: https://doi.org/10.1007/s11004-014-9541-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A New Differentiable Parameterization Based on Principal Component Analysis for the Low-Dimensional Representation of Complex Geological Models

Abstract

Access this article

Similar content being viewed by others

Eighty Years of the Finite Element Method: Birth, Evolution, and Future

Topology optimization of multi-scale structures: a review

Triangulation of the Earth’s surface and its application to the geodetic velocity field modelling

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material (ZIP 4583 KB)

Appendix: O-PCA Analytical Solution and Derivative for Binary Models

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A New Differentiable Parameterization Based on Principal Component Analysis for the Low-Dimensional Representation of Complex Geological Models

Abstract

Access this article

Similar content being viewed by others

Eighty Years of the Finite Element Method: Birth, Evolution, and Future

Topology optimization of multi-scale structures: a review

Triangulation of the Earth’s surface and its application to the geodetic velocity field modelling

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material (ZIP 4583 KB)

Appendix: O-PCA Analytical Solution and Derivative for Binary Models

Appendix: O-PCA Analytical Solution and Derivative for Binary Models

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation