Smart Proxy Modeling of a Fractured Reservoir Model for Production Optimization: Implementation of Metaheuristic Algorithm and Probabilistic Application

Ng, Cuthbert Shang Wui; Jahanbani Ghahfarokhi, Ashkan; Nait Amar, Menad; Torsæter, Ole

doi:10.1007/s11053-021-09844-2

Smart Proxy Modeling of a Fractured Reservoir Model for Production Optimization: Implementation of Metaheuristic Algorithm and Probabilistic Application

Original Paper
Open access
Published: 08 March 2021

Volume 30, pages 2431–2462, (2021)
Cite this article

Download PDF

You have full access to this open access article

Natural Resources Research Aims and scope Submit manuscript

Smart Proxy Modeling of a Fractured Reservoir Model for Production Optimization: Implementation of Metaheuristic Algorithm and Probabilistic Application

Download PDF

Cuthbert Shang Wui Ng¹,
Ashkan Jahanbani Ghahfarokhi¹,
Menad Nait Amar² &
…
Ole Torsæter¹

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Numerical reservoir simulation has been recognized as one of the most frequently used aids in reservoir management. Despite having high calculability performance, it presents an acute shortcoming, namely the long computational time induced by the complexities of reservoir models. This situation applies aptly in the modeling of fractured reservoirs because these reservoirs are strongly heterogeneous. Therefore, the domains of artificial intelligence and machine learning (ML) were used to alleviate this computational challenge by creating a new class of reservoir modeling, namely smart proxy modeling (SPM). SPM is a ML approach that requires a spatio-temporal database extracted from the numerical simulation to be built. In this study, we demonstrate the procedures of SPM based on a synthetic fractured reservoir model, which is a representation of dual-porosity dual-permeability model. The applied ML technique for SPM is artificial neural network. We then present the application of the smart proxies in production optimization to illustrate its practicality. Apart from applying the backpropagation algorithms, we implemented particle swarm optimization (PSO), which is one of the metaheuristic algorithms, to build the SPM. We also propose an additional procedure in SPM by integrating the probabilistic application to examine the overall performance of the smart proxies. In this work, we inferred that the PSO had a higher chance to improve the reliability of smart proxies with excellent training results and predictive performance compared with the considered backpropagation approaches.

Forecasting of Horizontal Gas Well Production Decline in Unconventional Reservoirs using Productivity, Soft Computing and Swarm Intelligence Models

Article 29 September 2018

Machine learning-based fracturing parameter optimization for horizontal wells in Panke field shale oil

Article Open access 13 March 2024

Artificial Neural Network Modeling and Forecasting of Oil Reservoir Performance

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Hydrocarbons are among the primary sources of energy in today’s world. This is proven by a statistical review conducted by British Petroleum (2020), which found that, in 2019, oil contributed to the largest share of the world primary energy of about 33.1%, whereas natural gas had the third largest share of 24.2%. Hence, they play a pivotal role in quenching the high demand of world energy consumption and such demand will be likely in an upward trend due to the increasing global population (Gerald et al. 2014; International Energy Agency 2018). In addition, the importance of hydrocarbons is reflected by the significant influence of their price on many other major economic domains (Lescaroux and Mignon 2009). This is illustrated clearly by the phenomenon of how many other industries have been affected by the fluctuation of oil price (Lescaroux and Mignon 2009). Therefore, it is essential to have a sustainable hydrocarbon production not only to fulfill the demand for energy consumption, but also to maintain the global economic growth. With respect to this, carbonate reservoirs are one of the main sources of hydrocarbons. These reservoirs make up approximately 60% of the global oil reserves and about 40% of the global gas reserves (Schlumberger 2020b). Most of these reservoirs are naturally fractured, and hence, accurate modeling of fluid flow in these reservoirs is one of the most critical steps to ensure the sustainable production of hydrocarbons.

In general, modeling of fluid flow in porous media can be perceived as a numerical reservoir simulation. Reservoir simulation is one of the most frequently used tools in reservoir management, which is the application of technological, labor, and financial resources to maximize the economic performance and the hydrocarbon recovery of a reservoir (Wiggins and Startzman 1990). This is because it has been implemented extensively to help predict the performance of a reservoir as well as to provide useful information for uncertainty analysis or any optimization task that includes enhanced oil recovery, hydraulic fracturing, and so forth. However, one of the challenges of accurate modeling of fractured reservoirs stems from a lack of underlying theory or principle to describe the behavior of fluid flow in these reservoirs. To mitigate this challenge, Barrenblatt (1960) established a theory pertaining to fluid flow in fractured porous media. Based on this theory, Warren and Root (1963) developed the dual-porosity method, which has been one of the most fundamental tools in simulating a fractured reservoir. However, this conventional model does not sufficiently capture the realistic behavior of fluid flow as fluid is assumed to move only through fractures, whereas the matrix blocks only supply fluid to fractures. Hence, this model was enhanced to the dual-porosity dual-permeability (DPDP) model, in which the transport of fluid between matrix blocks is considered (Uleberg and Kleppe 1996). The details regarding this model are explained further below.

Having developed the DPDP model implies that fractured reservoirs can be simulated numerically. Nonetheless, another challenge in terms of computational effort arises as the complexity of simulated fractured reservoirs increases (including as much details as possible to “describe realistically” a reservoir). Therefore, reservoir management might not be sufficiently efficient to keep up with sustainable hydrocarbon production. Fortunately, in today’s world of digitalization, methods of artificial intelligence and machine learning (AI&ML) have come to the rescue. In this context, Ertekin and Sun (2019) provided a very comprehensive review on the implementation of AI&ML methods in the field of reservoir engineering. They also proposed the use of hand-shaking protocol that would combine the advantages of both traditional and intelligent reservoir modeling to develop more powerful computational protocols. With this, the great potential and extensive utilization of AI&ML-based methods have also been demonstrated further in many technical domains of the petroleum industry (Mohaghegh 2000a, b, c; Parada and Ertekin 2012; Nait Amar and Jahanbani Ghahfarokhi 2020; Nait Amar et al. 2020). Moreover, with the help of AI&ML, Mohaghegh (2011) has coined a new class of reservoir modeling, namely smart proxy modeling (SPM). Fundamentally, SPM is the development of an artificial neural network (ANN) that receives both input and output data from a reservoir simulation model and undergoes a training phase. After the ANN has been trained to recognize the pattern induced by the data (relationship between input and output), it can yield the estimated result that matches with that produced by the reservoir model within a few seconds or minutes. Therefore, this ANN is termed “smart proxy.” Regarding this, the word “smart” reveals the ability to learn and capture the underlying physical behavior of a simulated reservoir model through pattern recognition and the word “proxy” denotes to act on behalf of the original model (Mohaghegh 2017, 2018).

For the past decade, SPM has been considered as a technological breakthrough in the petroleum industry as it has not only reduced the reservoir simulation time significantly, but it also provided the results within an acceptable range of accuracy. The successful application of smart proxies has been demonstrated in many literatures of the oil and gas industry. Mohaghegh et al. (2006) developed surrogate reservoir model (the initial nomenclature of SPM), which was an accurate representation of a sophisticated full-field reservoir model, and used it for uncertainty analysis. With this breakthrough, these surrogate models were implemented on different real fields in Saudi Arabia for geological uncertainty analysis (Mohaghegh et al. 2012a, c). Mohaghegh et al. (2012b, 2015) then reformulated the concept of SPM by categorizing it as grid-based and well-based. As the nomenclatures imply, grid-based SPM is done for the analysis of numerical model at grid block level, whereas well-based SPM is for the analysis at well level. Grid-based SPM has been applied in several real-life CO₂ sequestration projects (Mohaghegh et al. 2012b), whereas well-based SPM has been implemented for optimization of production scheduling of a real field in United Arab Emirates (Mohaghegh et al. 2015). Besides, the application of SPM was then extended gradually to other domains, such as history matching and enhanced oil recovery (EOR). He et al. (2016) coupled the use of SPM with differential evolution (DE) to perform automatic history matching. Alenezi and Mohaghegh (2016) also built a SPM that reproduced and forecasted the dynamic properties of a reservoir that has been water-flooded. Moreover, Mohaghegh (2018) discussed the utilization of SPM under the context of CO₂-EOR as a storage mechanism. Furthermore, Parada and Ertekin (2012) applied SPM to establish successfully a new screening tool for four different improved oil recovery (IOR) methods, including waterflooding, miscible injection of CO₂ and N₂, and injection of steam. Therefore, these literatures do not only show the high applicability of SPM in oil and gas industry, but they also highlight its potential for further enhancement.

Nevertheless, SPM still has few disadvantages. One of them is that a smart proxy built can only be applied to predict what the simulated reservoir might estimate only if the physics assumed in the numerical simulation is not changed. For instance, if a smart proxy is developed on a reservoir model with reservoir pressure of 4000 psia,^{Footnote 1} then it cannot be applied to perform any estimation of parameters when the reservoir pressure is not 4000 psia. To handle this problem, another smart proxy needs to be established. In addition to this, the spatio-temporal database is considered as the backbone of the SPM as it is the main component used to train the ANN model. Thus, if another smart proxy is built (as previously mentioned), then the database needs to be prepared again. Despite having such inconvenience, the time spent on preparation of this database is still much less than the time spent by numerical simulation. Pertaining to this, the preparation of a spatio-temporal database might take about few hours (or for few minutes with the help of commercial software). However, for a sophisticated reservoir simulation model, the computation might take a few days. It is important to understand that smart proxy is another example of data-driven model as it is developed by analyzing the collected data (Alenezi and Mohaghegh 2016, 2017). Hence, careful attention is required when a spatio-temporal database is created. If incorrect data are provided to the smart proxy, it will “learn wrongly” and produce unsatisfactory results. This complies with the short phrase that goes “garbage in and garbage out.”

Although there are many literatures explaining the theoretical basis of SPM, it is still treated as “black-box” as commercial software is mostly used to build a smart proxy. Thus, in this work, one of the objectives was to provide a more vivid illustration of how SPM can be performed based on a synthetic reservoir model. Besides, we present another alternative of training algorithm apart from the backpropagation algorithm that is mostly used in SPM. More intriguingly, we include a probabilistic application to evaluate further the overall performance of the developed SPMs. We opine that this integration in SPM is insightful as it helps to better reflect the performance of the proxy models. After this introduction, we discuss briefly the mathematical concepts of the DPDP model and how ANN operates. Three different algorithms, which are two examples of backpropagation algorithms, namely stochastic gradient descent (SGD) and adaptive moment estimation (Adam) algorithms, and particle swarm optimization (PSO), were implemented as the learning algorithm to train the ANN. Hence, the fundamentals of these algorithms are discussed next. Then, we explicate the background of the reservoir model simulated based on the DPDP method and the problem setting of the production optimization case. We also explain how the respective SPM is developed upon it and used in production optimization. Then, the results and discussion will follow. Prior to proceeding to conclusions, we also provide another case study, which considers a heterogeneous fractured reservoir model, to further show the robustness of the methodology discussed in this paper.

Methodology

Fundamentals of DPDP Model

In the conventional dual-porosity model, a grid block consists of two portions—the matrix block and the fractures. In this model, the fluid flows mainly through the fractures, whereas the matrix blocks only provide fluids to the fracture (Uleberg and Kleppe 1996). This phenomenon of fluid flow is illustrated in a two-dimensional case as in Figure 1.

Assuming a one-dimensional and one-phase flow case, the transport of fluid through the fracture can be mathematically expressed as (Barrenblatt 1960; Warren and Root 1963):

$$\frac{\partial }{\partial x}\left( {\frac{k}{\mu B}\frac{\partial P}{{\partial x}}} \right)_{{{{\rm fracture}}}} + \hat{q}_{{{{\rm matrix}}\_{{\rm fracture}}}} = \frac{\partial }{\partial t}\left( {\frac{\emptyset }{B}} \right)_{{{{\rm fracture}}}}$$

(1)

where k is permeability, B is the formation volume factor, $\mu$ is viscosity of fluid, and $\mathrm{\varnothing }$ is porosity. The term ${\widehat{{\rm q}}}_{{\rm matrix\_fracture}}$ indicates the supply of fluid to fractures by the matrix block, and its mathematical expression is:

$$-{ \, \widehat{{\rm q}}}_{{\rm matrix\_fracture}} = \frac{ \partial}{ \partial{\rm t}}{\left(\frac{ \emptyset }{{\rm B}}\right)}_{{\rm matrix}}$$

(2)

Because the assumption of no fluid flow between the blocks of matrix is not realistic, the dual-porosity model was extended to the DPDP model by adding a flow term in Eq. (2) (Uleberg and Kleppe 1996). Hence, the system of equations representing the DPDP model is:

$$\frac{\partial}{\partial{\rm x}}{\left(\frac{{\rm k}}{\mu{\rm B}}\frac{\partial{\rm P}}{\partial{\rm x}}\right)}_{{\rm fracture}} + { \, \widehat{{\rm q}}}_{{\rm matrix\_fracture}} \, = \, \frac{\partial}{\partial{\rm t}}{\left(\frac{ \emptyset}{{\rm B}}\right)}_{{\rm fracture}}$$

(3)

$$\frac{\partial}{\partial{\rm x}}{\left(\frac{{\rm k}}{\mu{\rm B}}\frac{\partial{\rm P}}{\partial{\rm x}}\right)}_{{\rm matrix}}-{ \, \widehat{{\rm q}}}_{{\rm matrix\_fracture}} \, = \, \frac{\partial}{\partial{\rm t}}{\left(\frac{ \emptyset }{{\rm B}}\right)}_{{\rm matrix}}$$

(4)

Regarding the exchange term, it can be further represented as:

$$-{ \, \widehat{{\rm q}}}_{{\rm matrix\_fracture}} \, = \, \sigma\frac{{{\rm k}}_{{\rm matrix}}}{\mu}\left({{\rm P}}_{{\rm matrix}} -{ \, {{\rm P}}}_{{\rm fracture}}\right)$$

(5)

where P denotes pressure, whereas $\upsigma$ is the shape factor or the geometric factor. This shape factor represents the geometry of the matrix block, and it dictates the flow fluid between the matrix blocks and the fracture system (Kazemi et al. 1976). There are many mathematical formulations available in the literature to describe this shape factor depending upon the physical effects and mechanisms considered (Warren and Root 1963; Ahmad and Olivier 2008; Su et al. 2013). In this context, one of the most widely applied forms is the one proposed by Kazemi et al. (1976), and it was used in this study. Regarding its formulation, Kazemi et al. (1976) discussed that the shape factor can be computed in a three-dimensional case as:

$$\sigma \, = \, 4 \times \left[\frac{1}{{{\rm L}}_{{\rm x}}^{2}} + \, \frac{1}{{{\rm L}}_{{\rm y}}^{2}} + \frac{1}{{{\rm L}}_{{\rm z}}^{2}}\right]$$

(6)

where the L term refers to the dimension of the matrix block in x-, y-, and z- directions.

ANN

ANN is a biologically inspired mathematical model or algorithm that can predict any relevant output within an acceptable range of accuracy after learning the relationship between the inputs and outputs provided (Wilamowski and Irwin 2011; Buduma and Locasio 2017). This biological inspiration stems from the imitation of learning method used in human brains. ANN is very robust due to its high generalization ability in capturing the nonlinearity of any process investigated (Gharbi and Mansoori 2005; Wilamowski and Irwin 2011; Nait Amar et al. 2018b). Thus, ANN is better than any traditional regression approach to solve complicated mathematical problems (Gharbi and Mansoori 2005). There are different types of ANN, such as feed-forward neural network, convolutional neural network (CNN), recurrent neural network (RNN). Multilayer perceptron (MLP), which is an example of feed-forward neural network,^{Footnote 2} was implemented here. Regarding the architecture of MLP, it is made up of three different types of layers, namely one input layer, one or more hidden layers, and one output layer (Wilamowski and Irwin 2011; Buduma and Locasio 2017). Each of these layers comprises simple calculating elements, which are known as nodes, units, or artificial neurons (Gharbi and Mansoori 2005). The output from each node in a layer is multiplied by the weights (and biases). The product enters the node in the next layer as input. These inputs are then summed and applied to activation function, also known as transfer function, to produce the output of the node. The structure or topology of an arbitrary ANN that comprises one input layer with three nodes, one hidden layer with four nodes, and one output layer with two nodes is shown in Figure 2.

Referring to Figure 2, the mechanism of ANN can be expounded mathematically as follows. From input layer to hidden layer, the output of the node is computed as:

$${{\rm o}}_{\hbox{j}} \, = \, {{\rm F}}\left(\sum_{{{\rm i}}{ = 1}}^{{{\rm N}}_{\hbox{i}}}{{\rm w}}_{\hbox{ji}}{{{\rm o}}}_{\hbox{i}}{+}{{\rm b}}_{\hbox{ji}}\right)$$

(7)

Then, from hidden layer to output layer, the output of the node is calculated as:

$${{\rm o}}_{{\rm k}} = {{\rm F}}\left(\sum_{{{\rm j}}{\rm = 1}}^{{{\rm N}}_{{\rm j}}}{{\rm w}}_{{\rm kj}}{{{\rm o}}}_{{\rm j}} + {{\rm b}}_{{\rm kj}}\right)$$

(8)

In Eqs. (7) and (8), the subscript i denotes the input layer, the subscript j means the hidden layer, and the subscript k indicates the output layer, N shows the number of nodes in each layer, o indicates either the output of node in the current layer or the input of node from previous layer (based upon the subscript), w is a set of weights, and b is a set of biases. Weights are considered as the fitting parameters in modeling of an ANN, whereas bias is an extra node that provides more flexibility for the ANN model to be trained. There are many forms of activation functions F that are readily used in ANN modeling. The major ones include sigmoid, rectified linear unit (ReLU), and hyperbolic tangent (Buduma and Locasio 2017). Here, the activation function used was ReLU and it is represented as:

$${{\rm F}}\left({{\rm x}}\right) \, = \left\{ \, \begin{array}{c}{\rm 0 for }{{\rm x}} \, \le \, {0}\\ {{\rm x}}{\rm for }{{\rm x}} \, > \, {0}\end{array}\right.$$

(9)

The derivative of the ReLU function is:

$$\frac{ \partial {{\rm F}}\left({{\rm x}}\right)}{ \partial {{\rm x}}} = \left\{ \, \begin{array}{c}{\rm 0 for }{{\rm x}} \, \le \, {0}\\ {\rm 1 for }{{\rm x}} \, >\, {0}\end{array}\right.$$

(10)

Mathematically, ANN learns the relationship or recognizes the pattern between input and output data through the tuning of the sets of weights and biases between the two layers. Through a number of epochs (or iterations), these weights and biases are optimized by minimizing any predefined error function (also known as loss or cost function), such as mean squared error, average absolute percentage error. There are different examples of algorithms that can be used to optimize these weights and biases. Backpropagation algorithm has been commonly used in this context. Examples of backpropagation algorithm are gradient descent (GD), Gauss–Newton algorithm, Levenberg–Marquardt algorithm (LM), adaptive gradient algorithm (AdaGrad), root-mean-square propagation (RMSProp), Adam, and so forth. Additionally, other metaheuristic algorithms, like PSO, DE, genetic algorithm (GA), and so forth, have also been proven useful for neural network training (Nait Amar et al. 2018a, b). As Bianchi et al. (2009) have counseled, metaheuristic algorithm is a high-level mathematical algorithm that is generally natural inspired and used to solve more sophisticated optimization problems. In this study, both backpropagation algorithm and metaheuristic approach have been employed to enable the ANN to “learn.” The selected backpropagation algorithm was GD, whereas PSO was the chosen metaheuristic training algorithm.

Backpropagation Algorithm

For the workflow of the GD algorithm, both the inputs and outputs are fed to the ANN as the training phase starts. When the inputs enter the ANN and proceed through the layers, they are gradually processed to yield the predicted output. Thereafter, the predicted output is compared with the desired output. Errors are then propagated back through the ANN. During this backpropagation, the weights and biases are adjusted to minimize the errors. Such process is repeated iteratively until either the errors become less than the predefined tolerance or the number of iterations is reached. The GD is an algorithm that applies the first-order derivative for computation. In this context, the first-order derivative of the error function is implemented to determine the minimum in the error space. The calculation of gradient at iteration t can be expressed mathematically as:

$$g_{t} ~ = ~\frac{{\partial E\left( {x,w_{t} } \right)}}{{\partial w_{t} }}~ = ~\left[ {\frac{{\partial E}}{{\partial w_{{1,t}} }}~~~~\frac{{\partial E}}{{\partial w_{{2,t}} }}~~~\frac{{\partial E}}{{\partial w_{{3,t}} }}~~ \ldots ~~\frac{{\partial E}}{{\partial w_{{N,t}} }}} \right]^{T}$$

(11)

where E indicates the error function, x the input vector, and w the weight (and bias) vector. Thereafter, the weights are updated by using the following equations. The same idea applies to the updating of the biases.

$$w_{{t + 1}} ~ = ~w_{t} ~ + ~\Delta w_{t}$$

(12)

$$w_{{t + 1}} ~ = ~w_{t} ~ - ~\left( {\gamma ~ \times ~g_{t} } \right)$$

(13)

In Eqs. (12) and (13), the weights (and biases) at iteration t + 1 are updated using the weights (and biases) at iteration t, the gradient at t, and $\gamma$, which is the learning rate or step size. Therefore, the gradient is always computed at every iteration step to adjust the weights (and biases). Pertaining to the computation of gradient of error function, it is highly dependent on the forms of error function and activation function that were used. Here, the error function used was the mean squared error, whereas the activation function used was ReLU.

The mathematical formulation of the application of GD as learning algorithm is as follows. For the following derivation, the meaning of the subscripts used here is the same as explained above. The term t means the target value or the actual output, P, denotes the total number of training sets provided; thus:

$${{\rm E}}\left({{\rm x}} , \, {\rm w, b}\right) \, = \, \frac{1}{{{\rm P}}}\sum_{{{\rm k}} = \, {1}}^{{\rm P}}{\left({{\rm t}}_{{{\rm k}} \, }-{ \, {{\rm o}}}_{{\rm k}}\right)}^{2}$$

(14)

Having defined the error function, the backpropagation algorithm starts by computing the weight update between the hidden and output layers. To perform this computation, the gradient of the error function with respect to the weights between the hidden and output layers is determined. Thereafter, the similar procedure is conducted to calculate the weight update between the hidden and input layer. This algorithm carries on iteratively until the value of error function (obtained by using the updated weights and biases) is less than a predefined tolerance or the initialized number of epochs is reached. For a more substantial understanding of the mathematical formulation of the backpropagation algorithm, peruse Wilamowski and Irwin (2011) and the relevant literatures. Here, the Keras module, which was developed by Chollet (2019), had been implemented with the help of the programming language Python 3.8.1 and TensorFlow 2.1.0 to use the GD algorithm to optimize the weights and biases. However, it is essential to note that in Keras module, instead of using GD algorithm, the stochastic gradient descent (SGD) algorithm is applied. The fundamentals of these two algorithms are the same. The main difference is that, in SGD, the gradient is only computed once at each iteration step (by randomly selecting a sample from the training set) and is used further (Buduma and Locasio 2017). By inducing this stochastic behavior, the computational cost is reduced drastically. Apart from SGD, Adam was another backpropagation algorithm used here; it is a more advanced and robust variant of SGD developed by Kingma and Ba (2015). Mathematically, it approximates the first and second moments of gradients to adaptively calculate the learning rates for different parameters (Kingma and Ba 2015). Refer to Kingma and Ba (2015) for the details of Adam. Here, Adam was also implemented using Python 3.8.1 and TensorFlow 2.1.0.

PSO

PSO was introduced by Kennedy et al. (1995) based upon the simulation of the social behavior of a flock of flying birds. As explained in several literatures (Kennedy et al. 1995; Shi and Eberhart 1999; Nait Amar et al. 2018a), mathematically, this algorithm operates by having a population of particles, which is also known as a swarm of particles. Each of these particles corresponds to a potential position or a solution in a search space. Then, the position of each particle is updated iteratively according to its position and velocity at previous timestep. The movements of the particles in the search space are controlled by their own best-known position (the local best position) and their best-known position in the entire swarm (the global best position). As this process occurs iteratively, the particles in the swarm will eventually converge to an optimal point, which is deemed as the best solution in the search space. The position and velocity for the j^th particle in a N-dimensional space at iteration t can be expressed, respectively, as:

$${{\rm x}}_{{\rm j,t}} \, = \, \left\{{{\rm x}}_{{\rm j1,t}}, {{\rm x}}_{{\rm j2,t}}, {{\rm x}}_{{\rm j3,t}},\ldots,{{\rm x}}_{{\rm jN,t}}\right\}$$

(15)

$${{\rm v}}_{{\rm j,t}} \, = \, \left\{{{\rm v}}_{{\rm j1,t}}, {{\rm v}}_{{\rm j2,t}}, {{\rm v}}_{{\rm j3,t}},\ldots,{{\rm v}}_{{\rm jN,t}}\right\}$$

(16)

Then, the velocity of each particle at next iteration t + 1 is updated as (Shi and Eberhart 1999):

$${{\rm v}}_{{\rm jN,t+1}} \, = \, {{\rm v}}_{{\rm jN,}{{\rm t}}} \, + { \, {{\rm c}}}_{1}{{{\rm r}}}_{1}\left({{\rm pbest}}_{{\rm jN,t}}\boldsymbol{ }-{ \, {{\rm x}}}_{{\rm jN,t}}\right) + { \, {{\rm c}}}_{2}{{{\rm r}}}_{2}\left({{\rm gbest}}_{{\rm N,t}}\boldsymbol{ }-{ \, {{\rm x}}}_{{\rm jN,t}}\right)$$

(17)

In Eqs. (15), (16), and (17), v_jN,t and x_jN,t represent the velocity of the jth particle at iteration t and its corresponding position in N-dimension quantity, respectively; pbest_jN,t corresponds to the N-dimension quantity of the individual j at the best position or the local best position at iteration t; gbest_N,t is the N-dimension quantity of the swarm at the best position or the global best position at iteration t; c₁ denotes the cognitive learning factor (also known as cognitive weight), whereas c₂ means the social learning factor (also known as social weight); r₁ and r₂ are random numbers extracted between 0 and 1. Upon updating the velocity, each particle moves to a new potential solution as:

$${{\rm x}}_{{\rm jN,t+1}} = {{\rm x}}_{{\rm jN,t}} + {{\rm v}}_{{\rm jN,t+1}}$$

(18)

A new parameter, inertial weight $\upomega$ introduced by Shi and Eberhart (1998), was included in Eq. (17) to improve the convergence condition. This also gradually decreases the velocity of the particles to have the swarm of particles under control (Nait Amar et al. 2018a). In other words, it plays a part in balancing the global search also known as exploration, and the local search also termed as exploitation (Shi and Eberhart 1998; Zhang et al. 2015):

$${{\rm v}}_{{{{\rm jN}},{{\rm t}} + 1}} ~ = {\omega }{{\rm v}}_{{{{\rm jN}},{{\rm t}}}} ~ + {{\rm c}}_{1} {{\rm r}}_{1} \left( {{{\rm pbest}}_{{{{\rm jN}},{{\rm t}}}} ~ - {{\rm x}}_{{{{\rm jN}},{{\rm t}}}} } \right) + {{\rm c}}_{2} {{\rm r}}_{2} \left( {{{\rm gbest}}_{{{{\rm N}},{{\rm t}}}} - {{\rm x}}_{{{{\rm jN}},{{\rm t}}}} } \right) .$$

(20)

In the context of the minimization problem, an objective function f to be minimized is defined. Then, to determine the local best solution at iteration t + 1, the following formula is given (Nait Amar et al. 2018a):

$${{\rm pbest}}_{{{\rm jN}}, {\rm t}+1} = \left\{\begin{array}{c}{{\rm pbest}}_{{\rm jN, t}}{\rm , if f} \, ({{\rm pbest}}_{{\rm jN, t}}) \, = \, {\rm f}({{\rm x}}_{{\rm jN,t}+1})\\ {{\rm x}}_{{\rm jN,t}+1}{\rm , otherwise}\end{array}\right.$$

(21)

Then, to find the global best solution at iteration t + 1, the following mathematical formulation is presented:

$${{\rm gbest}}_{{\rm jN, t+1}} \, = \, {{\rm min}}\left[{{\rm f}}\left({{\rm pbest}}_{{\rm jN, t+1}}\right)\right]$$

(22)

In this study, the objective function was the error function in the ANN modeling. To apply PSO as the training algorithm of ANN, this can be simply done by treating the weights and biases as the particles in the algorithm. Hence, the total number of particles in a swarm is the total number of weights and biases. Then, the optimization can be performed using the abovementioned formulations. Here, the package of PySwarms version 1.1.0, which was built by Miranda (2019), was implemented by using the programming language Python 3.8.1 to perform the optimization. In comparison with the SGD algorithm, one of the advantages of PSO is that it is a derivative-free algorithm. This implies that it is more robust as it can be utilized to optimize a mathematical function that is not easily differentiable.

Numerical Simulation Model

A three-dimensional, two-phase (black oil and water) reservoir simulation model was built to represent the “true” reservoir model. The “true” reservoir is in fact inspired by the dual-porosity model discussed in Firoozabadi and Thomas (1990), which is a two-dimensional and three-phase model (black oil, water, and gas—including free and dissolved gas). However, most of the reservoir parameters and relevant fluid properties were changed to develop the “true” model. This “true” reservoir model supplied the necessary data for the development of the respective SPM. This reservoir was a DPDP model made up of three layers with uniform thickness.^{Footnote 3} The top of this reservoir was set at the depth of 305 m. About the geometry of this model, each grid block had a length of 25 m, a width of 25 m, and a height of 15.2 m. Thus, the dimension of the reservoir model was 1525 m × 1525 m × 45.7 m, which corresponds to the number of blocks being 61 × 61 × 3. Regarding the well configuration, it was the five-spot pattern in which four injectors were, respectively, set to penetrate near the corners of this reservoir model and a producer was placed in the center of the reservoir. The injectors (producer) would inject water to (would produce from) all the fracture layers. Besides that, the performance of each well in this model was controlled by its respective rate. The target of the field production rate was set equal to the target of the field injection rate for pressure maintenance. For instance, if the target rate of the producer was 400 m³/day, then the target rate of each of the injector was 100 m³/day (totaling up to 400 m³/day of the target of the field injection rate). The numerical simulation of this DPDP reservoir model was conducted using ECLIPSE 100 software Schlumberger (2020a). Other details of this model are summarized in Table 1.

Table 1 Essential parameters used to develop the DPDP reservoir model

Smart Proxy Modeling of a Fractured Reservoir Model for Production Optimization: Implementation of Metaheuristic Algorithm and Probabilistic Application

Abstract

Similar content being viewed by others

Forecasting of Horizontal Gas Well Production Decline in Unconventional Reservoirs using Productivity, Soft Computing and Swarm Intelligence Models

Machine learning-based fracturing parameter optimization for horizontal wells in Panke field shale oil

Artificial Neural Network Modeling and Forecasting of Oil Reservoir Performance

Introduction

Methodology

Fundamentals of DPDP Model

ANN

Backpropagation Algorithm

PSO

Numerical Simulation Model

Production Optimization

Smart Proxy Modeling

Data Preparation and Analysis

Neural Network Training

Results and Discussion

Heterogeneous Model

Conclusions

Notes

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation