Estimating oil–gas ratio for volatile oil and gas condensate reservoirs: artificial neural network, support vector machines and functional network approach

Khamis, Mohammed A.; Fattah, K. A.

doi:10.1007/s13202-018-0501-0

Estimating oil–gas ratio for volatile oil and gas condensate reservoirs: artificial neural network, support vector machines and functional network approach

Original Paper - Production Engineering
Open access
Published: 09 June 2018

Volume 9, pages 573–582, (2019)
Cite this article

Download PDF

You have full access to this open access article

Journal of Petroleum Exploration and Production Technology Aims and scope Submit manuscript

Estimating oil–gas ratio for volatile oil and gas condensate reservoirs: artificial neural network, support vector machines and functional network approach

Download PDF

Mohammed A. Khamis¹ &
K. A. Fattah^1,2

4849 Accesses
7 Citations
Explore all metrics

Abstract

Laboratory experiment ideally, is the main method to obtain PVT properties of the oil and gas reservoir fluids. The alternative two methods widely used when laboratory experiments are not available are: equation of state (EOS) and empirical PVT correlations. The EOS requires lots of numerical computations based on identifying the full compositions of the reservoir fluids properties. The measurement and calculation of these properties are very expensive and time consuming. On the other hand, using of PVT correlations which are based on easily measured field data such as reservoir pressure and temperature, and gas and oil density is reliable and more economic. In this work, three artificial intelligence (AI) technique models were developed to predict the oil–gas ratio (R_v) for volatile oil and gas condensate reservoirs. Thirteen actual reservoir fluid samples (five volatile oils and eight gas condensates) covering a wide range of fluid behavior and characteristics were used. Whitson and Torp three parameters EOS were used to generate modified black oil (MBO) PVT properties that were used as a data set for model development. The MBO PVT data points were extracted for each sample using commercial PVT software at five different separator conditions. The nature of the input data was studied showing that data type is clustered. In addition, the correlations between the input parameters were checked. This preprocessed is helpful in selecting the best method to deal with the input parameters that will be fed to the developed models. According to this analysis and since the input parameters have different ranges, normalization of these parameters is vital to improve the accuracy of the models and to get the solution quickly and efficiently. Results showed that taking the log for the input parameters is the best among the other normalization techniques. The AI techniques that have been implemented in this research are; Artificial Neural Network (ANN) models, Functional Networks (FNs) and Support Vector Machines (SVMs). Models developed based on these techniques used 17,941 data points and a ratio of 70% for training, 15% for validation, and 15% for testing. To develop these models, three Matlab codes were written for each tool where the provided input data in excel format were read and prepressed before implementation. Results obtained using these techniques showed that the ANN model predicted R_v with an average square correlation coefficient of 0.9999 and an average relative error of 0.15% while FNs predicted R_v with an average correlation coefficient of 0.9635 and an average relative error of 27.6%. It was noted that SVMs gave the best results with an average correlation coefficient of 0.9990 and an average relative error of 0.12%. The results concluded that ANN and SVM accurately predicted such data since this type of data are clustered and these two models can handle this kind of data. The newly developed models depend only on easily obtainable parameters in the field and can have varied applications when typical lab PVT reports are not available.

Predicting the performance of constant volume depletion tests for gas condensate reservoirs using artificial intelligence techniques

Article 13 August 2022

M. Elmuzafar Ahmed, Abdullah S. Sultan, … Mohamed Mahmoud

PVT correlations for Pakistani crude oils using artificial neural network

Article Open access 10 February 2016

Muzammil Hussain Rammay & Abdulazeez Abdulraheem

Application of Artificial Intelligence to Estimate Oil Flow Rate in Gas-Lift Wells

Article 17 April 2020

Mohammad Rasheed Khan, Zeeshan Tariq & Abdulazeez Abdulraheem

Introduction

The conventional PVT properties for the reservoir fluids black oil type are solution gas–oil ratio, R_s; oil formation volume factor, B_o; and gas formation volume factor, B_g. According to Fattah et al. (2009), the difference between the modified black oil (MBO) PVT properties and the conventional black oil PVT properties are in the handling of the liquid in the gas phase, the oil–gas ratio, R_v. The MBO approach supposes that the composition of stock tank liquid can exist in both of the liquid and gas phases at reservoir conditions. The liquid content of the gas phase was assumed to be defined as a function of pressure named vaporized oil–gas ratio, R_v. It is similar to the solution gas–oil ratio, R_s, which is used to describe how much gas is dissolved in the liquid phase. The MBO properties or extended black oil was introduced for the first time by Spivak and Dixon (1973).

Whitson and Torp (1983), formed a set of steps to estimate the MBO properties according to the constant volume depletion (CVD) test data for the gas condensate. Coats, 1985, introduced a different method for gas condensate reservoir using commercial software of EOS PVT and a regression package to fit the PVT data obtained in the laboratory. McVay (1994), expanded Coats method for the volatile oil reservoir. Walsh and Towler (1994) also introduced another method to compute the MBO properties from the CVD experiment data of gas condensate. Fevang et al. (2000), showed strategies to assist engineers in choosing either MBO or compositional approaches. In 2006, Fattah et al. presented a comparison between Whitson and Torp, and Coats methods using compositional simulation. Results show a good match between experimental PVT data and the EOS model. Also in 2009, Fattah et al. developed a new set of correlations to calculate the MBO properties of volatile oil and gas condensate. Alimadadi et al. (2011), predicted the PVT properties using ANN model. Component mole fraction of the fluid sample, solution gas/oil ratio, (R_s), bubble point pressure (P_b), reservoir pressure, API oil gravity, and temperature were used as input parameters. The model operated the input parameters using two parallel multilayer perceptron (MLP) networks before the results will be recombined.

Arief et al. (2017), proposed a technique of using surrogate models and the available laboratory database to estimate the fluid properties. Two surrogate models are studied in his work: universal kriging and NN.

González et al. (2003) developed NN models to estimate the dew point pressure for retrograde gas reservoirs. The model accuracy was reported to be 8.74% in prediction.

Osman and Al-Marhoun (2005) established ANN models to estimate several PVT properties; formation volume factor, isothermal compressibility and brine density as a function of temperature, salinity and pressure. Also they predict the brine viscosity as a function of brine salinity and temperature. Developing these models was achieved using 1040 data points.

Oloso et al. (2009) predicted the crude oil viscosity and gas–oil ratio using support vector machine and functional network.

Ahmadi et al. (2015) utilized the NN technique to model the bubble point pressure as a function of fluid composition and other reservoir parameters. They used NN back propagation along with particle swarm optimization algorithm as an optimization tool to minimize the error.

Sahterri et al. (2015) developed NN model to predict the gas compressibility factor Z-factor using data set of 978 points. Their model estimated the Z-factor with 2.3% average relative error using Wilcoxon generalized radial basis function network.

Adeeyo (2016) developed NN models to predict the bubble point pressure and formation volume factor at bubble point pressure for Nigerian crude oils. Trial and error technique was used to come up with the best number of neurons that gave stable results.

Artificial intelligent technique models

Artificial neural network

An artificial neural network (ANN) defined as data processing model analogous to biological nervous systems, such as the brain and data processing. The most important component of this model is the innovative structure system of processing the information. It consists of a big number of extremely consistent elements or neurons which are in a harmony work to solve definite problems. ANNs, can learn by example like people. The ANN can be constructed for a definite application, such as pattern recognition or classification of data, over a process of learning. This process in biological systems includes modifications to the synaptic connections which exist between the imaginary neurons.

For the last two decades, AI has been sued extensively in several applications in oil industry. A good number of studies have been done using various computational intelligence (CI) schemes to forecast the characteristics of gas and oil flow through reservoirs and pipes using such schemes including logistic regression (LR) (Hosmer and Lemeshow 2000), multilayer perceptrons (MLP) (Wlodzisław et. al. 1997), and radial basis function (RBF) (Guojie 2004).

Functional networks

Functional networks (FNs) are modification of neural networks which consist of many layers of neurons that are connected through links. A simple calculation is performed in each computing unit or neuron. A special scalar function f of a weighted sum of inputs is linked with the neurons and with the use of well-known algorithms learning from the used data is processed. The functional networks main idea involves the allowance of learning the f functions during suppressing the weights. In addition, these functions of multidimensional can be consistently substituted by single variables functions. With the multiple n links that move from the last layer of neurons into an output unit which can will be written through a number of different forms for each different link. This procedure will generate a system of n − 1 equations. This system can be written directly from the neural network topology. Solving such system of equations will simplify the initial functions f linked with the neurons. Castillo et al. (2001) provided an inclusive demonstration showing the application of FN in the field of engineering and statistics. It was, however, observed in the literature that not much has been applied in of oil industry field.

Support vector machines

Support vector machines (SVMs) are a group of connected supervised learning methods that have been applied for organization and regression as well. These set of methods follow a group of generalized linear classifiers. They can also be treated as a special case of Tikhonov regularization. SVMs correlate the input vectors to a higher dimensional space where a maximal braking hyperplane is made. Two parallel hyperplanes are produced on each side of the hyperplane that splits the data. During this process a certain assumption is assumed that the greater the distance between these parallel hyperplanes, the better the abstraction error of the classifier will be (Burges 1998).

SVMs have been used widely in many engineering fields including defect prediction in software engineering (Elish and Elish 2007), surface tension prediction in chemistry (Jie Wang et. al. 2007), geotechnical engineering (Anthony and Goh 2007) and oil and gas (Jian and Wenfen 2006) with very promising results.

Fluid samples used

Fattah (2005) exposed PVT experiment data for thirteen reservoir fluid samples [eight gas condensates, (GC), and five volatile oils, (VO)]. The result of these PVT experiments data was used in this study. The samples were gotten from reservoirs demonstrating different locations and depth, and were selected to cover a wide range of oil and gas fluid characteristics. Some samples characterize near critical fluids (VO 2, VO 5, GC 1, and GC 2) as clarified by McCain and Bridges (1994). Table 1 presents a description of the major properties of these thirteen fluid samples.

Table 1 Characteristics of fluid samples (Fattah et al. 2009)

Full size table

EOS approach

For every sample in Table 1, an EOS model that matches the experimental results of all available PVT laboratory experiments (CCE, DL, CVD, and separator tests) was derived. For consistency, all EOS models were developed using Peng and Robinson (1976) EOS with volume shift correction (three-parameter EOS) (Fattah 2005). The procedure suggested by Coats and Smart (1986) to match the laboratory results was followed. Then the developed EOS model for each sample was used to obtain MBO PVT properties at different separator conditions using Whitson and Torp (1983) procedure. The MBO PVT properties include the four functions required for MBO simulation are (R_v, R_s, B_o, and B_g). Our database of R_v data consists of 1850 points from eight different gas condensate samples and 1180 points from five volatile oil samples. PVTi module of Eclipse was used to generate our database of R_v data.

R _v models using artificial intelligence techniques

Three Matlab codes were written for each tool to develop these AI models where the provided input data in excel format was read and prepressed before implementation. The ANN model and the other models were developed using 13 actual reservoir fluid samples. Table 2 represents statistical analysis of the input data. The data used as input for the three models developed in this study include reservoir pressure in psi, reservoir temperature in °R, reservoir bubble point pressure in psi, oil density and gas density at stock tank conditions in lb/cu, condensate yield, bbl/MMscf, and oil–gas ratio R_v, while the output parameter is the oil–gas ratio R_v.

Table 2 Input data statistics

Full size table

Results and discussion

The ANN model architecture in terms of number of neurons, layers and the type of connection function were determined based on trial and error process because it was the most successful criteria in developing the model. Two neural network types were used; feedforward backpropagation (Type 1, Fig. 1) and trainable cascade-forward backpropagation (Type 2, Fig. 2). On the other hand, different transfer functions were tested and it was found that the best function was log sigmoid. However, the best neural network or learning algorithm for training was of Type 2. Several problems were faced during training the network. The model was trapped at some point and caused the training to be sopped. This problem was related to the local minimum. To overcome this problem, the maximum number of validation failure was increased to 300 to get the global minimum. To train the network, 70% of the data was used while 15% was used for validation and the other 15% for testing.

Table 3 shows a summary of the results indicating that neural network of Type 2 gives better prediction using three hidden layers. Of course using more hidden layers will increase the accuracy of the results but it is time consuming.

Table 3 Neural network results

Full size table

Table 4 exhibits the statistics of the R_v correlation (Fattah 2005) as compared with the new neural network models generated. From this table, one can easily recognize that the new models from the NN and SVM are the best matching models which give the lowest average absolute error 0.1496 and 0.1222%, respectively. To validate the developed models, super testing was done based on unseen data. According to this test, SVM is the best method in terms of accurate prediction with an average relative error of 0.121% followed by NN model with an average relative error of 0.313. On the other hand, FN was the worst model with an average relative error of 27.3%.

Table 4 Statistical comparison of all models and correlations

Full size table

Figures 3 and 4 present the graphical representation of the results of Type 1 neural network using two and three layers, respectively. On the other hand, Figs. 5 and 6 display the results of neural network of Type 2.

Functional Networks were applied using 70% of the data for training and 30% for testing. Since the range of the input parameters is different as shown in Table 2, the logarithmic values of the input parameters was taken as a normalization method to improve the accuracy of the FN technique. This tool gives results of correlation coefficient of 0.965 for training and 0.962. Figure 7 represents the correlation between the predicted and the actual R_v for training which shows good match while Fig. 8 was for testing. Similarly, Fig. 9 displays the cross-plot of the predicted versus the actual R_v for training, whereas Fig. 10 displays the cross-plot for testing. The results show good agreement but it is not as good as the results obtained using the ANN for both the training and testing although FN is a type of ANN but it is not good in prediction for clustered data as the ANN do.

SVMs with different kernel functions (Poly, Gaussian, polyhomog, htrbf, and rbf) were used. SVMs were applied using 70% of the data for training and 30% for testing. It was noted that for this type of data only Poly and Gaussian kernel functions were working. This technique gives results of correlation coefficient of 0.995 for training and 0.999 for testing. Figure 11 represents the correlation between the predicted and the actual R_v for training which shows good match while Fig. 12 was for testing. Similarly, Fig. 13 displays the cross-plot of the predicted versus the actual R_v for training whereas Fig. 14 displays the cross-plot for testing. The results show better prediction compared with FN but it is not as good as the results obtained using the ANN for both the training and testing. The advantage of the SVM compare with the ANN is the run time is less. For this type of data only Poly and Gaussian kernel functions were working.

Figure 15 represents additional comparison between NN and SVM models in terms of average relative error percent against pressure range. It was shown that SVM is much better than NN.

Conclusions

The Artificial Neural Network, Support Vector Machine and Functional network techniques are effectively useful to estimate the oil–gas ratio.
The input and output parameters were preprocessed and the log normalization method is implemented which give better results for FN and SVM techniques.
Support Vector Machines give better results than Functional Networks with an average correlation coefficient of 0.9970 and 0.9935, respectively.
Since the analysis of the data indicated that the nature of the data is clustered for most of the input and output parameters, the ANN and SVM give the best results with an average relative error of 0.15 and 0.12%, correspondingly because these models are more flexible to deal with such data.
Super testing results also confirm the research conclusions.

References

Adeeyo YA (2016) Artificial neural network modeling of bubble point pressure and formation volume factor at bubble point pressure of Nigerian crude oil. In: Paper SPE-184378-MS presented at SPE Nigeria annual international conference and exhibition, Lagos, 2–4 Aug
Ahmadi MA, Pournik M, Shadizdeh SR (2015) Toward connectionist model for predicting bubble point pressure of crude oil: application of artificial intelligence. Petroleum 1:3017–3317
Google Scholar
Alimadadi F, Fakhri A, Farooghi D, Sadati H (2011) Using a committee machine with artificial neural networks to predict PVT properties of Iran crude oil. Society of Petroleum Engineers
Anthony TC, Goh, Goh SH (2007) Support vector machines: their use in geotechnical engineering as illustrated using seismic liquefaction data. Comput Geotech 34:410–421
Article Google Scholar
Arief IH, Forest T, Meisingset KK (2017) Estimating fluid properties using surrogate models and fluid database. Society of Petroleum Engineers
Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Disc 2:121–167
Article Google Scholar
Castillo E, Gutiérrez JM, Hadi AS, Lacruz B (2001) Some applications of functional networks in statistics engineering. Technometrics 43:10–24
Article Google Scholar
Coats KH (1985) Simulation of gas condensate reservoir performance. JPT:1870–1886
Coats KH, Smart GT (1986) Application of a regression-based EOS PVT program to laboratory data. SPERE 1:277–299
Article Google Scholar
Elish KO, Elish MO (2007) Predicting defect-prone software modules using support vector machines. J Syst Softw. https://doi.org/10.1016/j.jss.2007.07.040
Article Google Scholar
Fattah KA (2005) Volatile oil and gas condensate fluid behavior for material balance calculations and reservoir simulation, PhD Thesis, Cairo University, Egypt
Fattah KA, El-Banbi AH, Sayyouh MH (2006) Study compares PVT calculation methods for non black oil fluids. Oil Gas J 104(12):35–39
Google Scholar
Fattah KA, El-Banbi AH, Sayyouh MH (2009) New correlations calculate volatile oil, gas condensate PVT properties. Oil Gas J 107(20):41–46
Google Scholar
Fevang O, Singh K, Whitson CH (2000) Guidelines for choosing compositional and black-oil models for volatile oil and gas-condensate reservoirs. In: Paper SPE 63087 presented at the 2000 SPE annual technical conference and exhibition, Dallas, 1–4 Oct 2000
González A, Barrufet MA, Startzman R (2003) Improved neural network model predicts dew point pressure of retrograde gases. J Pet Sci Technol 37:183–194
Google Scholar
Guojie L (2004) Radial basis function neural network for speaker verification. A master of Engineering Thesis. The Nanyang Technological University
Hosmer D, Lemeshow S (2000) Applied logistic regression. Wiley, Hoboken
Book Google Scholar
Jian H, Wenfen H (2006) Novel approach to predict potentiality of enhanced oil recovery. In: SPE intelligent energy conference and exhibition, Amsterdam
Google Scholar
McCain WD Jr, Bridges B (1994) Volatile oils and retrograde gases—what’s the difference? Pet Eng Int:35–36
McVay DA (1994) Generation of PVT properties for modified black-oil simulation of volatile oil and gas condensate reservoirs. Ph.D. Thesis, Texas A&M University, Texas
Google Scholar
Oloso M, Khoukhi A, Abdulraheem A, Elshafei M (2009) Prediction of crude oil viscosity and gas–oil ratio curves using recent advances to neural network. In: Paper SPE-125360-MS presented at the SPE/EAGE reservoir characterization and simulation conference, Abu Dhabi, 19–21 Oct
Osman ESA, Al-Marhoun MA (2005) Artificial neural models for predicting PVT properties of oil field brines. In: SPE-93765-MS presented in the SPE middle east oil and gas show and conference, Bahrain, 12–15 Mar
Book Google Scholar
Peng DY, Robinson DB (1976) A new two-constant equation of state. Ind Eng Chem Fund 15:59–64
Article Google Scholar
Sahterri M, Ghorbani S, Hemmati-Sarapardeh A, Mohammadi A-H (2015) Application of Wilcoxon generalized radial basis function network for prediction of natural gas compressibility factor. J Taiwan Inst Chem Eng 50:131–141
Article Google Scholar
Spivak A, Dixon TN (1973) Simulation of gas condensate reservoirs. In: Paper SPE 4271 presented at the 3rd numerical simulation of reservoir performance symposium, Houston, 10–12 Jan
Book Google Scholar
Walsh MP, Towler BF (1994) Method computes PVT properties for gas condensate. OGJ:83–86
Wang J, Du H, Liu H, Yao X, Hu Z, Fan B (2007) Prediction of surface tension for common compounds based on novel methods using heuristic method and support vector machine. Talanta 73:147–156
Article Google Scholar
Whitson CH, Torp SB (1983) Evaluating constant-volume depletion data. JPT:610–620
Włodzisław Duch R, Adamczak, Jankowski N (1997) Initialization and optimization of multilayered perceptrons. In: Third conference on neural networks and their applications, Kule
Google Scholar

Download references

Author information

Authors and Affiliations

Petroleum and Natural Gas Engineering Department, College of Engineering, King Saud University, P.O. Box 800, Riyadh, 11421, Saudi Arabia
Mohammed A. Khamis & K. A. Fattah
Petroleum Engineering Department, Faculty of Engineering, Cairo University, Cairo, Egypt
K. A. Fattah

Authors

Mohammed A. Khamis
View author publications
You can also search for this author in PubMed Google Scholar
K. A. Fattah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammed A. Khamis.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Khamis, M.A., Fattah, K.A. Estimating oil–gas ratio for volatile oil and gas condensate reservoirs: artificial neural network, support vector machines and functional network approach. J Petrol Explor Prod Technol 9, 573–582 (2019). https://doi.org/10.1007/s13202-018-0501-0

Download citation

Received: 29 October 2017
Accepted: 04 June 2018
Published: 09 June 2018
Issue Date: March 2019
DOI: https://doi.org/10.1007/s13202-018-0501-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Estimating oil–gas ratio for volatile oil and gas condensate reservoirs: artificial neural network, support vector machines and functional network approach

Abstract

Similar content being viewed by others

Predicting the performance of constant volume depletion tests for gas condensate reservoirs using artificial intelligence techniques

PVT correlations for Pakistani crude oils using artificial neural network

Application of Artificial Intelligence to Estimate Oil Flow Rate in Gas-Lift Wells

Introduction