Machine learning accelerated approach to infer nuclear magnetic resonance porosity for a middle eastern carbonate reservoir

Mustafa, Ayyaz; Tariq, Zeeshan; Mahmoud, Mohamed; Abdulraheem, Abdulazeez

doi:10.1038/s41598-023-30708-7

Machine learning accelerated approach to infer nuclear magnetic resonance porosity for a middle eastern carbonate reservoir

Article
Open access
Published: 09 March 2023

Volume 13, article number 3956, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Machine learning accelerated approach to infer nuclear magnetic resonance porosity for a middle eastern carbonate reservoir

Download PDF

Ayyaz Mustafa¹,
Zeeshan Tariq²,
Mohamed Mahmoud³ &
…
Abdulazeez Abdulraheem³

1331 Accesses
Explore all metrics

Abstract

Carbonate rocks present a complicated pore system owing to the existence of intra-particle and interparticle porosities. Therefore, characterization of carbonate rocks using petrophysical data is a challenging task. Conventional neutron, sonic, and neutron-density porosities are proven to be less accurate as compared to the NMR porosity. This study aims to predict the NMR porosity by implementing three different machine learning (ML) algorithms using conventional well logs including neutron-porosity, sonic, resistivity, gamma ray, and photoelectric factor. Data, comprising 3500 data points, was acquired from a vast carbonate petroleum reservoir in the Middle East. The input parameters were selected based on their relative importance with respect to output parameter. Three ML techniques such as adaptive neuro-fuzzy inference system (ANFIS), artificial neural network (ANN), and functional network (FN) were implemented for the development of prediction models. The model’s accuracy was evaluated by correlation coefficient (R), root mean square error (RMSE), and average absolute percentage error (AAPE). The results demonstrated that all three prediction models are reliable and consistent exhibiting low errors and high ‘R’ values for both training and testing prediction when related to actual dataset. However, the performance of ANN model was better as compared to other two studied ML techniques based on minimum AAPE and RMSE errors (5.12 and 0.39) and highest R (0.95) for testing and validation outcome. The AAPE and RMSE for the testing and validation results were found to be 5.38 and 0.41 for ANFIS and 6.06 and 0.48 for FN model, respectively. The ANFIS and FN models exhibited ‘R’ 0.937 and 0.942, for testing and validation dataset, respectively. Based on testing and validation results, ANFIS and FN models have been ranked second and third after ANN. Further, optimized ANN and FN models were used to extract explicit correlations to compute the NMR porosity. Hence, this study reveals the successful applications of ML techniques for the accurate prediction of NMR porosity.

Estimation of NMR Total and Free Fluid Porosity from Seismic Attributes Using Intelligent Systems: A Case Study from an Iranian Carbonate Gas Reservoir

Article 28 March 2016

Towards Rapid Prediction of Nuclear Magnetic Resonance-Based Bimodal Porosities: An Example from the Middle Eastern Carbonate Reservoir

Article 27 June 2024

Adjusting porosity and permeability estimation by nuclear magnetic resonance: a case study from a carbonate reservoir of south of Iran

Article Open access 05 June 2018

Introduction

Porosity is of vital importance for describing the hydrocarbon reservoirs. It is used for various purposes such as hydrocarbon reserves estimation, describing the reservoir quality, and fluid flow properties within the oil/gas reservoirs. Total porosity is the combination of all types of porosity of the rock formation i.e. effective, isolated porosity and fractures porosity. Generally, it is assumed for the shale free formations (clean formations) and for carbonate formations that effective porosity is equal to total porosity of the formation however it is not true for all cases. Otherwise, correction has to be applied for shale content present in the rock formation as shale presence may lead to the overestimation of formation porosity. There are several techniques of porosity determination in the field and laboratory such like helium porosity in laboratory, mercury injection, nuclear magnetic resonance (NMR), x-ray computed tomography, density logs, sonic logs, neutron porosity. However, among all the techniques, NMR has been found the most reliable and robust technique for the accurate determination of accurate total porosity¹.

Various researcher investigated and proposed relationships for the porosity estimation in carbonate and other rock formations. Porosity is determined as the ratio between the pore volume (V_p) and bulk volume (V_b) of the rock formation. Porosity of the clean rock formation could be estimated using Eq. (1) proposed by Wyllie et al.².

$$\Phi = \frac{{\Delta T_{\log - } \Delta T_{ma} }}{{\Delta T_{f - } \Delta T_{ma} }}$$

(1)

where ΔT_log, ΔT_ma, and ΔT_f represent the sonic wave transit time of rock, rock matrix and fluid respectively, in µsec/ft.

Porosity could be estimated from the density as proposed by the Garmard and Poupon³.

$$\Phi = \frac{{\uprho_{ma - } \rho_{b} }}{{\uprho_{ma - } \rho_{f} }}$$

(2)

where ⍴_ma, ⍴_b, and ⍴_f represent the matrix density, bulk density and fluid density, respectively in g/cc.

The methods for porosity determination such as sonic, density and neutron, present the porosity as a function of lithology. On the contrary, NMR presents the total porosity as a function of hydrogen atoms exist in the fluid and does not depend on the lithology¹. Although NMR is believed to be a robust and reliable method of estimating the porosity however, factors such as oil viscosity and water salinity could influence the accuracy of NMR porosity⁴.

Timur⁵ introduced the producible porosity for the movable fluid in the rocks that is the ratio between the movable fluid and bulk volume of the rock formation. An empirical correlation was introduced by Timur⁶ for NMR porosity for carbonate rocks on the basis of free fluid index (FFI) data. Porosity was determined using magnetic resonance imaging logs (MRIL) during drilling by the Miller et al.¹. Another method was introduced by the Chang et al.⁷ to estimate the porosity with great accuracy using combinable magnetic resonance (CMR) tool and conventional logs for complex carbonate formations. High resolution NMR could be obtained using combinable magnetic resonance.

Estimation of total porosity and clay-bound water using NMR logs was also investigated by the Prammer⁸ and found a direct linear relation between transverse relaxation time and water content in clay. The accuracy of NMR porosity was investigated by Georgi et al.⁹ through comparing the NMR porosity with laboratory-based core porosity. Both porosities are found in good harmony with only 5% error between both measurements. Ehigie¹⁰ compared the different wireline logging tools used for estimating the porosity such like CMR, NMR and MRIL and noticed that MRIL measured porosity was not accurate and reliable in the areas of elliptical breakouts. Dual and triple porosity modal are the primary characteristics of carbonate rock formations which are caused by the presence of fractures, cavities and vugs.

NMR principle and applications

NMR is an advanced and non-destructive quantitative technique used for the evaluating the pore characteristics and fractures identification¹⁰. NMR measurements is also able to provide valuable information about the rock properties such as fluid type, fluid state, porosity, pore size distribution. It also provides the sufficient information about the free fluid index and bulk volume irreducible¹¹.

NMR measures the porosity by detecting the hydrogen nuclei present in the pore spaces. NMR measure the response of hydrogen nuclei of atoms possessing odd number of neutrons and protons under the influence of applied magnetic field at a certain frequency of resonance. The significantly strong angular momentum of hydrogen nuclei, a strong magnetic moment is developed in the protons due to their magnetic moment, and they start spinning like a magnet bar. The external magnetic field cause the polarization of hydrogen protons and a frequency, known as Larmor frequency, is induced in these protons. This induced frequency is quantitively measured as a quantum property. The NMR measurements also provide the precise estimation of the wettability in reservoir rocks¹².

The NMR porosity is basically derived from two types of time measurements: longitudinal (T₁) and transverse (T₂) relaxation times. The time required by relaxation of the radio frequency pulses to equilibrium energy state is regarded as T₁ whereas T₂ is the time required by the secondary magnetic field to loss its phase coherence which is the distinctive feature¹³. Three kinds of relaxation mechanisms control the T₂ relaxation time including diffusion-induced, bulk fluid mechanism, and surface interaction which are shown in Eq. (3).

$$\frac{1}{{{\mathrm{T}}_{2} }} = \frac{1}{{{\mathrm{T}}_{{2{\mathrm{B}}}} }} + \frac{1}{{{\mathrm{T}}_{{2{\mathrm{S}}}} }} + \frac{1}{{{\mathrm{T}}_{{2{\mathrm{D}}}} }}$$

(3)

where T_2D, T_2B and T_2S are the diffusion induced, bulk, and surface relaxation times, respectively. The diffusion relaxation is resulted from the diffusion of two magnetic fields gradients, the bulk relaxation is related to interaction among the dipole molecules of hydrogen, and the interaction between atoms and solid surfaces cause the surface relaxation¹⁴.

Machine learning applications for porosity determination

Various studies have been reported the machine learning applications for the determination of petrophysical properties such as^{13,15,16,17,18}. The log-based porosity of carbonate reservoirs was successfully predicted using the artificial intelligence (AI) approaches and validated through the core porosity. The performance of reported AI model was very good in terms of accurate prediction with an error of 8% and correlation coefficient of 0.98^13,15. Hamada and Elshafei¹⁶ developed an artificial neural network model for the prediction of permeability and porosity of tight gas sand reservoirs by using the resistivity and density logs, and NMR transverse relaxation time (T₂) as input parameters. The model was validated by comparing the results with core and NMR based porosity and permeability of gas sand reservoirs^11,16.

The NMR derived free-fluid porosity and vertical permeability were simulated and extrapolated using the two artificial intelligence approaches (fuzzy logic and artificial neural networks). Conventional logs data of two wellbores in Campos Basin, Brazil was used as model inputs. Predicted parameters were optimized through minimizing the errors by average and genetic algorithm approaches. Both techniques performed quite well with high accuracy and low errors¹⁷. NMR based permeability was calibrated with core permeability by applying the alternate conditional expectations algorithms to improve the accuracy of NMR based permeability estimation. The calibrated NMR permeability exhibited good harmony with the core permeability exhibited 0.87 correlation coefficient. Further, artificial neural networks model was further implemented to synthetically create the permeability and effective porosity in the un-logged zones or interval of sandstone reservoirs¹⁸.

NMR derived permeability and effective porosity were predicted with good accuracy through the artificial neural networks technique with Cuckoo optimization algorithm. Conventional logs such like resistivity, bulk density, sonic transit time and neutron porosity were used as model inputs to accurately predict the NMR derived porosity and permeability¹⁹. Although the model accuracy is good with low errors, however, this is a general case as there was no specific rock formation mentioned in this study. NMR log-based parameters were also predicted by Mohaghegh, using ANN for East Texas fields²⁰. However, prediction was based on well log data irrespective of rock formation or reservoir. Therefore, the model lacks to provide the accurate solution for specific type of reservoirs such as sandstone or carbonates reservoirs.

NMR log-based permeability was predicted using fuzzy-logic coupled with genetic algorithm optimization technique. Predicted permeability was then used to predict the well producibility of gas bearing siliciclastic reservoir. Model-based predicted permeability exhibited good correlation with experimentally measured core permeability²¹. A comparative study of different machine learning (ML) techniques such as conventional artificial neural networks, fuzzy decision tree, particle swarm optimization, genetic algorithm, imperialist competitive algorithm, and hybrid ML technique, was done to predict the log-based porosity and permeability of the oil reservoir in northern Persian Gulf. Comparison revealed that hybrid machine learning approach performed better than other techniques for predicting the permeability/porosity of oil reservoir in that area. The reliability of the model is reflected by good correlation with laboratory measured permeability/porosity. The model provides the conventional log-based permeability/porosity in general, however, the model is unable to provide information about the porosity and permeability of different geologic formations²².

A new machine learning approach was introduced by Ahmadi et al.²² for the accurate prediction of permeability and porosity using fuzzy logic (FL-GA) and least square support vector machine (LSSVM) in combination with genetic algorithm. Conventional logs data and NMR parameters, acquired from northern Persian Gulf oil fields, were used as input and output parameters. However, the study provided the general solution without mentioning any geologic formation or type of reservoir²³.

Although various studies reported the application of different artificial intelligence techniques for the prediction of petrophysical properties. However, the previously reported AI models were developed are either limited to the rock formations other than carbonate rocks (i.e. sandstone) or no rock formation was explicitly mentioned in the studies. One of the previous studies¹⁴ reported the application of AI for porosity in carbonate rocks using log data. Hence, this study provides the improved and accurate prediction models for porosity in carbonate reservoirs using NMR porosity data. To the best of author’s knowledge, the application of ML for NMR total porosity of the carbonate rock formations has not been reported previously.

As discussed earlier, NMR provides the porosity with great accuracy and reliability as compared to conventional logs however, NMR logging is very expensive and difficult to run, so it may not be available for all the wells. Owing to the importance of NMR porosity, it is deemed necessary to have the NMR porosity for all the wells. So, ML based models for the prediction of the NMR porosity for carbonates reservoirs would be of great significance and enable to obtain the NMR porosity for the wells where NMR logs are not run. In this study, three robust ML approaches including artificial neural networks (ANN), adaptive neuro-fuzzy inference system (ANFIS), and functional networks (FN) were used to develop the prediction models for NMR porosity for carbonates reservoirs. All three models exhibited very good accuracy in terms of high correlation coefficient and low errors for the testing and validation of the model.

Modelling approaches: ANN, ANFIS and FN

Artificial neural network (ANNs)

Artificially neural networks (ANN) is one of the widely used computational intelligence techniques for solving prediction problems, data mining, approximation, and pattern recognition^24,25. Several types of ANN are being used however, feedforward and backpropagation are more frequently used algorithms of ANN used for training and prediction²⁶. ANN uses different learning algorithms, transfer functions and network functions to solve the given prediction problem by delivering the required output prediction.

ANN resolves the given problem by imitating the assemblies and function of the human nervous system. There is an explicit architectural network forms the elements called artificial neurons which are selected on the basis of the nature of the given engineering problem²⁷. The artificial neural networks used huge algorithms’ set to establish the connections among the nonlinear variables in order to provide precise, reliable, and consistent prediction results²⁸. In general, two kinds of ANN are being employed such as feed-forward neural networks (FFNN) and feed-back neural network (FBNN). FFNN is the simplified and elementary ANN architecture comprised of perceptron of interconnected layers to develop a unidirectional process of transferring information in forward direction. The information of input functions is transferred nonlinearly through the hidden layer activation function in such a way that precise and relevant feature of output is determined²⁹. The second type is also popular and extensively used for ANN applications which has identical architectural features as FFNN with additional feature of creating a back loop. The back loop keeps refining the output variable by sending the error information back to adjust the weights until errors are no longer be improved²⁹.

The nodes of ANN network interact with each other through a developed pathways with multiple interconnections. Neural network framework may work with one hidden layer of neurons with assigned weights to each node. The input variables are fed as vectors during training phase. The weights are fine-tuned through gradient descent after the computed errors of output are looped back in FBNN. The processing is reiterated until no further improvement in weights and bias is obtained. The weights are updated using gradient descent on the basis of error function derivatives as expressed mathematically in Eq. (4)²⁹.

$$\Delta {\mathrm{w}}\left( {\mathrm{t}} \right) = \eta *\nabla {\mathrm{E}}*\left( {\mathrm{t}} \right) + \alpha *\Delta {\mathrm{w}}\left( {{\mathrm{t}} - {1}} \right)$$

(4)

where E, η, α, Δw, represent the error observed, updated weight, momentum and learning parameters, respectively²⁸.

The comparison among different supervised machine learning algorithms revealed that ANN has several advantages over the other algorithms²⁸.

1.
ANN is capable of dealing with the non-linear and sophisticated relationships between output and input parameters.
2.
ANN works efficiently for the problem with high dimensions
3.
Complicated functions and non-limiting classification groups containing input and output variables are exhibited by ANN.

The risk of under- and over-fitting is minimized through the strong tuning ability of ANN. In this study, a fully connected feed-forward neural network is used to predict the NMR porosity (Φ_NMR) using 3594 data points. Training dataset contains seventy percent data points (2516) which were selected randomly to develop the prediction correlation of the model. Testing and validation were done using the rest thirty percent data points (1078).

Adaptive neuro-fuzzy inference system (ANFIS)

Adaptive neuro-fuzzy inference system (ANFIS) is an integration of ANNs and fuzzy logic (FL) and capable of learning and reasoning of both techniques (ANNs and FL) simultaneously. ANFIS used the fuzzy logic theory to formulate the mapping of the input and output layers. Neural networks are used to regulate the mapping parameters by leaning functions³⁰. Many engineering problems are resolved through the application of adaptive neuro-fuzzy inference system (ANFIS). Accurate prediction models are developed using complicated and nonlinear algorithms of ANFIS which are structured between input and output parameters^31,32,33.

Functional networks (FN)

The functional network is the modified and straightforward edition of conventional neural networks^34,35,36. The FN is a type of supervised machine learning approach mainly utilized for resolving the regression and function approximation complications³⁷. The FN model is developed using FN algorithm which undergoes the learning process from the domain and data information. Three major components including input storing unit, muti-layer processing unit, and output storing unit, are involved in the development of FN prediction model. Algorithm determines the modelling parameters itself such as network structure, topology, and neural functions in contrast to the conventional neural networks that work on the pre-defined and fixed number of neurons. Further, the connections between the neurons (biases and weights) are learned and determined during the training process of the neural network modelling³⁸. On the other hand, number of neural functions is susceptible and changed during the parametric and structural learning process.

The workflow diagram encompasses the major phases of ML models’ development in this study are illustrated in Fig. 1. Three major components of the study are data analytics, machine learning modelling process and development of models.

Performance measures

The average absolute percentage error (AAPE), root mean squared error (RMSE), and coefficient of correlation (R) were determined for the predicted values to evaluate the accuracy and reliability of the models. In addition, co Mathematical relations of AAPE and RMSE are shown in Eqs. (5) and (6). Moreover, correlation coefficient (R) was determined as a scale of accuracy for models as well.

$${\mathrm{AAPE}} = \left| {\frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left( {\frac{{\beta_{a} - \beta_{p} }}{{\beta_{a} }}} \right)} \right|$$

(5)

$${\mathrm{RMSE}} = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{n} [\beta_{a} - \beta_{p} ]^{2} )}}{n}}$$

(6)

where ${\beta }_{p}$ and ${\beta }_{a}$ are the predicted and actual values respectively and ‘n’ represents the total number of data points.

The study goals are to provide the convenient, quick, and cost-effective solution for reliable and accurate determination of porosity in complex carbonate reservoirs using conventional log data. The three widely accepted ML approaches including adaptive neuro-fuzzy inference system (ANFIS), artificial neural networks (ANN), and functional network (FN), were implemented to predict the total NMR porosity (Φ_NMR) through quickly and conveniently and readily available well logs. Machine learning models were developed using conventional wireline logs from carbonate reservoirs. This study offers three ML prediction models for total porosity estimation which is an essential parameter that have strong influence on hydrocarbon storage in the reservoirs.

Data analysis

Log data, used in this study was acquired from oil wells in carbonate reservoirs containing 3594 data points. Log data such as gamma ray (GR), caliper (Cal), density (ρ), neutron porosity (Φ_N), and photoelectric factor (PE) were used as input parameters, whereas total porosity (Φ_NMR) measured from NMR logs is used as output for the ML prediction models. Datasets used for training and testing of ML models are shown in Figs. 2 and 3, respectively.

Input parameters exhibited quite good correlation with output parameter NMR porosity (Φ_NMR) specially density (ρ), neutron porosity (Φ_N), and photoelectric factor (PE) as reflected by the correlation coefficient (R) (Fig. 4). The strongest correlation was observed for ρ exhibiting R value of -0.64 with Φ_NMR and least value was observed for GR case. However, gamma ray has a strong physical relation with rock porosity as it provides the valuable information about the subsurface lithologies, clay volume, other reservoirs characteristics. Due to the geological significance of gamma ray it was included as input parameter. The PE and Φ_N also demonstrated good R values (-0.58 and 0.45 respectively) with Φ_NMR making them strong candidates for the model input parameters. GR and Cal were also incorporated as the input parameters of models based on their R values (0.17 and 0.30 respectively) and model simulation with different input parameters. Coefficients of correlation and cross-plots between input and output parameters are demonstrated in Figs. 4 and 5, respectively. Furthermore, background knowledge of rock science domain was also used to select the final input variables. Negative correlations were observed for density (ρ), and photoelectric factor (PE) whereas gamma ray (GR), neutron porosity (Φ_N), and caliper (borehole diameter) have strong positive correlation with NMR porosity (Φ_NMR). The brief overview of statistical parameters for training dataset (input and output) is shown in Table 1.

Table 1 Statistical indicators for training dataset.

Full size table

Data points of all input parameters are crowded near the mean values exhibiting low variability which is reflected by their low standard deviation. However, higher data variability in GR data is demonstrated by its standard deviation (20.89). GR data points are distributed over a broad range instead of clustering around the mean value. Skewness and kurtosis illustrated the measure of shape of data points. Kurtosis of both training and testing datasets representing almost similar trends. Parameters such as GR, Φ_N, ρ, and Φ_NMR, exhibited a platykurtic spread of the data points as reflected by the lower values of kurtosis. The data of these parameters exhibited light tails compared to the normal distribution that represents good quality data with less likelihood of outliers exist in the datasets. Data points of these parameters demonstrated broader and lower peak with lack of extreme values. On the contrary, data points of input parameters such as PE and Cal exhibited a wide distribution with heavier tails. Sharper and higher peaks of data points are observed for these parameters lead to the leptokurtic data distribution. Horizontal axis of histogram is stretched longer, however, most of the data points appear in a skinny (narrow) vertical range. One of the reasons of leptokurtic distribution of Cal and PE is the non-uniform borehole profile representing wash out at various locations in the boreholes.

Different degree of skewness is reflected by the input and output parameters. Almost, similar behavior in terms of skewness is observed for both the datasets (training and testing) as shown in Tables 1 and 2. Data distribution is fairly symmetrical for GR and Φ_NMR parameters with very small skewness. Most of the data points lie around the mean values for these two parameters. Two parameters Φ_N, and ρ demonstrated a moderate positive and negative skewness respectively. Most of the data points of ρ are less than the mean value exhibiting non-symmetry in the data points whereas; dataset Φ_N is dominated by the values higher than the average values. Notably, high skewness was observed for Cal and PE. Cal data contains the values higher than the mean values as reflected by the high negative skewness. On the other hand, high positive skewness is the characteristics of PE dataset indicating that mean value is higher than majority of the data points (Tables 2 and 3). Characteristics of skewness and kurtosis could be observed in the histograms of the input parameters as shown in Fig. 6.

Table 2 Statistical indicators for testing dataset.

Full size table

Table 3 Biases and weights connecting the input, hidden, and output layers of ANN Model.

Full size table

Models optimization and hyperparameters tuning

ANN model

Artificial neural networks program (ANN) was run at different selected parameters such as number of hidden layers’ neurons, input parameters, and number of realizations in order to optimize the prediction model by minimizing the errors and improving the ‘R’. The number of folds were used for cross-validation. Tangent sigmoidal was selected as an activation function between input and hidden layer and pure linear activation function between hidden and output layer. The number of hidden layers were selected by hit and trial method and Levenberg–Marquardt was used as a learning function. After the optimization of hidden layers in the model, model was run for 100 realizations in order to capture the non-distinctiveness in the dataset. The AAPE errors of ‘training’ and ‘testing and validation’ of ANNs model were compared as shown in Fig. 7. After optimization of the model at different number of realizations and hidden layers neurons, the optimized results were obtained at 13 neurons and 94 realizations exhibiting least errors. Topology of ANN model is shown in Fig. 8. Table 3, shows the biases and weights of the input, hidden, and output layers of the AAN model.the mathematical form of the model is as follows.

$$\left( {\Phi_{NMR} } \right)_{N} = \mathop \sum \limits_{q = 1}^{{H_{n} }} W_{qr} n_{hq} + b_{r}$$

(7)

$$n_{hq} = f \left(\mathop \sum \limits_{p = 1}^{{N_{p} }} W_{pq} i_{p} + b_{q}\right)$$

(8)

$$n_{hq} = f\left( {(W_{1q} (GR)_{N} + W_{2q} \left( {Cal} \right)_{N} + W_{3q} (\Phi_{N} )_{N} + W_{4q} \left( \rho \right)_{N} + W_{5q} \left( {{\mathrm{PE}}} \right)_{N} + b_{q} } \right)$$

(9)

$${\mathrm{f}}\left( {\mathrm{x}} \right) = \frac{2}{{1 + e^{ - 2x} }} - 1 = {\mathrm{tanh}}\left( {\mathrm{x}} \right)\;\left( {{\mathrm{Hyperbolic}}\;{\mathrm{Tangent}}\;{\mathrm{Sigmoid}}\;{\mathrm{Transfer}}\;{\mathrm{Function}}} \right)$$

(10)

However, normalization was done prior to ANN simulations (using Eq. 11), and de-normalization was done for final output of the model. Equation (17).

$$Inputs = \frac{{({\uppsi }_{\max } - {\uppsi }_{\min } )\left( {i - i_{\min } } \right)}}{{(i_{\max } - i_{\min } )}} + {\uppsi }_{\min }$$

(11)

where ${\uppsi }_{\min }$ and ${\uppsi }_{\max }$ are – 1 and 1 respectively. For the minimum and maximum input values, please refer to Table 2. The input parameters after normalization, are shown below from Eqs. (12)–(16).

$$\left( {GR} \right)_{N} = 0.012 \left( {\left( {GR} \right) - 17.74} \right) - 1$$

(12)

$$\left( {Cal} \right)_{N} = 1.94 \left( {\left( {Cal) - 8.04} \right)} \right) - 1$$

(13)

$$\left( {\Phi_{N} } \right)_{N} = 0.09 \left( {\Phi_{N} - 2.96} \right) - 1$$

(14)

$$\left( \rho \right)_{N} = 5.13 \left( {\rho - 2.45} \right) - 1$$

(15)

$$\left( {PE} \right)_{N} = 0.732 \left( {PE - 2.58} \right) - 1$$

(16)

Equation (17) showing the relationship used for de-normalization of the model output values.

$$Output = \frac{{({\uppsi }_{\max } - {\uppsi }_{\min } )\left( {i - i_{\min } } \right)}}{{(i_{\max } - i_{\min } )}} + {\uppsi }_{\min }$$

(17)

The values of $i_{\min }$ and $i_{\max }$ used for de-normalization are -1 and 1 respectively. The maximum (${\uppsi }_{\max }$) and minimum (${\uppsi }_{\min }$) values of the output data are 10.09 and 2.07, respectively. For the proposed model, the de-normalization could be shown as shown in Eq. 18.

$$\left( {\Phi_{NMR} } \right) = 3.68 \left( {\left( {\Phi_{NMR} } \right)_{N} + 1} \right) - 2.73$$

(18)

The weights and biases of input, hidden and output layers of ANN prediction model are mentioned in the Table 3.

ANFIS model

The ANFIS model was developed for the prediction of ‘Φ_NMR’ using the same data points for training and testing. The model was simulated at several different numbers and types of membership functions to achieve the optimized prediction of variable. Though, two membership functions resulted the optimized predicted values after the model was run for different number of membership functions. The optimized results were obtained for gaussian and linear types of membership functions that connect the input, intermediate and output layers.

FN model

The FN model was run to establish a prediction correlation for ‘Φ_NMR’ in addition to ANN and ANFIS techniques. The model simulation was performed using different methods of FN such as Backward-Forward (BF), Forward–Backward (FB), Backward-Elimination (BE), Forward-Selection (FS), and Exhaustive Search (ES) in order to achieve the best prediction results. After the number of runs, ES was selected to for the model development as it demonstrated lowest errors and highest precision in terms of ‘R’. The ES method was executed at linear and non-linear types of functions. The best results with lowest errors and highest ‘R’ were obtained with ES and non-linear types of FN.

Generally, the optimal neural function is obtained through the parametric learning process using a known group of Fourier, polynomial, and trigonometric functions. The study generated a exclusive structure of FN model for ‘Φ_NMR’ prediction model. Arrangement of estimated neural functions present a focused parameter which is to be predicted. A set of input parameters including GR, Cal, ${\Phi }_{N}$, $\rho$, and PE were regarded as neural functions. The following Eq. (19) showing the parametric learning of this study.

$${\Phi }_{NMR} = {\text{ g}}\left( {{\text{GR}},{\text{ Cal}},{ }\Phi_{N} ,{ }\rho ,{\text{ PE}}} \right) = \mathop \sum \limits_{i = 1}^{n} {\upsigma }_{i} {\text{g}}_{i} \left( {{\text{GR}},{\text{ Cal}},{ }\Phi_{N} ,{ }\rho ,{\text{ PE}}} \right)$$

(19)

where N, n, σ, represent the size of data set, size of network, and coefficients of neural functions, respectively. The neural functions are determined during the training of the data set. For this study, polynomial group is selected which is denoted by ${1},x,x^{{2}} , \ldots ,x^{h}$ where degree of polynomial is denoted by ‘h’.

The architectural design of the functional network consists of five input parameters and five neural functions. Polynomial function (degree, h = 2) is used to acquire these neural functions. The mathematical relation for the ‘Φ_NMR’ prediction models is illustrated in Eq. (20).

$${\Phi }_{NMR} = \mathop \sum \limits_{i = 1}^{k} \omega_{i} \left( {GR} \right)^{{\sigma_{1} }} \left( {Cal} \right)^{{\sigma_{2} }} \Phi_{N}^{{\sigma_{3} }} \rho^{{\sigma_{4} }} \left( {PE} \right)^{{\sigma_{5} }}$$

(20)

where number of neural functions and corresponding neural functions are denoted by p and σ (σ₁, σ₂, σ₃, σ₄, σ₅) respectively. The numerical values of neural functions and coefficients are for the ‘Φ_NMR’ model are mentioned in the Table 4.

Table 4 Neural functions and respective coefficients.

Full size table

Likewise other two models, a total of 3594 data points, acquired from different wells, were used for the FN prediction model. Out of total data points, seventy percent (70%) were used for building the model while rest of the 30% were utilized for the evaluation of model’s consistency and reliability.

Results and discussions

Three ML models were developed to predict NMR total porosity (Φ_NMR) in carbonate reservoirs using artificial neural network (ANN), adaptive neuro-fuzzy inference system (ANFIS), and functional network (FN) techniques. For all the prediction models, training of the prediction model was executed using seventy percent of the dataset (seen dataset) and testing and validation of the model were done using thirty percent of the data points (unseen dataset). The datasets acquired from different wells in the middle eastern basins. All three ML techniques were run using different parameters, hyper-parameters, and training algorithms to optimize the prediction results.

The ANN model exhibited the AAPE of 5.12, 5.0, and 5.04 for the unseen (testing and validation), seen (training) and overall datapoints respectively indicating high accuracy of the predicted values. The ‘R’ of the predicted values was found to be 0.95, 0.0.94, and 0.944 for unseen (testing and validation), seen (training), and overall data points respectively that reflected the reliability and consistency of the model predicted values. Further, model showed the RMSE of testing and training datasets as 0.39 and 0.39, respectively.

Accuracy of the ANFIS model is quite good with AAPE of 5.38, 5.08, and 5.18 for unseen (testing and validation), seen (training), and overall data points, respectively. The RMSE was found to be 0.39 and 0.41 for seen and unseen data points respectively which affirms the robustness of the prediction model. The ‘R’ of the ANFIS model for the unseen and seen data points was 0.937 and 0.942.

The FN model predicted ‘Φ_NMR’ values exhibited 5.84 and 6.06 AAPE for the training (model building) and testing datasets when compared to actual data set. The RMSE and ‘R’ for the training (seen) data points were determined as 0.47 and 0.942, and for the unseen (testing and validation) data points as 0.48 and 0.942, respectively. The FN model was also found robust and reliable in terms of accuracy however, the error indices are slightly higher as compared to the other two ML techniques i.e. ANN and ANFIS used in this study. The accuracy indices for all three prediction models are shown in Table 5 in terms of AAPE, RMSE, and ‘R’.

Table 5 Accuracy indicators for the ANN, FN and ANFIS models.

Full size table

Comparison between the ANN, ANFIS, and FN models

Accuracy indicators

The cross plots between the predicted and actual data points exhibited high coefficient of determination (R²) for both seen (training) and unseen (testing and validation) data points as shown in Fig. 9. Among all, the ANN model demonstrated the highest ‘R²’ values 0.902 and 0.90 between actual and predicted ‘Φ_NMR’ for the unseen (testing and validation) and seen (training) data points, respectively. The R² is an excellent indicator for the reliability and accuracy of the predicted values. Comparatively, a slightly lesser ‘R²’ were exhibited by the ANFIS model for testing (0.879) and training (0.887) data points (Fig. 9). Likewise, the training and testing datasets of FN model exhibited very good ‘R²’ value i.e., 0.889 and 0.888, respectively. However, R² of ANFIS prediction is slightly lesser than ANN and FN models. FN prediction is slightly better than ANFIS model in terms of R². Overall prediction performance of ANN is found to be better than ANFIS and FN. Consequently, all three ML models were found to be robust, steadfast, and consistent as reflected by the R² for the given parameters and their ranges used for model development.

Furthermore, prediction errors (AAPE and RMSE) demonstrated the performance of all three ML prediction models (ANN, ANFIS and FN) to be reliable, consistent, and accurate for the prediction of ‘Φ_NMR’ with a slight difference in the performance of these models. ANN model outperformed the ANFIS and FN models in terms of lowest AAPE (5.0) and RMSE (0.39) and highest ‘R’ (0.959) for the testing and validation dataset. The ANN model performance for training and testing is shown in Fig. 10. The training and testing performances of ANFIS and FN models could be seen in Figs. 11 and 12 comparing model predicted ‘Φ_NMR’ and the actual ‘Φ_NMR’ data. The ANFIS model provides slightly improved prediction compared to FN as demonstrated by prediction errors AAPE (5.08) and RMSE (0.41) for model testing and validation. The AAPE and RMSE were observed to be 6.06 and 0.48 respectively, for the testing and validation of the FN model. The predicted ‘Φ_NMR’ values are in excellent harmony with the actual data that affirm the applicability and viability of these models for other data sets as well. The performance of ANFIS and FN models is almost the same. ANN model was found to be the most powerful and accurate technique for the prediction of ‘Φ_NMR’. Therefore, the explicit relation developed for the ANN can be used with confidence to estimate the ‘Φ_NMR’ using any data set.

All three models were checked thoroughly before the conclusion is drawn. The ANN model was found to be more consistent at both lower and high porosity values with low errors which testify the better performance of ANN model as compared to ANFIS and FN models. however, a marginal difference confirmed that all three models are able to provide the reliable prediction values for NMR porosity.

Hence, all thee ML models can be used for the prediction of ‘Φ_NMR’ using the same parameters and their ranges. The predicted and actual comparison for all three models is shown in Figs. 10, 11, 12 and 13. The errors and ‘R²’ for these ML models are depicted in Figs. 13 and 14, respectively.

Summary and conclusions

In carbonate reservoirs, accurate determination of total porosity of the reservoir has always been a great challenge because of the extremely heterogeneous nature of carbonate pore system. The main motivation of this study is to overcome the challenges for accurate determination of NMR porosity using conventional well logs. Three ML techniques including adaptive neuro-fuzzy inference system, functional networks, and artificial neural networks were employed to build the prediction models. These proposed models were found to be consistent, robust, and viable for precise estimation of NMR-based total porosity in carbonate reservoirs using conveniently available conventional logs. The conclusions of the study are as follows:

1.
The proposed ML models would be able to predict the NMR porosity with great accuracy and reliability using readily accessible standard logs such as gamma ray, density, neutron porosity, caliper, and photoelectric factor.
2.
The developed models would perform excellent within the range of the input parameters. Further, the ANN-based explicit correlation could be applied confidently to compute the porosity using any other data set that contains the same parameters and ranges. The model can be easily recalibrated to predict the parameters outside the given ranges, once the data is available.
3.
All three proposed ML models demonstrated excellent performance and robustness for the prediction of NMR porosity in terms of low AAPE and RMSE errors and high coefficient of correlation.
4.
Overall, the performance of ANN prediction model was found to be the best for predicting NMR porosity as compared to the ANFIS and FN models.
5.
The proposed ML models exhibited excellent harmony in predicting the porosity for the testing and validation data points.
6.
The acquisition of NMR porosity is an expensive and time-consuming job. Thus, developed ML models would provide an intelligent solution to address the problem and make the operations cost-efficient and time-saving.
7.
The explicit equation based on ANN trained paradigm would definitely provide a convenient way to the computation of the total porosity without prior knowledge of ML coding language.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Miller, M. N., Paltiel, Z., Gillen, M. E., Granot, J. & Bouton, J. C. Spin echo magnetic resonance logging: porosity and free fluid index determination. In SPE Annual Technical Conference and Exhibition. Society of Petroleum Engineers. https://doi.org/10.2118/20561-MS (1990).
Wyllie, M. R. J., Gregory, A. R. & Gardner, G. H. F. An experimental investigation of factors affecting elastic wave velocities in porous media. Geophysics 23, 459–493. https://doi.org/10.1190/1.1438493 (1958).
Article ADS Google Scholar
Gaymard, R. & Poupon, A. Response of neutron and formation density logs in hydrocarbon-bearing formations. SPWLA (1968).
Elsayed, M. et al. A review on the applications of nuclear magnetic resonance (NMR) in the oil and gas industry: laboratory and field scale measurements. J. Pet. Exp. Prod. Tech. 12, 2747–2784 (2022).
Article Google Scholar
Timur, A. Producible porosity and permeability of sandstone investigated through nuclear magnetic resonance principles. SPWLA (1969).
Timur, A. Nuclear magnetic resonance study of carbonate rocks. Society of Petrophysicists and Well-Log Analysts (1972).
Chang, D., Vinegar, D., Morriss, H. J. & Straley, C. Effective porosity, producible fluid and permeability in carbonates from NMR logging. SPWLA (1994)
Prammer, M. G. NMR pore size distributions and permeability at the well site. In SPE Annual Technical Conference and Exhibition, Society of Petroleum Engineers. https://doi.org/10.2118/28368-MS (1994).
Georgi, D.T., Shorey, D.S. & Ostroff, G.M. Integration of NMR and conventional log data for improved petrophysical evaluation of Shaly Sands. SPWLA, Oslo (1999).
Ehigie, S. O. NMR-openhole log interpretation: Making the most of NMR data deliverables. In Presented at SPE Nigeria Annual International Conference and Exhibition. https://doi.org/10.2118/136971-MS (2010).
Mustafa, A., Mahmoud, M. A. & Abdulraheem, A. A review of pore strcuture characterization of unconventional tight reservoirs. In SPE Abu Dhabi International Petroleum Exhibition and Conference (ADIPEC), Abu Dhabi. SPE-197825-MS (2019).
Otchere, D. A., Mohammad, M. A. A., Ganat, T. O. A., Gholami, R. & Merican, Z. M. A. A novel empirical and deep ensemble super learning approach in predicting reservoir wettability via well logs. Appl. Sci. 12, 2942 (2022).
Article CAS Google Scholar
Golsanami, N., Sun, J. & Zhang, Z. A review on the applications of the nuclear magnetic resonance (NMR) technology for investigating fractures. J. Appl. Geophys. 133, 30–38 (2016).
Article ADS Google Scholar
Daigle, H. & Johnson, A. Combining mercury intrusion and nuclear magnetic resonance measurements using percolation theory. Transp. Porous Media 111, 669–679 (2016).
Article CAS Google Scholar
Elkatatny, S., Tariq, Z., Mahmoud, M. & Abdulraheem, A. New insights into porosity determination using artificial intelligence techniques for carbonates reservoirs. Petroleum 4, 408–418 (2018).
Article Google Scholar
Hamada, G. M. & Elshafei, M. A. Neural network prediction of porosity and permeability of heterogeneous gas sand reservoirs. In SPE Technical Symposium and Exhibition. SPE 126042 (2009).
Carrasquilla, A. A. G. & Briones, V. H. T. Simulating porosity and permeability of the nuclear magnetic resonance (NMR) log in carbonate reservoirs of Campos Basin, Southeastern Brazil using conventional logs and artificial intelligence approaches. Braz. J. Geophys. 37(2), 221–233 (2019).
Article Google Scholar
Al-Ajmi, F. A. & Holditch, S. A. NMR permeability calibration using a non-parametric algorithm and data from a formation in central Arabia. In SPE Middle East Oil and Gas Show. SPE 68112 (2001).
Zargar, G., Tanha, A. A., Parizad, A., Amouri, M. & Bagheri, H. Reservoir rock properties estimation based on conventional and NMR log data using ANN-Cuckoo: A case study in one of super fields in Iran southwest. Petroleum 6, 304–310 (2020).
Article Google Scholar
Mohaghegh, S. Virtual intelligence applications in petroleum engineering: Part 1—artificial neural networks. J. Pet. Technol. 52, 64–73 (2000).
Article Google Scholar
Malki, H. A. & Baldwin, J. A neuro-fuzzy based oil/gas producibility estimation methods. In International Joint Conference on Neural Networks. IEEE (2002).
Ahmadi, M. A. & Chen, Z. Comparison of machine learning methods for estimating permeability and porosity of reservoirs via petro-physical logs. Petroleum 5(3), 271–284. https://doi.org/10.1016/j.petlm.2018.06.002 (2018).
Article Google Scholar
Ahmadi, M. A., Ahmadi, M. R., Hosseini, S. M. & Ebadi, M. Connectionist model predicts the porosity and permeability of petroleum reservoirs by means of petro-physical logs: Application of artificial intelligence. J. Pet. Sci. Eng. 123, 183–200 (2014).
Article CAS Google Scholar
Mozaffari, A. & Azad, N. L. Optimally pruned extreme learning machine with ensemble of regularization techniques and negative correlation penalty applied to automotive engine coldstart hydrocarbon. Neurocomputing 131, 143–156 (2014).
Article Google Scholar
Dehghani, M. H., Azam, K., Khorasgani, F. C. & Fard, E. D. Assessment of medical waste management in educational hospitals of Tehran university medical science. Iran. J. Environ. Health Sci. Eng. 5(2), 131–136 (2008).
CAS Google Scholar
Chau, K. W. Application of a PSO-based neural network in analysis of outcomes of construction claims. Autom. Constr. 16(5), 642–646 (2007).
Article Google Scholar
Mustafa, A. et al. Data-driven machine learning approach to predict mineralogy of organic-rich shales: An example from Qusaiba Shale, Rub’al Khali Basin, Saudi Arabia. J. Mar. Pet. Geol. 137, 105495 (2022).
Article CAS Google Scholar
Otchere, D. A., Ganat, T. O. A., Gholami, R. & Ridha, S. Application of supervised machine learning paradigms in the prediction of petroleum reservoir properties: Comparative analysis of ANN and SVM models. J. Pet. Sci. Eng. 200, 108182 (2021).
Article CAS Google Scholar
Saikia, P., Baruah, R. D., Singh, S. K. & Chaudhuri, P. K. Artificial Neural Networks in the domain of reservoir characterization: A review from shallow to deep models. Comput. Geosci. 135, 104357 (2020).
Article Google Scholar
MATLAB user guide (2011).
Dogan, E., Yiğit, M. G., Sandalci, M. & Opan, M. Modelling of evaporation from the reservoir of Yuvacik dam using adaptive neuro-fuzzy inference systems. Eng. Appl. Artif. Intell. 23(6), 961–967. https://doi.org/10.1016/j.engappai.2010.03.007 (2010).
Article Google Scholar
Jalalifar, H., Mojedifar, S., Sahebi, A. A. & Nezamabadi-pour, H. Application of the adaptive neuro-fuzzy inference system for prediction of a rock engineering classification system. Comput. Geotech. 38(6), 783–790 (2011).
Article Google Scholar
Shahrabi, M. A., Kivi, I. R., Akbari, M. & Safiabadi, A. Application of adaptive neuro-fuzzy inference system for prediction of minimum miscibility pressure. Int. J. Oil Gas Coal Technol. https://doi.org/10.1504/IJOGCT.2014.057796 (2014).
Article Google Scholar
Castillo, E. Functional networks. Neural Process. Lett. 7(3), 151–159 (1998).
Article Google Scholar
Castillo, E., Cobo, A., Gutiérrez, J. M. & Pruneda, E. Functional networks: A new network-based methodology. Comput. Civ. Infrastruct. Eng. 15(2), 90–106 (2000).
Article Google Scholar
Castillo, E., Gutiérrez, J. M., Hadi, A. S. & Lacruz, B. Some applications of functional networks in statistics and engineering. Technometrics 43(1), 10–24 (2001).
Article MathSciNet MATH Google Scholar
Tariq, Z., Abdulraheem, A., Mahmoud, M. & Ahmed, A. A rigorous data-driven approach to predict poisson’s ratio of carbonate rocks using a functional network. Petrophysics 59(6), 761–777 (2018).
Google Scholar
Korany, M. A., Mahgoub, H., Fahmy, O. T. & Maher, H. M. Application of artificial neural networks for response surface modelling in HPLC method development. J. Adv. Res. 3(1), 53–63 (2012).
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Civil and Environmental Engineering Department, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA
Ayyaz Mustafa
Physical Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
Zeeshan Tariq
Petroleum Engineering Department, College of Petroleum Engineering and Geosciences, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran, 31261, Saudi Arabia
Mohamed Mahmoud & Abdulazeez Abdulraheem

Authors

Ayyaz Mustafa
View author publications
You can also search for this author in PubMed Google Scholar
Zeeshan Tariq
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Mahmoud
View author publications
You can also search for this author in PubMed Google Scholar
Abdulazeez Abdulraheem
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M. and Z.T. did the data analsis and run the machine learning models. A.M. and Z.T. wrote the main manuscript. M.M. and A.A. interpreted the results of prediction models. M.M. prepared the graphs for results. A.A. prepared the figures for model’s results. All authors review the manuscript.

Corresponding author

Correspondence to Ayyaz Mustafa.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mustafa, A., Tariq, Z., Mahmoud, M. et al. Machine learning accelerated approach to infer nuclear magnetic resonance porosity for a middle eastern carbonate reservoir. Sci Rep 13, 3956 (2023). https://doi.org/10.1038/s41598-023-30708-7

Download citation

Received: 08 July 2022
Accepted: 28 February 2023
Published: 09 March 2023
DOI: https://doi.org/10.1038/s41598-023-30708-7
Springer Nature Limited

Machine learning accelerated approach to infer nuclear magnetic resonance porosity for a middle eastern carbonate reservoir

Abstract

Similar content being viewed by others

Estimation of NMR Total and Free Fluid Porosity from Seismic Attributes Using Intelligent Systems: A Case Study from an Iranian Carbonate Gas Reservoir

Towards Rapid Prediction of Nuclear Magnetic Resonance-Based Bimodal Porosities: An Example from the Middle Eastern Carbonate Reservoir

Adjusting porosity and permeability estimation by nuclear magnetic resonance: a case study from a carbonate reservoir of south of Iran