Artificial intelligence-based framework for precise prediction of asphaltene particle aggregation kinetics in petroleum recovery

Sharifzadegan, Ali; Behnamnia, Mohammad; Dehghan Monfared, Abolfazl

doi:10.1038/s41598-023-45685-0

Artificial intelligence-based framework for precise prediction of asphaltene particle aggregation kinetics in petroleum recovery

Article
Open access
Published: 28 October 2023

Volume 13, article number 18525, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Artificial intelligence-based framework for precise prediction of asphaltene particle aggregation kinetics in petroleum recovery

Download PDF

Ali Sharifzadegan¹,
Mohammad Behnamnia¹ &
Abolfazl Dehghan Monfared¹

834 Accesses
Explore all metrics

Abstract

The precipitation and deposition of asphaltene on solid surfaces present a significant challenge throughout all stages of petroleum recovery, from hydrocarbon reservoirs in porous media to wellbore and transfer pipelines. A comprehensive understanding of asphaltene aggregation phenomena is crucial for controlling deposition issues. In addition to experimental studies, accurate prediction of asphaltene aggregation kinetics, which has received less attention in previous research, is essential. This study proposes an artificial intelligence-based framework for precisely predicting asphaltene particle aggregation kinetics. Different techniques were utilized to predict the asphaltene aggregate diameter as a function of pressure, temperature, oil specific gravity, and oil asphaltene content. These methods included the adaptive neuro-fuzzy interference system (ANFIS), radial basis function (RBF) neural network optimized with the Grey Wolf Optimizer (GWO) algorithm, extreme learning machine (ELM), and multi-layer perceptron (MLP) coupled with Bayesian Regularization (BR), Levenberg–Marquardt (LM), and Scaled Conjugate Gradient (SCG) algorithms. The models were constructed using a series of published data. The results indicate the excellent correlation between predicted and experimental values using various models. However, the GWO-RBF modeling strategy demonstrated the highest accuracy among the developed models, with a determination coefficient, average absolute relative deviation percent, and root mean square error (RMSE) of 0.9993, 1.1326%, and 0.0537, respectively, for the total data.

Estimation of asphaltene precipitation in light, medium and heavy oils: experimental study and neural network modeling

Article 18 November 2015

Empirical and Neural Network Modelling of Oil Recovery in Nano Assisted Enhanced Oil Recovery

Application of Artificial Intelligence to Predict Enhanced Oil Recovery Using Silica Nanofluids

Article 26 February 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Introduction

Petroleum industries, either downstream or upstream, can be adversely influenced by the least understood and the most problematic portion of crude oil, known as asphaltene, as oil is being explored, produced, processed, and transported^1,2,3,4. Asphaltenes are referred to as oil constituents that can be solved in aromatic solvents like benzene or toluene while insoluble in saturated hydrocarbons, including n-heptane or n-pentane^5,6. As any change in the thermodynamic parameters (such as temperature, pressure, or fluid composition) occurs, small tiny-particle-based asphaltenes may develop larger and become macro particles and accordingly deposit on the solid surface^7,8. With this in mind, precipitation and subsequent deposition of asphaltenes throughout either enhanced oil recovery techniques or natural depletion lead to severe operational and/or economic issues for oil production^9,10. Some of the most critical consequences of asphaltene formation include adverse wettability alteration of reservoir rock, the decline in well inflow capacity, formation damage, production facilities clogging, and equipment fouling^11,12.

Modeling and experimental research into temperature/pressure kinetics concerning asphaltene aggregation phenomena can assist in precisely predicting and/or controlling asphaltene-related issues in all stages of petroleum processing/production. In this regard, numerous laboratory studies using several experimental techniques, including the centrifugation method, small-angle X-ray and small-angle neutron scattering, confocal laser-scanning and fluorescence microscopy, optical and confocal microscopy, and particle-size analyzer were performed for analyzing asphaltene aggregation and aggregation size^{13,14,15,16,17,18,19,20}. In addition to experimental studies, there have been efforts to investigate asphaltene aggregation/precipitation behavior from a theoretical perspective.

Moradi et al. studied asphaltene aggregation size distribution through miscible gas injection, which enhanced the optimum extent for collision parameter, and then employed population balance for modeling aggregation kinetic. Conclusions indicated that during natural production, cluster aggregation plays a dominant role near the bubble point pressure of crude oil. In addition, nitrogen injection significantly affects the incremental content and size of asphaltene flocs²¹.

Duran et al. investigated asphaltene aggregation size and precipitation in diluted crude oil with n-heptane using the population balance method. Results showed that collision efficiency has less impacts on precipitation aggregation²². Soltani Soulgani et al. showed that asphaltene collision intensity relates to particle size distribution and density of the mixture. Results indicated that increasing n-hexane concentration in asphaltene-toluene solution leads to more asphaltene aggregation. It was observed that large aggregates, due to Brownian motion, are stable in the solution for less than 200 min. In addition, the aggregation rate in the reaction-limited aggregation process was found to be quicker than that of the diffusion-limited aggregation process. Also, the average size of asphaltene aggregation decreases in the asphaltene settling region²³. Hemmati-Sarapardeh et al. studied two main effective parameters, such as various asphaltene concentrations and normal alkane-to-toluene ratios on asphaltene aggregation. According to their findings, increasing asphaltene concentration would lead to an increase in aggregation size toward more than 100 microns, while increasing n-alkanes length leads to a decrease in the aggregation size. It was also observed that heteroatoms play a significant role in the average size of asphaltene. The asphaltene with the smallest polarity and aromaticity made a more stable solution with the lowest aggregation size⁶.

Poozesh et al. modeled asphaltene deposition in the pipeline and showed the effectiveness of medium stability and viscosity using different oil compositions. It was observed that the lower the viscosity, the greater the aggregation size and, hence, the greater the deposition rate²⁴. Hosseini-Moghadam et al. used different types of chemical inhibitors of asphaltene aggregation in an undiluted dead oil. Among various types of inhibitors, dodecylbenzene sulfonic acid has the highest efficiency in reducing the aggregation size of asphaltene at all performed temperatures due to efficient acid–base interactions. Additionally, the population balance approach was utilized for modeling and analyzing aggregation size and its kinetic behavior. Their results indicated that collision efficiency is reduced with decreasing temperature and inhibitor efficiency²⁵.

Despite considerable experimental and theoretical/mechanistic investigation focused on exploring the different aspects contributing to the kinetics of asphaltene aggregation behavior, some limited efforts have been made in the modeling stage of this phenomenon. In this way, the use of population balance modeling, as a widely accepted approach in the field of colloidal systems^26,27, has been reported by some researchers^22,25. The application of fractal theory and molecular dynamic simulation was also rarely addressed in this area^23,28,29. In addition, the experimental measurements of asphaltene aggregation kinetics are inherently time-consuming and require interpretation procedures alongside expensive laboratory equipment. Therefore, developing a simple and efficient modeling strategy for predicting asphaltene kinetic behavior is especially important since experimental results under different conditions are scarcely found in the previously published research. To the best of our knowledge, given the importance of the availability of data for asphaltene particle aggregation, it comes as a surprise that there has been no attempt to introduce simple, efficient, and simple-to-use models to date. In this regard, the application of artificial intelligence-based approaches seems interesting.

The growing use of artificial intelligence methods is gaining significant attention, as they offer a promising means to predict established phenomena with exceptional accuracy. Artificial intelligence leverages advanced computer algorithms and machine learning models to analyze complex data patterns, excelling at managing large datasets and uncovering intricate relationships, particularly in diverse domains. In the context of predicting issues related to asphaltene deposition using various artificial intelligence methods, some instances have been discussed in the literature.

In their research, Ghorbani et al. employed a combination of a genetic algorithm (GA) and support vector regression to predict asphaltene precipitation (expressed as an amount in percentage) in terms of temperature, molecular weight, and dilution ratio³⁰. Their study involved comparing the obtained results with the outcomes of two scaling equations, confirming the superior performance of their proposed approach. Hemmati-Sarapardeh and colleagues have also been involved in predicting asphaltene precipitation amount during natural depletion. They applied the Radial Basis Function (RBF) and Multilayer Perceptron (MLP) neural networks, optimized with various optimization algorithm⁵. Their analysis revealed that the most effective performance was achieved with the RBF-Particle Swarm Optimization (PSO) and MLP-Bayesian Regularization (BR) intelligent techniques. Sadi and Shahrabadi introduced a modeling scheme that integrates genetic algorithms (GA) and the group method of data handling to forecast the weight percentage of asphaltene precipitation. They assessed the accuracy of their developed approach by comparing it to the results obtained from the least squares support vector machine and scaling equations³¹.

Kardani et al. presented an RBF-ANN modeling approach aimed at estimating the weight percentage of precipitated asphaltene. Their results provided validation for the predictive capability of the developed model when compared to several previously proposed correlations³².

As can be inferred, most research efforts in the realm of asphaltene applications involving artificial intelligence have concentrated on predicting the amount of asphaltene precipitation. Notably, there is an apparent gap in the literature regarding investigations into the domain of aggregation kinetics.

Therefore, the main focus of the current work is to propose a new efficient, and precise framework for precise predictions of aggregation kinetics of asphaltene particles. In this regard, artificial intelligence approaches, including adaptive neuro-fuzzy interference system (ANFIS), Multi-layer Perceptron (MLP), radial basis function neural network (RBF-NN), and extreme learning machine (ELM), were employed to interrelate the average diameter of asphaltene aggregates (as output) to the input parameters: temperature, pressure, time, oil specific gravity and oil asphaltene content. The Grey Wolf Optimizer (GWO) is employed to optimize and facilitate learning in RBF-NN, while MLP modeling approach parameters are regulated using Bayesian Regularization (BR), Levenberg–Marquardt (LM), and Scaled Conjugate Gradient (SCG) algorithms. Subsequently, comparative graphical and statistical analysis is used to assess the effectiveness of the developed models in terms of goodness of fit and prediction capability.

Model development

In essence, the effectiveness of a machine learning technique is contingent on various factors, encompassing the foundational theoretical framework and the architecture of the model (for instance, tree structures or neural networks), and the unique characteristics of the problem being addressed³³. Additionally, the incorporation of different optimization algorithms (such as evolutionary and gradient-based) into machine learning approaches improves the tuning phase of modeling, ultimately leading to improved model performance. Moreover, the process of evaluating and comparing them plays a pivotal role in determining the most fitting modeling approach for the particular problem under consideration. Therefore, we have used these models with different theoretical bases and structures to explore the most optimized approach(s) in tackling the aggregation kinetics of asphaltene particles.

Modeling strategies

Adaptive neuro-fuzzy interference system (ANFIS)

Adaptive Neuro-Fuzzy Inference System (ANFIS) developed by R. Jang^34,35 is an intelligent algorithm based upon a hybrid neural network and fuzzy inference systems to lower the deficiencies associated with each algorithm^36,37; in such approach, backpropagation (BP) and hybrid methods are training techniques based on data collection process that is used for training the initial FIS.

ANFIS structure is systematically similar to the fuzzy inference system developed by Takagi–Sugeno-Kang^38,39. Gradient descent backpropagation, which is the primary learning rule in ANFIS, computes the derivative of the squared error of each output node (which is known as error rates) recursively from output to input nodes⁴⁰. This implies a mixed learning technique combining gradient descent and least-squares computational techniques. In the forward stage, output nodes (functional signals) are processed on the way to layer 4, and consequence parameters are achieved by least squares⁴¹. The gradient descent then renews the premise parameters in the backward step⁴².

The adaptive network structure is composed of 5 five network layers of 1 to 5 with nodes and connections illustrated in Fig. 1. It is believed that one output (f) and at least two inputs (which we call x and y) are considered within the fuzzy inference system (FIS).

To introduce the ANFIS architecture, two fuzzy if–then rules based upon a first-order Sugeno FIS are presented here^43,44:

$${\text{If}}\,{\text{x}}\,{\text{is}}\,{\text{A}}_{1} \,{\text{and}}\,{\text{y}}\,{\text{is}}\,{\text{B}}_{1} \,{,}\,{\text{then}}\,{{f}}_{1} \, = \,{\text{p}}_{1} {\text{x}}\, + \,{\text{q}}_{1} {\text{y}}\, + \,{\text{r}}_{1}.$$

$${\text{If}}\,{\text{x}}\,{\text{is}}\,{\text{A}}_{2} \,{\text{and}}\,{\text{y}}\,{\text{is}}\,{\text{B}}_{2} \,{,}\,{\text{then}}\,{{f}}_{2} = {\text{ p}}_{2} {\text{x}} + {\text{ q}}_{2} {\text{y}} + {\text{r}}_{2}.$$

The two layers of this structure are defined as follows: Layer 1 defines the fuzzification process; in this layer, each node produces the membership grades of an input variable with the functions of the node explicated as follows:

$${\text{O}}_{{{1},{\text{i}}}} \, = \,\mu_{{{\text{Ai}}}} \left( {\text{x}} \right),\,\,\,\,\,\,{\text{i}}\, = \,{1},\,{2}.$$

(1)

$${\text{O}}_{{{1},{\text{i}}}} \, = \,\mu_{{{\text{Bi}} - {2}}} \left( {\text{y}} \right),\,\,\,\,\,{\text{i}}\, = \,{3},\,{4}.$$

(2)

In which O_1,i is the output of the node i in layer l, and subscript i denotes the membership grade of a fuzzy set that is (A₁, B₁, A₂, B₂).

The entire incoming signals are produced by the output node situated in layer 2, in accordance with:

$${\text{O}}_{{{2},{\text{i}}}} \, = \,{\text{w}}_{{\text{i}}} \, = \,\mu_{{{\text{Ai}}}} \left( {\text{x}} \right)\, \times \,\mu_{{{\text{Bi}}}} \left( {\text{y}} \right),\,\,\,\,\, {\text{i}}\, = \,{1},\,{2}{\text{.}}$$

(3)

The calculation for the ratio of a rule's firing strength, divided by the sum of all the rule's firing strengths, is as follows:

$$\normalsize {\text{O}}_{{{3},{\text{i}}}} \, = \,\overline{\text{w}}_{\text{i}}\, = \,\frac{{\text{w}}_{\text{i}}}{{{\text{w}}_1 + {\text{w}}_2}},\,\,\,\,\,{\text{i}}\, = \,1,\,2.$$

(4)

All nodes in layer 4 are the adaptive nodes with a node output:

$${\text{O}}_{{{4},{\text{i}}}} \, = \,\overline{\text{w}}_{\text{i}}{f}_{{\text{i}}} \, = \,\overline{{\text{w}}}_{\text{i}}\,\left( {{\text{p}}_{{\text{i}}} {\text{x }} + {\text{ q}}_{{\text{i}}} {\text{y }} + {\text{ r}}_{{\text{i}}} } \right),\,\,\,\,\,{\text{i}}\, = \,{1},\,2.$$

(5)

where q_i, p_i, and r_i are named the consequent parameters.$\overline{\text{w}}$_i is called normalized firing strength.

In the end, the ultimate output, which is the sum of all incoming signals, is computed in layer 5:

$${\mathrm{O}}_{5,\mathrm{i}} =\sum_{{\text{i}}=1}^{2}\overline{{\text{w}}}_{\text{i}}{f}_{\text{i}} = \frac{{\text{w}}_{1}{f}_{1}+ {\text{w}}_{2}{f}_{2}}{{\text{w}}_{1}+{\text{w}}_{2}}.$$

(6)

Radial basis function (RBF) neural network

The feed-forward network known as the radial basis function neural network, which was introduced by Broomhead and Lowe⁴⁵, is commonly used for classification and regression tasks³³. This method is based on the theory of function estimation, and during the training process, it transforms data into a multi-dimensional space in order to search for an optimal surface^45,46.

RBF-NN comprises only three fixed layers: the input layer, the hidden layer, and the output layer⁴⁷. The intermediate layer in RBF-NN, which is considered the most significant component, connects the input and output layers. The intermediate layer, which plays a crucial role, links the input and output layers. Each neuron in this layer is located at a specific position with an assigned radius, and the distance between the input vector and the center is then calculated⁴⁸. The Euclidian distance is used for measuring input vectors and centers interval, which is shown in Eq. (7). Also, a simplified illustration of an RBF-NN model is shown in Fig. 2.

$${r}_{j}= \sqrt{\sum_{i=1}^{m}({x}_{i}-{c}_{ij}{)}^{2}} j = 1, 2,\dots , J .$$

(7)

In which ${r}_{j}$ and ${c}_{ij}$ are the radius and center.

Among all RBFs in this study, the Gaussian function was selected to transmit Euclidian distance from the hidden layer to the output. The Gaussian function is defined as follows:

$$\varphi \left(r\right)=\mathrm{exp}\left(-\frac{{r}^{2}}{2 {\sigma }^{2}}\right).$$

(8)

The spreading coefficient, $\sigma$, is a significant parameter that deals with the smoothness of RBF and should be quantified carefully. By using the Gaussian function as an activation function in RBF-NN, the final formulation will be obtained as follows:

$${y}_{i}= \sum_{j=1}^{J}{w}_{j} {\varphi }_{ij}\left(\Vert {x}_{i}-{c}_{j}\Vert \right) j = 1, 2,\dots , J \,and\, i = \mathrm{1,2}, \dots , m.$$

(9)

The RBF-NN employs weight ($w$), the number of nodes in the hidden layer ($J$), and the number of data points ($m$). The Euclidean distance is represented by $\Vert {x}_{i}-{c}_{j}\Vert$, as previously mentioned. To optimize the performance of RBF-NN, it is crucial to consider the two regularization parameters, namely, the spread coefficient of the Gaussian function and the number of nodes in the hidden layer, and optimize them simultaneously.

Extreme learning machine (ELM)

An extreme learning machine (ELM) is a single hidden layer feed-forward neural network (SLFN) in which input weights are selected randomly, and output weights are determined analytically in this approach. Many advantages of this method, such as high learning speed, a few adjustable parameters, proper for different non-linear problems, and generally high performance, make the algorithm more efficient⁴⁹.

Considering m as the number of data points in $({x}_{i}.{y}_{i})$ form and n as the number of neurons in the hidden layer, this method formulation could be written as follows:

$${y}_{j}= \sum_{i=1}^{n}{\rho }_{i}f\left({\omega }_{i}{x}_{j}+{b}_{i}\right), j=\mathrm{1,2},3,\dots m.$$

(10)

In which ${\rho }_{i}$, $f$ , ${b}_{i}$ , and ${\omega }_{i}$ are output weights, activation function, biases, and input weights for i-th neuron, respectively. The formulation, as mentioned earlier, could be written in another way as follows:

$$H \rho =Y,$$

(11)

where $H$ is the hidden layer output matrix, which could be defined as follows:

$$H= \left[\begin{array}{ccc}f\left({\omega }_{1}{x}_{1}+{b}_{1}\right)& \cdots & f\left({\omega }_{n}{x}_{1}+{b}_{n}\right)\\ \vdots & \ddots & \vdots \\ f\left({\omega }_{1}{x}_{m}+{b}_{1}\right)& \cdots & f\left({\omega }_{n}{x}_{m}+{b}_{n}\right)\end{array}\right],$$

(12)

where, $\rho =({\rho }_{1}\dots {\rho }_{n}{)}^{T}$ and $Y=({Y}_{1}\dots {Y}_{m}{)}^{T}$⁵⁰. The main regularization parameter in this approach is the number of neurons in the hidden layer, which is obtained empirically.

Multi-layer perceptron (MLP)

The most well-known classes in computational intelligence are artificial neural networks (ANNs)^51,52. ANNs designed based on the biological nervous system of the human brain can be used in figuring out the relationships between the inputs and outputs of a system. There are two different components in each ANN, known as neurons or nods (processing element), that process information and interconnections that connect neurons. MLP and RBF are the most common ANNs. An MLP neural network has three layers: the input layer, which is related to the input data; the intermediate layer, which is called the hidden layer and plays a role between the input and output information, and finally, the output layer, which is related to the output data. In a model, the internal appearance of the relationship between the input and output is controlled by the hidden layers⁵³. The number of neurons in the input layer corresponds to the number of input variables, whereas the number of neurons in the output layer typically represents the output property. It is essential to determine the number of neurons in each layer, including the number of hidden layers. Each neuron in the hidden layer is connected to all other neurons in the preceding and succeeding layers, and its value is the sum of the products of the values of each preceding neuron, a specific weight factor, and a bias term. The resulting value is then passed through activation functions, which can be applied to both hidden and output layers.

General characteristics of the applied AI modeling techniques

ANFIS leverages the strengths of both neural networks and fuzzy logic, making it well-suited for handling complex and non-linear systems. It exhibits flexibility in accommodating various types of data and problem domains. However, the selection of appropriate membership functions can pose a challenge, and it tends to have a slower convergence rate compared to using neural networks alone⁵⁴. MLP exhibits remarkable proficiency in tackling intricate non-linear challenges through its utilization of multi-layer networks for modeling complex problems. Furthermore, it excels in handling substantial input data and delivering swift predictions post-training. Nevertheless, it is worth noting that MLP can be computationally demanding, and the quality of training data can significantly influence the model's function^55,56. Like MLP, RBF can also handle nonlinear problems. The main difference between RBF and MLP lies in their structure. RBF has a simpler architecture with three layers⁵⁷. The RBF network excels in efficient localized learning and accuracy in interpolation. However, it presents challenges in selecting the appropriate basis functions and determining the optimal number of functions. Additionally, the training phase of RBF networks tends to be computationally intensive, and they are sensitive to noisy data and high-dimensional datasets⁵⁸. ELM offers key benefits, such as a reduced number of hyper-parameters, rapid training speed, and the ability to perform reasonably well with large datasets. Nevertheless, the main drawback of this method is limited accuracy in certain systems, as it suffers from the lack of fine-tuning for hidden layer weights^57,59.

Optimization algorithms

Levenberg–Marquardt

The Levenberg–Marquardt (LM) algorithm is an effective optimization technique that is commonly utilized in numerous applications where nonlinear least-squares problems need to be solved. Its wide usage in fields such as computer vision, machine learning, and physics is due to its capability to handle complex and noisy problems⁶⁰. Additionally, the LM algorithm is a popular choice for optimizing biases and weights in multilayer perceptron neural networks. Notably, calculating the Hessian matrix in the LM algorithm is not required; instead, it is approximated using the equation provided below^61,62:

$${H}_{m}= {J}_{m}^{T} {J}_{m}.$$

(13)

The Hessian matrix and Jacobian matrix are denoted by ${H}_{m}$ and ${J}_{m}$, respectively. In the MLP modeling approach, the Jacobian matrix is defined based on the weights and biases:

$$J\left(\alpha \right)= \left[\begin{array}{ccc}\frac{\partial {e}_{1}(\alpha )}{\partial {\alpha }_{1}}& \cdots & \frac{\partial {e}_{1}(\alpha )}{\partial {\alpha }_{n}}\\ \vdots & \ddots & \vdots \\ \frac{\partial {e}_{N}(\alpha )}{\partial {\alpha }_{1}}& \cdots & \frac{\partial {e}_{N}(\alpha )}{\partial {\alpha }_{n}}\end{array}\right].$$

(14)

The vector $e$ represents the errors in the network, and the equation below can be used to calculate the gradient:

$$g= {J}^{T} e.$$

(15)

After obtaining the gradient, the algorithm employs a Newton-like equation to update and determine the solution for the next steps, which is expressed as follows:

$${\omega }_{i+1}= {\omega }_{i}-({J}^{T}J- \delta I{)}^{-1} {J}^{T}e.$$

(16)

The variable $\omega$ represents the connection weight, and $\delta$ is a constant that can be adjusted during network training based on the outcome of each step. Specifically, if a step is successful, $\delta$ is decreased, whereas if a step is unsuccessful, $\delta$ is increased. Typically, the cost function decreases with each step^57,63.

Bayesian regularization

The Bayesian regularization (BR) algorithm is another commonly used method in machine learning and statistical modeling that helps prevent overfitting and improve model performance. By adding a prior distribution, typically a Gaussian with zero mean and a hyper-parameter variance, to the model's parameters, the algorithm aims to identify optimal parameter values that increase the posterior probability of the model given the input data. This technique is beneficial when dealing with noisy or limited data and enables the estimation of prediction uncertainty⁶⁴. The objective function for this algorithm is defined as:

$${f}_{obj}=a {\sigma }_{W}+b {\sigma }_{E}.$$

(17)

The objective function (${f}_{obj}$) is created by adding the sum of squared network weights (${\sigma }_{W}$) and the sum of network errors (${\sigma }_{E}$), with $a$ and $b$ denoting objective function parameters determined via Bayes' theorem. The BR algorithm endeavors to build a suitable network by minimizing the sum of weights and squared errors, as indicated in equation⁶⁵. Once the optimal values for $a$ and $b$ have been determined, the algorithm employs algebraic manipulation to utilize the LM algorithm to minimize the objective function^57,63.

Scaled conjugate gradient

The Scaled Conjugate Gradient (SCG) algorithm is a widely used numerical optimization method for optimizing machine learning model parameters, particularly in artificial neural networks. Its development aimed to create a more efficient and robust optimization technique than other commonly used methods, such as gradient descent and conjugate gradient. SCG combines conjugate gradient and line search techniques to minimize the objective function, and it incorporates a scaling procedure to adjust the step size based on the objective function's curvature.

The scaled conjugate gradient algorithm employs the conjugate direction for faster convergence instead of abrupt descent. The initial descent direction (${-i}_{0}$) and the search direction (${S}_{0}$), which is also referred to as the conjugate direction, are related and can be mathematically represented⁶¹:

$${S}_{0}= {-i}_{0}.$$

(18)

To identify the most suitable distance to move along the current search direction in this algorithm, a search line technique is employed. The technique can be described as follows:

$${u}_{j+1}= {u}_{j}+ {a}_{j} {i}_{j}.$$

(19)

The calculation of the subsequent search direction in this algorithm depends on the previous conjugate direction. The formula utilized to determine the next search direction is as follows:

$${S}_{j}= -{i}_{j}+ {b}_{j} {S}_{j-1}.$$

(20)

Notably, the SCG algorithm merges the conjugate gradient algorithm with the trust region approach, given that the line search method utilized to determine the step size may incur significant computational costs. Additionally, the latter is not the only approach used to determine the step size in the SCG algorithm^57,66.

Grey wolf optimizer (GWO)

In this study, a meta-heuristic algorithm, namely Grey Wolf Optimizer, was utilized to optimize the RBF-NN parameters and gain more precise results. The grey wolf prefers to spend its life in the pack. The leaders are a female and a male, called alphas. The alpha is primarily responsible for decisions about hunting, sleep place, time to wake up, and so on. Beta is the second level in a gray wolf's hierarchy. The betas are obedient wolves that help the alpha in decision-making or other activities in the pack. Omega is the lowest rank of gray wolves. The omega wolf function as a submissive figure, consistently yielding to other dominant wolves. It is often referred to as a scapegoat. If a wolf does not hold the alpha, beta, or omega position, it is considered a subordinate or delta, according to some sources. Delta wolves are subordinate to the alpha and beta but dominate over the omega. The hunting behavior of grey wolves can be divided into several key stages:

Tracking, pursuit, and approach to prey.
Chasing, circling, and harassing the prey until it stops moving.
Attack at the prey.

In this optimization algorithm, all of the above cases have been implemented. Additionally, the search agents have been divided into four categories: alpha, beta, delta, and omega. In this optimization algorithm, alpha represents the best solution in the current iteration. Grey wolves have a way of identifying prey's location and encircling them. Usually the alpha is in charge of hunting. Occasionally the beta and deltas would play an essential part in a hunt. Nevertheless, in the abstract search space, the best (prey) location is unknown. Mathematically modeling grey wolf hunting behavior involves assuming that the alpha, beta, and delta have superior knowledge about the potential location of prey. As a result, we save the top three best solutions gathered thus far and compel the other search agents, including the omega, to adjust their positions based on the best search agent's position^67,68.

Data gathering

The development of an accurate modeling strategy is strongly associated with the quality of the utilized dataset. The experimental measurements applied in the present work were collected from the literature⁶⁹. They measured the average diameter of asphaltene aggregates for two different crude oil samples (Samples A and B) at different pressure, temperature, and time values. This dataset comprises 423 data points, of which 70% and the rest were exploited as the train, test (15%), and validation (15%) samples, respectively. The diameter of asphaltene aggregates depends on different variables such as pressure, temperature, time, characteristics of the crude oil, type of asphaltene (basic structure of asphaltene), and asphaltene content, etc. In the current study, temperature, pressure, time, oil specific gravity (as a representative of crude oil characteristics), and oil asphaltene content were considered as the input variables. The output variable is the average diameter of asphaltene aggregates. The statistical criteria of the gathered dataset are shown in Table 1.

Table 1 Statistical criteria for the collected dataset in this study.

Full size table

According to Table 1, each parameter has a specific range, which can decrease training process performance and, subsequently, model accuracy. Therefore, as a pre-process step the parameters mentioned above are mapped to a new range between − 1 and 1 using the equation below.

$${x}_{m}=2 \frac{(x-{x}_{min})}{({x}_{max}- {x}_{min})}-1.$$

(21)

Results and discussion

Model development

In this study, four intelligent models, namely ANFIS, RBF-NN, MLP, and ELM, are proposed to accurately calculate the average diameter of asphaltene aggregates based on various factors such as time, pressure, temperature, crude oil specific gravity, and asphaltene content. To conduct our research, we utilized a comprehensive data bank that covers a wide range of laboratory conditions.

The dataset was randomly divided into three groups, namely, the train set, the test set, and the validation set. The training set included 70% of the dataset used for training the model. The 15% of the whole dataset was selected for a test set that is utilized to evaluate the prediction capability and generality of the developed model. Also, the remaining 15% was selected as the validation set to find the optimum parameters for each model and overcome the over-fitting issues. To assess the developed models with different characteristics, some statistical parameters were calculated and subsequently compared with each other to find the most accurate one.

The optimal values of the hyper-parameters (regularization parameters) were obtained through trial and error, except for RBF-NN, where the Grey Wolf Optimization algorithm was used to find the optimal hyper-parameter values. This process was repeated multiple times to achieve the best possible results. The GWO algorithm optimized RBF-NN’s regularization parameters by 15 iterations, and the population size contains 50 search agents. The optimized values for RBF-NN, the “number of neurons” in the hidden layer and “spread coefficient,” are 99 and 0.5803, respectively. There are two popular structure types for the ANFIS modeling approach: Takagi–Sugeno-Kang (TSK) and Mamdani. In this research, the ANFIS approach was implemented using the TSK structure.

Furthermore, the FIS was generated using the Fuzzy C-Means (FCM) technique, with the parameters 'number of clusters' set to 18 and 'exponent' configured at 2.3. The exponent serves as a tuning parameter in the ANFIS algorithm, regulating the level of fuzziness, and it is conventionally assigned a value greater than 1. In the ELM modeling approach, the number of neurons has been established at 104. In the case of the MLP modeling approach, the number of neurons in the hidden layer is set to 21, and the activation functions for the hidden and output layers are Tansig. Table 2 provides a concise overview of the regularization parameter values used in the intelligent models.

Table 2 Optimum values for all paradigms’ regularization parameters.

Full size table

Statistical error evaluation

Statistical analysis of error parameters is an essential component of any modeling approach. To assess the reliability and performance of the developed models, several statistical indicators are commonly used. In this study, we employed three such indicators: average absolute relative deviation percentage (AARD%), determination coefficient (${\text{R}}^{2}$), and root mean square error (RMSE)⁷⁰. These parameters are defined as follows, respectively:

$$\text{AARD}{\%}=\left(\frac{1}{{\text{N}}}\sum_{\text{i=1}}^{\text{N}}\left(\left|\frac{{\text{D}}^{\text{exp}}-{\text{D}}^{\text{pred}}}{{\text{D}}^{\text{exp}}}\right|\right) \right)\times100$$

(22)

$${\text{R}}^{2}\text{=1-}\frac{\sum_{\text{i=1}}^{\text{N}}{\left({\text{D}}^{\text{exp}}-{\text{D}}^{\text{pred}}\right)}^{2}}{\sum_{\text{i=1}}^{\text{N}}{\left({\text{D}}^{\text{exp}}-\stackrel{\mathrm{-}}{\text{D}}\right)}^{2}}$$

(23)

$${\text{R}}\text{MSE} = \sqrt{\frac{1}{{\text{N}}}\sum_{\text{i=1}}^{\text{N}}\left({\text{D}}^{\text{exp}}{-{\text{D}}^{\text{pred}}}\right)^{2}}$$

(24)

${\text{D}}^{\text{exp}}$ and ${\text{D}}^{\text{pred}}$ denote experientially measured and predicted values of asphaltene aggregate average diameter, respectively, and N is the number of data points.

The statistical parameters for the proposed models are presented in Table 3. The accuracy of the models was heavily influenced by the optimization of their regularization parameters. In this study, we compared the implemented paradigms based on the AARD% value, as it is not affected by the scale of the data.

Table 3 Different statistical parameters for proposed paradigms.

Full size table

Graphical error evaluation

Statistical plots such as Cumulative frequency versus absolute relative error plot, cross plot, average absolute relative deviation percent plot, Relative error versus experimental data points plot, and error distribution plot are practical tools to evaluate the performance of the developed models visually. Figure 3 illustrates the average absolute relative deviation percent (${\text{AARD}{\%}}$) for train, validation, test, and total sets.

Based on the AARD% values presented in Table 3 and Fig. 3, all models in this study exhibit satisfactory performance. However, it is evident that the GWO-RBF model provides the best results in terms of statistical parameters, while the MLP-SCG approach demonstrates the lowest prediction capability. Figure 4 displays cross-plots of predicted and experimental data points for the train, validation, and test sets. The one-slope line that emerged from the data points for each model confirms their accuracy.

Figure 5 illustrates the relative error percent (RE %) versus the experimental average diameter. It is worth noting that the accumulation of data points around the zero error line indicates the model’s superiority. Figure 5 illustrates that all proposed approaches exhibit a satisfactory clustering of data points around the zero error line. However, certain data points with low average diameter may not provide as accurate predictions as others. Among the models tested, GWO-RBF displays the most pronounced concentration around the zero error line.

Additionally, Fig. 6 depicts the cumulative frequency versus absolute relative error for all paradigms investigated in this study. As shown in this figure, for ELM, ANFIS, MLP-BR, and MLP-SCG models, 90.31%, 91.73%, 93.38% and 76.83% of the whole data have an absolute relative error of less than 4%, respectively. In the same condition, this figure indicates more than 94% for the GWO-RBF and MLP-LM paradigms. Also, it should be noted that the maximum value of absolute relative errors for GWO-RBF, ANFIS, ELM, MLP-BR, MLP-LM, and MLP-SCG are 12.50%, 17.31%, 16.03%, 16.02%, 16.99%, and 21.36%, respectively. Therefore, based on previous explanations, GWO-RBF can be selected as the best model in this study, and all of the paradigms can be ranked as follows:

$$GWO-RBF>MLP-LM>MLP-BR>ANFIS>ELM>MLP-SCG$$

Figure 7 further confirms the robustness of the best-developed model. This figure illustrates the relative error percent (RE%) between the predicted and experimental values of the GWO-RBF model. As shown in the plot, the majority of predicted results by the best-developed model have relative error values within the range of − 3% to + 3% for the entire dataset. This indicates the high accuracy of the presented model.

In addition to accuracy comparison, the machine learning approaches employed can also be compared based on their computational effort. In this regard, the models were evaluated in terms of their CPU time consumption. It is important to mention that the model codes were executed on a computer system equipped with a Core i7-4910MQ processor, running at a base frequency of 2.90 GHz.

Figure 8 provides a comparison of different methods based on CPU time. As shown, the GWO-RBF (which performed the best based on the AARD%) exhibited the highest CPU time among the applied techniques. The reason for the increased CPU time for the GWO-RBF method, compared to other methods, is its utilization of a metaheuristic optimization algorithm to find optimal solutions for two hyper-parameters within this model. This metaheuristic algorithm requires more time to converge to optimal values as it operates on a population-based approach. However, it is worth noting that such algorithms tend to achieve high-quality solutions.

In contrast, MLP-SCG showed the shortest CPU time. This can be attributed to the SCG algorithm, which operates based on gradients, enabling it to converge more rapidly than the GWO algorithm, which relies on a population-based meta-heuristic approach. However, it is worth noting that gradient-based algorithms are more susceptible to becoming trapped in local optima when compared to meta-heuristic methods.

Overall, in scenarios where computational resources are limited, lightweight and efficient models (e.g., MLP and ELM) can be applied with an acceptable degree of prediction error. Conversely, when computational resources are more abundant, and a strong emphasis is placed on achieving high accuracy, it becomes viable to utilize models that offer higher accuracy at the expense of increased resource consumption, such as GWO-RBF.

Computational complexity of models

Assessing the computational complexity of different machine learning models can be a laborious task. This is because it hinges on several factors, including the way the model is structured, the size of the dataset, the optimization methods employed, and so on. In this way, big O notation is used to explain the worst-case behavior for an algorithm complexity. Furthermore, the big O notation typically considers all feature inputs collectively, while the number of samples may fluctuate throughout the analysis⁷¹. Although the exact determination of big O for the models applied here is very challenging, we try to provide a rough estimate and qualitative comparison.

In the case of ELM, the computational complexity is relatively lower than other applied techniques. It usually depends on the number of neurons (N_n) and data dimension (D_d). In this regard, the computational complexity of approximately O (N_n × D_d) may be assumed. The computational complexity of ANFIS is influenced by a combination of factors, including the number of fuzzy rules, the selected training method, the complexity of membership functions, and the quantity of input variables. In general, ANFIS models tend to have a moderate level of computational complexity.

Nevertheless, it is essential to recognize that the exact complexity can vary considerably based on the specific configuration and the nature of the problem under consideration⁷². Therefore, assigning a big O is very challenging. Like ANFIS, The computational complexity of MLP can range from moderate to high, depending on their specific configurations.

The computational complexity could be approximately on the order of O (K_e × N_n × D_d), with K representing the count of training epochs. The computational complexity of an RBF network is influenced by factors like the number of RBF units, the dimensionality of the input data, and the training procedure. In the training phase, an overall computational complexity of approximately O (M × N × D_d × I) is assumed, where M refers to the training sample size, N represents the number of RBF units, and I indicates the number of iterations necessary for reaching convergence. However, some references state the RBF complexity simply as O(N²)⁷³.

Model trend estimation

Trend estimation is a crucial step in post-evaluating a modeling strategy and verifying a model's ability to track the variations in experimental measurements. To validate the robustness of the best-proposed model, we assessed the GWO-RBF approach's trend-prediction capability under different conditions, as shown in Fig. 9. The figure presents a comparison between the predicted and experimental average diameter of asphaltene aggregates over time for oil samples A and B under varying pressure and temperature conditions. The figure demonstrates that the GWO-RBF model accurately predicts the asphaltene aggregate diameter and the process trend under different states.

This study offers an AI-based predictive framework for accurately predicting the kinetics of asphaltene particle aggregation, addressing a critical concern in different phases of production from petroleum reservoirs. Its effectiveness lies in the accuracy of predictions obtained through the utilization of diverse AI methodologies and the thorough examination of significant variables. In addition, the proposed strategies have the potential to be integrated with the available simulator to consider the asphaltene precipitation phenomena more accurately. This integration could result in mitigating the uncertainties and improving prediction capability. Nonetheless, the AI-based paradigms are usually restricted by their reliance on available data and potential challenges in generalizing to a wide range of conditions. In other words, the extent of practical applicability of the developed models could be improved as more data is available for their training.

Conclusions

In the current research, reliable models based on GWO-RBF, MLP-LM, MLP-BR, MLP-SCG ANFIS, and ELM were constructed to estimate the asphaltene aggregate diameter versus time in terms of pressure, temperature, oil asphaltene content, and oil specific gravity. A series of experimental data was acquired based on the published literature to construct the model. A terrific match was obtained in terms of statistical parameters between the experimental and predicted values of asphaltene aggregate diameter using the developed models. However, the proposed GWO-RBF model exhibits higher accuracy compared to the other models for which the resulted coefficient of determination, average absolute relative deviation percent, and root mean square error were 0.9993, 1.1326%, and 0.0537, respectively. The obtained results show the strong generalization ability and high prediction capability of the introduced models.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Ansari, S. et al. Experimental measurement and modeling of asphaltene adsorption onto iron oxide and lime nanoparticles in the presence and absence of water. Sci. Rep. 13, 122. https://doi.org/10.1038/s41598-022-27335-z (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Fazeli, M., Escrochi, M., Hosseini, Z. S. & Vaferi, B. Experimental analyzing the effect of n-heptane concentration and angular frequency on the viscoelastic behavior of crude oil containing asphaltene. Sci. Rep. 12, 3965. https://doi.org/10.1038/s41598-022-07912-y (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Bahmaninia, H. et al. Toward mechanistic understanding of asphaltene adsorption onto quartz surface: The roles of size, concentration, and hydrophobicity of quartz, asphaltene composition, flow condition, and aqueous phase. J. Pet. Sci. Eng. 205, 108820. https://doi.org/10.1016/j.petrol.2021.108820 (2021).
Article CAS Google Scholar
Ahooei, A., Norouzi-Apourvari, S., Hemmati-Sarapardeh, A. & Schaffie, M. Experimental study and modeling of asphaltene deposition on metal surfaces via electrodeposition process: The role of ultrasonic radiation, asphaltene concentration and structure. J. Pet. Sci. Eng. 195, 107734. https://doi.org/10.1016/j.petrol.2020.107734 (2020).
Article CAS Google Scholar
Hemmati-Sarapardeh, A. et al. Modeling asphaltene precipitation during natural depletion of reservoirs and evaluating screening criteria for stability of crude oils. J. Pet. Sci. Eng. 181, 106127. https://doi.org/10.1016/j.petrol.2019.05.078 (2019).
Article CAS Google Scholar
Hemmati-Sarapardeh, A. et al. Effect of asphaltene structure on its aggregation behavior in toluene-normal alkane mixtures. J. Mol. Struct. 1220, 128605. https://doi.org/10.1016/j.molstruc.2020.128605 (2020).
Article CAS Google Scholar
Daryasafar, A., Masoudi, M., Kord, S. & Madani, M. Evaluation of different thermodynamic models in predicting asphaltene precipitation: A comparative study. Fluid Phase Equilibria 54, 112557 (2020).
Article Google Scholar
Shadman, M. M., Badizad, M. H., Dehghanizadeh, M. & Dehaghani, A. H. S. Developing a novel colloidal model for predicting asphaltene precipitation from crude oil by alkane dilution. J. Mol. Liquids 318, 113879. https://doi.org/10.1016/j.molliq.2020.113879 (2020).
Article CAS Google Scholar
Dashti, H., Zanganeh, P., Kord, S., Ayatollahi, S. & Amiri, A. Mechanistic study to investigate the effects of different gas injection scenarios on the rate of asphaltene deposition: An experimental approach. Fuel 262, 116615. https://doi.org/10.1016/j.fuel.2019.116615 (2020).
Article CAS Google Scholar
Kord, S., Soleymanzadeh, A. & Miri, R. A generalized scaling equation to predict asphaltene precipitation during precipitant dilution, natural depletion, water injection and gas injection. J. Pet. Sci. Eng. 182, 106320. https://doi.org/10.1016/j.petrol.2019.106320 (2019).
Article CAS Google Scholar
Rashid, Z., Wilfred, C. D., Gnanasundaram, N., Arunagiri, A. & Murugesan, T. A comprehensive review on the recent advances on the petroleum asphaltene aggregation. J. Pet. Sci. Eng. 176, 249–268. https://doi.org/10.1016/j.petrol.2019.01.004 (2019).
Article CAS Google Scholar
Mohammed, I. et al. Impact of asphaltene precipitation and deposition on wettability and permeability. ACS Omega 6, 20091–20102 (2021).
Article CAS PubMed PubMed Central Google Scholar
Duran, J. A. (University of Calgary, 2019).
Salehzadeh, M., Husein, M. M., Ghotbi, C., Taghikhani, V. & Dabir, B. Investigating the role of asphaltenes structure on their aggregation and adsorption/deposition behavior. Geoenergy Sci. Eng. 230, 212204 (2023).
Article CAS Google Scholar
Meng, J. et al. Size distribution of primary submicron particles and larger aggregates in solvent induced asphaltene precipitation. Preprint at https://arXiv.org/quant-ph/2204.00519 (2022).
Meng, J., You, J. B., Hao, H., Tan, X. & Zhang, X. Primary submicron particles from early stage asphaltene precipitation revealed in situ by total internal reflection fluorescence microscopy in a model oil system. Fuel 296, 120584. https://doi.org/10.1016/j.fuel.2021.120584 (2021).
Article CAS Google Scholar
Mirwald, J., Hofko, B., Pipintakos, G., Blom, J. & Soenen, H. Comparison of microscopic techniques to study the diversity of the bitumen microstructure. Micron 159, 103294 (2022).
Article CAS PubMed Google Scholar
Zhang, Q. et al. The study on interactions between stabilizers and asphaltenes. J. Dispers. Sci. Technol. 1–14 (2022).
Hammond, C. B. et al. Mesoscale aggregation of sulfur-rich asphaltenes: In situ microscopy and coarse-grained molecular simulation. Langmuir 38, 6896–6910 (2022).
Article CAS PubMed Google Scholar
Jennings, J., Growney, D., Brice, H., Mykhaylyk, O. & Armes, S. Application of scattering and diffraction techniques for the morphological characterization of asphaltenes. Fuel 327, 125042 (2022).
Article CAS Google Scholar
Moradi, S., Mahvelati, E. H., Ameli, F., Dabir, B. & Rashtchian, D. Application of population balance equation in modeling of asphaltene particle size distribution and characterization of aggregation mechanisms under miscible gas Injection. J. Mol. Liquids 232, 207–213 (2017).
Article CAS Google Scholar
Duran, J., Schoeggl, F. & Yarranton, H. Kinetics of asphaltene precipitation/aggregation from diluted crude oil. Fuel 255, 115859 (2019).
Article CAS Google Scholar
Soulgani, B. S., Reisi, F. & Norouzi, F. Investigation into mechanisms and kinetics of asphaltene aggregation in toluene/n-hexane mixtures. Pet. Sci. 17, 457–466. https://doi.org/10.1007/s12182-019-00383-3 (2020).
Article CAS Google Scholar
Poozesh, A., Sharifi, M. & Fahimpour, J. Modeling of asphaltene deposition kinetics. Energy Fuels 34, 9304–9319 (2020).
Article CAS Google Scholar
Hosseini-Moghadam, S.M.-A., Zahedi-Nejad, A., Bahrami, M., Torkaman, M. & Ghayyem, M.-A. Experimental and modeling investigations of temperature effect on chemical inhibitors of asphaltene aggregation. J. Pet. Sci. Eng. 205, 108858 (2021).
Article CAS Google Scholar
Handwerk, D. R., Shipman, P. D., Özkar, S. & Finke, R. G. Dust effects on Ir(0)n nanoparticle formation nucleation and growth kinetics and particle size-distributions: Analysis by and insights from mechanism-enabled population balance modeling. Langmuir 36, 1496–1506. https://doi.org/10.1021/acs.langmuir.9b03193 (2020).
Article CAS PubMed Google Scholar
Elduayen-Echave, B. et al. Inclusion of shear rate effects in the kinetics of a discretized population balance model: Application to struvite precipitation. Water Res. 200, 117242. https://doi.org/10.1016/j.watres.2021.117242 (2021).
Article CAS PubMed Google Scholar
Tirjoo, A., Bayati, B., Rezaei, H. & Rahmati, M. Molecular dynamics simulations of asphaltene aggregation under different conditions. J. Pet. Sci. Eng. 177, 392–402. https://doi.org/10.1016/j.petrol.2019.02.041 (2019).
Article CAS Google Scholar
Rahmati, M. Effects of heteroatom and aliphatic chains of asphaltene molecules on their aggregation properties in aromatics Solvents: A molecular dynamics simulation study. Chem. Phys. Lett. 779, 138847. https://doi.org/10.1016/j.cplett.2021.138847 (2021).
Article CAS Google Scholar
Ghorbani, M., Zargar, G. & Jazayeri-Rad, H. Prediction of asphaltene precipitation using support vector regression tuned with genetic algorithms. Petroleum 2, 301–306 (2016).
Article Google Scholar
Sadi, M. & Shahrabadi, A. Evolving robust intelligent model based on group method of data handling technique optimized by genetic algorithm to predict asphaltene precipitation. J. Pet. Sci. Eng. 171, 1211–1222 (2018).
Article CAS Google Scholar
Kardani, M. N., Baghban, A., Hamzehie, M. E. & Baghban, M. Phase behavior modeling of asphaltene precipitation utilizing RBF-ANN approach. Pet. Sci. Technol. 37, 1861–1867 (2019).
Article CAS Google Scholar
Behnamnia, M., Mozafari, N. & Dehghan Monfared, A. Rigorous hybrid machine learning approaches for interfacial tension modeling in brine-hydrogen/cushion gas systems: Implication for hydrogen geo-storage in the presence of cushion gas. J. Energy Storage 73, 108995. https://doi.org/10.1016/j.est.2023.108995 (2023).
Article Google Scholar
Jang, J.-S. R. in AAAI. 762–767.
Jang, J.-S. ANFIS: Adaptive-network-based fuzzy inference system. IEEE Trans. Syst. Man Cybern. 23, 665–685 (1993).
Article Google Scholar
Babanezhad, M., Behroyan, I., Nakhjiri, A. T., Marjani, A. & Shirazian, S. Performance and application analysis of ANFIS artificial intelligence for pressure prediction of nanofluid convective flow in a heated pipe. Sci. Rep. 11, 902. https://doi.org/10.1038/s41598-020-79628-w (2021).
Article CAS PubMed PubMed Central Google Scholar
Mustafa, A., Tariq, Z., Mahmoud, M. & Abdulraheem, A. Machine learning accelerated approach to infer nuclear magnetic resonance porosity for a middle eastern carbonate reservoir. Sci. Rep. 13, 3956. https://doi.org/10.1038/s41598-023-30708-7 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Sugeno, M. & Kang, G. Structure identification of fuzzy model. Fuzzy Sets Syst. 28, 15–33 (1988).
Article MathSciNet MATH Google Scholar
Takagi, T. & Sugeno, M. Fuzzy identification of systems and its applications to modeling and control. IEEE Trans. Syst. Man Cybern. SMC-15, 116–132 (1985).
Article MATH Google Scholar
Werbos, P. Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences (Harvard University, 1974).
Google Scholar
Filev, D. & Yager, R. R. On the issue of obtaining OWA operator weights. Fuzzy Sets Syst. 94, 157–169 (1998).
Article MathSciNet Google Scholar
Baghban, A., Bahadori, M., Ahmad, Z., Kashiwao, T. & Bahadori, A. Modeling of true vapor pressure of petroleum products using ANFIS algorithm. Pet. Sci. Technol. 34, 933–939 (2016).
Article CAS Google Scholar
Horst, B. & Abraham, K. Neuro-Fuzzy Pattern Recognition Vol. 41 (World Scientific, 2000).
MATH Google Scholar
Akbari, S., Mahmood, S. M., Tan, I. M. & Hematpour, H. Comparison of neuro-fuzzy network and response surface methodology pertaining to the viscosity of polymer solutions. J. Pet. Explor. Prod. Technol. 8, 887–900. https://doi.org/10.1007/s13202-017-0375-6 (2018).
Article CAS Google Scholar
Broomhead, D. & Lowe, D. Radial basis functions, multi-variable functional interpolation and adaptive networks. ROYAL SIGNALS AND RADAR ESTABLISHMENT MALVERN (UNITED KINGDOM) RSRE-MEMO-4148 (1988).
Tatar, A., Barati, A., Najafi, A. & Mohammadi, A. H. Radial basis function (RBF) network for modeling gasoline properties. Pet. Sci. Technol. 37, 1306–1313. https://doi.org/10.1080/10916466.2019.1575878 (2019).
Article CAS Google Scholar
Abdi, J., Hadipoor, M., Esmaeili-Faraj, S. H. & Vaferi, B. A modeling approach for estimating hydrogen sulfide solubility in fifteen different imidazole-based ionic liquids. Sci. Rep. 12, 4415. https://doi.org/10.1038/s41598-022-08304-y (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Hemmati-Sarapardeh, A., Varamesh, A., Husein, M. M. & Karan, K. On the evaluation of the viscosity of nanofluid systems: Modeling and data assessment. Renew. Sustain. Energy Rev. 81, 313–329. https://doi.org/10.1016/j.rser.2017.07.049 (2018).
Article CAS Google Scholar
Mahdaviara, M., Menad, N. A., Ghazanfari, M. H. & Hemmati-Sarapardeh, A. Modeling relative permeability of gas condensate reservoirs: Advanced computational frameworks. J. Pet. Sci. Eng. 189, 106929. https://doi.org/10.1016/j.petrol.2020.106929 (2020).
Article CAS Google Scholar
Li, Z.-C. & Fan, C.-L. A novel method to identify the flow pattern of oil–water two-phase flow. J. Pet. Explor. Prod. Technol. 10, 3723–3732. https://doi.org/10.1007/s13202-020-00987-1 (2020).
Article CAS Google Scholar
Vo Thanh, H., Sugai, Y. & Sasaki, K. Application of artificial neural network for predicting the performance of CO2 enhanced oil recovery and storage in residual oil zones. Sci. Rep. 10, 18204. https://doi.org/10.1038/s41598-020-73931-2 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Nait Amar, M., Ghriga, M. A. & Ouaer, H. On the evaluation of solubility of hydrogen sulfide in ionic liquids using advanced committee machine intelligent systems. J. Taiwan Inst. Chem. Eng. 118, 159–168. https://doi.org/10.1016/j.jtice.2021.01.007 (2021).
Article CAS Google Scholar
Shaygan, K. & Jamshidi, S. Prediction of rate of penetration in directional drilling using data mining techniques. Geoenergy Sci. Eng. 221, 111293. https://doi.org/10.1016/j.petrol.2022.111293 (2023).
Article Google Scholar
Chopra, S. et al. Taxonomy of adaptive neuro-fuzzy inference system in modern engineering sciences. Computat. Intell. Neurosci. 2021, 1–14 (2021).
Article Google Scholar
Ciaburro, G. & Venkateswaran, B. Neural Networks with R: Smart Models Using CNN, RNN, Deep Learning, and Artificial Intelligence Principles (Packt Publishing Ltd, 2017).
Google Scholar
Akkaya, B. & Çolakoğlu, N. Comparison of multi-class classification algorithms on early diagnosis of heart diseases. (2019).
Behnamnia, M., Dehghan Monfared, A. & Sarmadivaleh, M. Hybrid artificial intelligence paradigms for modeling of water-gas (pure/mixture) interfacial tension. J. Natural Gas Sci. Eng. 108, 104812. https://doi.org/10.1016/j.jngse.2022.104812 (2022).
Article Google Scholar
Le, V. T. et al. A multidisciplinary approach for evaluating spatial and temporal variations in water quality. Water 11, 853 (2019).
Article Google Scholar
Huang, G.-B., Wang, D. H. & Lan, Y. Extreme learning machines: A survey. Int. J. Mach. Learn. Cybern. 2, 107–122 (2011).
Article Google Scholar
Ng, C. S. W., Djema, H., Nait Amar, M. & Jahanbani Ghahfarokhi, A. Modeling interfacial tension of the hydrogen-brine system using robust machine learning techniques: Implication for underground hydrogen storage. Int. J. Hydrogen Energy 47, 39595–39605. https://doi.org/10.1016/j.ijhydene.2022.09.120 (2022).
Article CAS Google Scholar
Kisi, O. & Uncuoğlu, E. Comparison of three back-propagation training algorithms for two case studies. Indian J. Eng. Mater. Sci. 12 (2005).
Hagan, M. T. & Menhaj, M. B. Training feedforward networks with the Marquardt algorithm. IEEE Trans. Neural Netw. 5, 989–993. https://doi.org/10.1109/72.329697 (1994).
Article CAS PubMed Google Scholar
Mehrjoo, H., Riazi, M., Nait Amar, M. & Hemmati-Sarapardeh, A. Modeling interfacial tension of methane-brine systems at high pressure and high salinity conditions. J. Taiwan Inst. Chem. Eng. 114, 125–141. https://doi.org/10.1016/j.jtice.2020.09.014 (2020).
Article CAS Google Scholar
Nait Amar, M., Ouaer, H. & Abdelfetah Ghriga, M. Robust smart schemes for modeling carbon dioxide uptake in metal—Organic frameworks. Fuel 311, 122545. https://doi.org/10.1016/j.fuel.2021.122545 (2022).
Article CAS Google Scholar
Yue, Z., Songzheng, Z. & Tianshi, L. In: 2011 International Conference on Business Management and Electronic Information. (IEEE). 483–487
Møller, M. F. A scaled conjugate gradient algorithm for fast supervised learning. Neural Netw. 6, 525–533. https://doi.org/10.1016/S0893-6080(05)80056-5 (1993).
Article Google Scholar
Faris, H., Aljarah, I., Al-Betar, M. A. & Mirjalili, S. Grey wolf optimizer: A review of recent variants and applications. Neural Comput. Appl. 30, 413–435. https://doi.org/10.1007/s00521-017-3272-5 (2018).
Article Google Scholar
Mirjalili, S., Mirjalili, S. M. & Lewis, A. Grey wolf optimizer. Adv Eng. Softw. 69, 46–61. https://doi.org/10.1016/j.advengsoft.2013.12.007 (2014).
Article Google Scholar
Mohammadi, S., Rashidi, F., Ghazanfari, M.-H. & Mousavi-Dehghani, S. A. Kinetics of asphaltene aggregation phenomena in live oils. J. Mol. Liquids 222, 359–369 (2016).
Article CAS Google Scholar
Matthew, D. A. M., Jahanbani Ghahfarokhi, A., Ng, C. S. W. & Nait Amar, M. Proxy model development for the optimization of water alternating CO2 gas for enhanced oil recovery. Energies 16, 3337 (2023).
Article CAS Google Scholar
Zandieh, M., Kazemi, A. & Ahmadi, M. A comprehensive insight into the application of machine learning approaches in predicting the separation efficiency of hydrocyclones. Desalination Water Treat. 236, 123–143 (2021).
Article Google Scholar
Wardoyo, R. & Afifa, N. L. Computing the time complexity of ANFIS algorithm. Int. J. Adv. Res. Comput. Eng. Technol. (IJARCET) 7, 132–135 (2018).
Google Scholar
Gumerov, N. A. & Duraiswami, R. Fast radial basis function interpolation via preconditioned Krylov iteration. SIAM J. Sci. Comput. 29, 1876–1899 (2007).
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, 75169-13817, Iran
Ali Sharifzadegan, Mohammad Behnamnia & Abolfazl Dehghan Monfared

Authors

Ali Sharifzadegan
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Behnamnia
View author publications
You can also search for this author in PubMed Google Scholar
Abolfazl Dehghan Monfared
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.S.: conceptualization, methodology, software, investigation, data collection and curation, writing—original draft preparation. M.B.: writing—original draft preparation, software, validation, data analysis. A.D.M.: conceptualization, methodology, data curation, investigation, supervision, visualization, writing—original draft preparation, reviewing and editing. All authors reviewed the manuscript.

Corresponding author

Correspondence to Abolfazl Dehghan Monfared.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sharifzadegan, A., Behnamnia, M. & Dehghan Monfared, A. Artificial intelligence-based framework for precise prediction of asphaltene particle aggregation kinetics in petroleum recovery. Sci Rep 13, 18525 (2023). https://doi.org/10.1038/s41598-023-45685-0

Download citation

Received: 09 June 2023
Accepted: 23 October 2023
Published: 28 October 2023
DOI: https://doi.org/10.1038/s41598-023-45685-0
Springer Nature Limited

Artificial intelligence-based framework for precise prediction of asphaltene particle aggregation kinetics in petroleum recovery

Abstract

Similar content being viewed by others

Estimation of asphaltene precipitation in light, medium and heavy oils: experimental study and neural network modeling

Empirical and Neural Network Modelling of Oil Recovery in Nano Assisted Enhanced Oil Recovery

Application of Artificial Intelligence to Predict Enhanced Oil Recovery Using Silica Nanofluids

Explore related subjects

Introduction

Model development

Modeling strategies

Adaptive neuro-fuzzy interference system (ANFIS)

Radial basis function (RBF) neural network

Extreme learning machine (ELM)

Multi-layer perceptron (MLP)

General characteristics of the applied AI modeling techniques

Optimization algorithms

Levenberg–Marquardt

Bayesian regularization

Scaled conjugate gradient

Grey wolf optimizer (GWO)

Data gathering

Results and discussion

Model development

Statistical error evaluation

Graphical error evaluation

Computational complexity of models

Model trend estimation

Conclusions

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation