Introduction

Crude oil contributes a considerable portion to the energy demand of the world (Alimohammadi et al. 2019). Fulfilment of this energy requirement through sustainable production of crude oil is highly important and essential for each country to meet its demand for economic progress. However, under certain circumstances, assurance of required rate of crude oil production demand becomes extremely difficult or nearly impossible due to the solid deposit blockages arising in production flow lines or even in hydrocarbon reservoir. This interruption of hydrocarbon production could be occurred due to the formation of hydrates, emulsions, inorganic scales and organic deposits (Mansoori 2010). Among these above-mentioned solids, asphaltene is regarded to be the most troublesome organic solid deposit because it possesses complex formation phenomenon and therefore requires complicated, diverse and expensive treatment solutions (Melendez-Alvarez et al. 2016; Al-Qasim et al. 2018; Thou et al. 2002).

Initially, the word “Asphaltene” was introduced by a person having name “Boussingault” (1837). According to the widely accepted definition, Asphaltene is the fraction of crude oil which exhibits insolubility in n-alkanes particularly n-pentane and n-heptane, whereas it becomes highly miscible in aromatic compounds such as toluene, benzene, xylene. (Goual 2012). Compositionally, asphaltene molecule possesses high percentage of carbon and hydrogen elements which are present in the form of alkyl chains and aromatic rings along with the presence of small proportion of heteroatoms and metallic elements (Zheng et al. 2020). The molecular weight of asphaltene is not fixed, and researchers have reported a wide range of asphaltene molecular weight because asphaltene molecules undergo in the self-association phenomenon which depends upon its polarity, thus causing it to grow in size and ultimately increase the molecular weight (Zheng et al. 2020). However, the most accepted value is 750 g/mol (Alimohammadi et al. 2019). Lastly, structure wise; Continental, Archipelago, and Yen-Mullins model are the most popular and accepted structural asphaltene models. Figure 1a–c illustrates the structural models of asphaltene (Fakher et al. 2020).

Fig. 1
figure 1

a Archipelago asphaltene structure, b Continental asphaltene structure, and c Yen–Mullins asphaltene model (Reprint with permission from (Fakher et al. 2020) copyright ©2020 Springer)

According to several researchers, there are four governing factors which play a vital role in the precipitation of asphaltene from crude oil. These include the changes in oil composition, variation in temperature and pressure, and electrokinetic effect (Gharbi et al. 2017). Though, alterations in oil composition occur through depressurization process and blending of oils; however, during production of hydrocarbon, the changes in pressure is considered to be the foremost factors among all other affecting factors. The deposition problem could cause emergence of numerous challenges to operators in the form of pore throats clogging in reservoir, wettability alteration of reservoir rock from water wet to oil wet, increase in the oil viscosity, wells, downhole and surface equipment and facilities choking or blocking; thus, causing net hydrocarbon recovery reduction and requirement of the deployment of expensive treatment methods for mitigation (Alimohammadi et al. 2019; Gharbi et al. 2017; Madhi et al. 2018). These treatment techniques may include Chemical, Mechanical, Thermal, Biological, and Ultrasonic methods (Alimohammadi et al. 2019; Gharbi et al. 2017). Among all these mentioned methods, mechanical and chemical approaches are mostly employed.

Looking at the gravity of this asphaltene deposition problem, operating companies usually prefer prevention over mitigation of this issue, as it facilitates them to save treatment related money and expenditures because of the uninterrupted supply of hydrocarbon throughout the life of well. There are various prevention strategies that were reported over the years. Among all of them, SARA based relationships models (Saturates, Aromatics, Resins, Asphaltenes) are quite popular and common (Mohammed et al. 2021; Guzmán et al. 2017; Ali et al. 2021a). These relationships are developed based on the fact that crude oil is characterized on the basis of SARA fractions and each fraction affects the asphaltene stability in crude oil (Ali et al. 2021a). Resins and aromatics promote asphaltene stability in oils, thus ceasing asphaltene precipitation (Ashoori et al. 2017). The reason is because aromatics dissolve asphaltene molecules, while resins are polar components which interact with asphaltene molecules to provide shielding effect against other asphaltene molecules preventing their flocculation (Mansoori 2010; Ali et al. 2021a, b; Ashoori et al. 2017). Alternatively, saturates and asphaltenes component of crude oil favor asphaltene precipitation mechanism. The resins are miscible in saturates thus high saturate contents dissolve more resins letting asphaltene molecules free to combine and deposit (Ashoori et al. 2017). As far as asphaltene contents of crude oil are concerned, although, higher proportion of asphaltene facilitates asphaltene precipitation, but in past there have been some cases reported where crude oils were seen to exhibit unstable behavior even having very low contents of asphaltene (Mansoori 2010; Law et al. 2019).

In past, several investigators have reported different SARA based asphaltene stability predicting models. These models include Colloidal Instability Index, Stability Index, Colloidal Stability Index, Chamkalani’s stability classifier, Jamaluddin’s method, Modified Jamaluddin’s method, Stankiewicz plot, Qualitative-quantitative analysis (QQA) and Stability cross plot (SCP) (Mohammed et al. 2021; Guzmán et al. 2017; Ali et al. 2021a). In this research study, these aforementioned stability predicting models are applied onto the seventeen (17) crude oils SARA datasets having known information about their stability status according to the oil field experience as present in the open literature. Detailed performance analysis, both qualitative and quantitative, of the stability results obtained by each model is carried out to evaluate the models accuracy. Finally, the advantages and limitations or drawbacks of each model found in this study are discussed and accordingly new reliable predicting model is also presented.

Asphaltene stability predicting models

Colloidal instability index (CII)

Colloidal Instability Index (CII) assumes crude oil system as colloidal solution which is composed of SARA pseudo-components. Mathematically, it can be expressed as the ratio of the sum of saturates and asphaltenes contents to the sum of resins and aromatics contents. CII can be calculated using Eq. (1): (Mohammed et al. 2021; Guzmán et al. 2017; Ali et al. 2021a)

$${\text{CII}} = \frac{{\left( {{\text{Saturates in wt\%}}} \right) + \left( {{\text{Asphaltenes in wt\%}}} \right)}}{{\left( {{\text{Resins in wt\%}}} \right) + \left( {{\text{Aromatics in wt\%}}}\right)}}$$
(1)

Following are the stability judgment criteria to determine the crude oils stability: (Mohammed et al. 2021; Guzmán et al. 2017).

For value of CII ≥ 0.9, the crude oil is termed unstable, while for the value of CII ≤ 0.9, the crude oil is considered as stable. If CII value lies in between 0.7 and 0.9, then crude oil comes under uncertain zone.

Stability index (SI)

Stability index (SI) is a well-renowned stability parameter used to find asphaltene stability in crude oil samples. It is expressed as the ratio of asphaltenes to resins (A/R). To determine stability status, if SI value obtained is lower than 0.35, then crude oil is regarded as stable otherwise unstable. (Mohammed et al. 2021; Guzmán et al. 2017).

Colloidal stability index (CSI)

CSI index was proposed on the concept that the asphaltene contents present in unstable crude oils are likely to be more polar than those contained by stable oils. CSI can be computed by using Eq. (2): (Mohammed et al. 2021; Guzmán et al. 2017)

$${\text{CSI}} = \frac{{\left( {\varepsilon^{{{\text{sat}}}} {\text{*Saturates in wt\%}}} \right) + \left( {\varepsilon^{{{\text{asph}}}} {\text{*Asphaltenes in wt\%}}} \right)}}{{\left( {\varepsilon^{{{\text{res}}}} {\text{*Resins in wt\%}}} \right) + \left( {\varepsilon^{{{\text{arom}}}} {\text{*Aromatics in wt\%}}} \right)}}$$
(2)

where (ε) represents a dielectric constant and its values for SARA components for stable and unstable oils are following (Goual and Firoozabadi 2002; Hascakir 2017): Stable crude oils (εsat = 1.921, εasph = 5.5, εres = 4.7, and εarom = 2.379). Unstable crude oils (εsat = 1.921, εasph = 18.4, εres = 3.8, and εarom = 2.379). If CSI comes greater than 0.95, then oil is considered as unstable else it is categorized as stable (Ali et al. 2021a; Fan et al. 2002).

Stankiewicz plot (SP)

Stankiewicz et al. (2002) proposed a plot between ratios of saturates to aromatics and asphaltenes to resins for predicting the asphaltene precipitation risk. Figure 2 shows the Stankiewicz plot which includes a curve separating unstable and stable region (Pereira et al. 2017).

Fig. 2
figure 2

Stankiewicz plot (Reprinted with permission from (Pereira et al. 2017), Copyright © (2017) American Chemical Society)

Chamkalani stability classifier (CSC)

Chamkalani (2015) developed a stability classifier plot by applying technique called “Least square support vector machine (LS-SVM)” for the prediction of asphaltene precipitation in crude oils. Chamkalani stability classifier divided into three zones, namely: severe, mild and minor problem.

Jamaluddin method (Jamal)

Pereira et al. (2017) and Zendehboudi et al. (2014) also developed a graphical plot using Jamaluddin’s experimental data for the prediction of asphaltene stability crude oils. Figure 3 shows the Jamaluddin’s plot between asphaltene and resins weight percentage. This plot is split into two regions, namely stable and unstable regions.

Fig. 3
figure 3

Jamaluddin’s plot (Adapted with permission from (Pereira et al. 2017), Copyright © (2017) American Chemical Society)

Modified Jamaluddin’s method (M Jamal)

Pereira et al. (2017) conducted modification in Jamaluddin’s plot to improve its accuracy. The improvement is implemented by decreasing the slope of the line dividing stable and unstable zone. Figure 4 shows the modified Jamaluddin plot.

Fig. 4
figure 4

Modified Jamaluddin’s plot (Adapted with permission from (Pereira et al. 2017), Copyright © (2017) American Chemical Society)

Sepúlveda stability criterion (QQA: qualitative-quantitative analysis)

Sepúlveda et al. (Guzmán et al. 2017; Sepúlveda et al. 2010) conducted a Qualitative-Quantitative Analysis for various SARA values relationships which includes: S/A, R/A, (S/Ar*A), Ar/(S/A), Ar/A and (R/A)/(S/Ar). Finally, they were able to propose different plots for each relationship divided into three regions (unstable, metastable and stable) having assigned numerical values for each region. Figure 5 shows an example of (R/A) case. The maximum value was identified for each SARA relationship and then utilized in further calculations. The dotted line separating the two zones i.e., stable and metastable zones is established by dividing the maximum value obtained by numerical value 2, whereas the dotted line separating metastable and unstable zone is constructed by the division of maximum value by numerical value 4 for that particular relationship. These three aforementioned zones are assigned a value of 1 (for unstable region), 5 (for metastable region) and 10 (for stable region). Oppositely, for [Ar/(S/A)] relationship case the interpretation of plot becomes reverse and bottom region of the chart is assigned a value of 10, for middle region a value of 5 is placed and for the upper region the value of 1 is assigned. In addition, these assigned values are added for each SARA relationship discussed and is determined for all crude oil samples studied.

Fig. 5
figure 5

Example of R/A case (QQA: qualitative-quantitative analysis) (Reprinted with permission from (Guzmán et al. 2017), Copyright © (2017) Elsevier)

Following rule is to be applied for predicting the asphaltene stability in crude oils (Guzmán et al. 2017; Sepúlveda et al. 2010):

If summation result obtained is found to be greater than 30, then that particular crude oil will be considered as stable, if summation result is found to be between 15 and 30, then that crude oil will be termed as metastable, and if summation value obtained is found under 15, then crude oil will be regarded as unstable oil.

Sepúlveda stability criterion (SCP: stability cross plot)

Sepúlveda et al. (Guzmán et al. 2017; Sepúlveda et al. 2010) also proposed four SARA based stability cross plots as shown in Fig. 6. Unstable and stable zones are also marked on each cross plot through separating curve. These plots can identify crude oil stability status in terms of asphaltene precipitation risk.

Fig. 6
figure 6

Stability Cross Plots for SCP (Reprinted with permission from (Guzmán et al. 2017), Copyright © (2017) Elsevier)

Performance metrics

Accuracy (ACC)

The most commonly used metric to evaluate the performance of predicting model is accuracy. It is defined as the ratio of total correct predictions to the total number of predictions made. Accuracy (can be expressed mathematically as Eq. (3): (Ali et al. 2021b; Bekkar et al. 2013)]

$${\text{ACC}} = \frac{{{\text{TP}} + {\text{TN}}}}{{{\text{TP}} + {\text{TN}} + {\text{FN}} + {\text{FP}}}}$$
(3)

Sensitivity (TPR) and specificity (TNR)

Predictor sensitivity is also known as true positive rate (TPR), recall or hit rate. It indicates the correctly classified stable samples out of total number of stable samples. It can be computed using Eq. (4): (Ali et al. 2021b; Bekkar et al. 2013)

$${\text{TPR}} = \frac{{{\text{TP}}}}{{{\text{TP}} + {\text{FN}}}}$$
(4)

Similarly, specificity expresses the correctly identified unstable samples out of total number of unstable samples present. It can be calculated using Eq. (5): (Ali et al. 2021b; Bekkar et al. 2013)

$${\text{TNR}} = \frac{{{\text{TN}}}}{{{\text{FP}} + {\text{TN}}}}$$
(5)

Specificity is also commonly known as true negative rate (TNR) or inverse call. TPR and TNR values close or equal to numerical value 1 indicates good performance of predicting model.

Predictive values (PPV and NPV)

Predictive value shows predicting model performance with respect to its ability to correctly predict both stable and unstable oil samples. On the other hand, Precision or positive predictive value (PPV) of a predicting model can be expressed as the ratio of the correctly predicted stable samples to the total number of stable samples used. Equation (6) is used to calculate Predictive value (PPV): (Ali et al. 2021b; Bekkar et al. 2013)

$${\text{PPV}} = \frac{{{\text{TP}}}}{{{\text{FP}} + {\text{TP}}}}$$
(6)

Negative predictive value (NPV) or inverse precision is defined as the ratio of the number of correctly predicted unstable samples by the model to the total number of unstable samples predicted. NPV can be computed using Eq. (7): (Ali et al. 2021b; Bekkar et al. 2013)

$${\text{NPV}} = \frac{{{\text{TN}}}}{{{\text{FN}} + {\text{TN}}}}$$
(7)

Values of PPV or NPV near to numerical value of 1 indicate high accuracy of predicting model.

Where TP, TN, FP, and FN in the above-mentioned predicting models formula represent the following (Ali et al. 2021b; Bekkar et al. 2013):

TP = Total number of operationally stable samples correctly predicted by applied predicting model.

TN = Total number of operationally unstable samples correctly predicted by applied predicting model.

FP = Total number of operationally unstable samples incorrectly predicted by applied predicting model.

FN = Total number of operationally stable samples incorrectly predicted by applied predicting model.

Data gathering

For evaluating the accuracy of applied predicting models, seventeen (17) crude oil samples SARA data having known asphaltene stability status information under field conditions are taken from three different published research papers (Rogel et al. 2001; Leon et al. 2002; Wattana et al. 2005). In this research study, approximately equal number of stable (S) (08) and unstable (U) (09) crude oil samples are used so that asphaltene predicting models overall accuracy cannot be affected by the imbalanced class distribution of applied SARA dataset. Figure 7 is showing the crude oils data used in this study:

Fig. 7
figure 7

SARA data of Crude oils and their asphaltene stability operational experience

Results and discussion

Table 1 and Fig. 8 show the results obtained for different asphaltene predicting models when applied to seventeen crude oils SARA data used in this study.

Table 1 Calculated values of different SARA relationships
Fig. 8
figure 8figure 8

Showing the results formed by a Jamaluddin, b Modified Jamaluddin, c Stankiewicz plot, d Chamkalani Stability Classifier, eh Stability Cross Plot, and in Qualitative-Quantitative Analysis when implemented to 17 crude oil samples

Table 2 shows the stability predicting outcomes of nine predicting models. These results are obtained using calculated values presented in Table 1 and Fig. 8. According to Table 2, CII and CSC models yield same results. Most of the oil samples are identified as unstable or uncertain, in terms of asphaltene stability, by these two predicting models. Only one sample, namely; LM1 is correctly predicted as stable sample by CII and CSC. CSI performed best among all predicting models applied. It produces 14 correct predictions out of total 17 predictions made. Like CII and CSC, Stability index (SI) and Jamaluddin’s plot (Jamal) yield exactly same predictions. Both predicting models classified six (6) stable samples correctly out of total 8 stable samples in a dataset, while only three unstable samples are accurately identified from the total of 9 unstable samples present in a dataset. Modified Jamaluddin’s plot (Mod. Jamal) identified almost all samples as unstable except two (NM2 and West Texas oil samples) which were found to be incorrect. According to the Stankiewicz plot (SP), four unstable and three stable oil samples are correctly predicted. QQA also yielded somewhat similar results to that produced by SP, however, the predicting model, QQA, predicted some oil samples as metastable. Only four unstable samples and one stable sample are correctly predicted by QQA model. Lastly, SCP predicting model classified six stable samples and three unstable samples correctly. Only one oil sample, namely; NM1 is predicted as Metastable sample by SCP model.

Table 2 Predicting results of different models

In addition, looking at Table 2 it can be clearly observed that NM2, LM2, NM7 and West Texas oil samples are wrongly predicted while three oil samples i.e., OL3, OL4, and FR1 are accurately predicted by most of the asphaltene stability predicting models applied. As discussed in earlier section, that these asphaltene stability predicting models work on the net effect of SARA values; therefore, NM2 and West Texas crude oil, having exceptionally low asphaltene contents, are expected to exhibit good stability in crude oil but in actual conditions, oils show instability towards asphaltene. This case is similar to previously reported cases available in the published literature. For example the Boscan crude oil is found to be operationally stable despite having higher percentage of asphaltene contents in crude oil of approximately 17 percent, while other lighter Arabian oil samples containing very low asphaltene contents exhibit strong instability in field conditions (Law et al. 2019). NM7 crude oil sample is anticipated to be predicted as unstable oil sample but is wrongly predicted by all predicting models applied in this research study. This happens because NM7 possesses low resins and high asphaltene contents. LM2 is another crude oil which was also predicted incorrectly by most predicting models as unstable sample because of its high asphaltene contents. Three unstable oil samples, namely; OL3, OL4 and FR1 are predicted correctly by almost all applied predicting models, whereas none of the stable samples are classified correctly by the predicting models at the same time.

To evaluate the performance metrics for all applied asphaltene stability predicting models, the uncertain and metastable cases are considered as unstable case. This assumption is made because it is better to consider all those oils as unstable which are having incomplete stability information, and therefore, accordingly arrangements for the prevention of asphaltene precipitation are implemented for these oils. Secondly, it was reported in previous research studies that most of the SARA based stability predicting models are inclined towards predicting the oil samples as unstable (Guzmán et al. 2017; Ali et al. 2021a). According to Table 3, CSI proved to be most accurate model in predicting stability of asphaltene in oil as it is quite evident from its comparatively high accuracy. The higher value of TPR, TNR, PPV and NPV of CSI model shows that it predicted both stable and unstable oil samples well. However, it can be seen from its NPV value that instability is slightly better predicted as compared to stability by CSI. CII, CSC and SCP acquire second rank in terms of accuracy. CII and CSC yielded exactly same stability predictions and thus their all statistical metrics calculated are found to be same. Lower TPR and higher TNR values show that these all models predicted unstable samples with high accuracy as compared to stable oil samples. Moreover, their higher PPV value and lower NPV value depict that majority of unstable oil samples are not wrongly predicted but most of the stable oil samples are incorrectly identified, respectively. On the other hand, SCP model predicted stability well as compared to instability as depicted from its higher TPR value than TNR value. Lower PPV value relative to NPV value of SCP model suggests that more unstable samples are falsely classified as compared to stable oil samples. Stability index (SI) and Jamaluddin’s plot (JP) hold third rank among all predicting models. Similar to SCP model, these predicting models also predict stable samples better as compared to unstable samples as revealed by their higher TPR and lower TNR values. Furthermore, these predicting models also possess higher NPV than PPV values because they predicted more unstable samples incorrectly as compared to stable oil samples. Lastly, at fourth position, Mod Jamal, SP and QQA are placed. All these predicting models identified unstable samples better as compared to stable samples as it is evident from their lower TPR value than TNR value. These predicting models also predicted majority of the stable samples as unstable samples.

Table 3 Values of Statistical metrics calculated for all applied predicting models

The observation found in this study about stability predicting models that CII, SP, CSI, CSC, Mod. JP, QQA are good instability predicting model, while JP, SI and SCP are better stability predicting models are found in strong consensus with the previous research findings reported in the literature (Guzmán et al. 2017; Ali et al. 2021a, b; Pereira et al. 2017). Finally, the sequence of accuracy of these stability predicting models can be written as: CSI > CII, SCP, CSC > JP, SI > Mod. JP, QQA, SP.

This research study proved that each predicting model favors a certain class of crude oils (stable and unstable), except CSI, and therefore affects their prediction accuracies. As far as CSI is concerned, although this predicting model is performed comparatively better but it takes an advantage of dielectric constant values present in its formula. The SARA dielectric values to be placed in the formula for judging the asphaltene stability in crude oil are decided on the basis of prior information about stability status of crude oils. This practice tends this predicting model to perform excellently. Moreover, all SARA-based asphaltene stability predicting models take SARA values as inputs for stability evaluation. These SARA values of crude oils can be determined by variety of techniques and methodologies. The SARA value determined from different techniques could show significant difference in values for the same oil, therefore, may result in inaccurate stability predictions from these models (Pereira et al. 2017; Kharrat et al. 2007; Fan et al. 2002). Furthermore, these predicting models do not take into account the factors like operational parameters such as pressure and temperature, solubility parameter of crude oil and asphaltene, polarities of resins and asphaltenes and compositional and structural characteristics of asphaltene molecule which according to previous studies play a major role in deciding the asphaltenes stability in crude oil (Gharbi et al. 2017; Guzmán et al. 2017; Ali et al. 2021b; Punase et al. 2016; Prakoso et al. 2015, 2018; Goual and Firoozabadi 2002; Hascakir 2017; Asomaning 2003). Finally, the collection point for the testing of crude oil is also important because these SARA values may differ with respect to location due to the compositional changes occurring in crude oil during production. Therefore, it is recommended, to use SARA based models for stability screening in dead crude oil systems working at ambient conditions such as in separators or in downstream operations where operational parameters variations are comparatively low, as compared to upstream crude oil productions, causing less compositional changes in fluids.

Development of new predictor (ANJIS Asphaltene stability predicting model)

By taking the advantage of CSI excellent accuracy, an attempt is made to develop a reliable generalized stability predicting model, namely; [Abdus, Nimra, Javed, Imran & Shaine (ANJIS) Asphaltene stability predicting model] by taking the relationship of SARA mentioned in Eq. (8). k1, k2, k3 and k4 are the adjustable parameters found by tuning them through MATLAB optimization tool against CSI outcomes.

$${\text{ANJIS}} = \frac{{\left( {{\text{Sat}}^{{k_{1} }} } \right) + \left( {{\text{As}}^{{k_{2} }} } \right)}}{{\left( {{\text{Ar}}^{{k_{3} }} } \right) + \left( {R^{{k_{4} }} } \right)}}$$
(8)

After tuning process, the final Eq. (9) of a new predictor Asphaltene stability predictor (ANJIS) is found to be as:

$${\text{ANJIS}} = \frac{{\left( {{\text{Sat}}^{1.7269} } \right) + \left( {{\text{As}}^{5.186} } \right)}}{{\left( {{\text{Ar}}^{1.525} } \right) + \left( {R^{4.302} } \right)}}$$
(9)

This new model can be applied without having prior knowledge of crude oil stability status. If ANJIS < 0.03, then oil is classified as stable, while if ANJIS ≥ 0.03 then oil is identified as unstable.

ANJIS is capable of giving reliable prediction rates both for stable and unstable oil samples. Table 4 shows the prediction results of ANJIS when applied on SARA dataset of 17 crude oil samples used in this study. Although, the accuracy of ANJIS is not extraordinary high, this is quite obvious because of the other factors involved in the stability process of asphaltene other than SARA values. However, the prediction made by ANJIS is quite balanced in terms of identifying both two classes i.e., stability and instability. The other benefit of using ANJIS is that it does not categorized oil samples as metastable or uncertain which other predicting models do. Among all predictors, ANJIS rank second.

Table 4 Values of Statistical metrics calculated for ANJIS model

It is discussed earlier that predictor fails at certain conditions of SARA values, therefore ANJIS accuracy of 0.76 is good (Table 4).

For further verification of the reliability of new model (ANJIS), the model is applied on one more dataset of Rogel et al. study (Rogel et al. 2003). The SARA dataset with operational stability experience information of 16 crude oils belong to Rogel et al. (2003) study is presented in Table 5.

Table 5 SARA value of crude oils and their stability status reported by Rogel et al. (2003)

To compare the performance of ANJIS model with other predicting models, we are referring here to the study of Ali et al. (2021a). Ali and coworkers (Ali et al. 2021a) performed detailed statistical analysis of different SARA based predicting models on the Rogel et al. dataset using 15 crude oil samples. Except CSI, none of the model was able to perform excellently. The second rank models were found to be CII and CSC having accuracy of 0.67 with TNR and TPR value of 1 and 0.28, respectively, thus showing that these models only predicted unstable stable with high accuracy and poor predictors of stable samples. On the other hand, if the performance of ANJIS is observed, the predicting model has higher accuracy with balance prediction rates for both stable and unstable samples. Table 6 shows the statistical results of ANJIS for Rogel et al. dataset.

Table 6 Values of Statistical metrics calculated for ANJIS model on Rogel et al. SARA Dataset (Rogel et al. 2003)

Conclusions

  1. 1.

    All predicting models are easy to apply but showed low accuracies in predicting asphaltene stability in crude oils except CSI.

  2. 2.

    CII, SP, CSI, Chamkalani, Mod. JP, QQA predicted unstable oil samples better, while JP, SI and SCP identified stable oil samples more accurately. This factor tends models to make imbalanced predictions for the two classes of oil samples.

  3. 3.

    CSI performed comparatively better because of the placement of SARA dielectric constants in its formula depending on the oil stability status of field conditions.

  4. 4.

    SARA based models only consider net effect of SARA values, while predicting asphaltene stability and ignores variety of other factors which could impact asphaltene stability in oil thus causing risk of producing erroneous prediction results.

  5. 5.

    A new predictor, ANJIS Asphaltene stability predictor is introduced that can predict both stable and unstable crude oil samples with reasonable accuracy.

  6. 6.

    It is recommended to develop a rigorous model that could incorporate the structural and compositional features of SARA in addition to their weight percentages in crude oil along with the operational parameters like temperature and pressure to increase the accuracy of stability prediction. Therefore, ANJIS model may be incorporated in future simulation studies.