Petrophysical log-driven kerogen typing: unveiling the potential of hybrid machine learning

Azadivash, Ahmad; Soleymani, Hosseinali; Kadkhodaie, Ali; Yahyaee, Farshid; Rabbani, Ahmad Reza

doi:10.1007/s13202-023-01688-1

Petrophysical log-driven kerogen typing: unveiling the potential of hybrid machine learning

Original Paper-Exploration Geology
Open access
Published: 09 August 2023

Volume 13, pages 2387–2415, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Petroleum Exploration and Production Technology Aims and scope Submit manuscript

Petrophysical log-driven kerogen typing: unveiling the potential of hybrid machine learning

Download PDF

Ahmad Azadivash¹,
Hosseinali Soleymani¹,
Ali Kadkhodaie ORCID: orcid.org/0000-0003-4789-8631²,
Farshid Yahyaee¹ &
…
Ahmad Reza Rabbani¹

1175 Accesses
1 Citation
Explore all metrics

Abstract

The importance of characterizing kerogen type in evaluating source rock and the nature of hydrocarbon yield is emphasized. However, traditional laboratory geochemical assessments can be time-intensive and costly. In this study, an innovative approach was taken to bridge this gap by utilizing machine learning techniques to ascertain key parameters—Organic Oxygen Index (OI), Hydrogen Index (HI), and kerogen type—from petrophysical logs of a well in the Perth Basin, Western Australia. This approach assembled geochemical data from 138 cutting samples of the Kockatea and Woodada formations and petrophysical log data. Subsequently, six machine learning algorithms were applied to predict the OI and HI parameters. The efficacy of these methods was assessed using statistical parameters, including Coefficient of Determination (R2), Average Percentage Relative Error, Average Absolute Percentage Relative Error, Root Mean Square Error, and Standard Deviation. The Support Vector Machines method emerged as the standout performer, with an R2 of 0.993 for the OI and 0.989 for the HI, establishing itself as an optimal tool for predicting these indices. Additionally, six classifiers were employed to determine kerogen types, with accuracy tested using precision, recall, F1-Score, and accuracy parameters.The study's findings highlight the superiority of the Gradient Boosting method in kerogen-type classification, achieving an impressive accuracy rate of 93.54%. It is concluded that when utilized with petrophysical logs, machine learning methodologies offer a powerful, efficient, and cost-effective alternative for determining OI, HI, and kerogen type. The novelty of this approach lies in its ability to accurately predict these crucial parameters using readily available well-log data, potentially revolutionizing traditional geochemical analysis practices.

Graphical abstract

Machine Learning Assisted State-of-the-Art-of Petrographic Classification From Geophysical Logs

Article 31 August 2024

Machine learning for geochemical exploration: classifying metallogenic fertility in arc magmas and insights into porphyry copper deposit formation

Article Open access 24 January 2022

A new approach to dividing the tectonic setting of igneous rocks: machine learning and GeoTectAI software

Article 28 June 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The exploration and evaluation of petroleum resources from source rocks heavily rely on petroleum geochemistry, encompassing crucial parameters such as kerogen type, hydrogen index (HI), and oxygen index (OI). These parameters serve as pivotal proxies for identifying the hydrocarbons' nature and understanding organic matter maturation, thus determining the hydrocarbon generation potential (Durand 1980; Breyer 2012; Dembicki 2016; Chen et al. 2017; Lee 2020). However, the complexity of extracting unconventional petroleum resources presents significant challenges, urging the need for more efficient and less error-prone evaluation methods.

Traditional techniques like Rock-Eval pyrolysis have long been established but suffer from several limitations. These include labor-intensive procedures, the risk of sample contamination, inaccuracies in depth determination, high costs, and time-consuming processes (Lafargue et al. 1998; Behar et al. 2001). As the demand for unconventional petroleum resources escalates, these methods face criticism and calls for improved alternatives.

In recent years, machine learning techniques have emerged as potential problem solvers in the domain of petroleum geochemistry and related fields (Schmoker 1979, 1981; Fertle & Rickie 1980; Meyer & Nederlof 1984; Schmoker & Hester 1983; Fertle 1988; Herron 1988; Carpentier, 1989; Passey et al. 1990; Tariq et al. 2020; Rui et al. 2019; Wang et al. 2019; Khalil Khan et al. 2022). However, these solutions require more extensive validation and practical application, a task that this study aims to undertake.

This study focuses on the geochemical parameters of kerogen type, HI, and OI, hypothesizing that these can be successfully estimated through a novel application of machine learning techniques. The main objective is to establish a cost-effective and efficient methodology for evaluating source rocks using hybrid machine learning techniques, filling a pressing need within the field. The analysis will primarily concentrate on the Perth Basin, drawing insights from a comprehensive range of studies (Khoshnoodkia et al. 2011; Mahmoud et al. 2017; Alizadeh et al. 2018; Johnson et al. 2018; Rui et al. 2019; Shalaby et al. 2019; Lawal et al. 2019; Wang et al. 2019; Handhal et al. 2020; Tariq et al. 2020; Kang et al. 2021; Safaei-Farouji & Kadkhodaie 2022; Deaf et al. 2022; Nyakilla et al. 2022; Zhang et al. 2022; Khalil Khan et al. 2022; Maroufi & Zahmatkesh 2023).

While the forthcoming sections will explore the application of various machine learning techniques to predict OI and HI, such as Support Vector Machines (SVM), Group Method of Data Handling (GMDH), Multi-Layer Perceptron (MLP), Decision Tree (DT), Adaptive Neuro-Fuzzy Inference System (ANFIS), and Radial Basis Function (RBF), it is essential to acknowledge the limitations of the methodology. Some potential limitations include the availability and quality of well-log data, the representativeness of the selected dataset, and the generalization of the results to other geological settings. Additionally, the performance of machine learning models may be influenced by hyperparameter tuning and the choice of input features. It is vital to address these limitations to ensure the robustness and reliability of the study's findings.

Ultimately, the implications of this research extend beyond the academic sphere, providing practical solutions for petroleum geochemistry. By offering an innovative, efficient, and accurate methodology for assessing the production potential of Perth basin source rock, this study contributes significantly to the field. The findings herein stand to not only streamline the process of source rock evaluation but also enhance our understanding of the hydrocarbon production potential in unconventional petroleum resources. By acknowledging and addressing the limitations, the study aims to bolster the confidence and applicability of the proposed methodology in real-world scenarios.

Geological setting

The Perth Basin is a sizable sedimentary basin with a north-to-northwest trend that stretches for around 1,300 km along the western edge of the Australian continent. It was formed during the separation of Australia and Greater India in the early Permian to Early Cretaceous. It comprises a crucial onshore component and extends offshore to the continent-ocean boundary to water depths of approximately 4500 m (Rollet et al. 2013).

The Darling Fault defines the basin's eastern edge, while the Indian Ocean covers its offshore region to the west. The tectonic evolution of the basin is mainly under the direction of the Darling Fault (Owad-Jones & Ellis 2000).

Two primary tectonic stages with a tensional system are associated with the basin's origin and history. A rifting basin is linked to the late Permian's initial development phase. The second event occurred between the Late Jurassic and Early Cretaceous and is associated with splitting of the Australian plate from India. Most of the rocks in the basin are clastic and date from the Permian to more recent times (Marshall et al. 1989).

The basin is divided into a complex graben system with several sub-basins by several normal faults with a north–south trend and younger northwest-southeast trending shift faults (Crostella & Backhouse 2000). The thicknesses of the same sedimentary unit are spatially highly variable, reflecting the relative differences in subsidence rates. The continental and marine sedimentary settings fluctuated throughout the basin's history due to this differential subsidence and related relative sea-level fluctuations (Delle Piane et al. 2013).

In this basin, sediments from the Late Permian through the Cretaceous have been deposited in various settings, from marine to terrestrial. Sandstone, siltstone, and mudstone are typically present, along with low quantities of coal, conglomerate, and carbonate (Playford et al. 1976; Crostella 1995).

The offshore northern Perth Basin consists of three main depocenters; the Abrolhos, Houtman, and Zeewyck sub-basins. The Abrolhos Sub-basin is an elongated north–south-oriented depocenter carrying up to 6000 m of Cisuralian to Lower Cretaceous sedimentary rocks deposited during multiple rift events (Jones et al. 2011; Grosjean et al. 2017). Figure 1 shows the structural setting of the Perth Basin together with its faults and sub-basins.

The most important source and cap rock in the basin was thought to be the Early Triassic Kockatea Shale (Playford et al. 1976; Karimian Torghabeh et al. 2014). The Dandaragan Trough, where sediment thicknesses of more than 1000 m were deposited, experienced active subsidence during the Kockatea Shale deposition that covered the northern Perth Basin (Iasky & Mory 1993). It comprises limestone beds, siltstone, minor sandstone, and black shale. The unit comprises thin, red, purple, or brown ferruginous siltstone or fine-grained sandstone outcrops (Crostella 1995). Except for three wells in the Houtman Sub-basin, every well drilled in the offshore northern Perth Basin converges on the Lower to Middle Triassic Woodada Sequence. The thickness of the Woodada Sequence ranges from 45 to 183 m, except for Wittecarra 1, where the sequence is up to 685 m thick (Jorgensen et al. 2011). Interbedded fine-grained sandstone and carbonaceous siltstone make up the Woodada Formation. The unit has a transitional character on wireline logs between the Lesueur Sandstone, the overlying coarser material, and the Kockatea Shale, the underlying fine-grained material (Mory & Iasky 1996). Figure 2 shows the stratigraphic chart for the offshore northern Perth Basin.

Materials and methods

Data

One hundred thirty-eight cutting samples from the Kockatea and Woodada formations were collected for this study to evaluate their geochemical properties. In order to achieve the ultimate temperatures of 800 °C in the pyrolysis oven and 850 °C in the oxidation oven, the Rock-Eval 6 pyrolysis was used based on the presented workflow of (Espitalie et al. 1977) and (Lafargue et al. 1998) under standard test conditions with a temperature plan of 25 °C min-1. Each sample was removed of iron filings from the drill bit and micas generated from lost circulation material, pulverized after being ground, and measured to a weight of 60–70 mg before being sent through the device. Rock–Eval 6 pyrolysis offers a potent method for assessing a hydrocarbon source rock's quantity, thermal maturity, and type —three crucial elements. The system outputs S1: free hydrocarbons [mg HC/g rock], S2: hydrocarbons cracked [mg HC/g rock], TOC: total organic carbon (wt.%), and Tmax: the temperature at which the maximum amount of hydrocarbon generation occurs in a sample (°C), which include the main acquired characteristics as well as several calculated parameters from results obtained, such as PI (Production index): S1/(S1 + S2), HI (Hydrogen index): (S2/TOC) × 100 [mg HC/g TOC], and OI (Oxygen index): (S3/TOC) × 100 [mg CO2/g TOC].

Numerous researchers, including (Schmoker (1979, 1981); Carpentier et al. (1989); Fertle and Rieke (1980); Herron (1988); Fertle (1988); Meyer and Nederlof (1984); Schmoker and Hester (1983); Passey et al. (1990); Bolandi et al. (2017); Zhao et al. (2019)), have investigated the relationships between geochemical parameters and responses of well-logging tools. Therefore, the input parameters for calculating the HI and OI values for the rock samples chosen for this study were collected from well-log data. The well-log data were collected using sonic (DT), gamma-ray (GR), neutron (NPHI), and bulk density (RHOB) logs. Table 1 and Fig. 3 present the statistical description and the complete data set utilized to train ML models in the Kockatea and Woodada formations.

Table 1 Ranges of the data used for ML modeling in this study

Petrophysical log-driven kerogen typing: unveiling the potential of hybrid machine learning

Abstract

Graphical abstract

Similar content being viewed by others

Machine Learning Assisted State-of-the-Art-of Petrographic Classification From Geophysical Logs

Machine learning for geochemical exploration: classifying metallogenic fertility in arc magmas and insights into porphyry copper deposit formation

A new approach to dividing the tectonic setting of igneous rocks: machine learning and GeoTectAI software

Explore related subjects

Introduction

Geological setting

Materials and methods

Data

Machine learning methods

Group method of data handling)GMDH(

Support vector machine (SVM)

Multilayer perceptron (MLP)

Radial basis function neural network (RBF)

Adaptive neuro-fuzzy inference system (ANFIS)

Random forest (RF)

Extreme gradient boosting (XGBoost)

Light gradient boosting machine (LGBM)

Results

OI and HI estimation

Kerogen type estimation

Discussion

OI and HI estimation

Kerogen type estimation

Conclusions

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix A: Optimized parameters for machine learning methods

Appendix A: Optimized parameters for machine learning methods

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation