Introduction

The distribution of heterocyclic and polycyclic aromatic hydrocarbons in crude oils have been proposed as indicators of the source organic matter (OM), depositional environment, and thermal maturity of the parent source rocks (Radke et al. 1982, 1986, 2000; Alexander et al. 1992; Jinggui et al. 2005; Armstroff et al. 2006). Heterocyclic aromatic hydrocarbons, i.e., dibenzothiophene (DBT), dibenzofuran (DBF) phenanthrene (P) and fluorene (F) have similar structure skeletons and have been suggested to originate from a similar precursor (Asif et al. 2010), and the abundance of these compounds has been related to the nature of the OM and its depositional environment and also used for source and oil correlations (Fan et al. 1990; Radke and Willsch 1994; Radke et al. 2000; Jinggui et al. 2005; Xiangchun et al. 2011).

The West Siberian Basin in Russia (Fig. 1) is one of the most prolific oil and gas sedimentary basins in the world, with huge potential for future exploration. In addition, the West Siberian Basin has gained the attention of the academic community and upstream operators owing to its excellent source rock characteristics and commercial hydrocarbon accumulations (e.g., Pyatkov et al. 1988; Kontorovich et al. 1991; Belkin and Bachurin 1991; Peters et al. 1993; Afanasyev et al. 1993; Petrov 1994; Lopatin et al. 1997; Kontorovich et al. 1997). Most of the previous studies have focused on the organic geochemical characteristics of the shale facies within the Mesozoic sedimentary rocks and concluded favorable source rock intervals in the Jurassic succession with oil generation potential (Lopatin et al. 1997, 1998; Kontorovich et al. 1997). Nevertheless, due to the slow drilling campaign in the super Basin, untapped petroleum potential remains in underexplored huge structures. Hence, additional data and research are required to unlock potential in mature areas. In this work, the distribution of heterocyclic and polycyclic aromatic hydrocarbons in West Siberian oils was studied to better understand the source of the organic matter, lithology and depositional environment. The aromatic maturity indicators and their equivalent vitrinite reflectance (%Rc) were also applied to evaluate the maturity of the parent source rocks. Additionally, organic matter origin and depositional environment-related aromatic biomarkers were also used to infer the oil families and their potential source rocks.

Fig. 1
figure 1

Structural map of West Siberian Basin, including oil and gas fields in the Basin (modified from Rovenskaya and Nemchenko (1992))

First, the studied oil samples have been fractionated into oil composition (SARA). Then, the gas chromatography–mass spectrometry (GC–MS) analysis has been performed on the aromatic hydrocarbon fraction. Afterward, the heterocyclic and polycyclic aromatic compounds in the aromatic hydrocarbon fraction and their ratios were quantified using several ions like 166, 168, 175, 178, 182, 184, 192, and 198 and were used to evaluate the thermal maturity, source input and depositional conditions.

Geological background

The West Siberian Basin is an early Triassic rift Basin filled by Mesozoic sedimentary succession, ranging from the Triassic to the Cretaceous (Surkov et al. 1997). The lithostratigraphic column of the studied Basin is presented in Fig. 2. The Triassic to Cretaceous sediments were deposited unconformably above the Triassic volcanic rocks, which are mainly composed of clastic sediments that are punctuated by unconformities of different magnitudes and ages (Fig. 2).

Fig. 2
figure 2

Generalized stratigraphic column of the Mesozoic succession in the West Siberian Basin

Clastic sediments of the Tampey Formation represent the Triassic sedimentation while the marine sediments of the Tyumen, Abalak and Bazhenov formations form the Jurassic package. The Lower–middle Jurassic Tyumen Formation is overlain conformably by the Tampey Formation in the Northern Basin and with unconformable contact in the central and south of the Basin (Fig. 2). This formation comprises dominantly shale, along with subordinate sandstone. The stratigraphic equivalents of the Tampey Formation also consist largely of continental coal-bearing clastic rocks in the southern Basin areas (Fig. 2). During Callovian to Kimmeridgian, the sediments of the Abalak Formation were deposited unconformably above the Tampey Formation in the central and south areas of the Basin. The Abalak Formation comprises mainly shale in the Northern Basin and its stratigraphic equivalents consist largely of sandstones of the Vnsyugan Formation in the central and southern areas (Fig. 2).

The Abalak Formation is conformably overlain by the deep marine shale sequence of the latest Jurassic–earliest Cretaceous (Volgian–early Berriasian) Bazhenov Formation. The organic-rich Bazhenov Formation was deposited in a deep-water anoxic depression formed in the entire central part of the West Siberian Basin (Vyshemirsky 1986; Rudkevich et al. 1988). The shale units of the Jurassic Tampey Bazhenov formations are potential source rocks with high oil generation potential (Lopatin et al. 1997, 1998; Kontorovich et al. 1997). It also is conformably overlain by the mixed sandstone and shale sequence of the Early to Late Cretaceous formations (Fig. 2).

Materials and methods

In the current study, four oil samples were collected from the reservoir rocks exposed along oilfields in the West Siberian Basin as shown in Fig. 1. These oil samples were subjected to bulk fractionation process of oil composition (SARA) and gas chromatography–mass spectrometry (GC–MS) analysis of the aromatic hydrocarbon fraction at the Departments of Petroleum Engineering and Physical Chemistry, Kazan Federal University, Russia.

For SARA analysis, the asphaltene fraction was precipitated from the studied oils by dissolution in hexane (1:40). Others fractions, i.e., saturate, aromatic and resin were thereafter fractionated using liquid column chromatography and solvents of different polarity (Fig. 3).

Fig. 3
figure 3

The illustration scheme of the SARA separation process

GC–MS was utilized to separate the compounds in the aromatic hydrocarbon fraction of the analyzed oil samples. The test was performed on the Agilent 7890B with a flame ionization detector. Within the GC–MS furnace, the capillary column had a length of 30 m and a diameter of 0.32 mm. The samples were warmed from 40 to 300 °C (at 4 °C/min rate), after which temperature was held at 300 °C for 30 min. Finally, the heterocyclic and polycyclic aromatic compounds in the aromatic hydrocarbon fraction were detected and quantified based on their peak heights.

Results and discussion

The analyzed West Siberian Basin oils are medium (23.55–32.57° API) and paraffinic in nature, with saturate/aromatic ratios between 3.48 and 8.64. The physical and geochemical characteristics together with the hydrocarbon distribution of normal alkanes and isoprenoids show the absence of biodegradation (Hakimi et al. 2022). In addition, the normal alkane and isoprenoid distributions of the analyzed samples implied oil generation from source rocks containing mixed organic matter and deposited under suboxic to relatively oxic conditions (Hakimi et al. 2022).

Total ion chromatogram (TIC)

The total ion chromatograms (TICs) of aromatic fractions of the analyzed oil samples are shown in Fig. 4. The TICs of the aromatic are dominated by diagrammatic and triaromatic hydrocarbons (Fig. 4).

Fig. 4
figure 4

TICs distributions of heterocyclic and aromatic HCs in the aromatic fraction of the oil samples (AD samples) from the western Siberian Basin, Russia; MN, DMN, TMN, TeMN, BP, MBP, DMBP, P, MP, MF, DMP, DMF, and TMP (Please see "Abbreviations" section)

In the current study, the heterocyclic and aromatic compounds such as alkylnaphthalenes, alkylphenanthrenes, alkyldibenzothiophenes, alkyldibenzofurans, alkylbiphenyls, and alkylfluorenes were identified in the aromatic fractions of the investigated oil samples (Fig. 4). It can be seen that each compound of heterocyclic and aromatic hydrocarbon shows distinctive distributions in the analyzed samples. The dimethylnaphthalenes (DMN) and methyl biphenyls (MBP) are equally abundant in samples A, B and C while trimethylnaphthalenes (TMN) and dimethylbiphenyls (DMBP) are higher in sample D oil. It was also noted that, MBP > DMN and TMN > DMBP. Samples A and C oils also show a higher abundance of MN and DMN than other oil samples (see Fig. 4). In addition, tetramethylnaphthalenes (TeMN), MF, phenanthrene (P), dimethyl fluorenes (DMF), methyl phenanthrenes (MP) and dimethylphenanthrenes (DMP) are most abundant in sample D oil (Fig. 4). These observations of the relative distributions of heterocyclic and polycyclic aromatic hydrocarbons indicate in the West Siberian Basin are attributed to the source of organic matter input and depositional settings of the source rocks (Odier and Artaud 1992; Radke et al. 2000; Fenton et al. 2007), For example, the BP and MBP are significantly higher in their relative abundance in samples B and C oil in the other two oil samples (Fig. 4), suggesting abundant in terrestrially-derived OM (Odier and Artaud 1992; Fenton et al. 2007).

Thermal maturity evolution

In this study, multiple biomarker maturity ratios and parameters of the methylated phenanthrene and naphthalene distributions in the aromatic hydrocarbon fraction of the analyzed oil samples can be employed to assess the thermal maturity of organic matter (Radke and Welte 1983; Radke et al. 1984, 1986; Alexander et al. 1985; Budzinski et al. 1995; van Aarssen et al. 1999). The application of aromatic hydrocarbons in maturity assessment is based on the thermodynamic stability of isomers with increasing thermal stress. The aromatic maturity parameters presented in Table 1 includes methylphenanthrene ratio (MPR) methylphenanthrene index (MPI-1), alkyldibenzothiphene parameter (MDR), methylnaphthalene ratio (MNR), dimethylnaphthalene ratio (DNR-1) and trimethylnaphthalene ratio (TNR-2). The methylphenanthrenes and alkyldibenzothiphene show the dominant abundance of 3-MP and 2-MP in m/z 192 mass fragmentograms, while 4-MDBT is more abundance in the m/z 198 mass fragmentograms (Fig. 5).

Table 1 Partial aromatic hydrocarbon ratios of oil samples from West Siberian Basin, Russia (Please see "Abbreviations" section)
Fig. 5
figure 5

Partial m/z 178 + 192 + 198 mass chromatograms showing distribution of P, MP and MDPT isomers in the aromatic fraction of the western Siberian Basin’s oils (AD samples) (Please see "Abbreviations" section)

However, the MPI-1 and MDR values were estimated and range from 0.74 to 1.01 and 5.73 to 7.54, respectively (Table 1). These high values of methylphenanthrene index (MPI-1) and alkyldibenzothiphene (MDR) of the analyzed oil samples show equivalent vitrinite reflectance (%Rc MPI and %Rc MDR) in the range of 0.85–1.01% and 0.93–1.06%, respectively (Table 1), implying oil generation from mature source rocks in the peak to late mature of oil generation window (Radke and Welte 1983; Jiang and George 2019). This finding is demonstrated by the correlation diagram of calculated reflectance values from MPI-1 and MDR (Fig. 6a).

Fig. 6
figure 6

Thermal maturity cross-plots of the %Rc from: A MDR ratio and MPI-1, B TNR ratio and MPI-1, and C MDR and MNR ratios; showing that the western Siberian Basin’s oils were generated from mature source rocks in the range of peak-mature to late-mature of oil generation window (Please see "Abbreviations" section)

Furthermore, the high thermal maturity of the analyzed oil samples is also consistent with the equivalent vitrinite reflectance (%Rc MPR, %Rc MNR and %Rc TNR-2) calculated from MPR, MNR, and TNR-2 parameters (Table 1). The calculated reflectance values of the TNR-2 and MPR range from 0.92 to 0.96% and 0.94 to 1.05%, respectively (Table 1), while calculated reflectance from MNR is higher and ranges between 1.04% and 1.14% (Table 1). Additionally, the correlation diagrams of %Rc MPI-1 versus %Rc TNR-2 (Fig. 6b) and %Rc MNR versus %Rc MDR (Fig. 6c) indicate the oils are in the late maturity generation window.

Characteristics of aromatic biomarker and their interpretation in relation to OM input and environmental conditions

Numerous heterocyclic and polycyclic aromatic compounds are more abundant in terrestrial plants in general and are useful indicators of source organic facies type, depositional environment and lithologies (Fan Pu et al. 1990; Budzinski et al. 1995; Hughes et al. 1995; Radke et al. 2000; Li et al. 2013; Asahina and Suzuki 2018; Asif and Wenger 2019).

The distributions of alkylated phenanthrenes and naphthalenes have been widely employed as indicators of OM origin. Elevated abundances of 9-MP, 2,10-DMP, 3,9-DMP and 3,10-DMP have been observed in sediments of marine origin while high amounts of 1-MP and 1,7-DMP are found in sediments with higher plant origin (Budzinski et al. 1995). The 9-MP is generally abundant, with significant amounts of 1-MP in all analyzed oil samples (Fig. 5), indicating a marine organic matter with notable amounts of land plant inputs. The cross-plot of Log (1-MP/9-MP) and Log (1,7-DMP/1,3 + 2,10 + 3,9 + 3,10-DMP) in Fig. 7a delineates the organic matter source inputs. The analyzed oils plot in the bottom left quadrant of Fig. 7a, indicating a predominant input of 9-MP and 1,3 + 2,10 + 3,9 + 3,10-DMP that signifies a marine algal OM source. Similarly, the oils plot in the bottom left quadrant of the cross-plot of log (1,2,5-TMN/1,3,6-TMN) and log (1,2,7-TMN/1,3,7-TMN) in Fig. 7b, signifying the predominant abundances of 1,3,6-TMN and 1,3,7-TMN, which are associated with a marine organic matter and higher thermal maturity, respectively (van Aarssen et al. 1999; Kang et al. 2020).

Fig. 7
figure 7

Cross plots of A Log [1,7-DMP/(1,3- + 3,9- + 2,10- + 3,10-DMP)] versus Log [1-MP/9-MP], and B Log [1,2,7-TMN/1,3,7-TMN] versus Log [1,2,5-TMN/1,3,6-TMN] for the oils (Please see "Abbreviations" section)

This finding is also demonstrated by the alkylated naphthalene ratios such as trimethylnaphthalene (TMR) and dimethylnaphthalene (DMR) (Asahina and Suzuki 2018). Asahina and Suzuki (2018) proposed that the alkylated naphthalene ratios (TMR and DMR) as indicators of organic matter source input and concluded that TMR values > 0.8 and DMR values > 1.0 signify terrestrial OM origin. The DMR values (1.0–2.4) of all studied oils are > 1.0 while TMR values range from 0.5 to 1.4 (Table 1). The cross-plot of TMR and DMR shows that the analyzed oils are sourced mainly from mixed organic matter, but with the considerable contribution of marine OM for oil samples B and C (Fig. 8a).

Fig. 8
figure 8

Cross plots of A DMR and TMR ratios (after Asahina and Suzuki 2018), and B MDBF/MP versus DBF/P (after Baydjanova and George 2019); showing the organic matter into the source rocks for the analyzed oils (Please see "Abbreviations" section)

Furthermore, this interpretation is supported by dibenzofuran/phenanthrene (DBF/P) and methyldibenzofurans/methylphenanthrenes (MDBFs/MPs) ratios. Baydjanova and George (2019) proposed the cross-plot of dibenzofuran/phenanthrene (DBF/P) against methyldibenzofurans/methylphenanthrenes (MDBFs/MPs) ratios to distinguish between marine- and terrestrial-sourced organic matter. Accordingly, the studied oil samples are sourced from both terrigenous and marine organic matter (Fig. 8b). However, the elevated abundance of dibenzofuran (DBF) has been noted in the oil samples A and D is indicative of considerable terrigenous OM input (Radke et al. 2000; Asif and Wegner 2019; Fig. 8b). The relative abundances of heterocyclic aromatic compounds and their alkylated isomers have been applied reconstruct the environment of deposition (Hughes 1984; Schou and Myhr 1988; Jinggui et al. 2005; Radke et al. 2000; Asif and Wegner 2019; Baydjanova and George 2019). Higher abundances of dibenzothiophene (DBT) and naphthalene are relatively higher in source rocks from marine environments while phenanthrene (P), dibenzofuran (DBF), and fluorene (F) are higher in source rocks from freshwater sedimentary environments (Fan et al. 1990; Radke et al. 2000; Jinggui et al. 2005; Xiangchun et al. 2011).

In the analyzed oils, the relative abundances of DBF, MDBF, DBT, MDBT, MP, and F are determined from the sums of the peak heights of their diagnostic ions (i.e., m/z 168, 182, 184, 198, and 166, respectively) and are shown in Figs. 5 and 9 as well as Table 1.

Fig. 9
figure 9

Partial m/z 166 + 168 + 182 + 184 mass chromatograms showing distribution of DBT, F, MBP, DMBP, and MDBF isomers in the aromatic fractions of the western Siberian Basin’s oils (AD samples) (Please see "Abbreviations" section)

Ternary diagrams of the relative abundances of the above aromatic compounds show that the variations in the relative abundances of these aromatic compounds are effective markers of facies and depositional environments (Pu et al. 1990; Radke et al. 2000; Asif and Wegner 2019). Asif and Wegner (2019) differentiated source facies using a ternary diagram of %F, %DBF and %DBT and observed that freshwater lacustrine-, coaly-, and marine carbonate-sourced oils have the highest abundance of F, DBF and DBT, respectively. In this study, the relative abundances of F, DBF and DBT in the studied oils were observed in the 50.9–68.6%, 18.3–36.3%, and 12.9–13.8% range, respectively (Table 1). The Li et al. (2013) ternary diagram of F–DBF–DBT shows that the relative abundances of F decrease from freshwater/deltaic terrestrial to marine environments while the abundance of DBF increases (Fig. 10a). The dominance of F over DBF and DBT in the samples A and D is indicative of a non-marine (freshwater/deltaic) depositional environment (Fig. 10a). In contrast, the relatively marine depositional environment of the probable source rock for the oil samples B and C are located away from the DBT and F maxima and is found near to the DBF (Fig. 10a). In addition, the relative abundances of P compared to DBF and DBT in these oil samples supports the interpretation of freshwater/deltaic to marine depositional environments (Fig. 10b; Pu et al. 1990).

Fig. 10
figure 10

Ternary plots of A P, F, DBT, and B P, DBF, and DBT, showing depositional environment and lithology (after Li et al. 2013; Fan Pu et al. 1990) (Please see "Abbreviations" section)

The varying abundance of dibenzothiophene (DBT) is also used to provide information about the different source facies and depositional environments (Hughes et al. (1995). Hughes et al. (1995) employed the cross-plot of Pr/Ph and dibenzothiophene/phenanthrene (DBT/P) to differentiate five distinct environments and lithologies.

The Pr/Ph ratios for the studied oils range from 1.61 to 2.17, indicating deposition under sub-oxic to relatively oxic conditions (Didyk et al. 1978). The oil samples plot in zone 3 of the Hughes et al. (1995) diagram, which corresponds to marine clay-rich source rock (Fig. 11a). Similarly, the oil samples plot in the site and close to zone 3 of the modified diagram of Radke et al. (2000), corroborating the interpretation of oil from marine shale source facies (Fig. 11b). In addition, the MDBT/MDBF ratio also supports the interpretation of two oil groups sourced from a variety of sources of organic matter (Fig. 11b). The clay-rich source facies of the oil samples is further demonstrated from the cross-plot of ∑TMN/∑MN ratios versus MNR values (Asahina and Suzuki 2018; Fig. 11c).

Fig. 11
figure 11

Cross-plots of A Pr/Ph versus DBT/P ratios (after Hughes et al. 1995), B MDBT/MDBF versus Pr/Ph ratios (after Radke et al. 2000) and C ∑TMNs/∑MNs ratios and MNR values (after Asahina and Suzuki 2018); showing depositional environment and lithology of the source rocks for the analyzed oils (Please see "Abbreviations" section)

Oil–oil correlation

In this work, the oil–oil correlation and its implication for oil families and their potential source rocks were investigated using the organic matter (OM) origin, depositional environment and lithology-related biomarkers in the heterocyclic and polycyclic aromatic hydrocarbons distributions. The evaluation of these oils as a result of oil–oil correlation show that the four oil samples studied in the present paper could be grouped into two oil families (Fig. 12). These concept two unique categories are supported by the source organic matter, and environmental-related aromatic biomarker ratios that outlined in the previous subsection (Sect. "Characteristics of aromatic biomarker and their interpretation in relation to OM input and environmental conditions") and cross-plots between these oils (Figs. 8, 10). Indeed, oil samples A and D can be incorporated into a single oil family, while samples B and C can be recognized as a different oil family (Fig. 12).

Fig. 12
figure 12

Star diagram of the selected organic matter origin and depositional environment-related aromatic biomarkers for oil–oil correlation, showing two source rocks for the studied west Siberian Basin’s oils (AD samples)

Group (I) of the oil family consists of two oil samples (A and D), which are characterized by a relatively high Ph/Ph ratio between 2.11 and 2.17 (Table 1) and high abundant of 9-MP with significant amounts of 1-MP (Fig. 5), indicating a mixed organic matter with high amounts from the land plants to the source rock that deposited under suboxic to relatively oxic environmental conditions. This finding of the high terrestrial organic matter contributions is also supported by the distribution higher relative abundance of DBF and higher DMR (Fig. 8a, b). In addition, the oils from group A are mainly generated from source rocks deposited in non-marine (freshwater/deltaic) environments as demonstrated by the high abundances of F and DBF over DBT (Fig. 10).

The group II oil family includes two oil samples B and C with relatively lower Ph/Ph ratios (1.61 and 2.09) when compared to group I oils (Table 1), which indicates source rock deposited under suboxic environmental conditions. These oils also have a relatively low abundance of DBF and lower DMR that is suggestive of higher contributions of marine OM (Fig. 8a, b). The higher abundance of F in the group II oils is suggestive of a lacustrine depositional environment (Li et. al. 2013; Asif and Wegner 2019).

Summary and conclusions

Four oil samples taken from wells in the West Siberian Basin, central Russia, were geochemically analyzed and characterized to determine the organic matter (OM) origin, thermal maturity, lithology, and depositional conditions of the probable source rocks. Additionally, oil–oil correlation was utilized to determine the oil families in the samples and their genetic relationship to oil’s source rock.

  1. (1)

    Heterocyclic and polycyclic aromatic hydrocarbons reveal that the oils generally sourced from clay-rich source rock, containing mixed organic matter and deposited in fluvial-deltaic to lacustrine environments under suboxic to relatively oxic conditions.

  2. (2)

    Maturity-sensitive aromatic hydrocarbon parameters indicate that the oils were generated from source rocks in the peak-mature to late mature range of the oil generation window.

  3. (3)

    Result of oil–oil correlation analysis show two distinct oil families. The oils of group (I) oil family were generated from source rocks with relatively higher input of land plants that were deposited in freshwater/deltaic environment.

  4. (4)

    Oils of the group II oil family were sourced from rocks with relatively higher marine algal OM input and deposited in a lacustrine environment under suboxic conditions.

  5. (5)

    The heterocyclic and polycyclic aromatic results can be used as a foundation for further genetic correlation of the oils, particularly in the organic matter input and thermal maturity degree of their source rocks.