Introduction

Diesel is widely used as a commercial petroleum product; hence, it can be easily released into soil and water. Accidental exposure to various hydrocarbons such as fuel and refinery products, frequently occurs in the storage and transportation system. Diesel spill accounts for approximately 25% of annual spill accidents in Canada1. The identification of pollution origin is important to remediate diesel-contaminated media. A few studies have reported potential methods to identify the source of diesel contamination2,3,4,5. However, the discrimination of diesel refined by different refineries from the same crude oil remains a challenge.

In Korea, domestic diesel is produced from the same crude oil regardless of the refineries, and this limits the use of previously reported identification indices such as the diagnostic ratio of sesquiterpanes5 and polycyclic aromatic hydrocarbons (PAHs)6. Thus, there is a need for novel indices for diesel source identification. In this milieu, we exploited the fact that biodiesels have to be mandatorily blended with domestic diesel products (3% by volume) by the national policy. Generally, the refineries blend different biodiesels produced from empty fruit bunch and waste cooking oil. Fatty acid methyl esters (FAMEs) are the major compounds in biodiesels7,8. In this study, we estimated the FAME compositions in diesels as a potential index for discriminating specific diesel origin in contaminated soils. Four diesel products were collected from four major domestic refineries in Korea, and their relative FAME compositions were analyzed. Principal components analysis (PCA) was used to characterize the differences in the relative FAME composition in different diesel sources and soil samples artificially contaminated with each source. Furthermore, we estimated the weathering effect on the method using relative FAME composition.

Materials and methods

Materials

Four diesel products, blended with biodiesel, were obtained from gas stations operated by four major oil refineries in Korea, and the samples were designated as D1, D2, D3, and D4. The diesel samples were stored in amber glass bottles with screw caps at 4 °C. The FAME composition in each diesel sample was determined by dissolving an aliquot of diesel in dichloromethane (DCM, ≥ 99%, JT Baker), analyzing by gas chromatography-mass spectrometry (GC–MS, QP 2010, Shimadzu), and comparing with the FAME standard (FAMQ-005, AccuStandard). We collected soil in the hillside near the Korea Institute of Science and Technology (KIST); this region is not exposed to petroleum products. The distribution of soil particle size was determined by sieve separation. The abundant particle size range of the soil was 2–0.075 mm, which belongs to sand; in addition, approximately 25 wt% of soil particles were below 0.075 mm, indicating the presence of silt and clay particles. The collected soil was dried for 7 days at room temperature, passed through a 2 mm mesh size sieve, and then used to prepare diesel-contaminated soils9.

Preparation of diesel-contaminated soils and extraction of FAME

The collected soil (2 kg) was thoroughly mixed with 200 g of diesel (10% by weight) to prepare diesel-contaminated soils. The apparent color of soil changed from light to dark brown after being mixed with diesel. The soils contaminated with different diesel products were compacted to a predetermined depth to achieve similar porosity. The diesel-contaminated soils were placed in opaque jars and stored under open condition in a fume hood during the experiment.

Soil samples were collected from the four different diesel-contaminated soils using a small hand auger with an inner diameter of 16 mm at 1, 30, and 60 days after being placed in the fume hood. To analyze FAMEs in the soil samples, DCM extraction was performed by modifying a previously reported procedure10. Briefly, 2 g of soil sample was mixed with 10 mL of DCM; the mixture was agitated on a rotary shaker at 40 rpm for 1 h, and the supernatant was analyzed by GC–MS after settling the suspension.

Analytical method

FAMEs in the soil samples were analyzed by GC–MS equipped with an Omega Wax 250 column (Supelco, 30 m × 0.25 mm, thickness 0.25 µm)11 in the total ion chromatogram (TIC) and selective ion monitoring (SIM) modes. The oven temperature was programmed as reported previously12,13, and the details are summarized in Table 1.

Table 1 Analytical conditions of GC–MS.

Discrimination of specific diesel in soil samples using principal component analysis

The FAMEs that were successfully detected in the extracts of diesel-contaminated soils were selected and used for discrimination. The relative FAME composition and proportion were evaluated for discrimination of the diesel sources in the contaminated soil. For the PCA, the relative proportion of the selected FAME was calculated using each peak area. The relative proportion of FAMEs was defined as the proportion of one FAME compound area to the total peak areas of the selected FAMEs. The difference in the relative FAME proportion in the four diesel samples was presented by two principal components (PC1 and PC2). The principal components of the diesel samples were calculated by the PCA with the following equations using SPSS (ver. 13.0)14.

$$ {\text{PC1}} = {\text{A}}_{{1}} {\text{X}}_{{1}} + {\text{A}}_{{2}} {\text{X}}_{{2}} + {\text{A}}_{{3}} {\text{X}}_{{3}} + \cdots + {\text{A}}_{{\text{n}}} {\text{X}}_{{\text{n}}} $$
(1)
$$ {\text{PC2}} = {\text{B}}_{{1}} {\text{X}}_{{1}} + {\text{B}}_{{2}} {\text{X}}_{{2}} + {\text{B}}_{{3}} {\text{X}}_{{3}} + \cdots + {\text{B}}_{{\text{n}}} {\text{X}}_{{\text{n}}} $$
(2)

where, An and Bn are the component scores obtained using the factor analysis in SPSS. Xn is calculated as (X − Xavg)/Xstdev, in which X represents the relative proportion of one FAME compound in diesel sample. Coordinates of soil extracts were calculated by component scores, Xavg, and Xstdev derived from determining the coordinates of diesel sources. PC1 and PC2 could indicate the linear combination of each FAME component that produces the maximum difference among the four diesel groups. Thus, the calculated values can be used to allocate extractants of unknown samples to one of the four source groups. The diesel source in contaminated soils was discriminated by comparing the closeness of position of the extracts and the diesel sources in the PC1–PC2 plane (represented by Euclidean distance). Heptadecanoic acid methyl ester (HA), linolenic acid methyl ester (LA), and elaidic acid methyl ester (EA) were accurately measured at m/z 74 in the SIM mode (Fig. 1). Thus, the PCA was conducted for the relative proportion of HA, LA, and EA in the diesel sources and soil extracts to avoid the misjudgment driven by the disappearance of certain compounds while correlating the relative FAME compositions of the sources and samples.

Figure 1
figure 1

FAMEs in the standard and extracts of the diesel-contaminated soils analyzed by GC–MS in the SIM mode at m/z 74.00. Specific compounds for the peak numbers are listed in Table 2.

Results and discussion

Identification of FAMEs in different diesel sources

FAMEs in each diesel product were identified by GC–MS in the TIC mode (Table 2). Different sets of FAMEs were detected in each diesel product, implying that the diesels used in this study might contain different kinds of biodiesels. D1 and D2 showed additional peaks for cis-5,8,11,14,17-Eicosapentaenoic acid methyl ester whereas cis-11-14-17-Eicosatrienoic acid methyl ester was detected only in D2. The HA, LA, and EA were commonly found in all diesel products; hence, these three FAME compounds were used to discriminate the diesel sources in the contaminated soil.

Table 2 List of FAMEs in the standard and diesel sources analyzed in the TIC mode. ■ indicate that the FAME was detected and □ implies not detected in each diesel sample.

Discrimination of diesel origin in soils by analyzing the relative FAME composition

DCM extract of soil not exposed to diesel did not show distinguishable peaks for FAMEs on GC–MS chromatograms, indicating that the interference by natural compounds in soils on the analysis is negligible. Although the diesel sources presented at least four common FAME compounds, only three compounds were successfully detected in the extracts of diesel-contaminated soils, namely HA, LA, and EA. Relative FAME composition in the diesel sources and extracts of soil samples is summarized in Table 3. The HA was the most dominant compound with the relative composition of 78.5% ± 3.4%, whereas the composition of LA and EA was 6.8% ± 1.8% and 14.8% ± 1.7%, respectively. The PCA discriminated the diesel sources based on the differences in the relative FAME composition. All diesel samples were distributed at different locations in the PC1–PC2 plane (Fig. 2): D1 in the 4th quadrant, D2 in the 1st quadrant, D3 in the 2nd quadrant, and D4 in the 3rd quadrant. The cumulative eigenvalue of the two principal components was 99.97%. The soil extracts at different degrees of weathering were plotted in the PC1–PC2 plane. Each soil sample point, except that for day 60, gathered around their source diesel. The PCA results imply that the relative FAME composition might be applied as an index to discriminate diesel origin in the contaminated soil.

Table 3 Relative FAME compositions in the diesel sources and soil samples contaminated with the corresponding sources at different weathering times.
Figure 2
figure 2

PCA for the relative FAME compositions and proportion in the diesel sources and soil samples contaminated with the corresponding sources at different weathering time; ♦ D1, ● D2, ▲ D3, and ■ D4.

Weathering effect on the discrimination method using relative FAME compositions

Euclidean distances from a sample point to all of the diesel source points in the PC1–PC2 plane are presented in Fig. 3. In general, all samples showed the shortest Euclidean distance to the corresponding diesel source. However, the Euclidean distance between an extract from soil sample and the corresponding source increased with time. This can be attributed to different weathering resistances of each FAME in diesel15,16. The weathering effect implies that the diesel source discrimination method using relative FAME composition is subject to a degree of error, especially in the case of highly weathered samples.

Figure 3
figure 3

Euclidean distance between data points of the diesel sources and soil samples contaminated with (a) D1, (b) D2, (c) D3, and (d) D4.

Conclusions

Biodiesels are considered an alternative to conventional refined petroleum and are widely used in several countries; they are also used in biodiesel-blended diesel. Relative FAME composition and proportion of biodiesel and biodiesel-blended diesel might be employed as an index to discriminate the diesel source in diesel-contaminated soil. However, environmental weathering might affect the accuracy of the proposed discrimination method. To verify the applicability of this method, further studies should focus on the long-term variations in FAME compositions in diesel under more environmentally relevant conditions using various biodiesel-blended products.