Advanced land imager superiority in lithological classification utilizing machine learning algorithms

Shebl, Ali; Kusky, Timothy; Csámer, Árpád

doi:10.1007/s12517-022-09948-w

Advanced land imager superiority in lithological classification utilizing machine learning algorithms

Original Paper
Open access
Published: 04 May 2022

Volume 15, article number 923, (2022)
Cite this article

Download PDF

You have full access to this open access article

Arabian Journal of Geosciences Aims and scope Submit manuscript

Advanced land imager superiority in lithological classification utilizing machine learning algorithms

Download PDF

Ali Shebl^1,2,
Timothy Kusky^3,4 &
Árpád Csámer¹

1595 Accesses
15 Citations
1 Altmetric
Explore all metrics

Abstract

Different types of remote sensing data are commonly used as inputs for lithological classification schemes, yet determining the best data source for each specific application is still unresolved, but critical for the best interpretations. In addition, various classifiers (i.e., artificial neural network (ANN), maximum likelihood classification (MLC), and support vector machine (SVM)) have proven their variable efficiencies in lithological mapping, yet determining which technique is preeminent is still questionable. Consequently, this study aims to test the potency of Earth observing-1 Advanced Land Imager (ALI) data with the frequently utilized Sentinel 2 (S2), ASTER, and Landsat OLI (L8) data in lithological allocation using the widely accepted ANN, MLC, and SVM, for a case study in the Um Salatit area, in the Eastern Desert of Egypt. This area has a recent geological map that is used as a reference for selecting training and testing samples required for machine learning algorithms (MLAs). The results reveal (1) ALI superiority over the most commonly used S2, ASTER, and L8; (2) SVM is much better than MLC and ANN in executing lithologic allocation; (3) S2 is strongly recommended for separating higher numbers of classes compared to ASTER, L8, and ALI. Model overfitting may negatively impact S2 results in classifying small numbers of targets; (4) we can significantly enhance the classification accuracy, to transcend 90% by blending different sensor datasets. Our new approach can help significantly in further lithologic mapping in arid regions and thus be fruitful for mineral exploration programs.

Data Integration for Lithological Mapping Using Machine Learning Algorithms

Article 05 July 2022

Mapping sequences and mineral deposits in poorly exposed lithologies of inaccessible regions in Azad Jammu and Kashmir using SVM with ASTER satellite data

Article 15 March 2022

Using remote sensing data for geological mapping in semi-arid environment: a machine learning approach

Article 03 February 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Lithological identification is a crucial concern for understanding geological history, prospecting for mineral deposits, and assessing numerous environmental hazards. Remote sensing datasets are considered an efficient, rapid, relatively cheap, and readily available source for lithological mapping (Gad and Kusky 2006; Rajendran et al. 2014; Emam et al. 2016; Ge et al. 2018; Shebl et al. 2021a). Besides saving time and effort, they yield effective and accurate mapping results, especially for inaccessible areas where fieldwork is challenging. Moreover, structural mapping (Kusky et al. 2011; Abd El-Wahed et al. 2019), hydrothermal alteration mapping (Pour and Hashim 2015; Shebl and Csámer 2021a), and mineral discrimination (Amer et al. 2010; Gabr et al. 2010; Ninomiya and Fu 2019) can all be fulfilled efficiently. In all of these studies, wide areas were mapped using digital remote sensing datasets and image processing techniques without exhausting effort or huge amounts of time.

Similarly, using machine learning algorithms (MLAs) as an automatic inductive approach to recognize data patterns (Cracknell and Reading 2014), large numbers of pixels can be classified depending on smaller numbers of labeled pixels, generally referred to as training data. Thus, a significant amount of time and effort can be saved through the utilization of MLAs for lithological mapping. Once trained and learned, MLAs can predict a value and thus create and assign a label to the unknown pixels efficiently. This process is simply a kind of artificial intelligence and can be categorized as a supervised classification because pixels are transformed from unknown to labeled based on previously selected pixels seen by the algorithm. Consequently, supervised classification is premised mainly on the presence of training data (Inzana et al. 2003; Kotsiantis 2007). Alternatively, unsupervised techniques classify pixels (e.g., different rock units) via clustering, depending mainly on spectral characteristics without being fed by training areas (Kumar and Sahoo 2017). This is considered the main base in rock identification, where rocks (with various mineralogical constituents) respond variously to different wavelengths and thus have various responses and appearances in remote sensing data and can be discriminated from each other. Notable improvements in lithological mapping using remote sensing data have been made by using classifiers such as maximum likelihood classifier (MLC) (Yu et al. 2012a; Ge et al. 2018), naïve Bayes (NB) (Cracknell and Reading 2014), artificial neural networks (ANNs) (He et al. 2015; Latifovic et al. 2018), k-nearest neighbors (K-NN) (Cracknell and Reading 2014; Ge et al. 2018), support vector machines (SVMs), and random forests (RF) (Kuhn et al. 2018; Cardoso-Fernandes et al. 2020; Shebl and Csámer 2021b). Besides geological discrimination, mineral exploration programs have significantly enhanced utilizing various machine learning algorithms and classification schemes, e.g., band ratio matrix transformation (Askari et al. 2018; Noori et al. 2019), spectral angle mapper and spectral information divergence (Hadigheh and Ranjbar 2013; El-Magd et al. 2015; Ahmadirouhani et al. 2018; Sheikhrahimi et al. 2019), fuzzy logic modeling (Sekandari et al. 2020), linear spectral unmixing (Pour and Hashim 2012; Pour et al. 2019; Takodjou Wambo et al. 2020), constrained energy minimization (Zhang et al. 2007; Aboelkhair et al. 2021; Shebl et al. 2021a), and mixture tuned matched filtering (Pour and Hashim 2012; Mehr et al. 2013; Pour et al. 2018; Noori et al. 2019), utilizing various remote sensing datasets.

Notwithstanding the proven effectiveness of Advanced Land Imager (ALI) data in geological and hydrothermal alteration mapping (Pour and Hashim 2014), ALI is rarely utilized in lithological classification using MLAs. Thus, the novelty in the current contribution could be outlined in assessing ALI efficiency in delivering accurate lithological allocation and comparing the results of a test case with the widely used datasets (ASTER, Landsat 8, and Sentinel 2) and accepted classifiers (e.g., ANN, MLC, SVM). Moreover, due to the economic importance of the study area because of the presence of several ore deposits including REEs (EGSMA 1983), the current study aims to enhance a recent geological map of the study area depending mainly on objectivity introduced with MLAs, instead of subjectivity that could be evident with traditional mapping, as noticed by some differences among the previous geological maps of the investigated areas.

Materials and methods

Study area description

The Um Salatit–Mueilha area is located in the Central Eastern Desert (CED) of Egypt, as shown in Fig. 1a. Precisely, the investigated area is located between latitudes 24° 49" to 25° 18" N and longitudes 33° 50" to 34° 05" E covering an area of about 1400 km². The study area is well known for ancient mining activities and has a recently published geological map (Zoheir et al., 2019), which is useful for comparison with our results and verification. The area is covered by a widely distributed stretch of Neoproterozoic ophiolitic mélange consisting mainly of allochthonous ophiolitic fragments mingled in a sheared matrix, as well as other different mappable units (Zoheir et al., 2019). Mélange assemblages are vastly extended within the area along with other mappable units, such as metavolcanics, metagabbro-diorites, and granitic rocks, as shown in the map, Fig. 1b.

Data characteristics and preprocessing

Landsat-8 (L8) has two sensors, namely OLI (Operational Land Imager) and TIRS (Thermal Infrared Sensor), to acquire spectral data in the visible and near-infrared (VNIR), short-wave (SWIR), and thermal (TIR) infrared regions. OLI data are recorded in nine spectral bands, while TIRS data give information only in two bands, as shown in Table 1. The whole study area is covered by a scene that was acquired on 25 October 2019. Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) is commonly used in lithological discrimination (Pour and Hashim 2014; Pour et al. 2018; Kumar et al. 2020; Cardoso-Fernandes et al. 2020) and detects radiance in fourteen bands covering spectral bands from VNIR, SWIR, and TIR regions (Yamaguchi et al. 2001), as shown in Table 1. A cloud-free ASTER scene (AST_L1A_00303062007083043) acquired on 6 March 2007 is utilized for this study. Earth Observing-1 imagery was provided with three devices: Advanced Land Imager (ALI), the Hyperion, and the Linear Etalon Imaging Spectrometer Array (LEISA) Atmospheric Corrector (LAC) (Franks et al. 2017). Sensors onboard the EO-1 satellite have produced robust products for scientific analysis of the Earth during its entire 16-year mission (Franks et al. 2017). ALI recorded image data from ten spectral bands (Czapla-Myers et al. 2016), as shown in Table 1. The data used for the current study (EO1A1740422003070110PZ) was acquired in 2003. L8, ASTER, and ALI data were obtained through the U.S. Geological Survey, https://earthexplorer.usgs.gov/.

Table 1 Characteristics of the utilized optical datasets

Full size table

Sentinel 2 (S2) was developed by the European Space Agency (ESA) to provide spectral data in 13 bands (Drusch et al. 2012), as shown in Table 1. For the purpose of the current study and by the availability of S2 data from the European Space Agency (ESA), a cloud-free S2A MSI as an L1C product was downloaded. It should be emphasized that these datasets were accurately selected depending on checking metadata files and technical reports. Based on the solar zenith angle that depends on local overpass time, latitude, and date, we found that ALI data was best recorded in 2003 (Franks et al. 2017). As declared by NASA, all ASTER SWIR data collected after 1 April 2008 have been marked as unusable; therefore, we found that ASTER data acquired during 2007 would be the best. For L8 and S2, and as there are no reported technical errors, we decided to use cloud-free recently launched datasets. Thus, we consider that our data is of high-quality data for achieving the desired lithological classification. The utilized data are georeferenced to UTM, WGS 84 zone 36 N. Subsequently, we performed the fast line-of-sight atmospheric analysis of spectral hypercubes (FLAASH) atmospheric correction (Shebl and Csámer 2021c) and data resizing to the investigated study area. All of these operations were carried out using the Environment for Visualizing Images (ENVI) software version 5.6. For the Sentinel-2 dataset, bands were georeferenced to zone 36 North UTM projection using the WGS-84 datum and then radiometrically corrected using sen2cor processor in sentinel application platform (SNAP).

It is known that finer spatial resolution increases within-class variability; thus, it does not significantly enhance the classification accuracy (Hsieh et al. 2001). Thus, the spatial resolution effect should be removed. All TIR and panchromatic bands were excluded; then, cubic convolution resampling to 20 m was performed. We found that the 20 m pixel size is a reasonable value among the lowest (30 m) and the highest (10 m) spatial resolutions of the implemented data. In this way, an unbiased classification that preserves the relative superiority for each dataset (expressed by variabilities in the number of participated bands) is ensured. Consequently, the resampled bands for each sensor were stacked in files named S2, AST, L8, and ALI, having 12, 9, 7, and 9 bands, respectively. Then, two main combinations were created: S2 + AST + L8 and S2 + AST + L8 + ALI. Now, six inputs are ready to be tested for their potentiality in lithological classification.

Training and testing samples

With reference to the geologic map (Zoheir et al. 2019), well-distributed training pixels were carefully delineated for nine main classes. A total number of 18,567 training pixels were selected. Then, 3776 ground truth pixels were determined from the nine classes to test and evaluate the model performance after this feed by the training pixels as presented in Table 2. To ensure unbiased results, locations of ground truth data were carefully selected depending on the geological map and kept constant for all the classifiers and datasets. Moreover, the number of testing pixels is area-wise accurately selected (i.e., wadi deposits and syn-orogenic granite are tested by 879 and 579 pixels, respectively, due to their larger area compared to ophiolitic metagabbro which is tested only by 188 pixels (as it occupies the smallest rock unit area)).

Table 2 Areas, training and testing pixels, and abbreviations of the lithological classes

Full size table

Machine learning classifiers

Artificial neural network

Artificial neural network (ANN) is a widely known MLA and is used frequently for pattern recognition. As the name suggests, it tries to imitate the human brain in solving problems after training and learning; thus, ANN’s main processing units are named neurons or sometimes nodes. The network is formed by binding the nodes, which in turn are included in three main layers, input layer, hidden or middle layer, and an output layer, that could be reached by an iterative experiment (Haykin 2010). For this study, we performed multi-layer feed-forward ANN by the logistic activation function. To achieve optimum parameter settings for ANN, several empirical trials were made and assumed values (previously used in similar studies) were assigned to minimize generalization errors. Several local minimums were discarded till reaching the global minimum. We get better results by assigning the training root mean square (RMS) exit criterion as 0.1, training threshold contribution value as 0.9, training rate as 0.2, and training momentum as 0.9.

Maximum likelihood classifier

The maximum likelihood classifier (MLC) is a classical classifier widely used in classifications of remote sensing data. As the name suggests, unknown pixels are classified to a certain class only when they have a high probability of belonging to that class. Thus, the probability density function hypothesis is the main base for MLC (Scott and Symons 1971). For the current study, lithological generalization using MLC is carried out using ENVI 5.6 software.

Support vector machine

SVM has become one of the most important models in remote sensing and machine learning studies. Statistical learning theory (Ougiaroglou et al. 2018) that was first introduced by Vapnik in 1963 (Cortes and Vapnik 1995) is the main base for SVM. In this technique, the known datasets are supposed to be distributed in n-dimensional space and separated by a hyperplane. Logically, the best hyperplane is that introduced by the maximum isolation for the classes. The hyperplane that achieves this maximum separation is named the margin. As well as this optimal separator margin, a penalty for misclassifications is introduced to achieve the most efficient results. Normally, and as the algorithm deals with large amounts of data, a kernel function is frequently required (Wang et al. 2017). To specify the optimum parameters for the SVM classifier, linear, polynomial, radial basis function, and sigmoid kernels were applied to decide the best kernel performance. In this study, the radial basis function kernel delivers the most efficient results. The penalty parameter was set to 100, as the best value (after several trials), for managing the training errors. The gamma parameter in the kernel function was assigned as the inverse of the band number (Othman and Gloaguen 2014) to reasonably control the SVM model’s non-linearity degree.

Accuracy assessment methods

To assess classification outputs and the performance of classifiers, the accuracy for each class has been assessed using the confusion matrix. The producer’s accuracy describes how well the classifier correctly allocates the pixels, and the user’s accuracy shows how well the produced thematic map is by calculating the probability of correctly classifying a pixel into its pre-given class (Congalton 1991; Ge et al. 2018). In this study, we evaluate the results using the average accuracy (average of the producer’s accuracy and user’s accuracy) as well as the overall accuracy (OA), that is, the total number of pixels labeled correctly by MLAs as a fraction of the total number of image pixels. Moreover, the well-known kappa coefficient that measures the coincidence of the resultant thematic maps with the reference data is used to evaluate the consistency of the results (Cohen 1960) according to the following equation.

$$\kappa = \frac{{N\sum\nolimits_{i = 1}^{n} {m_{i,i} - \sum\nolimits_{i = 1}^{n} {\left( {G_{i} C_{i} } \right)} } }}{{N^{2} - \sum\nolimits_{i = 1}^{n} {\left( {G_{i} C_{i} } \right)} }}$$

where i represents the class number, N is the total number of classified values compared to truth values, the correctly classified values number of the truth class i is represented by m_i,i. C_i and G_i are the total number of predicted and truth values belonging to class i, respectively.

Results

Classification accuracy results of the nine classes reveal that Osp and Wdp are correctly classified from all the datasets and by the three classifiers, with average accuracy always above 90% (Fig. 2a–c). This is attributed to the pure distinguished spectral signatures, caused by the abundance of antigorite, lizardite, clinopyroxenite, and magnetite in the mineral composition of serpentinite (Gad and Kusky 2006) when compared to other rock units, as well as the bright, distinctive tone and fine texture of wadi deposits. Also, Nss is well classified by a percentage transcended 90% by MLC, SVM for all the datasets except with S2 and L8; its average accuracy was around 80% (Fig. 2a–c). Higher accuracies for ASTER in discriminating sandstone are attributed to significant SWIR absorption features of silicate minerals in ASTER band-passes (Mars and Rowan 2010). Syn-tectonic (Sog) and post tectonic granites (Pog) have been approximately classified from all the datasets and classifiers. A slightly lower average accuracy (60–80%) is recorded for (Mvs) metavolcanics (Fig. 2a–c). The reason for this decrease is the wide chemical and mineralogical compositions for metavolcanics in the study area (acidic to intermediate metavolcanics with their related pyroclastics). As reported by Zoheir and Weihed (2014), metavolcanics comprise a series of weakly metamorphosed calc-alkaline volcanics of andesite-dacite composition forming a mixture of lithofacies with interrelated pyroclastic volcanic tuffs and breccias.

Similarly, misclassifications are always accompanied by the ophiolitic mélange (Fig. 2a–c), and this could be explained by the definition of the term mélange itself. It describes mappable geological units or bodies of mixed rocks consisting of blocks of different ages and origins (Kusky et al. 2020). Since the mixing may occur at multiple scales (Kusky and Bradley 1999), including below our pixel resolution, it is difficult to correctly classify this unit. Thus, mixed spectral signatures are often included within the mélange, and thus, confusion is evident for all the classifiers and datasets. This is especially prominent with S2 that has higher spectral characteristics, leading to overfitting the models that adversely affect the average accuracies for ophiolitic mélange (Ome) (Fig. 2a–c), and OAs in all S2 generalization processes (Fig. 2d,e). Moreover, classification errors are always present with ophiolitic metagabbro (Omg), with an average accuracy ranging from 40 to 78% because it is sometimes misclassified as metagabbro-diorite due to the proximity in chemical and mineralogical compositions between the two classes. Thus, MLC totally misclassifies ophiolitic metagabbro (0% accuracy) (Fig. 2b). Also, MLC and SVM distinguished metagabbro-diorite more efficiently when compared to ANN. Metagabbro-diorite plutons, as well as the granitic rocks, intruded the intermediate-acidic metavolcanics. Also, several acidic dykes, granitic sheets, and quartz veins cut through different rock types (Zoheir and Weihed 2014), which sometimes affect the overall accuracy. However, considerable matching with the geologic map is observed, especially when SVM is the used classifier. The results revealed the superiority of ALI over S2, ASTER, and L8 whatever the implemented classifier, as shown in Fig. 2d,e, and described in Table 3. SVM is the most efficient classifier by delivering the highest accuracy percentages in all the applied generalization processes (Fig. 2f). For S2 + AST + L8 combination, a significant raise in the OA is presented when using MLC (86.73%) and SVM (87.79%); however, ANN classifier cannot enhance the OA beyond 77.09% (Fig. 2f).

Table 3 Overall accuracies (OA in %) and kappa coefficients (K) for the utilized datasets and classifiers

Full size table

By enhancing the previous combination with ALI (S2 + AST + L8 + ALI), a robust boost in the OA for the classifiers is observed, giving 79.21% for ANN, 89.40% for MLC, and transcending 90% for SVM (Fig. 2f, g), confirming the role of ALI in enhancing the classification accuracy using MLAs. Consequently, SVM proved its ability to classify rock units reasonably (Fig. 3) rather than MLC and ANN during all the classification processes performed in this study, as shown in Table 3. Also, ALI proved its worthiness in the generalization process (as noticed by a decrease of the salt and pepper effect that always accompanies lithological classifications, as shown in Fig. 4, when comparing metavolcanics (represented in blue)). These results are confirmed by comparing the results (with slight magnification for the southwestern corner of the study area) produced by Sentinel 2, ASTER, Landsat OLI, and ALI separately, utilizing the three classifiers to produce 12 thematic maps (i.e., 4 thematic maps for each classifier). Figure 4 strongly shows the effect of decreasing error pixels in metavolcanics by embedding ALI in the allocation process.

Discussion

In this study using ANN, MLC, and SVM, reasonable results for classifying the rock units of Um Salatit area are reported. From our point of view and after a comprehensive survey of widely accepted MLAs in performing reliable lithologic mapping through the last decade (Grebby et al. 2011; Amer et al. 2012; Yu et al. 2012b; Mehr et al. 2013; Hadigheh and Ranjbar 2013; He et al. 2015; Jellouli et al. 2016; Othman and Gloaguen 2017; Manap and San 2018; Ge et al. 2018; Bachri et al. 2019; Bentahar and Raji 2021; Karimzadeh and Tangestani 2021; Shebl et al. 2021b), we found that these classifiers are among the most widely recommended classifiers. Moreover, these classifiers employ different mechanisms for data generalization and cover the two main categories of parametric and non-parametric algorithms. Coinciding with Ge et al. (2018) and Bachri et al. (2019), SVM proved its leverage over ANN and MLC. Furthermore, the utilized and recommended SVM classifier outperforms some deep learning methods (e.g., random forest) in lithological classifications (Kumar et al. 2020). For the used datasets, we noticed several variations in generalization accuracies, and this can be explained by considering sensors with different spectral characteristics over several rock units that in turn display wide ranges of chemical and mineralogical compositions. For instance, processes produce absorption features in the visible and near-infrared radiation (0.4 to 1.1 μm) due to the presence of transition elements such as Fe²⁺, Fe³⁺ (Hunt and Ashley 1979). In this study, serpentinites and rocks containing Fe²⁺, Fe³⁺ can be distinguished due to the spectral advantages in the VNIR ranges for S2, L8, and ASTER. Also, ferric-iron-bearing minerals can be discriminated using six unique wavelength bands of ALI spanning the visible and near-infrared (Hubbard and Crowley 2005). Moreover, due to strong hydroxyl group absorption, serpentinites are rarely misclassified (92% as the lowest OA) for all the sensors. Sog, Pog, Wdp, and Nss are also well distinguished by all the data types, with slight variances in the accuracies of classifying these rocks. These variances are attributed to the performance of the classifiers, as well as mineral absorption features caused by vibrational overtones, electronic transition, charge transfer, and conduction processes (Cloutis 1996) in the reflected solar light area covered by the sensors (0.325 to 2.5 μm).

However, even though S2 has the highest spectral characteristics and the largest number of bands compared to the other sensors, S2 cannot correctly classify Ome and Omg. This can be interpreted by the overfitting for MLAs, caused by the high sensitivity offered by 12 bands of S2 in classifying 9 classes. Overfitting is considered a main defect of MLAs and can be defined by higher sensitivity of the details and noise of training data that could negatively impact the model pursuance on testing data (i.e., low bias and high variance) (Dietterich 1995). This is confirmed by the poor classification results only with the mélange (which has several spectral classes, noise, or fluctuations), where the model picked up these mixed signatures and negatively affected the generalization process resulting in lower accuracies when examined by testing data. Thus, it is recommended to use S2 in mapping many information classes rather than a lower number of spectral classes, which coincides with the results from Ge et al. (2018). On the other hand, underfitting may be the case with L8 lower accuracies for Ome and Omg because this wide range of reflected wavelengths (0.325 to 2.5 μm) is covered only by 7 bands. The best fit for the classification of Ome and Omg is achieved by ASTER and ALI, thus yielding considerably high given average classification accuracies for Ome and Omg. The higher overall accuracy of ALI is interpreted by the improved signal-to-noise ratio (SNR) that is considered one of the most significant performance aspects of ALI to increase the quality of data (Mendenhall et al. 2000; Lobell and Asner 2003). Also, it is noticed that considering the spectral characteristics from more than one sensor boosts the classification accuracies as noted for S2 + AST + L8 and S2 + AST + L8 + ALI (Fig. 2d–f). In the latter combination, the classification improvement caused by adding ALI is basically due to raising the accuracy by correct generalization for Ome and Omg (Fig. 2g). Consequently, it is recommended to use ALI, especially in identifying mélange rocks or generally when the information class includes many spectral subclasses (which is a common case in several remote sensing applications), as the output thematic map from ALI and its combination (S2 + AST + L8 + ALI) fit well with the reference geologic map. In this way, ALI can be used in several geological classifications (as several spectral classes are always included within an information class, by the effect of weathering, vegetation, or any environmental conditions) and may be in other similar applications.

We strongly recommend increasing the training data size, especially when Sentinel 2 data is implemented in the generalization process. Furthermore, executing regularization methods, k-fold cross-validation, and ensemble learning algorithms (Parsa 2021) are also strongly recommended to reduce overfitting and help achieve optimal prediction. It is should be emphasized, however, that the recommended SVM classifier outperforms some deep learning methods (e.g., random forest) in lithological classifications (Kumar et al. 2020). The current study opens the door for the use of ALI data (that is rarely utilized in lithological generalization) applications in future lithological allocations not only with transfer learning methods but also with deep learning algorithms (Shi et al. 2021; Dong et al. 2021; Parsa 2021) that have proven their efficiency in delivering reliable results. Our future research focuses mainly on feeding deep learning algorithms with ALI data (which has proven its potency in the current study) for better lithological and hydrothermal alteration mapping.

Conclusions

This study investigated the potential of ALI, S2, ASTER, and L8 data in mapping rock units of Um Salatit-Mueilha area, utilizing ANN, MLC, and SVM. The study concluded the following.

1.
SVM outperforms MLC and ANN in delivering an object-based geological map that could be used for future studies over the investigated area.
2.
We were able to better discriminate all the lithological classes studied, but ophiolitic metagabbro and ophiolitic mélange always have lower accuracies in the produced thematic maps, especially with S2. This result may be interpreted by model overfitting with the higher spectral characteristics of S2. The best results from ALI are attributed to improved data quality by enhancing the signal-to-noise ratio.
3.
Two additional combinations (S2 + ASTER + L8 and S2 + ASTER + L8 + ALI) show higher OA resulting mainly from boosting Ome and Omg accuracies.
4.
Increasing the applied datasets from different sensors significantly enhances the predictive mapping.
5.
ALI is recommended for usage in lithological classifications, especially when the number of classes is ten or lower. ALI is much better in generalizing an information class containing spectral subclasses than S2, which is recommended for allocating a higher number of classes.

References

Abd El-Wahed M, Kamh S, Ashmawy M, Shebl A (2019) Transpressive structures in the Ghadir Shear Belt, Eastern Desert, Egypt: evidence for partitioning of oblique convergence in the Arabian-Nubian Shield during Gondwana Agglutination. Acta Geol Sin - English Ed 93:1614–1646. https://doi.org/10.1111/1755-6724.13882
Article Google Scholar
Aboelkhair H, Ibraheem M, El-Magd IA (2021) Integration of airborne geophysical and ASTER remotely sensed data for delineation and mapping the potential mineralization zones in Hamash area, South Eastern Desert. Egypt Arab J Geosci 14:1–22. https://doi.org/10.1007/S12517-021-07471-Y/FIGURES/20
Article Google Scholar
Ahmadirouhani R, Karimpour MH, Rahimi B et al (2018) Integration of SPOT-5 and ASTER satellite data for structural tracing and hydrothermal alteration mineral mapping: implications for Cu–Au prospecting. 9:237–262. https://doi.org/10.1080/19479832.2018.1469548
Amer R, Kusky T, Ghulam A (2010) Lithological mapping in the Central Eastern Desert of Egypt using ASTER data. J African Earth Sci 56:75–82
Article Google Scholar
Amer R, Kusky T, El Mezayen A (2012) Remote sensing detection of gold related alteration zones in Um Rus area, Central Eastern Desert of Egypt. Adv Sp Res 49:121–134
Article Google Scholar
Askari G, Pour AB, Pradhan B et al (2018) Band ratios matrix transformation (BRMT): A sedimentary lithology mapping approach using ASTER satellite sensor. Sensors 18:3213 18:3213. https://doi.org/10.3390/S18103213
Bachri I, Hakdaoui M, Raji M et al (2019) Machine learning algorithms for automatic lithological mapping using remote sensing data: a case study from Souk Arbaa Sahel, Sidi Ifni Inlier, Western Anti-Atlas. Morocco ISPRS Int J Geo-Information 8:248. https://doi.org/10.3390/ijgi8060248
Article Google Scholar
Bentahar I, Raji M (2021) Comparison of Landsat OLI, ASTER, and Sentinel 2A data in lithological mapping : a case study of rich area (Central High Atlas, Morocco). Adv Sp Res 67:945–963. https://doi.org/10.1016/J.ASR.2020.10.037
Article Google Scholar
Cardoso-Fernandes J, Teodoro AC, Lima A, Roda-Robles E (2020) Semi-Automatization of support vector machines to map lithium (Li) bearing pegmatites. Remote Sens 12:2319 12:2319. https://doi.org/10.3390/RS12142319
Cloutis EA (1996) Review article hyperspectral geological remote sensing: evaluation of analytical techniques. Int J Remote Sens 17:2215–2242. https://doi.org/10.1080/01431169608948770
Article Google Scholar
Cohen J (1960) A coefficient of agreement for nominal scales. Educ Psychol Meas 20:37–46. https://doi.org/10.1177/001316446002000104
Article Google Scholar
Congalton RG (1991) A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens Environ 37:35–46. https://doi.org/10.1016/0034-4257(91)90048-B
Article Google Scholar
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297. https://doi.org/10.1007/bf00994018
Article Google Scholar
Cracknell MJ, Reading AM (2014) Geological mapping using remote sensing data: a comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information. Comput Geosci 63:22–33. https://doi.org/10.1016/j.cageo.2013.10.008
Article Google Scholar
Czapla-Myers J, Ong L, Thome K, McCorkel J (2016) Validation of EO-1 Hyperion and advanced land imager using the radiometric calibration test site at Railroad Valley, Nevada. IEEE J Sel Top Appl Earth Obs Remote Sens 9:816–826. https://doi.org/10.1109/JSTARS.2015.2463101
Article Google Scholar
Dietterich T (1995) Overfitting and undercomputing in machine learning. ACM computing surveys (CSUR) 27.3:326–327
Dong Y, Liang T, Zhang Y, Du B (2021) Spectral-spatial weighted kernel manifold embedded distribution alignment for remote sensing image classification. IEEE Trans Cybern 51:3185–3197. https://doi.org/10.1109/TCYB.2020.3004263
Article Google Scholar
Drusch M, Del Bello U, Carlier S et al (2012) Sentinel-2: ESA’s optical high-resolution mission for GMES operational services. Remote Sens Environ 120:25–36
Article Google Scholar
EGSMA (1983) Egyptian geological survey and mining authority metallogenic map of the Aswan Quadrangle Egypt. Scale 1:500000
Google Scholar
El-Magd IA, Mohy H, Basta F (2015) Application of remote sensing for gold exploration in the Fawakhir area, Central Eastern Desert of Egypt. Arab J Geosci 8:3523–3536. https://doi.org/10.1007/s12517-014-1429-4
Article Google Scholar
Emam A, Zoheir B, Johnson P (2016) ASTER-based mapping of ophiolitic rocks: examples from the Allaqi-Heiani suture, SE Egypt. Int Geol Rev 58:525–539. https://doi.org/10.1080/00206814.2015.1094382
Article Google Scholar
Franks S, Neigh C, Campbell P et al (2017) EO-1 data quality and sensor stability with changing orbital precession at the end of a 16 year mission. Remote Sens 9:412. https://doi.org/10.3390/rs9050412
Article Google Scholar
Gabr S, Ghulam A, Kusky T (2010) Detecting areas of high-potential gold mineralization using ASTER data. Ore Geol Rev 38:59–69. https://doi.org/10.1016/J.OREGEOREV.2010.05.007
Article Google Scholar
Gad S, Kusky T (2006) Lithological mapping in the Eastern Desert of Egypt, the Barramiya area, using Landsat thematic mapper (TM). J African Earth Sci 44:196–202. https://doi.org/10.1016/j.jafrearsci.2005.10.014
Article Google Scholar
Ge W, Cheng Q, Jing L et al (2018) Lithological discrimination using ASTER and Sentinel-2A in the Shibanjing ophiolite complex of Beishan orogenic in Inner Mongolia, China. Adv Sp Res 62:1702–1716. https://doi.org/10.1016/j.asr.2018.06.036
Article Google Scholar
Grebby S, Naden J, Cunningham D, Tansey K (2011) Integrating airborne multispectral imagery and airborne LiDAR data for enhanced lithological mapping in vegetated terrain. Remote Sens Environ 115:214–226. https://doi.org/10.1016/j.rse.2010.08.019
Article Google Scholar
Hadigheh SMH, Ranjbar H (2013) Lithological mapping in the eastern part of the Central Iranian volcanic belt using combined ASTER and IRS data. J Indian Soc Remote Sens 414(41):921–931. https://doi.org/10.1007/S12524-013-0284-1
Article Google Scholar
Haykin S (2010) Neural networks: a comprehensive foundation. 1999. Mc Millan, New Jersey 1–24
He J, Harris JR, Sawada M, Behnia P (2015) A comparison of classification algorithms using Landsat-7 and Landsat-8 data for mapping lithology in Canada’s arctic. Int J Remote Sens 36:2252–2276. https://doi.org/10.1080/01431161.2015.1035410
Article Google Scholar
Hsieh PF, Lee LC, Chen NY (2001) Effect of spatial resolution on classification errors of pure and mixed pixels in remote sensing. IEEE Trans Geosci Remote Sens 39:2657–2663. https://doi.org/10.1109/36.975000
Article Google Scholar
Hubbard BE, Crowley JK (2005) Mineral mapping on the Chilean-Bolivian Altiplano using co-orbital ALI, ASTER and Hyperion imagery: data dimensionality issues and solutions. Remote Sens Environ 99:173–186. https://doi.org/10.1016/j.rse.2005.04.027
Article Google Scholar
Hunt GR, Ashley RP (1979) Spectra of altered rocks in the visible and near infrared. Econ Geol 74:1613–1629. https://doi.org/10.2113/gsecongeo.74.7.1613
Article Google Scholar
Inzana J, Kusky T, Higgs G, Tucker R (2003) Supervised classifications of Landsat TM band ratio images and Landsat TM band ratio image with radar for geological interpretations of central Madagascar. J African Earth Sci 37(1–2):59–72, ISSN 1464-343X, https://doi.org/10.1016/S0899-5362(03)00071-X
Jellouli A, El Harti A, Adiri Z et al (2016) Lithological mapping using ASTER data in the Moroccan Anti Atlas belt. EGUGA 18:EPSC2016–13872
Karimzadeh S, Tangestani MH (2021) Evaluating the VNIR-SWIR datasets of WorldView-3 for lithological mapping of a metamorphic-igneous terrain using support vector machine algorithm; a case study of Central Iran. Adv Sp Res 68:2421–2440. https://doi.org/10.1016/J.ASR.2021.05.002
Article Google Scholar
Kotsiantis SB (2007) Supervised machine learning: a review of classification techniques
Kuhn S, Cracknell MJ, Reading AM (2018) Lithologic mapping using random forests applied to geophysical and remote-sensing data: a demonstration study from the Eastern Goldfields of Australia. Geophysics 83:B183–B193. https://doi.org/10.1190/geo2017-0590.1
Article Google Scholar
Kumar Y, Sahoo G (2017). An Improved Cat Swarm Optimization Algorithm Based on Opposition-Based Learning and Cauchy Operator for Clustering. https://doi.org/10.3745/JIPS.02.0022
Article Google Scholar
Kumar C, Chatterjee S, Oommen T, Guha A (2020) Automated lithological mapping by integrating spectral enhancement techniques and machine learning algorithms using AVIRIS-NG hyperspectral data in gold-bearing granite-greenstone rocks in Hutti, India. Int J Appl Earth Obs Geoinf 86:102006. https://doi.org/10.1016/J.JAG.2019.102006
Article Google Scholar
Kusky TM, Bradley DC (1999) Kinematic analysis of mélange fabrics: examples and applications from the McHugh Complex, Kenai Peninsula, Alaska. J Struct Geol 21:1773–1796
Article Google Scholar
Kusky TM, Ramadan TM, Hassaan MM, Gabr S (2011) Structural and tectonic evolution of El-Faiyum depression, North Western Desert, Egypt based on analysis of Landsat ETM+, and SRTM Data. J Earth Sci 22:75–100
Article Google Scholar
Kusky T, Wang J, Wang L, et al (2020) Mélanges through time: life cycle of the world’s largest Archean mélange compared with Mesozoic and Paleozoic subduction-accretion-collision mélanges. Earth-Science Rev. 209
Latifovic R, Pouliot D, Campbell J (2018) Assessment of convolution neural networks for surficial geology mapping in the South Rae Geological Region, Northwest Territories. Canada Remote Sens 10:307. https://doi.org/10.3390/rs10020307
Article Google Scholar
Lobell DB, Asner GP (2003) Comparison of earth observing-1 ALI and Landsat ETM+ for crop identification and yield prediction in Mexico. IEEE Trans Geosci Remote Sens 41:1277–1282. https://doi.org/10.1109/TGRS.2003.812909
Article Google Scholar
Manap HS, San BT (2018) Lithological mapping using different classification algorithms in western antalya, turkey. Int Multidiscip Sci GeoConference Surv Geol Min Ecol Manag SGEM 18:551–556. https://doi.org/10.5593/SGEM2018/2.2/S08.069
Article Google Scholar
Mars JC, Rowan LC (2010) Spectral assessment of new ASTER SWIR surface reflectance data products for spectroscopic mapping of rocks and minerals. Remote Sens Environ 114:2011–2025. https://doi.org/10.1016/j.rse.2010.04.008
Article Google Scholar
Mehr SG, Ahadnejad V, Abbaspour RA, Hamzeh M (2013) Using the mixture-tuned matched filtering method for lithological mapping with Landsat TM5 images. 34:8803–8816. https://doi.org/10.1080/01431161.2013.853144
Mendenhall JA, Lencioni DE, Evans JB (2000) Earth Observing-1 Advanced Land Imager: radiometric response calibration
Ninomiya Y, Fu B (2019) Thermal infrared multispectral remote sensing of lithology and mineralogy based on spectral properties of materials. Ore Geol Rev 108:54–72
Article Google Scholar
Noori L, Pour A, Askari G et al (2019) Comparison of different algorithms to map hydrothermal alteration zones using ASTER remote sensing data for polymetallic vein-type ore exploration: Toroud-Chahshirin Magmatic Belt (TCMB). North Iran Remote Sens 11:495. https://doi.org/10.3390/rs11050495
Article Google Scholar
Othman A, Gloaguen R (2014) Improving lithological mapping by SVM classification of spectral and morphological features: the discovery of a new chromite body in the Mawat Ophiolite Complex (Kurdistan, NE Iraq). Remote Sens 6:6867–6896. https://doi.org/10.3390/rs6086867
Article Google Scholar
Othman AA, Gloaguen R (2017) Integration of spectral, spatial and morphometric data into lithological mapping: a comparison of different machine learning algorithms in the Kurdistan Region, NE Iraq. J Asian Earth Sci 146:90–102. https://doi.org/10.1016/J.JSEAES.2017.05.005
Article Google Scholar
Ougiaroglou S, Diamantaras KI, Evangelidis G (2018) Exploring the effect of data reduction on neural network and support vector machine classification. Neurocomputing 280:101–110. https://doi.org/10.1016/j.neucom.2017.08.076
Article Google Scholar
Parsa M (2021) A data augmentation approach to XGboost-based mineral potential mapping: an example of carbonate-hosted ZnPb mineral systems of Western Iran. J Geochemical Explor 228:106811. https://doi.org/10.1016/J.GEXPLO.2021.106811
Article Google Scholar
Pour AB, Hashim M (2012) The application of ASTER remote sensing data to porphyry copper and epithermal gold deposits. Ore Geol Rev 44:1–9. https://doi.org/10.1016/J.OREGEOREV.2011.09.009
Article Google Scholar
Pour AB, Hashim M (2014) ASTER, ALI and Hyperion sensors data for lithological mapping and ore minerals exploration. Springerplus 3:130
Article Google Scholar
Pour AB, Hashim M (2015) Integrating PALSAR and ASTER data for mineral deposits exploration in tropical environments: a case study from Central Belt, Peninsular Malaysia. Int J Image Data Fusion 6:170–188
Article Google Scholar
Pour AB, Park TYS, Park Y et al (2018) Application of multi-sensor satellite data for exploration of Zn-Pb sulfide mineralization in the Franklinian Basin. North Greenland Remote Sens 10:1186. https://doi.org/10.3390/rs10081186
Article Google Scholar
Pour AB, Park T-YS, Park Y, et al (2019) Landsat-8, advanced spaceborne thermal emission and reflection radiometer, and WorldView-3 multispectral satellite imagery for prospecting copper-gold mineralization in the Northeastern Inglefield Mobile Belt (IMB), Northwest Greenland. Remote Sens 11:2430 11:2430. https://doi.org/10.3390/RS11202430
Rajendran S, Nasir S, Kusky TM, al-Khirbash S, (2014) Remote sensing based approach for mapping of CO₂ sequestered regions in Samail ophiolite massifs of the Sultanate of Oman. Earth-Science Rev 135:122–140
Article Google Scholar
Scott AJ, Symons MJ (1971) Clustering methods based on likelihood ratio criteria. Biometrics 27:387. https://doi.org/10.2307/2529003
Article Google Scholar
Sekandari M, Masoumi I, Pour AB et al (2020) Application of Landsat-8, Sentinel-2, ASTER and WorldView-3 spectral imagery for exploration of carbonate-hosted Pb-Zn deposits in the Central Iranian Terrane (CIT). Remote Sens. 12:1239. https://doi.org/10.3390/RS12081239
Shebl A, Csámer Á (2021a) Lithological, structural and hydrothermal alteration mapping utilizing remote sensing datasets: a case study around Um Salim area. Egypt IOP Conf Ser Earth Environ Sci 942:012032. https://doi.org/10.1088/1755-1315/942/1/012032
Article Google Scholar
Shebl A, Csámer Á (2021b) Stacked vector multi-source lithologic classification utilizing machine learning algorithms: data potentiality and dimensionality monitoring. Remote Sens Appl Soc Environ 24:100643. https://doi.org/10.1016/J.RSASE.2021.100643
Article Google Scholar
Shebl A, Csámer Á (2021c) Reappraisal of DEMs, radar and optical datasets in lineaments extraction with emphasis on the spatial context. Remote Sens Appl Soc Environ 24:100617. https://doi.org/10.1016/J.RSASE.2021.100617
Article Google Scholar
Shebl A, Abdellatif M, Elkhateeb SO, Csámer Á (2021a) Multisource data analysis for gold potentiality mapping of Atalla area and its environs, Central Eastern Desert, Egypt Miner. 11:641. https://doi.org/10.3390/MIN11060641
Shebl A, Abdellatif M, Hissen M et al (2021b) Lithological mapping enhancement by integrating Sentinel 2 and gamma-ray data utilizing support vector machine: a case study from Egypt. Int J Appl Earth Obs Geoinf 105:102619. https://doi.org/10.1016/J.JAG.2021.102619
Article Google Scholar
Sheikhrahimi A, Pour AB, Pradhan B, Zoheir B (2019) Mapping hydrothermal alteration zones and lineaments associated with orogenic gold mineralization using ASTER data: a case study from the Sanandaj-Sirjan Zone. Iran Adv Sp Res 63:3315–3332. https://doi.org/10.1016/J.ASR.2019.01.035
Article Google Scholar
Shi Y, Ma D, Lv J, Li J (2021) ACTL: asymmetric convolutional transfer learning for tree species identification based on deep neural network. IEEE Access 9:13643–13654. https://doi.org/10.1109/ACCESS.2021.3051015
Article Google Scholar
Takodjou Wambo JD, Pour AB, Ganno S et al (2020) Identifying high potential zones of gold mineralization in a sub-tropical region using Landsat-8 and ASTER remote sensing data: a case study of the Ngoura-Colomines goldfield, eastern Cameroon. Ore Geol Rev 122:103530. https://doi.org/10.1016/J.OREGEOREV.2020.103530
Article Google Scholar
Wang F, Zhen Z, Wang B, Mi Z (2017) Comparative study on KNN and SVM based weather classification models for day ahead short term solar PV power forecasting. Appl Sci 8:28. https://doi.org/10.3390/app8010028
Article Google Scholar
Yamaguchi Y, Fujisada H, Tsu H et al (2001) ASTER early image evaluation. Adv Sp Res 28:69–76
Article Google Scholar
Yu L, Porwal A, Holden EJ, Dentith MC (2012a) Towards automatic lithological classification from remote sensing data using support vector machines. Comput Geosci 45:229–239. https://doi.org/10.1016/j.cageo.2011.11.019
Article Google Scholar
Zhang X, Pazner M, Duke N (2007) Lithologic and mineral information extraction for gold exploration using ASTER data in the south Chocolate Mountains (California). ISPRS J Photogramm Remote Sens 62:271–282. https://doi.org/10.1016/J.ISPRSJPRS.2007.04.004
Article Google Scholar
Zoheir B, Weihed P (2014) Greenstone-hosted lode-gold mineralization at Dungash mine, Eastern Desert. Egypt J African Earth Sci 99:165–187
Article Google Scholar
Zoheir B, El-Wahed MA, Pour AB, Abdelnasser A (2019) Orogenic gold in transpression and transtension zones: field and remote sensing studies of the barramiya–mueilha sector. Egypt Remote Sens 11:2122
Article Google Scholar

Download references

Acknowledgements

The authors are thankful to USGS and ESA for providing the data. Thanks to Prof. Mahmoud Ashmawy, Prof. Mohamed Abdelwahed, and Prof. Samir Kamh (Tanta University) for their support. Ali Shebl is funded by the Stipendium Hungaricum scholarship under the joint executive program between Hungary and Egypt. The authors highly appreciate editors’ and reviewers’ valuable and profound comments that greatly enhanced our manuscript.

Funding

Open access funding provided by University of Debrecen. This research received no external funding. This research is supported by Debrecen University, China National Science Foundation (91755213, 41961144020), and China University of Geosciences (MSFGPMR02—3).

Author information

Authors and Affiliations

Department of Mineralogy and Geology, University of Debrecen, Debrecen, Hungary
Ali Shebl & Árpád Csámer
Department of Geology, Tanta University, Tanta, Egypt
Ali Shebl
State Key Lab of Geological Processes and Mineral Resources, Center for Global Tectonics, School of Earth Sciences, China University of Geosciences, Wuhan, China
Timothy Kusky
Badong National Observatory and Research Station for Geohazards, China University of Geosciences, Wuhan, China
Timothy Kusky

Authors

Ali Shebl
View author publications
You can also search for this author in PubMed Google Scholar
Timothy Kusky
View author publications
You can also search for this author in PubMed Google Scholar
Árpád Csámer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ali Shebl.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Additional information

Responsible Editor: Biswajeet Pradhan

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shebl, A., Kusky, T. & Csámer, Á. Advanced land imager superiority in lithological classification utilizing machine learning algorithms. Arab J Geosci 15, 923 (2022). https://doi.org/10.1007/s12517-022-09948-w

Download citation

Received: 12 October 2021
Accepted: 20 March 2022
Published: 04 May 2022
DOI: https://doi.org/10.1007/s12517-022-09948-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Advanced land imager superiority in lithological classification utilizing machine learning algorithms

Abstract

Similar content being viewed by others

Data Integration for Lithological Mapping Using Machine Learning Algorithms

Mapping sequences and mineral deposits in poorly exposed lithologies of inaccessible regions in Azad Jammu and Kashmir using SVM with ASTER satellite data

Using remote sensing data for geological mapping in semi-arid environment: a machine learning approach

Introduction