Assessment of Machine Learning Algorithms for Land Cover Classification in a Complex Mountainous Landscape

Amin, Gomal; Imtiaz, Iqra; Haroon, Ehsan; Saqib, Najum us; Shahzad, Muhammad Imran; Nazeer, Majid

doi:10.1007/s41651-024-00195-z

Assessment of Machine Learning Algorithms for Land Cover Classification in a Complex Mountainous Landscape

Open access
Published: 30 August 2024

Volume 8, article number 34, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Geovisualization and Spatial Analysis Aims and scope Submit manuscript

Assessment of Machine Learning Algorithms for Land Cover Classification in a Complex Mountainous Landscape

Download PDF

Abstract

Mapping land cover (LC) in mountainous regions, such as the Gilgit-Baltistan (GB) area of Pakistan, presents significant challenges due to complex terrain, limited data availability, and accessibility constraints. This study addresses these challenges by developing a robust, data-driven approach to classify LC using high-resolution Sentinel-2 (S-2) satellite imagery from 2019 within Google Earth Engine (GEE). The research evaluated the performance of various machine learning (ML) algorithms, including classification and regression tree (CART), maximum entropy (gmoMaxEnt), minimum distance (minDistance), support vector machine (SVM), and random forest (RF), without extensive hyperparameter tuning. Additionally, ten different scenarios based on various band combinations of S-2 data were used as input for running the ML models. The LC classification was performed using 2759 sample points, with 70% for training and 30% for validation. The results indicate that the RF algorithm outperformed all other classifiers under scenario S1 (using 10 bands), achieving an overall accuracy (OA) of 0.79 and a kappa coefficient of 0.76. The final RF-based LC mapping shows the following percentage distribution: barren land (46.7%), snow cover (22.9%), glacier (7.9%), grasses (7.2%), water (4.7%), wetland (2.9%), built-up (2.7%), agriculture (1.9%), and forest (1.2%). It is suggested that the best identified RF classifier within the GEE environment should be used for advanced multi-source data image classification with hyperparameter tuning to increase OA. Additionally, it is suggested to build the capacity of various stakeholders in GB for better monitoring of LC changes and resource management using geospatial big data.

Evaluating the impact of classification algorithms and spatial resolution on the accuracy of land cover mapping in a mountain environment in Pakistan

Article 01 February 2017

Spatial Resolution Impacts on Land Cover Mapping Accuracy

Article 06 August 2024

Land use and land cover classification using machine learning algorithms in google earth engine

Article 19 August 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Land cover (LC) can be significantly altered by factors such as excessive agricultural development, rapid population growth, and the overexploitation of natural resources, leading to landscape degradation (Beuchle et al. 2015). The mountain environments worldwide, including the Hindu Kush, Karakoram, and Himalayan (HKH) range, are highly susceptible to various anthropogenic and natural hazards, such as climate change, tourism, urbanization, population growth, and economic development (us Saqib et al. 2019; Saini & Singh 2024). These mountain regions are prone to natural disasters like landslides, flash floods, earthquakes, and glacier lake outburst floods (GLOFs) (Bacha et al. 2018; Jamil et al. 2019; Xu et al. 2009). Understanding the LC patterns in such areas can help in identifying locations that are at greater risk of such kind of natural disasters and can aid in developing strategies to mitigate them (Saini & Singh 2024).

Apart from these natural disasters, the rapid population growth, excessive agricultural expansion, and uncontrolled urbanization in mountainous area have led to the overexploitation of natural resources, landscape degradation, and land deterioration. Understanding the LC features and implementing better management strategies for natural resources are crucial for sustainable development in the environmental, social, and economic sectors (Dang and Kawasaki 2017). Any adverse impact on the fragile mountainous environment can pose severe challenges for the large human population residing in these areas. Different studies have emphasised the importance of LC analysis for effective management of natural resources in the fragile mountainous environments (Satti et al. 2023). To examine such changes in LC with exceptionally high accuracy, remote sensing (RS) images classified with machine learning (ML) algorithms are considered as standard tools that are persistently used all around the world (Gargiulo et al. 2020; Jia et al. 2023).

A large number of ML-based LC classification algorithms have been explored over the past decade to produce accurate, up-to-date, and long-term LC maps (Zhang & Zhang 2020; Wang et al. 2023). For instance, artificial neural network (ANN) (Yuan et al. 2009), random forest (RF) (Gislason et al. 2006; Wang et al. 2023), classification and regression tree (CART) (Shao & Lunetta 2012), and support vector machine (SVM) (He et al. 2005) have demonstrated superior performance in mapping different LC types compared to traditional classifiers (Belgiu and Drăgu 2016). The RF classifier is particularly popular in the RS community (Xiong et al. 2017) due to its high accuracy, achieved by constructing multiple decision trees (DTs). For example, RF algorithm was used in Western Himalayas to classify vegetation types with an overall accuracy (OA) of 80%, considering topographic and climate variables for improved accuracy (Singh et al. 2023). Similarly, a study conducted by Zurqani (2024) for forest canopy cover using RF achieved an OA ranging between 83.31 and 94.35%. Moreover, Mansaray et al. (2019)deployed both SVM and RF classifiers for mapping paddy rice in China using Landsat-8 and Sentinel-2 images for the 2015 and 2016, and the RF classifier demonstrated higher accuracy (95%) compared with the SVM classifier (90.8%). Delalay et al. (2019) utilized CART, maximum entropy, and RF classifiers within the Google Earth Engine (GEE) environment for LC classification using Sentinel-2 data. The results showed that the RF technique had the highest OA (95%), followed by maximum entropy (93%) and CART (61%) in the mountainous region of Nepal. A recent study (Mahmoodzada et al. 2024) utilized the SVM and the multilayer perceptron (MLP) to map snow cover area in Pamir region of Hindukush with kappa coefficient of 0.75 and 0.83, respectively. Shetty et al. (2021) evaluated the impacts of training sampling design on LC classification results within the GEE, concluded RF outperformed both CART and SVM.

The selection of an appropriate ML classifier for LC mapping is a challenging task due to the large number of available algorithms, their varying computational performance, and the conflicting information about their OA. Additionally, the combination of spectral bands used as input can significantly affect the classification accuracy (Shetty et al. 2021; Xiong et al. 2017). Various researchers have explored the use of different spectral band combinations from RS data, such as Sentinel-2 (Silveira et al. 2023) to improve the LC classification accuracy (Gumma et al. 2020; Stromann et al. 2020). However, there appears to be a lack of published research evaluating the performance of diverse ML algorithms applied to Sentinel-2 imagery for LC mapping in the HKH region of Pakistan. The majority of existing studies in this geographic context have focused on classifying a limited set of LC types (Khan et al. 2020a, b; Qamer et al. 2016; Satti et al. 2023, 2024). These investigations have typically employed single image and conventional classification algorithms for mapping within small sub-regions, often resulting in varying and even contradictory outcomes. For instance, in the current study area Khan et al., (2019) and Ali et al., (2019a, b) performed LC mapping for Gilgit city and Gilgit district in Pakistan, respectively, using Landsat data and maximum likelihood classifier (MLC) to identify five generic LC classes. This makes it challenging to compare the accuracy of the generated LC maps and identify the most reliable approach for this complex mountainous environment (Delalay et al. 2019). As of the time of writing this paper, there have been no published studies focusing on detailed LC mapping in the HKH region of Pakistan utilizing high-resolution RS imagery, despite the significant advancements in the field.

This gap in the literature underscores the need for a comprehensive evaluation of the performance of a wider range of ML algorithms, including their computational efficiency and classification accuracy, when applied to high-resolution Sentinel-2 data for LC mapping in the HKH region of Pakistan. Such an assessment would provide valuable insights to support the selection of the most appropriate LC classification approach for this ecologically significant yet geographically challenging area. When mapping LC over a large extent (such as the current study area), researchers had to consider the key challenges regarding the processing of ‘big earth data’ and the availability of images (Satti et al. 2024). Previous researchers in the study area were limited in their ability to run ML classification algorithms due to constraints in computing power and storage. However, the utilization of GEE free cloud-based computing platform has enabled scientists to utilize the satellite data for large-scale LC mapping in a more efficient and effective way (Gorelick et al. 2017; Zurqani 2024).

The objective of this scientific study is to compare and evaluate the performance of various machine learning algorithms available within the Google Earth Engine platform for land cover mapping in the HKH region of Pakistan. This assessment will be conducted without any hyperparameter tuning, using the full range of spectral band combinations from Sentinel-2 imagery with a temporal aggregation method. Moreover, the final product generated from this study will be made freely available for users, facilitating broader access and utilization of the LC mapping results to support land management, spatial planning, and disaster risk management to achieve sustainable development in the region.

Material and Methods

Study Area

Gilgit-Baltistan (GB) located in the north of Pakistan is characterized by a remote mountainous environment, surrounded by the world’s famous highest mountain ranges, i.e., Hindu Kush, Karakoram, and Himalaya. Administratively, GB is divided into ten districts, namely, Astore (5179 km²), Diamer (6901 km²), Ghanche (8525 km²), Ghizer (12043 km²), Gilgit (4009 km²), Hunza (11343 km²), Nagar (2993 km²), Kharmang (2802 km²), Shigar (8810 km²), and Skardu (7200 km²) (Amin et al. 2021) (Fig. 1).

GB can be broadly classified into five distinct ecological zones: dry alpine zones and permanent snowfields, alpine meadows and alpine scrub, sub-alpine scrub, dry temperate coniferous forest, and dry temperate evergreen oak scrub. Moreover, GB has an exceptionally complex mountain system, with approximately 90% of its area covered by rugged mountain ranges and glaciers, while the remaining area consists of arable land (Hussain & Bangash 2017).

The climate of Gilgit-Baltistan is influenced by both the monsoon season, which contributes up to 80% of the region’s summer precipitation, and westerly cyclones, which account for approximately 66% of the high-elevation snowfall. However, the steep topography of the Karakoram range diminishes the influence of these wind systems as one moves towards the northern parts of the region (Bolch et al. 2012; Rankl et al. 2014). Generally, the weather conditions are severe with cold winters and extremely hot summers. The region receives precipitation of approximately 200–2000 mm per year, varying in different elevation zones, whereas temperature ranges between 10 °C (in winters) and 40 °C (in summer) depending on the valley’s elevation range (Gilani et al. 2020; Nawaz et al. 2019).

Training Sample Selection

Nine LC classes (Table 1) were defined based on a detailed literature review of the study area (Ali et al. 2019a, b; Gumma et al. 2020; Khan et al. 2020a, b; Qamer et al. 2016; Rahim et al. 2018). Training and validation sample points for each LC class were selected through simple random sampling using high-resolution Google Earth imagery and the authors’ personal experience of the study area. During the preparation and verification of the sample points, the principles of ‘consistency’ and ‘reliability’ were carefully maintained (Hill et al. 2008). This involved minimizing the inclusion of mixed pixels by avoiding sample collection from the edges of LC class boundaries and fragmented landscapes (Hu & Hu 2019; Phan et al. 2020).

Table 1 General information about various land cover classes with labels identified in the study area

Full size table

Auxiliary data available in the GEE, such as the Advanced Land Observing Satellite (ALOS) Digital Surface Model (DSM) (Tadono et al. 2014), Shuttle Radar Topography Mission (SRTM) Digital Elevation Model (DEM) (Farr et al. 2007), and Defence Meteorological Satellite Program (DMSP) Operational Line-Scan System (OLS) night time imagery, were used to support the selection of training samples and improve the overall classification accuracy. Additionally, DEM-derived products, such as slope, aspect, and elevation, were utilized as supplementary data for visual interpretation during the training sample selection process.

In total, 2759 sample points were randomly divided, with 70% used for training the classification models and 30% reserved for validation and accuracy assessment. The distribution of sample points across the nine LC classes was as follows: water (341), forest (304), grasses (306), wetland (364), agriculture (389), barren land (467), built-up (197), glacier (176), and snow (215). The same training and validation datasets were used to evaluate the performance of each classification model.

Data Acquisition and Processing

The European Space Agency (ESA) and the European Commission (EC) launched the Sentinel-2 (S-2) satellites in 2015 and 2017 under the Copernicus program. The Sentinel-2A and 2B satellites have a revisit interval of 10 days individually and a combined revisit time of 5 days. The improved spatial and spectral resolution of the S-2 imagery has opened up new possibilities for environmental studies and monitoring (ESA 2015; Gargiulo et al. 2020). The twin S-2 satellites provide global coverage with 13 spectral bands, ranging from the visible to the short-wave infrared (SWIR) wavelengths, offering different spatial and spectral resolutions (Table 2). In the GEE catalogue, the Sentinel-2 Multispectral Instrument (MSI) data is available as Level-2A Surface Reflectance (SR) product, processed by running the sen2cor algorithm (COPERNICUS 2017). The Sentinel-2 Level-2A SR data within the GEE can be accessed using the code snippet ‘ee.ImageCollection(“COPERNICUS/S2_SR”)’, which provides 12 UINT16 spectral bands representing surface reflectance scaled by a factor of 10,000.

Table 2 Details of spatial and spectral resolution of Sentinel-2 satellite

Full size table

Figure 2 provides an overview of the image processing steps employed to map the LC of the study area. The methodology implemented in this study involved temporal aggregation of all available images within the study area to generate a composite image free from cloud cover. To mitigate the influence of monsoon rains and fresh snow on the classification outcome, a temporal frame ranging from 1st May to 30th September 2019 (152 days) and cloud coverage threshold of less than 20% was selected. This resulted in an image collection of 192 scenes, distributed across ten Sentinel-2 tiles. Subsequently, a 10-m spatial resolution cloud free image composite was generated by calculating the median pixel value from the entire image collection.

The median function was utilized as the temporal aggregation method to minimize the impact of missing or gapped cells in the Sentinel-2 imagery. This approach is commonly employed in the literature to reduce noise, particularly along scene borders (Carrasco et al. 2019; Rudiyanto et al. 2019). For each of the 10 spectral bands considered, any pixel values identified as blanks were replaced by the median value of that band across all images in the acquisition period. This ensures that a pixel identified as blank remains as such only if all images taken during the study period have a blank value for that location. However, in this study, no blank or void pixels were identified after the temporal aggregation process was applied.

For Sentinel-2 image classification, ten scenarios with different band combinations were used (Table 3) based on comprehensive literature review (Adepoju & Adelabu 2020; Alifu et al. 2020; Li et al. 2020; Xiong et al. 2017). These studies have suggested specific band combinations that have shown promising results in LC mapping applications. By evaluating these band combinations against ML classifiers in the GEE platform, we aimed to identify the best performing classifier without engaging in any hyperparameter tuning.

Table 3 Rationale and detail of experimental scenarios used in the study based on S-2 SR band combinations

Full size table

Classification

The ML classifiers included in the current study were classification and regression tree (CART), maximum entropy, minimum distance, support vector machine (SVM), and random forest (RF). All of these classification models were implemented without any hyperparameter tuning, using the default parameter values to ensure a consistent comparison across all models (Table S1), as suggested by Maxwell et al. (2018).

CART Classifier

The CART algorithm proposed by Breiman et al. (1984) is a decision tree construction method that works on the principle of a dichotomous recursive segmentation system. The CART algorithm utilizes the Gini coefficient as the criterion for identifying the ideal test variances and segmentation thresholds to create a binary tree-based decision tree (DT) for classification. The CART algorithm operates by recursively splitting the training data at each decision node, known as the greedy splitting approach, to increase the homogeneity of the data in the resulting nodes based on a statistical test such as the Gini index. The Gini coefficient is defined as follows in Eqs. (1, 2 and 3):

$$\text{Gini} \text{Index} =1-{\sum }_{j}^{.}{P}^{2}(j|h)$$

(1)

$$P(j|h)=\frac{{n}_{j}(h)}{n(h)}$$

(2)

$${\sum }_{j=1}^{.}P(j|h)=1$$

(3)

where P(j|h) is a randomly selected sample from a training set or relative frequency of category, n_j(h) corresponds to the number of samples in category j when the value of test variable in the training set is h node, where n(h) is the number of samples in the training dataset with the test variable value of h, and j denotes the category number.

To be precise, the CART algorithm takes the training dataset and partitions it into smaller subsets recursively. This partitioning process continues until the smaller cells are grouped based on the same class label, with the maximum accuracy of prediction validated by the pruning value (Hayes et al. 2015; Mondal et al. 2019). The CART algorithm does not require parameters and has the advantages of fast operating speed and easy manipulation (Shao & Lunetta 2012); different studies have successfully deployed CART for classification of satellite imagery with promising accuracy (Hu et al. 2018; Johansen et al. 2015).

Maximum Entropy Classifier

Maximum entropy (MaxEnt) or gmoMaxEnt classifier in GEE is based on the maximum entropy principle to select the data with maximum entropy from all training sets (Mcdonald et al. 2009). It works best in a condition where the prior distribution and conditional dependency are unknown making it difficult to perform prediction with any assumption. Therefore, the gmoMaxEnt classifier utilizes a machine learning approach to perform spatial predictions using incomplete or limited training data (Moreno et al. 2011).

Minimum Distance Classifier

The minimum distance classifier uses spectral characteristics of the training samples which have been selected as representatives of the different feature classes. The Euclidean distance between the selected pixel values and the mean values of each class is calculated. Later, the candidate or selected pixel is assigned to the class with which it has the shortest Euclidean distance (Hu & Hu 2019).

SVM Classifier

SVM was described by Cortes and Vapnik (1995) which is commonly used in a range of RS applications (Rudiyanto et al. 2019; Stromann et al. 2020). SVM is a supervised machine learning technique that aims to find an ideal hyperplane that discriminates different classes from their decision boundary. During classification, SVM classifiers use an iterative process to allocate candidate pixels to classes by maximizing class separability from the training set and labels each pixel according to their nearest class in feature space (Boser et al. 1992; Tsai et al. 2018). The selection of the support vectors mainly depends on the choice of cost parameter C, kernel functions, and Gamma. The most used kernel functions include linear, polynomial, and radial basis function (RBF). A detailed description of the SVM classifier can be found in Melgani and Bruzzone (2004). The mathematical equations of linear, polynomial, radial basis and sigmoid kernel functions are listed below as Eqs. (4–7).

$$k\left({x}_{i, } {y}_{i}\right)= {x}_{i}^{T }. {x}^{j}$$

(4)

$$k({x}_{i }{y}_{i)= }{(\gamma . {x}_{j }^{T}.{x}_{j}+ r)}^{d}$$

(5)

$$k({x}_{i }{y}_{i)= }{e}^{{-\gamma ({x}_{i}{-x}_{j)}}^{2}}$$

(6)

$$k({x}_{i }{y}_{i)= }\text{tan}h(\gamma .{x}_{j}^{T}.{x}_{j+ }r)$$

(7)

where k is kernel, j is the feature, ${x}_{i}$ are input data points, and ${y}_{i}$ are the corresponding output data points. In polynomial kernel, d shows the degree of polynomial, whereas $r$ in the polynomial and sigmoid function is considered as bias term.$r$ is the gamma term that presents in all types of function except linear which describes the impact of the training range. The model will be constrained and not be able to handle the complexity of data if the value of gamma is too small and contrariwise.

RF Classifier

Random forest (RF) is a nonparametric supervised ML algorithm (Lee et al. 2018). The RF classifier is a nonlinear, relatively fast classifier that acts robustly to noisy training data. The RF algorithm was developed as an extension of the CART decision tree method and generates multiple classification trees to improve the overall prediction performance of the model. It operates by using a number of decision trees (DTs), where each tree is created from an independently constructed random sample of the training data to assign classification labels to each class (McCord et al. 2017). The RF algorithm applies a bagging technique, randomly selecting a subset of features from the input observations for each decision tree to be grown (Belgiu and Drăgu 2016).

As mentioned in Ahmed et al. (2019), for each tree ‘d’ from total ‘D’ number of trees, select any random data from training dataset and create the random forest tree T_d by randomly select ‘i’ point. This process must be recursive until the minimum node size is achieved. Combining the outputs of all trees as we get Eq. (8):

$$\sum\nolimits_{d=1}^{D}={T}_{d}(i)$$

(8)

For the new prediction at any point i, the regression will be Eq. (9):

$${F}_{rf}^{d}(i)=\frac{1 }{d}\sum\nolimits_{d=1}^{D}{T}_{d}(i)$$

(9)

Methods for Accuracy Assessment

The accuracy of the LC classification was measured using various derivates of the confusion matrix, such as overall accuracy (OA), kappa coefficient, producer’s accuracy (PA), and user’s accuracy (UA) for each class. The OA represents the percentage of pixels that were correctly labelled by the classifier, while the kappa coefficient is a measure of the overall agreement between the classification results and the reference data, accounting for chance agreement. The OA estimation and the kappa coefficient were used to compare the classification accuracy of each machine learning algorithm across the different band combination scenarios, ultimately supporting the selection of the best-performing classifier. Additionally, the Pearson correlation (PC) coefficient was calculated between the OA and kappa values. The PC coefficient measures the statistical relationship or association between these two continuous variables, providing information about the magnitude and direction of the correlation, which can range from − 1 to + 1 (Benesty et al. 2009).

To further evaluate the performance of the LC classification models, we employed standard metrics including precision, recall, and F1-score (Saini & Singh 2024). These metrics were computed for each LC class, providing a comprehensive evaluation of the classifier’s ability to accurately identify and differentiate between distinct LC types. Precision quantifies the proportion of correctly classified samples within a specific class, while recall represents the proportion of samples from a given class that were correctly identified. The F1-score, a harmonic mean of precision and recall, offers a balanced measure of the classifier’s overall performance for each class, providing a robust indicator of the model’s effectiveness in classifying different LC types (Rapinel et al. 2023).

Results

Comparison of Classification Accuracy of GEE Classifiers

The results show that the various ML classifiers achieved differing levels of OA and kappa index under the different band combination scenarios (Table 4 and Fig. 3). The 10-band set (S1, as detailed in Table 3) of Sentinel-2 data, covering the visible to SWIR band regions, resulted in OAs ranging between 0.59 and 0.79 and kappa coefficients between 0.54 and 0.76 across the ML algorithms. In the S2 scenario (Table 3), which utilized a different band configuration, the CART classifier achieved the highest OA of 0.73 and kappa of 0.69 (Table 4). In contrast, the S9 scenario, which used only the combination of Sentinel-2 bands B2-B4, did not perform well, with no classification method exhibiting satisfactory accuracy. The comparative analysis of the ML algorithms and band combinations provides valuable insights for the final selection of the optimal approach for the LC mapping task.

Table 4 Overall accuracy and Kappa statistics for each classification method under investigation for each scenario. The best assessed statistical values are indicated in bold font and poor results are indicated in italic and bold (column wise)

Full size table

From the results presented in Table 4, it is evident that the RF classifier generally had the highest OA and kappa values across all the evaluated scenarios. The SVM classifier had the second-highest OA and kappa values in some scenarios, but its performance was generally lower than that of RF. In contrast, the minDistance classifier exhibited the lowest OA and kappa values across all scenarios and among all the classifiers. The gmoMaxEnt and CART classifiers generally had an intermediate level of performance, with OA and kappa values that were lower than RF but higher than minDistance. To better represent and explain the results, the relatively low-performing scenarios for all classifiers were dropped, and only the top five scenarios were selected for further analysis and discussion (Fig. 3). This approach allows for a more focused and informative presentation of the most promising classification outcomes for Gilgit-Baltistan.

A PC coefficient of 0.9 with a p-value of 0.0 was observed between the OA and kappa values across all ML classifiers. This indicates a strong positive and statistically significant relationship between these two-performance metrics. The close correspondence between OA and kappa suggests that the kappa index was closely tied to the overall classification accuracy, such that higher OA values were consistently associated with higher kappa scores, and vice versa.

The analysis of individual LC class accuracies revealed several notable trends. The RF exhibited strong performance, achieving high PA of 0.97 and UA of 0.88 for the water class. In contrast, the SVM and minDistance were unable to match the UA and PA values attained by RF for this class. RF also demonstrated high UA (0.77) and PA (0.75) for the forest cover, as well as for the grasses (PA = 0.71, UA = 0.78). Among the classifiers, CART and minDistance showed the least underestimation for the grasses class, with UA = 0.55 and PA = 0.48. For agriculture class, RF achieved a high PA of 0.89 and UA of 0.78, while minDistance recorded relatively lower UA and PA values (UA = 0.60, PA = 0.65). The RF classifier also performed exceptionally well for the wetland class, with UA of 0.85 and PA of 0.79, whereas minDistance exhibited considerably lower UA (0.53) and PA (0.69) values. Interestingly, SVM attained the highest PA (0.93) but the lowest UA (0.59) among all classifiers for the barren land class. Similarly, for the built-up areas, RF achieved the highest UA (0.89), while minDistance had the lowest UA (0.32). However, SVM had the lowest PA (0.07), whereas the gmoMaxEnt classifier recorded the highest PA (0.68) for the built-up class. In the case of glacier classification, CART achieved the highest PA (0.59), while gmoMaxEnt had the lowest PA (0.14). RF, on the other hand, attained the highest UA (0.67), and minimum distance had the lowest UA (0.39). For the snow class, minDistance obtained the highest UA (0.97) and gmoMaxEnt had the lowest UA (0.75), while RF achieved the highest PA (0.97), and CART had the lowest PA (0.89). These findings suggest that classifiers utilizing a higher number of spectral bands generally exhibit more favourable UA and PA performance across the various LC classes.

Moreover, misclassification of LC classes was observed across all classification results. For instance, many barren land validation points were incorrectly labelled as agriculture, and similar issues occurred for built-up areas being mislabelled as water (Fig. 4a–e). These misclassification patterns highlight the challenges in accurately discriminating certain LC types, likely due to spectral similarities or mixed pixels. Despite these issues, the comparison of OA and kappa index indicates that the RF classifier performed excellently among the five implemented algorithms, by exhibiting the highest and most consistent classification results.

Moreover, we also compared and evaluated the classification performance of ML algorithms across different LC categories using precision, recall, and F1-score metrics (Fig. 5). These findings reveal that RF demonstrated superior precision, recall, and F1-scores for water bodies, wetlands, agriculture, barren land, and snow LC type. It achieves precision scores of 0.96, 0.83, 0.85, and 0.91, respectively. SVM and gmoMaxEnt also exhibit competitive precision scores, particularly in the water, wetland, and agriculture categories. However, all algorithms struggle to accurately classify the forest category, with significantly lower precision scores across the board. The minimumDistance algorithm consistently performs the poorest in terms of precision for all LC categories. When considering recall, RF again shows strong performance in correctly identifying LC class of water (0.94), grasses (0.67), and snow (0.98). However, SVM and gmoMaxEnt exhibit relatively lower recall scores, especially for the forest and wetland categories. The minimumDistance algorithm consistently has lower recall scores compared to other algorithms, indicating difficulties in correctly classifying various land LC. Comparing the performance of CART, it also shows mixed results across different LC categories. In terms of precision, CART performs relatively well for water (0.82), agriculture (0.71), and barren land (0.75). However, its precision scores are lower compared with RF in most categories. CART struggles particularly in the forest category, achieving a precision score of 0.57, which is significantly lower compared to other algorithms. In terms of recall, CART performs decently for water (0.86) and barren land (0.68) categories. However, it falls behind RF in terms of recall scores across most LC classes. It demonstrates challenges in correctly identifying instances of forest and wetland, with recall scores of 0.59 and 0.73, respectively.

Additionally, focusing on the F1-score (Fig. 5c), RF consistently achieves high scores for water, agriculture, barren land, and snow categories, indicating a balanced performance in terms of precision and recall. SVM and gmoMaxEnt also exhibit competitive F1-scores for the water, wetland, and agriculture categories. Like the other metrics, the forest category poses a challenge for all algorithms, resulting in relatively lower F1-scores. The minDistance algorithm consistently performs the worst across all LC categories, while the CART achieves moderate F1-scores scores for water (0.75) and barren land (0.71) categories. However, its overall performance is lower compared to RF and SVM in most LC categories.

Comparing the performance among LC classes (Fig. 5), it is evident that the algorithms excel in different categories. RF consistently performs well in water, agriculture, barren land, and snow. SVM and gmoMaxEnt showcase strengths in water and wetland categories. However, all algorithms struggle with accurately classifying the forest category, indicating the inherent complexity of distinguishing forest cover. CART, on the other hand, lags behind RF, SVM, and gmoMaxEnt in terms of F1-scores across most LC categories. It struggles in accurately classifying the forest category, resulting in a relatively lower F1-score of 0.58. Moreover, the F1-scores algorithm consistently performs the poorest among all the algorithms, indicating limitations in accurately classifying LC categories.

These results suggest that RF is the most reliable algorithm across multiple LC categories, demonstrating superior precision, recall, and F1-scores. However, the performance of each algorithm varies depending on the specific LC class. These findings emphasize the importance of selecting appropriate ML models based on the target LC type. Further research and experimentation are required to uncover the underlying factors causing the observed performance variations and to refine the classification accuracy for challenging LC categories, such as forests.

Land Cover Classification of Gilgit-Baltistan

The classified maps (Fig. 3) based on ten different scenarios (Table 3) and five ML classifiers were visually the same and consistent, displaying all results was not possible due to limitation of space. Based on accuracy assessment (OA and Kappa), RF (scenario S1) was found to be the best ML classifier among all. Therefore, RF LC classification was selected as a final map and used for area distribution of each LC class in the Gilgit-Baltistan (Fig. 6).

The final RF-based LC results (Fig. 6 and Fig. 7) depict that barren land (33,201.77 km²) is the largest LC type classified in the study area, followed by snow cover (16,275.59 km²), glacier (5651.42 km²), grasses (5141.72 km²), water (3356.22 km²), wetland (2063.60 km²), build-up (1928.81 km²), agriculture (1316.73 km²), and forest (868.04 km²).

The study area is predominantly classified as barren land, which is distributed generally throughout the region, while the snow and glacier classes were found to be distributed primarily across the northern and eastern sides. The agriculture class has a distinct distribution, concentrated along the river and stream channels, which also contain the human settlements and built-up areas distributed throughout the valleys. Forest and wetland classes were found to be concentrated along the high-altitude ridges and rangelands, mostly between the south-west and north of the study area, while grasses were distributed in the northern, southern, south-eastern, and north-western parts, comprising the alpine pastures and rangelands that are usually covered with snow during the winter months (approximately 6 months). The built-up class in GB is typically distributed in the valleys, surrounded by agricultural and uncultivated lands, with the most concentrated areas observed in the main cities of Astore, Ghizer, Skardu, Chilas, and Gilgit (Ali et al. 2019a, b), providing insights into the landscape characteristics and human–environment interactions within the study area.

The district-wise area estimates of each LC class (Table 5) revealed that water is concentrated in Hunza district (1099.85 km²), district Diamer has the largest area of forest (353.66 km²), grasses (1162.57 km²), and wetland (644.36 km²), while the district Astore covers the largest area of agricultural land. Moreover, barren land and build-up were distributed largely in district Ghizer with an area of 7053.31 km² and 329.99 km², respectively, while glacier (1751.57 km²) and snow (4114.97 km²) classes contribute to the highest area in district Hunza and Shigar, respectively.

Table 5 District wise area estimates of each land cover class resulting from image classification using best performing RF algorithm with scenario S1. The colour bar indicates the highest (green) to lowest (red) and medium (orange to yellow) coverage of the land cover class area for each column in km²

Full size table

Discussion

Advantages and Opportunities of GEE Cloud Platform

Mapping LC features over large areas often faces challenges due to the limited availability and inconsistency of cloud-free satellite imagery. In this study, these challenges were addressed by employing temporal aggregation of Sentinel-2 imagery to create a comprehensive LC map for the Gilgit-Baltistan region of HKH. Previous classification efforts in this area had been hindered by the shortage of cloud-free images. The introduction of the web-based GEE cloud platform has significantly addressed the computational limitations that often hinder LC mapping in developing countries (Li et al. 2020; Zhou et al. 2020).

Few localized studies have been conducted in the study area, applying traditional approaches for LC mapping. However, as already mentioned, these previous efforts have utilized fewer LC classes or focused on easily distinguishable classes like barren land, forests, or water bodies in small sub-basin areas (Khan et al. 2020a, b; Qamer et al. 2016). Comparing our results with previous studies is challenging due to differences in methodologies and data sources. Previous studies in the area utilized Landsat-8 (L-8), Landsat-7/5/4, and MODIS products, while our study stands out as the first to utilize Sentinel-2 data. This is significant because Sentinel-2 offers shorter revisit times and higher-resolution imagery, enabling more accurate and practical LC classification in our target area.

The accuracy evaluation in this study demonstrates that the use of the GEE cloud platform enables robust and accurate regional scale LC mapping. GEE offers key advantages, such as the ability to quickly and precisely select sample points using supplementary data like socio-economic information, population, DMSP-OLS, DEM products, and satellite datasets, surpassing the capabilities of traditional tools. Moreover, GEE scripts can be easily enhanced for long-term monitoring of LC changes and driving indicators. Collaboration among stakeholders and agencies can further strengthen the capacity to address challenges like food security and flood mapping using open-source geospatial data. One such example is the SERVIR program, a joint NASA and USAID initiative that helps developing countries utilize Earth observation satellites and geospatial technologies (SERVIR 2005). A similar collaborative approach can be adopted in the Gilgit-Baltistan for improved planning and decision-making. The use of the GEE cloud platform has enabled the development of high-resolution, regional-scale LC products, overcoming the computational limitations that had previously hindered such large-scale mapping efforts (Faqe Ibrahim et al. 2023). This innovative approach provides a strong foundation for future LC monitoring and change analysis to support sustainable management in the study region.

Strengths and Implication of Our Land Cover Classification

The LC information on the nine classes mapped (Fig. 6) in this study is essential for improving our understanding of various earth surface processes in Gilgit-Baltistan. For instance, the delineation of water and glacier bodies can enable disaster monitoring of GLOFs, which are a frequent occurrence in the study area (Jamil et al. 2019).

During the current study, ML classifiers under investigation were not exhaustively tuned using their hyperparameters. Instead, the models were left to operate independently using their default input settings to map the land features based on the same training dataset. This approach identified the most efficient ML classifier, which upon further hyperparameter tuning could potentially achieve exceptional results. Choosing a set of optimal hyperparameters for a ML model is time exhausting process and is user-dependent which may affect the classification accuracy. Although default values for these parameters are usually suggested, to ensure that the accurate classification has been produced (Maxwell et al. 2018). The optimal parameters of the model vary from area to area depending on the quality of the dataset and the number of sample points, spatial distribution, and derivative features such as texture and spectral indices (Tsai et al. 2018). However, the limitation of less quantity of sample points in the current study was likely addressed and improved by the bagging method of RF classifier (Breiman et al. 1984) which performed very well in the current study. To overcome such limitations under various scenarios (Table 3), RF is considered one of the best ML classifiers and has been tested widely by many researchers (Gargiulo et al. 2020; Pradhan et al. 2020; Stromann et al. 2020). Thus, the current approach provides an opportunity for future studies in the Gilgit-Baltistan to select RF for LC classification, which upon hyperparameter tuning would provide excellent classification accuracy.

However, to estimate the extent of LC classes, various regional and global LC datasets exist which are prepared using different imagery (Landsat-5/7/8, S-2, and MODIS, etc.), algorithms, and with varying resolutions. Accuracy of these products is needed to be improved when applied at local or regional level (Wagle et al. 2020) and specially in the mountainous regions. Due to multiple differences such as input data, classifier type, data acquisition time, and spatial resolution, it creates poor agreement among different products when applied at the regional or global level. Thus, producing a reliable local or regional level LC classification products is essential and applicable. Figure comparing the multiple global LC models and current study results is included in the supplementary file (Fig. S1), which provide evidence for the stated argument.

During the study, few challenges were encountered, mostly pertaining to uncertainties between build-up, water, agriculture, wetland, and barren land mapping producing misclassification of these LC classes probably due to similar or alike spectral responses (Fig. 8 and 9). This problem was more related in villages where the houses are of masonry type with roofs covered with clay and sand (Rafi et al. 2016), causing misclassification among barren land and build-up class. Also, it was difficult to distinguish between croplands and seasonal grasslands using the S-2 imagery due to overlapping phenology signatures of agriculture, forest, and grasslands especially in case of their sparse presence (Fig. 8a). We tackled these difficulties by acquiring high-quality samples from secondary data, as well as using the high-resolution Google Earth data (as reference). However, higher mapping accuracy would likely be achieved with larger and more accurate training datasets along with hyperparameter tuning of classifier (Ka & Sa 2018; Tsai et al. 2018).

To establish a rigorous comparison and ensuring scientific validity of results of our LC model and global LC product, we utilized ESA WorldCover product (Chaaban et al. 2022; Zanaga et al. 2022) which has same spatial resolutions and is also produced using RF model with OA of 74.4% (Fig. 8). It is observed that our RF-based LC model achieves superior results at the local scale and can further improve with hyperparameter tuning. For instance, Fig. 8a–c reveals agricultural fields that are clearly visible and accurately mapped by our model. In contrast, the ESA WorldCover has inaccurately classified these areas as shrubland or grasslands. The reference image (Fig. 8j) acquired by an unmanned aerial vehicle (UAV) over the same area further corroborates the accurate mapping of agricultural fields by our model. Also, Fig. 8d–f illustrates an example where a barren land (braided river) was misclassified as water by WorldCover product (Fig. 7f), while the same extent is accurately mapped by current study model (Fig. 8e). Additionally, Fig. 8i highlights areas of misclassification compared to Fig. 8g–h, where the water class (water stream/river) in Fig. 8i was misclassified as shrubs, trees, and bare classes. Nevertheless, the ESA world cover product has better resolution as compared with other global products, but producing global scale LC maps still remains a challenge. In such cases, regional-level studies prove to be a better option, offering improved efficiency and OA of LC products. Practitioners at the local and regional level can utilize regional accurate data products for planning and designing new development programs for local communities.

Another aspect of current classification is the use of median operation for temporal aggregation of satellite imagery which might influence the overall classification accuracy. The use of median composite from Sentinel-2 data offers significant advantages LC mapping. To understand this, spectral response of median composite using all images (Fig. 9a) and a single image at same locations is provided (Fig. 9b). By combining data from multiple acquisitions, the median composite effectively reduces the impact of atmospheric conditions and minimizes temporal variability, resulting in more accurate and reliable reflectance values. The response of LC classes remained equivalent without showing irregular responses which suggests that the median composite has no erroneous data which might have negatively influenced the overall classification. However, there is an observed overlap in the median composite SWIR region (Band 12) for snow and water (Fig. 9a), which can be attributed the common physical properties of these two materials, such as their high reflectance in the NIR region and their relatively low absorption in the SWIR region (Shao et al. 2020). However, RF is well suited for tackling the challenge of overlapping reflectance values among various LC classes by employing ensemble learning. RF combines multiple decision trees trained on random subsets of the data from all bands, introducing diversity into the model. This allows the algorithm to capture a broader range of spectral patterns and features beyond the overlapping wavelengths, enhancing the discrimination between snow and water. Also, previous studies (Phan et al. 2020; Xie et al. 2019) have also validated the accuracy of median operation for preparing composite image.

Conclusions

The Sentinel-2 is exceptional among presently operating Earth-observation satellites due to its wide spectral wavelength, its 10 m spatial resolution, and revisit time. This study represents a first assessment and evaluation of five ML algorithms using GEE to classify LC classes using temporally aggregated Sentinel-2 data for the year 2019 (May–September) based on ten scenarios (50 LC products) over a complex mountainous environment. The five tested ML algorithms produced OA ranging between 0.59 and 0.79, without any hyperparameter tuning. Among these classifiers, two of the ML algorithms, RF and gmoMaxEnt performed exceptionally well with scenario S1, while CART (S2 scenario) and SVM (S1 scenario) performed ordinary with a difference of OA of 0.06 and 0.14 as compared with RF classifier (S1 scenario), respectively.

Moreover, the current study has compared GEE’s in-build ML algorithms using default input parameter values to remove biasness among classifiers, providing consistent environment. Doing so, the OA-based evaluation identified RF classifier best suitable for mapping mountainous areas like Gilgit-Baltistan with complex mountain system. Therefore, in the future, the best identified RF classifier with scenario S1 within GEE environment should be used for advance multi-source data image classification with hyperparameter tuning to increase overall OA and better prediction. Also, it is suggested to build the capacity of various stakeholders in Gilgit-Baltistan for better monitoring the LC changes and resource management using big data coupled with the GEE cloud platform.

Data Availability

The final land cover product and code is available at https://github.com/gomalhunzai/Gilgit-Baltistan-LandCover-GEE. Additional data will be made available on request.

References

Adepoju KA, Adelabu SA (2020) Improving accuracy evaluation of Landsat-8 OLI using image composite and multisource data with Google Earth Engine. Remote Sens Lett 11(2):107–116. https://doi.org/10.1080/2150704X.2019.1690792
Article Google Scholar
Ahmed N, Islam MN, Hasan MF, Motahar T, Sujauddin M (2019) Understanding the political ecology of forced migration and deforestation through a multi-algorithm classification approach: the case of Rohingya displacement in the southeastern border region of Bangladesh. Geol, Ecol, Landscapes 3(4):282–294. https://doi.org/10.1080/24749508.2018.1558025
Article Google Scholar
Ali K, Bajracharya RM, Chapagain NR, Raut N, Kumar B, Begum F, Khan MZ, Ali M, Ahmed A (2019) Analyzing Land Cover Change Using Remote Sensing and GIS : a Case Study of Gilgit 10(1):100–105
Google Scholar
Ali S, Biermanns P, Haider R, Reicherter K (2019) Landslide susceptibility mapping by using a geographic information system (GIS) along the China-Pakistan Economic Corridor (Karakoram Highway), Pakistan. Nat Hazard 19(5):999–1022. https://doi.org/10.5194/nhess-19-999-2019
Article Google Scholar
Alifu H, Vuillaume JF, Johnson BA, Hirabayashi Y (2020) Machine-learning classification of debris-covered glaciers using a combination of Sentinel-1/-2 (SAR/optical), landsat 8 (thermal) and digital elevation data. Geomorphology 369:107365. https://doi.org/10.1016/j.geomorph.2020.107365
Article Google Scholar
Amin G, Haroon E, Imtiaz I, Saqib NU, Shahzad MI (2021) Ecotourism potential assessment for Gilgit-Baltistan, Pakistan using integration of GIS, remote sensing AHP and crowd-sourced data. Geocarto Int 37(25):1–20. https://doi.org/10.1080/10106049.2021.2005157
Article Google Scholar
Bacha AS, Shafique M, van der Werff H (2018) Landslide inventory and susceptibility modelling using geospatial tools, in Hunza-Nagar valley, northern Pakistan. J Mt Sci 15(6):1354–1370. https://doi.org/10.1007/s11629-017-4697-0
Article Google Scholar
Belgiu M, Drăgu L (2016) Random forest in remote sensing: a review of applications and future directions. ISPRS J Photogramm Remote Sens 114:24–31. https://doi.org/10.1016/j.isprsjprs.2016.01.011
Article Google Scholar
Benesty J, Chen J, Huang Y, Cohen I (2009) Pearson Correlation Coefficient BT - Noise Reduction in Speech Processing In: Cohen Y, Huang J, Chen J, Benesty (eds.); pp. 1–4. Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-00296-0_5
Beuchle R, Grecchi RC, Shimabukuro YE, Seliger R, Eva HD, Sano E, Achard F (2015) Land cover changes in the Brazilian Cerrado and Caatinga biomes from 1990 to 2010 based on a systematic remote sensing sampling approach. Appl Geogr 58:116–127. https://doi.org/10.1016/j.apgeog.2015.01.017
Article Google Scholar
Bolch T, Kulkarni A, Kääb A, Huggel C, Paul F, Cogley JG, Frey H, Kargel JS, Fujita K, Scheel M (2012) The state and fate of Himalayan glaciers. Science 336(6079):310–314
Article CAS Google Scholar
Boser BE, Guyon IM, Vapnik VN (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the fifth annual workshop on Computational learning theory (COLT '92). Association for Computing Machinery, New York, 44–152. https://doi.org/10.1145/130385.130401
Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees. CRC Press
Google Scholar
Carrasco L, O’Neil AW, Daniel Morton R, Rowland CS (2019) Evaluating combinations of temporally aggregated Sentinel-1, Sentinel-2 and Landsat 8 for land cover mapping with Google Earth Engine. Remote Sens 11(3):288. https://doi.org/10.3390/rs11030288
Article Google Scholar
Chaaban F, El Khattabi J, Darwishe H (2022) Accuracy assessment of ESA WorldCover 2020 and ESRI 2020 land cover maps for a region in Syria. J Geovisualization Spat Anal 6(2):31. https://doi.org/10.1007/s41651-022-00126-w
Article Google Scholar
Copernicus (2017) Sentinel-2 MSI: multispectral instrument, level-2A: Earth Engine Data Catalog. Google Earth Engine. https://developers.google.com/earth-engine/datasets/catalog/COPERNICUS_S2_SR_HARMONIZED
Cortes C, Vapnik V (1995) Support vector machine. Mach Learn 20(3):273–297
Article Google Scholar
Dang AN, Kawasaki A (2017) Integrating biophysical and socio-economic factors for land-use and land-cover change projection in agricultural economic regions. Ecol Model 344:29–37. https://doi.org/10.1016/j.ecolmodel.2016.11.004
Article Google Scholar
Delalay M, Tiwari V, Ziegler AD, Gopal V, Passy P (2019) Land-use and land-cover classification using Sentinel-2 data and machine-learning algorithms: operational method and its implementation for a mountainous area of Nepal. J Appl Remote Sens 13(01):1. https://doi.org/10.1117/1.jrs.13.014530
Article Google Scholar
ESA (2015) Sentinel-2 user handbook. European Space Agency Standard Document 1:1–64
Google Scholar
Faqe Ibrahim GR, Rasul A, Abdullah H (2023) Improving crop classification accuracy with integrated Sentinel-1 and Sentinel-2 data: a case study of barley and wheat. J Geovisualization Spat Anal 7(2):22. https://doi.org/10.1007/s41651-023-00152-2
Article Google Scholar
Farr TG, Rosen PA, Caro E, Crippen R, Duren R, Hensley S, Kobrick M, Paller M, Rodriguez E, Roth L (2007) The shuttle radar topography mission. Rev Geophys, 45(2). https://doi.org/10.1029/2005RG000183
Gargiulo M, Dell’aglio DAG, Iodice A, Riccio D, Ruello G (2020) Integration of Sentinel-1 and Sentinel-2 data for land cover mapping using w-net. Sensors (Switzerland) 20(10):1–16. https://doi.org/10.3390/s20102969
Article Google Scholar
Gilani H, ArifGoheer M, Ahmad H, Hussain K (2020) Under predicted climate change: distribution and ecological niche modelling of six native tree species in Gilgit-Baltistan Pakistan. Ecol Indic 111:106049. https://doi.org/10.1016/j.ecolind.2019.106049
Article Google Scholar
Gislason PO, Benediktsson JA, Sveinsson JR (2006) Random forests for land cover classification. Pattern Recogn Lett 27(4):294–300. https://doi.org/10.1016/j.patrec.2005.08.011
Article Google Scholar
Gorelick N, Hancher M, Dixon M, Ilyushchenko S, Thau D, Moore R (2017) Google Earth Engine: planetary-scale geospatial analysis for everyone. Remote Sens Environ 202:18–27. https://doi.org/10.1016/j.rse.2017.06.031
Article Google Scholar
Gumma MK, Thenkabail PS, Teluguntla PG, Oliphant A, Xiong J, Giri C, Pyla V, Dixit S, Whitbread AM (2020) Agricultural cropland extent and areas of South Asia derived using Landsat satellite 30-m time-series big-data using random forest machine learning algorithms on the Google Earth Engine cloud. Gisci Remote Sens 57(3):302–322. https://doi.org/10.1080/15481603.2019.1690780
Article Google Scholar
Hayes T, Usami S, Jacobucci R, McArdle JJ (2015) Using classification and regression trees (CART) and random forests to analyze attrition: results from two simulations. Psychol Aging 30(4):911–929. https://doi.org/10.1037/pag0000046
Article Google Scholar
He LM, Kong FS, Shen ZQ (2005) Multiclass SVM based land cover classification with multisource data. 2005 Int Conf Mach Learn Cyber, ICMLC 2005:3541–3545. https://doi.org/10.1109/icmlc.2005.1527555
Article Google Scholar
Hill J, Stellmes M, Udelhoven T, Röder A, Sommer S (2008) Mediterranean desertification and land degradation: mapping related land use change syndromes based on satellite observations. Global Planet Change 64(3–4):146–157
Article Google Scholar
Hu Y, Hu Y (2019) Land cover changes and their driving mechanisms in Central Asia from 2001 to 2017 supported by Google Earth Engine. Remote Sens 11(5):564. https://doi.org/10.3390/rs11050554
Article Google Scholar
Hu Y, Dong Y, Batunacun. (2018) An automatic approach for land-change detection and land updates based on integrated NDVI timing analysis and the CVAPS method with GEE support. ISPRS J Photogramm Remote Sens 146(October):347–359. https://doi.org/10.1016/j.isprsjprs.2018.10.008
Article Google Scholar
Hussain A, Bangash R (2017) Impact of climate change on crops’ productivity across selected agro-ecological zones in Pakistan. Pakistan Dev Rev 56(2):163–187. https://doi.org/10.30541/v56i2pp.163-187
Article Google Scholar
Jamil A, Khan AA, Bayram B, Iqbal J, Amin G, Yesiltepe M, & Hussain D (2019) Spatio-temporal glacier change detection using deep learning: a case study of Shishper glacier in Hunza. International Symposium on Applied Geoinformatics, 5. https://www.researchgate.net/publication/337720130
Jia M, Wang Z, Mao D, Ren C, Song K, Zhao C, Wang C, Xiao X, Wang Y (2023) Mapping global distribution of mangrove forests at 10-m resolution. Sci Bullet 68(12):1306–1316
Article Google Scholar
Johansen K, Phinn S, Taylor M (2015) Mapping woody vegetation clearing in Queensland, Australia from Landsat imagery using the Google Earth Engine. Remote Sens Applic: Soc Environ 1:36–49. https://doi.org/10.1016/j.rsase.2015.06.002
Article Google Scholar
Ka A, & Sa A (2018) Improved Landsat-8 Oli and Sentinel-2 MSI classification in mountainous terrain using machine learning on Google Earth Engine. October
Khan AA, ul Hassan SN, Baig S, Khan MZ, Muhammad A (2019) The response of land surface temperature to the changing land-use land-cover in a mountainous landscape under the influence of urbanization: Gilgit city as a case study in the Hindu Kush Himalayan Region of Pakistan. Int J Econ Environ Geol 10(3):40–49
Google Scholar
Khan AA, Jamil A, Hussain D, Taj M, Jabeen G, Malik MK (2020) Machine-learning algorithms for mapping debris-covered glaciers: the Hunza basin case study. IEEE Access 8:12725–12734. https://doi.org/10.1109/ACCESS.2020.2965768
Article Google Scholar
Khan A, Said A, Ullah I (2020) Landsat based distribution mapping of high-altitude peatlands in Hindu Kush Himalayas — a case study of Broghil Valley. Pakistan J Mountain Sci 17(1):42–49. https://doi.org/10.1007/s11629-019-5384-0
Article Google Scholar
Lee J, Cardille JA, Coe MT (2018) BULC-U: Sharpening resolution and improving accuracy of land-use/land-cover classifications in Google Earth Engine. Remote Sens 10(9):1–21. https://doi.org/10.3390/rs10091455
Article CAS Google Scholar
Li Q, Qiu C, Ma L, Schmitt M, Zhu XX (2020) Mapping the land cover of africa at 10 m resolution from multi-source remote sensing data with google earth engine. Remote Sens 12(4):1–22. https://doi.org/10.3390/rs12040602
Article Google Scholar
Mahmoodzada AB, Das P, Varade D, Akhtar MA, Shimada S (2024) High-resolution mapping of seasonal snow cover extent in the Pamir Hindu Kush using machine learning-based integration of multi-sensor data. Acta Geophys 72(2):1455–1470. https://doi.org/10.1007/s11600-023-01281-4
Article Google Scholar
Mansaray LR, Wang F, Huang J, Yang L, Kanu AS (2019) Accuracies of support vector machine and random forest in rice mapping with Sentinel-1a, Landsat-8 and Sentinel-2a datasets. Geocarto Int 35(10):1–21. https://doi.org/10.1080/10106049.2019.1568586
Article Google Scholar
Maxwell AE, Warner TA, Fang F, Maxwell AE, Warner TA, Implementation FF, Maxwell AE, Warner TA (2018) Implementation of machine-learning classification in remote sensing : an applied review. Int J Remote Sens 39(9):2784–2817. https://doi.org/10.1080/01431161.2018.1433343
Article Google Scholar
McCord SE, Buenemann M, Karl JW, Browning DM, Hadley BC (2017) Integrating remotely sensed imagery and existing multiscale field data to derive rangeland indicators: application of Bayesian additive regression trees. Rangel Ecol Manage 70(5):644–655. https://doi.org/10.1016/j.rama.2017.02.004
Article Google Scholar
Mcdonald R, Mohri M, Silberman N, Walker D, Mann G (2009). Efficient large-scale distributed training of conditional maximum entropy models. Adv Neural Inform Process Syst, 1231–1239. https://doi.org/10.1007/s13218-010-0039-x
Melgani F, Bruzzone L (2004) Classification of hyperspectral remote sensing. IEEE Trans Geosci Remote Sens 42(8):1778–1790
Article Google Scholar
Mondal P, Liu X, Fatoyinbo TE, Lagomasino D (2019) Evaluating combinations of Sentinel-2 data and machine-learning algorithms for mangrove mapping in West Africa. Remote Sens 11(24):2928. https://doi.org/10.3390/rs11242928
Article Google Scholar
Moreno R, Zamora R, Molina JR, Vasquez A, Herrera MÁ (2011) Predictive modeling of microhabitats for endemic birds in South Chilean temperate forests using maximum entropy (Maxent). Eco Inform 6(6):364–370
Article Google Scholar
Nawaz MA, Khan AA, Khalid U, Buerkert A, Wiehle M (2019) Superfruit in the Niche—underutilized Sea Buckthorn in Gilgit-Baltistan. Pakistan Sustain 11(20):5840
Article CAS Google Scholar
Phan TN, Kuch V, Lehnert LW (2020) Land Cover Classification using Google Earth Engine and random forest classifier—the role of image composition. Remote Sens 12(15):2411. https://doi.org/10.3390/rs12152411
Article Google Scholar
Pradhan B, Al-Najjar HAH, Sameen MI, Tsang I, Alamri AM (2020) Unseen land cover classification fromhigh-resolution orthophotos using integration of zero-shot learning and convolutional neural networks. Remote Sens 12(10):1–26. https://doi.org/10.3390/rs12101676
Article Google Scholar
Qamer FM, Shehzad K, Abbas S, Murthy MSR, Xi C, Gilani H, Bajracharya B (2016) Mapping deforestation and forest degradation patterns in Western Himalaya. Pakistan Remote Sens 8(5):1–17. https://doi.org/10.3390/rs8050385
Article Google Scholar
Rafi MM, Lodi SH, Ahmed M, Kumar A, Verjee F (2016) Development of building inventory for northern Pakistan for seismic risk reduction. Int J Dis Resilience Built Environ 7(5):501–20
Article Google Scholar
Rahim I, Ali SM, Aslam M (2018) GIS based landslide susceptibility mapping with application of analytical hierarchy process in District Ghizer, Gilgit Baltistan Pakistan. J Geosci Environ Protect 06(02):34–49. https://doi.org/10.4236/gep.2018.62003
Article Google Scholar
Rankl M, Kienholz C, Braun M (2014) Glacier changes in the Karakoram region mapped by multimission satellite imagery. Cryosphere 8(3):977–989. https://doi.org/10.5194/tc-8-977-2014
Article Google Scholar
Rapinel S, Panhelleux L, Gayet G, Vanacker R, Lemercier B, Laroche B, Chambaud F, Guelmami A, Hubert-Moy L (2023) National wetland mapping using remote-sensing-derived environmental variables, archive field data, and artificial intelligence. Heliyon 9(2):1–17. https://doi.org/10.1016/j.heliyon.2023.e13482
Article Google Scholar
Rudiyanto MB, Shah RM, CheSoh N, Arif C, IndraSetiawan B (2019) Automated near-real-time mapping and monitoring of rice extent, cropping patterns, and growth stages in Southeast Asia using Sentinel-1 time series on a Google Earth Engine platform. Remote Sens 11(14):1–27. https://doi.org/10.3390/rs11141666
Article Google Scholar
Saini R, Singh S (2024) Land use land cover mapping and snow cover detection in Himalayan region using machine learning and multispectral Sentinel-2 satellite imagery. Int J Inf Technol 16(2):675–686. https://doi.org/10.1007/s41870-023-01673-1
Article Google Scholar
Satti Z, Naveed M, Shafeeque M, Ali S, Abdullaev F, Ashraf TM, Irshad M, Li L (2023) Effects of climate change on vegetation and snow cover area in Gilgit Baltistan using MODIS data. Environ Sci Pollut Res 30(7):19149–19166
Article Google Scholar
Satti Z, Naveed M, Shafeeque M, Li L (2024) Investigating the impact of climate change on trend shifts of vegetation growth in Gilgit Baltistan. Global Planetary Chang 232:104341. https://doi.org/10.1016/j.gloplacha.2023.104341
Article Google Scholar
Servir (2005) Servir global: connecting space to village
Shao Y, Lunetta RS (2012) Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points. ISPRS J Photogramm Remote Sens 70:78–87
Article Google Scholar
Shao D, Xu W, Li H, Wang J, Hao X (2020) Modeling snow surface spectral reflectance in a land surface model targeting satellite remote sensing observations. Remote Sens 12(18):3101. https://doi.org/10.3390/rs12183101
Article Google Scholar
Shetty S, Gupta PK, Belgiu M, Srivastav SK (2021) Assessing the effect of training sampling design on the performance of machine learning classifiers for land cover mapping using multi-temporal remote sensing data and Google Earth Engine. Remote Sens 13(8):1433. https://doi.org/10.3390/rs13081433
Article Google Scholar
Silveira EMO, Radeloff VC, Martinuzzi S, Pastur GJM, Bono J, Politi N, Lizarraga L, Rivera LO, Ciuffoli L, Rosas YM (2023) Nationwide native forest structure maps for Argentina based on forest inventory data, SAR Sentinel-1 and vegetation metrics from Sentinel-2 imagery. Remote Sens Environ 285:113391
Article Google Scholar
Singh R, Mishra AP, Kumar M, & Pande CB (2023) Classification of Vegetation Types in the Mountainous Terrain Using Random Forest Machine Learning Technique. In Climate Change Impacts on Natural Resources, Ecosystems and Agricultural Systems (pp. 615–628). Springer. https://doi.org/10.1007/978-3-031-19059-9_27
Stromann O, Nascetti A, Yousif O, Ban Y (2020) Dimensionality reduction and feature selection for object-based land cover classification based on Sentinel-1 and Sentinel-2 time series using Google Earth Engine. Remote Sens 12(1):76. https://doi.org/10.3390/RS12010076
Article Google Scholar
Tadono T, Ishida H, Oda F, Naito S, Minakawa K, Iwamoto H (2014) Precise global DEM generation by ALOS PRISM. ISPRS Annal Photogramm, Remote Sens Spat Inform Sci 2(4):71
Article Google Scholar
Tsai YH, Stow D, Chen HL, Lewison R, An L, Shi L (2018) Mapping vegetation and land use types in Fanjingshan National Nature Reserve using google earth engine. Remote Sens 10(6):927. https://doi.org/10.3390/rs10060927
Article Google Scholar
us Saqib N, Yaqub A, Amin G, Khan I, Ajab H, Zeb I, Ahmad D (2019) The impact of tourism on local communities and their environment in Gilgit Baltistan, Pakistan: a local community perspective. Environ Socio-Econ Stud 7(3):24–37. https://doi.org/10.2478/environ-2019-0015
Article Google Scholar
Wagle N, Acharya TD, Kolluru V, & Huang H (2020) Applied sciences multi-temporal land cover change mapping using Google Earth Engine and ensemble learning methods https://doi.org/10.3390/app10228083
Wang M, Mao D, Wang Y, Xiao X, Xiang H, Feng K, Luo L, Jia M, Song K, Wang Z (2023) Wetland mapping in East Asia by two-stage object-based random forest and hierarchical decision tree algorithms on Sentinel-1/2 images. Remote Sens Environ 297(August):113793. https://doi.org/10.1016/j.rse.2023.113793
Article Google Scholar
Xie Y, Lark TJ, Brown JF, Gibbs HK (2019) Mapping irrigated cropland extent across the conterminous United States at 30 m resolution using a semi-automatic training approach on Google Earth Engine. ISPRS J Photogramm Remote Sens 155(February):136–149. https://doi.org/10.1016/j.isprsjprs.2019.07.005
Article Google Scholar
Xiong J, Thenkabail PS, Tilton JC, Gumma MK, Teluguntla P, Oliphant A, Congalton RG, Yadav K, Gorelick N (2017) Nominal 30-m cropland extent map of continental Africa by integrating pixel-based and object-based algorithms using Sentinel-2 and Landsat-8 data on Google Earth Engine. Remote Sens 9(10):1–27. https://doi.org/10.3390/rs9101065
Article Google Scholar
Xu J, Grumbine RE, Shrestha A, Eriksson M, Yang X, Wang Y, Wilkes A (2009) The melting Himalayas: cascading effects of climate change on water, biodiversity, and livelihoods. Conserv Biol 23(3):520–530. https://doi.org/10.1111/j.1523-1739.2009.01237.x
Article CAS Google Scholar
Yuan H, Van Der Wiele CF, Khorram S (2009) An automated artificial neural network system for land use/land cover classification from landsat TM imagery. Remote Sens 1(3):243–265. https://doi.org/10.3390/rs1030243
Article Google Scholar
Zanaga D, Van De Kerchove R, Daems D, De Keersmaecker W, Brockmann C, Kirches G, Wevers J, Cartus O, Santoro M, Fritz S, Lesiv M, Herold M, Tsendbazar NE, Xu P, Ramoino F, Arino O, Zanaga D, Van De Kerchove R, Daems D (2022) WorldCover 2020:v100. https://doi.org/10.5281/zenodo.7254221
Article Google Scholar
Zhang D-D, Zhang L (2020) Land cover change in the central region of the Lower Yangtze River based on landsat imagery and the Google Earth Engine: a case study in Nanjing. China Sensors 20(7):2091. https://doi.org/10.3390/s20072091
Article Google Scholar
Zhou B, Okin GS, Zhang J (2020) Leveraging Google Earth Engine (GEE) and machine learning algorithms to incorporate in situ measurement from different times for rangelands monitoring. Remote Sens Environ 236:111521. https://doi.org/10.1016/j.rse.2019.111521
Article Google Scholar
Zurqani HA (2024) High-resolution forest canopy cover estimation in ecodiverse landscape using machine learning and Google Earth Engine: validity and reliability assessment. Remote Sens Applic: Soc Environ 33:101095. https://doi.org/10.1016/j.rsase.2023.101095
Article Google Scholar

Download references

Acknowledgements

The authors thank ESA for providing Copernicus Seninel-2 data and ESA WorldCover project [2020] and GEE team for provision of sample code and cloud computing platform for analysis. The data that support the findings of this study are openly available in Earth Engine Data Catalog. The authors would also like to thank JAXA team for provision of ALOS DSM (https://ror.org/059yhyy33) and NOAA’s National Geophysical Data Center for DMPS/OLS (https://ror.org/02z5nhe81) data and Google Earth team for high-resolution imagery. The author would also like to thank the respective institutions for providing environment and platform for conducting the research.

Funding

Open access funding provided by The Hong Kong Polytechnic University This work is supported by The Hong Kong Polytechnic University’s Start-up Fund for RAPs under the Strategic Hiring Scheme [Project ID: P0044784] and The Hong Kong Polytechnic University’s Research Institute for Sustainable Urban Development [Project ID: 1-BBG2].

Author information

Authors and Affiliations

Earth and Atmospheric Remote Sensing Lab (EARL), Department of Meteorology, COMSATS University Islamabad, Islamabad, 45550, Pakistan
Gomal Amin, Iqra Imtiaz, Ehsan Haroon & Muhammad Imran Shahzad
Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Hong Kong, SAR, China
Gomal Amin & Majid Nazeer
Department of Environmental Sciences, Quaid-I-Azam University, Islamabad, 15320, Pakistan
Najum us Saqib
Research Institute of Land and Space, The Hong Kong Polytechnic University, Hong Kong, SAR, China
Majid Nazeer

Authors

Gomal Amin
View author publications
You can also search for this author in PubMed Google Scholar
Iqra Imtiaz
View author publications
You can also search for this author in PubMed Google Scholar
Ehsan Haroon
View author publications
You can also search for this author in PubMed Google Scholar
Najum us Saqib
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Imran Shahzad
View author publications
You can also search for this author in PubMed Google Scholar
Majid Nazeer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gomal Amin.

Ethics declarations

Ethics Approval

The researchers confirm that they are abided to the ethical standards of conducting research.

Informed Consent

We affirm that all research participants gave their full consent.

Conflict of Interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 616 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Amin, G., Imtiaz, I., Haroon, E. et al. Assessment of Machine Learning Algorithms for Land Cover Classification in a Complex Mountainous Landscape. J geovis spat anal 8, 34 (2024). https://doi.org/10.1007/s41651-024-00195-z

Download citation

Accepted: 19 August 2024
Published: 30 August 2024
DOI: https://doi.org/10.1007/s41651-024-00195-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Assessment of Machine Learning Algorithms for Land Cover Classification in a Complex Mountainous Landscape

Abstract

Similar content being viewed by others

Evaluating the impact of classification algorithms and spatial resolution on the accuracy of land cover mapping in a mountain environment in Pakistan

Spatial Resolution Impacts on Land Cover Mapping Accuracy

Land use and land cover classification using machine learning algorithms in google earth engine

Explore related subjects

Introduction

Material and Methods

Study Area

Training Sample Selection

Data Acquisition and Processing

Classification

CART Classifier

Maximum Entropy Classifier

Minimum Distance Classifier

SVM Classifier

RF Classifier

Methods for Accuracy Assessment

Results

Comparison of Classification Accuracy of GEE Classifiers

Land Cover Classification of Gilgit-Baltistan

Discussion

Advantages and Opportunities of GEE Cloud Platform

Strengths and Implication of Our Land Cover Classification

Conclusions

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics Approval

Informed Consent

Conflict of Interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 616 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation