1 Introduction

Landscape comprises heterogeneous elements with diverse ecological, biological, geological, hydrological, social, economic, and environmental characteristics. Local ecosystems with land cover are the primary elements of the landscape [1]. The structure of a landscape decides the functional capabilities which are reflected through bio-geo-socio-economic variables and the resources associated with them [2]. Landscape dynamics refers to the changes in the landscape structure due to natural and anthropogenic reasons, which is assessed through temporal land use (LU) and land cover (LC) analyses.

LC represents the biophysical cover of the land surface, like vegetation and non-vegetation. In contrast, land use denotes the anthropogenic use of land like built-up, agriculture, horticulture, mining, etc. A quantitative analysis of LULC determines the extent and changes in spatial distributions [3], which indicate human interaction with their local environment and its associated changes.

Rapid LU changes are dynamic and unsustainable, necessitating the analysis of the spatial patterns of quantitative changes [4] for planned interventions with minimal impacts of changes in forest cover, agricultural lands, etc. Land use changes leading to degradation and deforestation have been a prime influencer of changes in ecosystems with alterations in climate patterns, bio-geochemical cycles, etc., affecting the livelihood of people [5,6,7,8]. Forest ecosystems provide services supporting economic development and maintain ecology in balance [1, 9, 10]. However, after globalization in the nineties, unplanned developmental activities witnessed large-scale forest land transitions [11,12,13].

LU changes have altered the structure of the landscape, fragmenting contiguous forests into smaller patches which is detrimental to the sustenance of biodiversity, carbon sequestration potential, and other ecological services at local and global scales [14,15,16,17,18,19]. Fragmentation of forests creates disturbances in ecological and socio-economic processes [20] with habitat loss, distribution of habitats into patches, a decrease in habit patch size, loss of species diversity, etc., leading to disruptions in wildlife habitats and aggravating human-wildlife conflicts [21,22,23,24,25]. Hence, it necessitates analyzing and monitoring the forest ecosystem to restrict deforestation and degradation of forests.

The forest fragmentation analysis quantifies the area under five categories, namely interior, perforated, transitional, patch, and edge forest [26, 27], which aids in monitoring and quantifying spatial and temporal patterns of forest ecosystem transitions [28]. Forest fragmentation analysis in biodiversity-enriched regions would help in formulating conservation strategies for the prudent management of ecologically fragile habitats of endemic taxa [9, 18]. Hence, dynamics of LULC aids in planning, managing, and prioritizing the area for sustainable development, conserving biodiversity, and maintaining bio-geo-climatic processes to sustain socio-economic activities [29, 30] by mitigating the degradation and destruction of natural resources. [3, 31,32,33,34,35].

Depleting vegetation cover in forest areas has led to the decline of carbon sequestration potential and the ability to moderate micro-climate, evident from increased land surface temperature [36]. The spatial patterns of the landscape structure and composition are ascertained through spatial matrices.

Multiresolution (spatial, spectral and temporal) geospatial data provide information about ecosystem structure which help in resource management, resource monitoring, mapping, and change detection [34]. The availability of long-term, temporal multispectral remote sensing data helps to identify and quantify LU dynamics [33] and understand the effects of unplanned anthropogenic interventions on the natural LC [3]. Integrating LU information with collateral data through a Geographic Information System (GIS) aid in comprehending land use information with agents of change, including policy interventions.

Different spectral indices such as the Normalised Difference Vegetation Index (NDVI), Normalised Difference Water Index (NDWI), Normalised Difference Built-up Index (NDBI), Bare Soil Index (BSI), Thermal Index (TI) have been used to identify landscape structure, patterns of land use changes, changes in vegetation cover, and the impact of urbanisation and development, which help in the regional planning for sustainable management [3, 10, 37,38,39]. NDVI is the primary spatial index commonly used for classifying LC as vegetation and non-vegetation [15, 40,41,42].

Land uses (LU) in a region are discerned through the classification of spatial data based on supervised and unsupervised algorithms. The unsupervised classification is based on clusters of pixels without prior knowledge of common spectral signatures and characteristics. Commonly used techniques include K-Means Clustering [43, 44], ISODATA, and Principal Component Analysis. In contrast to these, supervised classifiers depend on prior knowledge through training datasets and techniques, including Random Forest (RF) [45,46,47], Support Vector Machine (SVM) [48], Classification and Regression Trees (CART) [49], Gaussian Maximum Likelihood (MLC) [17, 18, 40], Artificial Neural Network (ANN) [50, 51], Minimum Distance-to-means Classifier [6], and Mahalanobis Distance Classifier [14]. Recent advancements in the classification of big data (spatial) through machine learning algorithms based on learning aid in making an informed optimal decision due to the availability of accurate land use information in less time. Deriving LU information from spatial data depends on spatial and spectral characteristics of data, classification algorithm, and training or ground truth data [14]. Adopting appropriate classification algorithm aid in accurately assessing spatiotemporal changes in land use.

Random Forest classifier [7, 39, 43, 46, 47, 52,53,54,55,56,57,58,59] is the most widely used ML algorithm based on ensemble methods like bagging and boosting [60]. It provides accurate classification in a heterogeneous landscape through a set of decision trees from a randomly selected subset of the training set and aggregates decisions for deciding the final class. The accuracy of the classifier is assessed through samples (not used for training), which provide unbiased error estimates. RF randomly selects variables from training samples at each node to determine the best split to construct a tree based on the Gini index measure that gives a measure of impurity within a node.

Support Vector Machine classifier [50, 55, 57,58,59, 61,62,63,64] constructs a hyperplane or set of hyperplanes based on the number of features in a high or infinite dimensional space for classifying spatial data with a small amount of training data [60].

Gaussian Maximum Likelihood Classifier (GMLC) uses mean and covariance values with probability density functions for deriving LU information and is a parametric supervised classification technique [4, 6,7,8, 11, 32, 34, 50, 55, 59, 65].

1.1 Objectives

Objectives of the current study are:

  1. 1.

    Assessing LULC dynamics in Chikamagaluru district, Karnataka State, India using temporal remote sensing data.

  2. 2.

    Understanding spatial patterns of forest structure through assessment of fragmentation of forests.

  3. 3.

    Evaluation of the performance of classification algorithms.

2 Materials and method

2.1 Study area

Chikamagaluru district, which covers an area of 7101 sq. km (3.8% of Karnataka), is in the southwestern part of Karnataka between 12 °54′ 42″ and 13° 53′ 53″ N and 75° 04′ 46″ and 76° 21′ 50″ E. The district stretches 138.4 km from east to west and 88.5 km from north to south, and is divided into seven taluks—Chikamagaluru, Kadur, Koppa, Mudigere, Narashimharajapura, Sringeri, and Tarikere (Ajjampura and Tarikere taluks as per the 2021 statistical report). According to the 2011 Census, the district has a total population of 1,137,961, which accounts for 1.9% of the population of the state [66] and ranks 25th in Karnataka. The district has a population density of 158 per sq km, which is the 3rd least dense district in Karnataka. The literacy rate is 79.25%, and the Scheduled Caste and Scheduled Tribe populations make up 22.3% and 4% of the total population, respectively. About 81% of the population lives in rural areas, while 19% live in urban areas. The district is well-connected by road to Hassan, Mysore, Bangalore, Shivamogga, Udupi, and Mangaluru, and the nearest airports are Mysore and Mangalore. There are two railway junctions at Kadur and Birur. The district is divided into three agro-climatic zones (Fig. 1): the hilly zone (Chikamagaluru, Koppa, Mudigere, Narashimharajpura, and Sringeri), central dry zone (Kadur), and southern transition zone (Tarikere). The economy of the district is primarily based on rural agriculture and is supplemented by income from tourism. Chikamagaluru has a higher per capita income of 68,956 Rs (2008–09) than other districts in Karnataka, and the GDP of the district was 5222 cr. in the year 2012–13. The per capita annual income of the district was 66,366 Rs (2012–13) [67,68,69].

Fig. 1
figure 1

Study area (Chikmagalur district, Karnataka) with Agro-climatic zones

The district of Chikamagaluru is divided into two distinct regions, the western part being a forested hilly area known as the 'Malnad' area, while the eastern part is dominated by a plain region or ‘Maidan’ area. The forest cover in the district is managed by five divisions, including Chikamagaluru, Koppa, Bhadravathi, Kudremukh National Park, and Bhadra Wildlife Sanctuary. The vegetation in the district can be broadly categorized into four types: dry deciduous hill type, moist deciduous type, evergreen type, and Sholas and Grassland type. Mullayanagiri, the highest peak in the district, rises 1926 m above MSL. The major rivers in the district are Tunga and Bhadra, while other perennial rivers include Hemavati, Netravati, and Vedavathi [70].

The district has rocks formed during the Archaean age, with gneiss formations in the southwestern, northern, and eastern parts, and schist formations in the central and western parts. The district's soil is rich in iron, magnetite, and granite deposits, with black soil found around Baba Budan Giri hills, enriching the coffee plantation, and red and gravel soils found in the southern part of the district. Due to the presence of extensive hilly areas, the climate in the district is cooler, with April being the hottest month, with a mean daily maximum temperature of 36 °C and a mean daily minimum temperature of 19 °C. The average annual rainfall in the Chikamagaluru district is 1925 mm, ranging from 595 to 2379 mm. Sringeri taluk receives the highest rainfall in the district, amounting to 3773.2 mm, while Kadur taluk receives the lowest rainfall of 620.3 mm.

The major crops grown in the region include ragi, paddy, sunflower, jowar, Bengal gram, groundnut, and maize. The district has 5727 ha of canal-based irrigation facilities. The major horticulture crops grown in the district are areca nut, coconut, black pepper, banana, mango, cardamom, ginger, and vegetables. Coffee is primarily grown in the valleys of Baba Budan Giri hills.

2.2 Data

Temporal LULC analyses of the Chikamagaluru district were done using Landsat series data from 1973 to 2021 and the collateral data, which included a vegetation map of 1:250,000 of South India developed by the French Institute (1986) and Bhuvan LULC maps of 1:50,000 from 2005 to 2016. The vegetation map of South India (1986) was useful in identifying various forest cover types. Table 1 lists the data used for assessing LULC dynamics. Landsat ETM + data of 2005 were corrected for the SLC error by image enhancement and restoration techniques using the nearest neighbour interpolation.

Table 1 Data used for the study

2.3 Method

The analysis was performed in a series of steps as depicted in Fig. 2. Spatial data acquired through space-borne sensors corresponding to the study region were downloaded from the public domain and carried out pre-processing through geometric and radiometric rectification followed by the land cover and land use analyses [71].

Fig. 2
figure 2

Method followed for assessing the extent and condition of land cover and land uses

Land cover analysis was done to delineate the regions under vegetation and non-vegetation through computation of the Normalized Difference Vegetation Index (NDVI) as per Eq. 1. NDVI ranges from − 1 to 1, and lower values indicate regions under non-vegetation, while positive values indicate vegetation cover.

$$NDVI = \frac{(NIR- Red)}{(NIR + Red)}$$
(1)

Land use analyses are done through supervised machine learning algorithms using Google Earth Engine at an open-source cloud-based platform. The land use analyses using spatial data involved (i) identification of heterogeneous areas in the study region through false color composite (FCC) of spatial data acquired at NIR, Red, and Green wavelength. (ii) Polygons corresponding to heterogeneous patches in FCC and uniformly distributed throughout the region were digitised from FCC and supplemented by digitising polygons in virtual data (Google Earth). Digitised polygons are representative of heterogeneous patches (or likely land uses), uniformly distributed throughout the study region, and cover at least 15% of the spatial extent of the study region. (iii) Land use analysis was done through supervised non-parametric and parametric classifiers (1) Random Forest (Classifier.smileRandomForest), (2) Support Vector Machine or SVM (Classifier.libsvm) and (3) Maximum Likelihood classifier. Random Forest and Support Vector Machine classifiers were implemented in the Google Earth Engine platform and the Maximum Likelihood classifier was used in GRASS (Geographic Resources Analysis Support System) GIS. The performances of these classifiers (algorithms) have been evaluated through accuracy assessment and Kappa statistics.

2.3.1 Random forest

(RF) uses bagging or Bootstrap aggregation, which chooses a random sample from the data set (Fig. 3). So, each model is generated from the samples (bootstrap samples) provided by the original data with a replacement. The algorithm considers the prediction from each tree, and the final output is predicted based on the majority votes of predictions [52, 72, 73]. RF considers decision trees based on average values of data subsets that improve predictive accuracy. The final output depends on majority voting after aggregating the results of all models.

Fig. 3
figure 3

Random forest classifier

2.3.2 Support vector machine (SVM)

Is used for classification as well as regression analysis [61]. The SVM algorithm generates the best line or decision boundary (or hyperplane) that segregates n-dimensional space into classes so that the new data can easily be put in the correct category in the future. SVM chooses the extreme points or support vectors that help in creating the hyperplane (Fig. 4).

Fig. 4
figure 4

Support vector machine

The distance between the vectors and the hyperplane is called a margin. The goal of SVM is to maximize the margin between classes. The hyperplane with the maximum margin is called the optimal hyperplane. Linear SVM is a parametric model, but a radial-based kernel SVM is non-parametric.

2.3.3 Maximum likelihood classifier (MLC)

Qualitatively evaluates the variance and covariance of spectral reflectance for each class as per Eq. 2.

$$L= \frac{1}{{\left(2\lambda \right)}^\frac{n}{2}|{{\Sigma }_{k}|}^\frac{1}{2}}\mathrm{exp}|\frac{1}{2}(X-{\mu }_{k}){\sum }_{k}^{-1}(X-{\mu }_{k}{)}^{t}|$$
(2)

where

n = spectral resolution

L = likelihood of X belonging to class k

μk = mean vector of class k

Σk = variance–covariance matrix of class k

In the current study, the best classifier (the Random Forest classifier) was prioritized by assessing the relative performance of three chosen classification algorithms.

Spatial data were classified into 12 LU classes i.e., evergreen forest, moist deciduous forest, dry deciduous forest, shola forest, scrubland, waterbody, cropland, fallow land, open space, rocky area and built-up area. These classes were combined broadly into the forest, shola forest, scrubland, waterbody, agriculture, built-up and open space to understand the spatial extent of land uses.

Accuracy assessment of LU classification is done through computation of confusion matrix and Kappa statistics by comparing computed LU information with the training data. Overall accuracy represents the total classification accuracy (Eq. 3), which is calculated as the percentage of the ratio of the total number of correctly classified pixels and the total number of reference pixels. Producer and user accuracy are computed as per Eqs. 4 and 5.

$$Overall\;Accuracy = \frac{{no\;of\;correct\;classified\;feature}}{{total\;feature}} \times 100$$
(3)
$$User^{\prime }s \;accuracy = \frac{{no\;of\;features\;of\;specific\;class\;correctly\;classified}}{{row\;total}} \times 100$$
(4)
$$Producer^{\prime } s\;accuracy = \frac{{no\;of\;features\;of\;specific\;class\;correctly\;classified}}{{column\;total}} \times 100$$
(5)

The Kappa Statistic or Cohen's Kappa is a statistical measure of the reliability of categorical variables as per Eq. 6.

$$Kappa\;Statistics \left(\widehat{K}\right)= \frac{N\sum_{i=1}^{r} {x}_{ii} - {\sum }_{i=1}^{r} \left({x}_{i+}\times {x}_{+i}\right)}{{N}^{2}- {\sum }_{i=1}^{r}( {x}_{i+} \times {x}_{+i})}$$
(6)

where

r = the number of rows in the error matrix

xii = the number of observations in row i and column i

xi+ = total of observations in row i

x+i = total of observations in column i

N = total number of observations included in the matrix.

LU change is computed considering temporal land uses based on the base year (1973) and current year (2021) data as per Eq. 7.

$$Change\;Rate = \left(\frac{Land\;use\;area\;of\;current\;year - Land \;use\; area\; of\; base\; year}{Current\; year - base\; year}\right)*100$$
(7)

2.3.4 Forest fragmentation

Fragmentation of forests is assessed through fragmentation metrics, namely patch, transitional, edge, perforated, and interior forests [27] through computation of Pf and Pff considering 3 × 3 kernel with moving window approach Pf determines the proportion of forest pixels in the window (Eq. 8) and Pff determines the proportion of all adjacent or cardinal pixel pairs that include at least one forest pixel, for which both pixels are forested (Eq. 9).

$${P}_{f}= \frac{proportion\; of\; number\; of\; forest\; pixels}{total\; number\; of\; non\; water\; pixels\; in\; window}$$
(8)
$${P}_{ff}= \frac{proportion\; of\; number\; of\; forest\; pixel\; pairs}{total\; number\; of\; adjacent\; pairs\; of\; at\; least\; one\; forest\; pixel}$$
(9)

Figure 5 illustrates forest fragmentation classification into six fragmentation categories based on Pf and Pff.

Fig. 5
figure 5

Forest fragmentation metrics Pf and Pff in a fixed area window

In this study, forest class included evergreen, moist deciduous, dry deciduous, scrubland, and shola forest, whereas the non-forest category included agriculture, horticulture, built-up and open spaces.

3 Results and discussion

3.1 Land cover analyses

LC assessed through NDVI using temporal remote sensing data, given in Table 2 shows a decline in vegetation cover from 65.5% (1973) to 62.73% (2021) due to anthropogenic activities (Fig. 6). The area under non-vegetation areas increased from 34.5 to 37.27% due to agricultural and urban expansion. Unplanned developmental activities coupled with enhanced agriculture (crop lands and horticulture) activities were prime reasons for the irreversible loss of forest cover.

Table 2 Temporal land cover in Chikmagaluru from 1973 to 2021
Fig. 6
figure 6

Land cover based on NDVI of Chikamagaluru district from 1973 to 2021

3.2 Land use analyses

Temporal (1973 to 2021) LU were computed through a supervised classifier based on the Random Forest algorithm (Fig. 7), revealing a decline in forest cover of 32.77% and an increase in agricultural area by 6.31% and horticulture by 43.14%.

Fig. 7
figure 7

Change of land uses in Chikamagaluru district from 1973 to 2021

Temporal LU listed category-wise in Table 3 reveal a reduction in forest cover from 46.38 to 30.65% from 1973 to 2021. In the forest, evergreen forest covered 1293.59 sq km (1973), 1175.42 sq. km (1991), 1155.5 sq. km (1998), 1107.73 sq. km (2005), 1060.33 sq. km (2013) and 706.38 sq. km (2021). The main reason behind the decline of forest cover is the transition of forested areas into agricultural and horticultural categories. Cumulatively the forest cover has decreased from 46.38 to 30.65% during five decades. The Bhadra wildlife sanctuary and Kudremukh National Park mainly consist of the reserve forest.

Table 3 LU changes in percentage of Chikamagaluru from 1973 to 2021

Agricultural land has been increasing in this district from 1973 (29.71% or 2143.03 sq. km) to 2021 (32.65% or 2355.10 sq. km). There is a constant increase in agricultural land use of 6.13% from 1973 to 2021 (Fig. 8).

Fig. 8
figure 8

Category-wise LU changes of Chikamagaluru from 1973 to 2021

The change in the horticultural area increased from 4.45% or 313.43 sq km in 1973 to 25.05% or 1807.16% in 2021, showing an increase of 43.14%. Many parts of the district, forests, and agricultural lands transition into horticultural land.

The open space in the area covered around 187.04 sq km or 1.92% area in 1973 and 103.27 sq km or 1.43% area in 2021. The conversion of open spaces was seen in agricultural fields and built-up areas. The spatial extent of built-up also increased by 2.10% from 32.84 sq km (0.61%) in 1973 to 105.48 sq km (1.46% of the area) in 2021 (Fig. 9).

Fig. 9
figure 9

LU analysis of Chikamagaluru district from 1973 to 2021

LU Classifiers were evaluated based on the accuracy assessment (Table 4, Fig. 10). The comparative evaluation reveals that the supervised classifier Random Forest has performed better, evident from the kappa value of 0.8567 compared to SVM (0.6177) and MLC (0.8264) and higher overall accuracy of 90.22% in RF, compared to 66.32% (in SVM) and 85.18% (in MLC). The category-wise evaluation shows that commission error in RF ranged from 43.8 to 0.1 (highest in shola forest by 43.8) and omission error ranged from 65.5 to 8.9 (highest in built-up by 65.5). Built-up has been misclassified into open spaces. SVM shows a range of commission errors from 53.6 to 0.5 (highest in 53.6 in shola) and 59.1 to 0.5 omission errors (highest in 59.1 in shola). MLC provided a range of commission errors from 25.3 to 5.1 (25.3 as highest in scrub) and omission errors from 34.8 to 1.3 (34.8 as highest in dry deciduous). Table 5 provides category-wise LU statistics based on supervised classification algorithms (RF, SVM, MLC).

Table 4 Comparison of different classification algorithms on the basis of accuracy
Fig. 10
figure 10

Land use classification by RF, SVM and MLC

Table 5 Total area calculated by three different classification algorithms

The comparison shows that RF gave the highest Kappa coefficient and overall accuracy than SVM and MLC. Random Forest classifier outperformed the other classifiers based on kappa, and overall accuracy and results conform with the earlier studies [43, 49,50,51, 58, 74,75,76,77,78,79,80]. Classification parameters are further tuned in RF classifier by maintaining multi-class variance was maintained as the spectral bands are correlated and overlap spectral regions.

3.2.1 Forest fragmentation

Fragmentation analyses of forests in Chikmagaluru district reveal of decline in the contiguous forest from 55.72% or 4018.89 sq. km in 1973 to 28.05% or 2023.07 sq. km in 2021 (Table 6) with an increase in patch forests to 0.01% or 0.79 sq. km in 2021 (Fig. 11). Transitional forest increased from 0.45% or 32.37 sq. km in 1973 to 1.80% or 129.76 sq. km in 2021. Edge forest and perforated forest also increased from 1973 to 2021 from 3.61% or 260.04 sq. km to 5.11% or 368.33 sq. km and 1.38% or 99.37 sq. km to 2.71% or 2023.07 sq. km, respectively. The trend of fragmentation of forests in Chikmagaluru district is depicted in Fig. 12.

Table 6 Temporal (1973–2021) assessment of forest fragmentation (in sq. km) of Chikamagaluru district
Fig. 11
figure 11

Forest fragmentation analysis of Chikamagaluru from 1973 to 2021

Fig. 12
figure 12

trend of forest fragmentation in Chikamagaluru from 1973 to 2021

3.3 Discussion

The spatial extent of non-vegetation has increased from 65.5% (1973) to 62.73% (2021) due to urban expansions, etc. Land use analysis showed a sharp decline in native forest cover and an increase in agriculture and horticultural activities expanded into the forest areas. Also, the area under unpaved surfaces increased because of the increasing population and industrialization in the district. Assessment of forest fragmentation showed a decrease in interior or contiguous forest cover, emphasizing the need to maintain forest ecosystem integrity to sustain ecosystem services.

The evaluation of performance of Random Forest (RF), Support Vector Machine (SVM) and Maximum Likelihood (MLC) classifiers for LU classification in a heterogenous landscape like Chikamagaluru district showed that Random Forest provided the most accurate results evident from the higher accuracy and kappa coefficient.

4 Conclusion

Landscape dynamics in hilly terrain (Chikamagaluru district, Karnataka State) during five decades were assessed using temporal remote sensing data in the cloud platform using machine learning algorithms, namely Random Forest (RF) and Support Vector Machines (SVM) and compared with the parametric maximum likelihood classifier. The land cover analysis showed a decreasing trend of vegetation (209.58 sq. km) land cover in the study area from 65.5% in 1973 to 62.7% in 2021. The land use analysis showed a drastic decline in forest cover and an increase in agriculture and horticultural practice in five decades. Forest cover has decreased by 48.91% (3527.73 sq. km) due to the conversion of forest areas into horticulture, agricultural lands, and urban areas. Horticulture practice has increased from 4.35 to 25.05%, which showed a 3111.94 sq. km increase in land use. The built-up area has been expanded by 2.1% (151.34 sq. km) in the study area in five decades. Spatiotemporal analysis of the forest structure in Chikamagaluru district was analysed using fragmentation analysis. The result showed a decrease in the interior forest by 27.67% (1995.82 sq. km) within the five decades and an increase in non-forest by 23.5% (1694.68 sq. km).

The evaluation of performances of machine learning algorithms (Random Forest, Support Vector Machine) and parametric classifier Maximum Likelihood classifier, reveals that the supervised classifier based on the Random Forest algorithm has performed better, evident from higher overall accuracy and Kappa value.