1 Introduction

The Kingdom of Bahrain’s rapid development boom during the 1980s raised the demand for land, particularly in coastal areas, over the past 34 years (1986–2020). As a result, the pressure on the country’s overall area to develop urban centers and re-design the existing cities along the shore had intensified due to this growing land demand. Therefore, different information and statistics on LULC are required for sustainable land resource management and policymaking processes to monitor continually, model, and update changes.

Remote sensing science and techniques play an essential role in producing spatial information and detailed raw data on terrestrial phenomena using space-borne platforms (Jansen and Gregorio 2004). These data on different geographical locations facilitate the characterization and evaluation of the changes (Yuan et al. 2005). Remote sensing data’s spatial, temporal, and spectral characteristics are utilized to map LULC to aid in land resource management decision-making (Berlanga-Robles and Ruiz-Luna 2002). The emergence of satellite imagery has developed tools for systematic observation of land cover from space (Mollicone et al. 2003). Data required for large land areas are collected at different time intervals and used to monitor changes on the earth’s surface (Jensen 2005). Landsat data help assess LULC for different years, analyze changes, understand land-use patterns and factors, and build databases for long-term monitoring tools when combined with geographic information system data and various statistical analytical tools (Jensen 2005). GIS and Remote sensings are effective techniques for obtaining precise and timely data and information on the spatial distribution of LULC changes over broad areas (Guerschman et al. 2003; Rogan and Chen 2004; Zsuzsanna et al. 2005). Remote sensing and GIS data have been used to map land uses and land cover and detect changes in many studies over the past fifty years. These data have been used to study expansion and urbanization, quantify land changes, and study the effects of land changes in faster, straightforward ways than traditional methods in survey studies (Da Costa and Cintra 1999). GIS provides a versatile framework for gathering, storing, displaying, and evaluating digital data needed for change detection (Demers 2008; Wu et al. 2006). GIS relies heavily on remote sensing imagery as a data source. Satellite imagery is employed to recognize synoptic data of the earth’s surface (Ulbricht and Heckendorff 1998). Over the last 20 years, researchers concentrated on using remotely sensed image band characterization to study urban and nonurban area changes (Gadal and Ouerghemmi 2019; Vigneshwaran and Kumar 2018; Jacquin et al. 2008; Xu 2007; Zha et al. 2003). Due to their ability to provide immediate and synoptic usage of land cover, remotely sensed images are suitable tools for identifying, managing, and monitoring urban built-up areas, spatial distribution, and expansion (Hegazy and Kaloop 2015; Rawat and Kumar 2015). Many types of research used different classification techniques such as supervised, unsupervised, object-based, or deep learning classification in their studies for extracting urban areas from a multispectral satellite image (Ghosh and Siddique 2018; Bramhe et al. 2018; Forget et al. 2017; Hegazy and Kaloop 2015; Rawat and Kumar 2015; Zhang et al. 2014; Ndehedehe et al. 2013). With a spatial accuracy of 30 m, Landsat multispectral images became the backbone of large-scale land cover change studies due to the long period covered by satellite data over 45 years (Duan et al. 2020; Tewabe and Fentahun 2020; Lu et al. 2010; Wulder et al. 2008; Cohen and Goward 2004). Many satellite images are being used to monitor and study land cover changes, such as SPOT, ASTER, IR, IKONOS, QuickBird, WorldView1, 2, 3 (Allen et al. 2013; Lu and Weng 2009). The land cover information extracted from remote sensing data is an essential part of various applications, including land-use mapping and monitoring of variables. Different satellite-based LC data applications have been created in the last 30 years. The most important of which is the analysis of land cover dynamics that significantly impacts natural habitats (Yuan et al. 2005), as well as the creation of LC maps utilizing a variety of classification methods, all of which are linked to direct statistics to determine classification accuracy and error ratio. Land cover analysis and classification results are based on four main characteristics: temporal, spatial, radiometric, and spectral accuracy (Allen et al. 2013). Macleod and Congalton (1998) identified four manifestations of detection of land-use changes in the study of land resources, namely, the detection of changes that have occurred, the determination of nature of change, the measurement of the areal extent of change, and the assessment of the spatial pattern of change (Macleod and Congalton 1998). Landsat Multispectral Scanner (MSS), Thematic Mapper (TM), and Enhanced Thematic Mapper Plus (ETM+) data have been broadly employed in studies towards the determination of land cover since 1972, the starting year of the Landsat program, mainly in forest and agricultural areas (Campbell 2007). The availability of detailed data on land uses for urbanization is essential for planning processes (Jensen and Cowen 1999). Remote sensing data have become the cornerstone of all urban studies emphasizing mapping and analyzing the changes in area, extent, and patterns (Wang et al. 2020). Open access to earth observation satellite data provides spatially valid datasets over broad areas with great spatial detail and temporal frequency (Xiao et al. 2006). The availability of many free satellite images and GIS layers and improvements in remote-sensing data gathering with increased spatial accuracy allow and improve quantitative studies of the rate and pattern of urban LULC change (Epstein et al. 2002).

2 Study Area

The Kingdom of Bahrain is an archipelago of 36 low-lying islands that vary in size, surrounded by shallow waters no deeper than 20 m. The country is located in the southern part of the Arabian Gulf. Its territorial water covers roughly 9200 km2 and accounts for more than 90% of the total area. In 2015, the country’s land area was estimated to be 782.4 km2 (2020), including rocky coral atolls (Fashts), as shown in Fig. 1. The coastal length is more than 537 km, and the marine area is more than 9200 km (Information and eGovernment Authority 2020). The country has significant population and infrastructure growth, which has raised the demand for land (UNDP 2018). As a result, the country’s land area has increased from 697 km2 in 1987 to 782 km2 in 2019 (Information and eGovernment Authority 2020). Due to the increased demand for land, the increment rate reached its maximum point (21 km2/year) between 1997 and 2007 (UNDP 2018). The majority of the country’s population (more than 95%) lives in densely populated coastal areas (Mesaiqer and Al-Zayani 2008). From the 1950s until 2007, the population increased by 2.7 percent per year, reaching around 435,654 in 1987 and over 544,366 in 1994. In 2013, the estimated population was 1,253,191, which was expected to grow to 1,523,084 by 2020 (CIO (19862013); Information and eGovernment Authority 2020). According to CIO’s (19862013) predictions, the country’s population will reach 2.2 million by 2030 (Information and eGovernment Authority 2018; UNDP 2018). Increased demand for land, notably for urbanization and creating new cities and residential facilities, has increased population. More than half of Bahrain’s main island is almost entirely urbanized. Aside from considerable pressure on infrastructure, health, education, and social services, the country is vulnerable to some issues due to its arid desert climate and small land area (UNDP 2018; UNdata 2021).

Fig. 1
figure 1

The Kingdom of Bahrain Islands location between Saudi Arabia and Qatar in the Arabian Gulf. This (Satellite Image map download from https://earthexplorer.usgs.gov/, then Mosaic. ID: L1C_T39RVJ_A030419_20210419T071115, Acquisition Date: 2021/04/19, Platform: SENTINEL-2A, Tile Number: T39RVJ.AND, ID: L1C_T39RVK_A030419_20210419T071115, Acquisition Date: 2021/04/19, Platform: SENTINEL-2A, Tile Number: T39RVK)

3 Methodology

3.1 Data Acquisition

In this analysis, we used 30 m dry season six Landsat cloud-free datasets: Landsat-5, Thematic Mapper (TM) from August 14, 1986, Landsat-7 from July 11, 2000, Enhanced Thematic Mapper Plus (ETM+) from July 25, 2005, and Landsat-8 Operational Land Imager (OLI) from August 24, 2013, and August 11, 2020. Table 1 shows the specifics of Landsat satellite image data downloaded from the Earth Explorer website of the United States Geological Survey (USGS) (https://earthexplorer.usgs.gov/). Earth Explorer is an online website that allows searching, viewing, exporting information, and downloading earth science data from the USGS (EROS 2021). In addition, the images were spatially registered on the World Geodetic System (WGS84) and then re-projected the data to the Northern Hampshire zone 39N on Universal Transverse Mercator (UTM) projection. We used the Maximum Likelihood Classifier (MLC) algorithms to classify the images. All the maps were assigned to the WGS 1984 UTM Global Geodetic Coordinates System. We used the 2020 Landsat 8 OLI image as the master reference for all the images and data used in this study.

Table 1 Details of Landsat Band data used for this study collected from EarthExplorer

3.2 Ancillary Data

Ancillary data, also known as auxiliary data, are frequently utilized to improve image classification and accuracy. Due to the limitations of spectral information for defining change, researchers must rely on auxiliary data (Franklin 1995). To determine LULC Changes processes, many researchers used ancillary data as input features with multi-temporal images (Hurskainen et al. 2019; Feng et al. 2018; Zhu et al. 2016; Zoungrana et al. 2015; Yu et al. 2013). This study used high-resolution satellite images for different dates: IKONOS 2000, GeoEye1 2011, 2013, Worldview3 2019, and the multiresolution seamless image database-MrSID 1994, 1998, and ASTER 2012, 2013 to enhance the LULC-classification. We used auxiliary data, mainly Bahrain topographic scanned maps from the “Survey and Land Registration Bureau (SLRB)” scale 1: 50,000, 25,000, and 10,000, to adjust the decision of LULC-classes, Fig. 2. In addition, we assimilated the four digital governorates zoning maps; Capital, Muharraq, Northern, and Southern (UPDA 2022) that published by the general directorate of urban planning at the “Urban Planning and Development Authority” https://upda.gov.bh/ to validate the classification results (Harris and Ventura 1995).

Fig. 2
figure 2

Ancillary data: a IKONOS 2000, b MRSID 1994, c Bahrain Zoning Map, 2017, and d topographic map 1:25,000 of the same area from 1986

3.3 LULC Classes Definition

The classification of LULC is based mainly on its creation, with the levels being divided into numerous levels based on the level of accuracy and purpose necessary. We adapted the LULC level-1 classification from Anderson et al. (1976). Most of these categories were based on statistical data, aerial photos, satellite imagery data, or directly based on Anderson et al. (1976) or other land classifications. We created a classification scheme for this study based on prior knowledge of the study region and information derived from auxiliary data and information (Anderson et al. 1976). The accessible digital Bahrain Zoning maps (UPDA 2022) and other auxiliary data were the primary sources for identifying the research area’s main LULC classes. As a result, seven LULC classes covering the surface study area were identified and classified, including the reclaimed area (Rec), vegetation (Veg), built-up/urban area (Bul), rock outcrop (Roc), bare ground (Bar), gypsic soil (Gyp), and wetland/sabkhas region (Wetland/Sabkhas) (Wet). Table 2 displays the seven LULC classification schemes adapted for this study.

Table 2 Definitions of LULC classes (

3.4 Image Preprocessing

This study used multitemporal and multispectral Landsat satellite images acquired in 1986, 1994, 2000, 2005, 2013, and 2020. The changes in the LULC areas were detected using a Supervised Maximum Likelihood Classification algorithm for the six temporal dates. Remote sensing and GIS Software were used to handle the geometric and radiometric enhancement, mosaicking, and the sub-setting of Landsat 5 Thematic Mapper (TM), Landsat 7, Enhanced Thematic Mapper Plus (ETM+), and Landsat 8 Operational Land Imager (OLI) images. Furthermore, high-resolution satellite images of different dates IKONOS, 2000, GeoEye1 2011, 2013, Worldview3, 2019, ASTER, 2012, 2013 and multiresolution seamless image database-MrSID 1994, 1998 were used to enhance the LULC-classification. Image-preprocessing operations for this study include geometric correction (rectification), ETM+ Scan Line Corrector–SLC off and gap filling, radiometric, and atmospheric calibration (Lu et al. 2004). Geometric (rectification), ETM+ Scan Line Corrector–SLC off and gap filling, radiometric, and atmospheric calibration are a few of the image-preprocessing techniques used in this study (Lu et al. 2004). All the images were co-registered and radiometrically calibrated to compare different temporal dates of the study area. Radiometric calibration was done to reduce or correct errors in the digital numbers of the images. The calibrated Digital Numbers (DNs) were first transformed to absolute units of at-sensor spectral brightness and then to Top-Of-Atmosphere (TOA) reflectance (Chander et al. 2009). The DNs stored in the original image are converted into biophysical variables of standard significance (reflectance). For Landsat 5, 7, and 8, the TOA spectral radiance is calculated using the band-specific multiplicative rescaling factor. The TOA reflectance is then adjusted for the solar angle. TOA reflectance has three advantages over at-sensor spectral radiance when comparing images from different sensors. First, the time difference between data collection reduces the cosine effect of differing solar zenith angles. Second, TOA reflectance adjusts for spectral band changes in exo-atmospheric solar irradiance, resulting in varied values of exo-atmospheric solar irradiance. Third, the TOA reflectance compensates for differences in Earth–Sun distance between data gathering dates. These variances might be significant geographically and temporally (Chander et al. 2009).

3.5 Image Classification

We employed the supervised classification methodology to classify the LULC categories (sites) into the predetermined seven classes in this study, Table 2. Combining training samples (sites) with prior knowledge and familiarity with the study area is necessary to complete the supervised classification (Jensen 2005). The training sites were chosen based on visible areas in all the image sources. We began by defining polygons around representative portions of each LULC category (Sites) in each Landsat image as identified and defined in Table 2. Between 200 and 500 training sites were identified in each rectified satellite image. We employed the maximum likelihood classifier-MLC to produce spectral signatures to classify all the pixels in the image after the training sites had been established. Following the first classification, a majority filter was used to smooth the classification results by reducing noise from the classed raster maps, resulting in the final LULC raster maps utilized for further analysis. The 1:10,000 Bahrain topographic maps were utilized to construct ground signatures for the supervised categorization of the 1986 image. For the 1994 supervised classification, ground control signatures were created using MrSID orthophotos from 1994 and Bahrain topographic maps at a scale of 1:10,000. The ground signature was created for the 2000 Landsat ETM image using high-resolution IKONOS 2000. The Landsat 2005 ETM image and the Landsat 2013 OLI image, ASTER 2005 and 2013, were utilized to construct ground signatures. Ground signatures for the Landsat 2020 OLI were created utilizing Sentinel 2. The six images were classified into seven classes, namely built-up/urban area (Bul), bare land (Bar), gypsic soil (Gyp), reclaimed land (Rec), rock outcrop (Roc), vegetation (Veg) and wetlands/sabkhas (Wet) as displayed in Table 2. We then supervised all the images using the maximum MLC algorithm (Lillesand and Kiefer 1999).

3.6 Accuracy Assessment

The error matrix is the most used technique for determining accuracy (Congalton and Green 2019). The classification accuracy can be evaluated to produce an overall measure of the map’s quality, which can then be used to compare alternative change detection systems (Foody 2002). The minimum level of accuracy in interpreting remote sensing data to identify LULC classes should be at least 85% (Anderson et al. 1976). These standards error matrices (accuracy assessment statistics) are computed based on the same data references for each image to calculate the components of overall accuracy, user’s accuracy, producer’s accuracy, error of commission (EC), error of omission (EO), and kappa coefficient. Overall accuracy (OA) is the total number of successes compared to the total number of samples in the categorized image. It is calculated by summing the number of correctly classified values and dividing it by the total number of values in the confusion matrix in Eq. (1). User’s accuracy (UA) is the probability of classified pixel on each map representing the actual class on the ground or real-world location (Congalton 1991; Jensen 2005; Campbell 2007) and is calculated using Eq. (2). On the other hand, the producer’s accuracy (PA) measures the error of omission. It is the probability that a reference pixel is classified correctly. It is calculated by dividing the number of corrected classified samples of a specific category by the total number of reference samples using Eq. (3). The error of commission (EC) is the proportion of a pixel that is predicted to be in a class, but it does not. It is calculated using Eq. (4). The error of omission (EO) is the proportion of observed pixels on the ground that are not classified on the map, and it corresponds to the producer’s accuracy. The kappa coefficient (KC) is a measure of the difference between the actual agreement between reference data and an automated classifier and the chance agreement between the reference data and a random classifier, expressed using Eq. (5) (Lillesand et al. 2015). The percentage of correctly categorized pixels is calculated from the percentage expected by chance. It measures the difference between the actual agreement and chance (random) agreement between the map and the validation data on the ground (Congalton 2001). The higher the classification accuracy of the map, the more valuable it is for land administrators and land-use planners. The KC (Koc et al. 2012) is a discrete multivariate approach for determining the level of agreement or accuracy. Its value ranges from −1 to 1. However, it frequently falls between 0 and 1 (Zanotta 2018):

$$\mathrm{Overall Accuracy} (\mathrm{OA}) = \frac{\mathrm{Sum} {\rm of} {\rm diagonal} {\rm tallied}(\mathrm{correctly} {\rm indentified})}{\mathrm{Total} {\rm number} {\rm of} {\rm samples}}\times 100$$
(1)
$${\mathrm{User's}}\,\mathrm{Accuracy} \,\left(\mathrm{UA}\right)=\frac{\mathrm{Samples}\, {\rm correctly}\, {\rm identified}\, {\rm in}\, {\rm the}\, {\rm row}}{\mathrm{Row}\, {\rm total}}\times 100$$
(2)
$${\rm{Producer's}}\,\mathrm{Accuracy} \,(\mathrm{PA})=\frac{{\rm Samples}\, {\rm correctly}\, {\rm identified} \,{\rm in}\, {\rm the}\, {\rm column}}{\mathrm{Column}\, {\rm total}}\times 100$$
(3)
$$\mathrm{Error}\, {\rm of}\, {\rm Omission}\, (\mathrm{EO})= \frac{\sum \mathrm{off} \,\mathrm{Diagonal}\, {\rm element} \, {\rm of}\, {\rm Column} }{\mathrm{Column}\, {\rm Total}}\times 100$$
(4)
$$\mathrm{Kappa} \, {\rm Coeffient} \,(\mathrm{KC}) = \frac{N {\sum }_{i=1}^{r}xii - \sum_{i=1}^{r}{(x}_{i+} \times {x}_{+1}) }{{N}^{2} - {\sum }_{i=1}^{r}({x}_{i+}\times {x}_{+1})}$$
(5)

where r is the rows number in the matrix, xii is the number of observations in row i and column i (the diagonal elements), x+i and xi+ are the marginal totals of row i and column j, respectively, and N is the observations’ number.

This study used the error matrix (confusion matrix), the most efficient accuracy assessment method for the six-time period’s MLC raster classified maps: 1986, 1994, 2000, 2005, 2013, and 2020 (Roy and Inamdar 2019). We used this step to quantitatively assess how efficiently the classified remotely sensed data pixels were sampled into corrected LULC’s category. The resulting classified images were verified pixel by pixel for the accuracy assessment (Keshtkar et al. 2017). The seven classes, Table 2, were considered for accuracy assessment with a minimum of 71 sample points for each defined class, as recommended by Congalton (1991). We used the “Equalized stratified random” sampling method to create randomly distributed points, where each class has the same number of points. Four hundred and ninety-seven (497) systematic point samples for each classified image were used to test the precision by comparing them with specific points from the available ancillary data for this study. We calculated the confusion error matrix considering an accuracy level equal to or above 85% to be an excellent and reliable LULC classification as recommended by previous studies (Alrababah and Alhamad 2006; Manandhar et al. 2009; Koc et al. 2012). We used the confusion matrix for obtaining descriptive and analytical statistics of the classification accuracy assessment, as explained in Sup Tables 1–6 (Foody, 2002; Congalton, 1991; Jensen, 2005).

3.7 Change Detection

Change detection aims to compare the spatial representation of two points in time while controlling all variations due to differences in the variables of interest (Green et al. 1994). The ability to identify changes in the earth’s surface features in real-time and with high accuracy lays the groundwork for a greater understanding of the linkages and interactions between human and natural events, allowing for improved resource management and use (Lu et al. 2004). The most common data type used to detect changes is geographic data, usually in digital forms such as satellite imagery, analog format (earlier aerial images), and vector format (maps). Other data types, such as historical and economic data, can be employed (Singh 1989). Due to the availability of massive archival data sets in recent decades, several digital change detection algorithms and methodologies for evaluating and identifying LULC changes have been developed and evaluated (Dewidar 2004). These methods and procedures have been thoroughly examined, with great descriptions and summaries supplied (Haque and Basak 2017). Since digital change detection is significantly influenced by the temporal, spatial, spectral, and thematic resolutions of remotely sensed data, choosing the right change detection approach is critical for producing accurate findings (Lu et al. 2004). Image differencing, image rationing, PCA, CVA, and Post-Classification Comparison are the most common approaches for detecting changes (Xu et al. 2009; Bekalo 2009). The use of machine-learning techniques on remotely sensed imageries has recently received a lot of consideration (Maxwell et al. 2018; Adam et al. 2014). LULC classification applications have used different machine-learning advanced methods. For the supervised classification techniques, many researchers used support vector machine (SVM), spectral angle mapper (SAM), Mahalanobis distance (MD), random forest (RF), fuzzy adaptive resonance theory-supervised predictive mapping (Fuzzy ARTMAP), radial basis function (RBF), decision tree (DT), multilayer perception (MLP), naive Bayes (NB), maximum likelihood classifier (MLC), and fuzzy logic (Ma et al. 2019; Shih et al. 2019). The Affinity Propagation (AP) cluster method, fuzzy c-means algorithms, K-means algorithms, and ISODATA are examples of unsupervised classification approaches (iterative self-organizing data) (Maxwell et al. 2018; Camps-Valls et al. 2011). As a result, multiple studies on LULC modeling have been conducted utilizing various machine-learning techniques (Talukdar et al. 2020) and comparing machine-learning algorithms (Camargo et al. 2019). In addition, some research has been conducted to determine the most appropriate and accurate algorithm for LULC mapping among the many machine-learning classifiers (Camargo et al. 2019; Jamali 2019). In this study, we used the GIS-based change detection approach, which integrates GIS and remote sensing method (Lu et al. 2004) to explore and identify the changes that have taken place to the spatial extent and pattern of our study area (Gallego 2004). We used GIS because of its ability to incorporate a different source of data with different data accuracies and formats (Petit and Lambin 2001) for long-period intervals into LULC change detection (Lu et al. 2004). This approach helps analyze the direction, rate, and spatial pattern of LULC changes (Weng 2002). The magnitude of change (MC), the percentage of change (PC), and the annual rate of change (ARC) of the classified images were calculated based on the following equations:

$$\mathrm{MC} ({\mathrm{km}}^{2})= {A}_{i} - {A}_{f}$$
(6)
$$\mathrm{PC} (\%) = \frac{{A}_{i} - {A}_{f} }{{A}_{i}} \times 100$$
(7)
$$\mathrm{ARC} ({\mathrm{km}}^{2}{\mathrm{year}}^{-1}) = \frac{{A}_{i} - {A}_{f}}{n}$$
(8)
$$\mathrm{ARC} (\%) = \frac{{A}_{i} - {A}_{f}}{{A}_{i} \times n} \times 100$$
(9)

where Ai is the class area (km2) at the initial time, Af is the class area (km2) at the final time, and (n) is the number of years of the study period.

There are different techniques for change detection analysis, and the most common and used ones include post-classification comparison, image ratio, and manual on-screen digitization of change, principal components analysis, image regression, conventional image differentiation, and multi-date image classification (Ayele et al. 2018; Lu et al. 2004). We used the post-classification comparison techniques to integrate the six classified images, which were classified individually to produce a land cover raster map, Fig. 3a–f, Tables 3 and 4. Then, we compared the corresponding classes to identify areas where change has occurred (El-Hattab 2016). We build a series of “from-to” matrixes by comparing them on a pixel-by-pixel basis (Jensen 2005; Al-doski et al. 2013). It involves the overlay (or “stacking”) of two or more classified images. Change areas are simply those not classified the same at different times. The multi-date change detection uses a binary mask applied to two dates (Jensen 1996; Dobson et al. 1995) and quantifies the change rates and magnitude (Ukor et al. 2016). The degree of change detection success depends on the accuracy of each image classification (Jensen 2005). Many researchers utilizing USGS-Landsat achieves data (Islam et al. 2018; Matlhodi et al. 2019; Tena et al. 2019; Butt et al. 2015; Rawat and Kumar 2015) to undertake post-classification technique-based MLC for detecting LULC changes. Pairs of the produced classified raster maps of the six dates (1986, 1994, 2000, 2005, 2013, and 2020) were compared by applying Boolean logical “AND” operation using GIS Software. Three matrices were developed to understand the changing status and magnitude rate for 1986 to 2005, 2005 to 2020, and 1986 to 2020, Table 5. In addition, six chord diagrams were created using Microsoft Power BI to visualize the classified raster maps classes (Fig. 4a–f), and three to visualize the changes in the specified temporal dates (Fig. 5a–c). In addition, the data presented in Table 5 was used to create three-chord diagrams to visualize the changes for the specified years in this study: 1986–2005, 2005–2020, and 1986–2020 (Fig. 5a–c). These diagrams represent the changes in the seven LULC-classes in the study area: reclamation, vegetation, built-up, rock outcrop, bare soil, Gypsic soil, and wetland from the “initial year” to the “first year.” The data in the three tables were reconstructed and added to the program to fit the parameters’ criteria. The Chord visual offers three fields (From, To, and Values). The data from the initial year go into the first field (From), the data from the first year (year of change) go into the second field (To), and the change between classes goes into the third field (Values). Finally, the Chord diagram is displayed according to the parameters selected visually (Data Visualization | Microsoft Power BI, n.d.; Ferrari and Russo 2016). Different researchers have employed Chord-Diagram, as a graphical approach for visualizing the inter-relationships between data in building the transition matrices (e.g., Chen et al. 2017; Ferrara et al. 2021; Kacem et al. 2021; Xiao et al. 2021; Siddique et al. 2020; Vinatier and Arnaiz 2018; Viana and Rocha 2020).

Fig. 3
figure 3

Land Use Land Cover (LULC) classes for a 1986, b 1994, c 2000, d 2005, e 2013 and f 2020

Table 3 LULC area by class and its percentage to the total area (km2) 1986–2020
Table 4 LULC classes in 1986, 2005, and 2020 and the spatio-temporal changes of the study area
Table 5 LULC-classes change detection for 1986–2005, 2005–2020, and 1986–2020 (km2)
Fig. 4
figure 4

Chord diagrams visualize the total area of LULC-classes and the total area, by km2 for a 1986, b 1994, c 2000, d 2005, e 2013, and f 2020

Fig. 5
figure 5

a–c Chord diagrams explain the total area of LULC-classes transition by km2 during for a 1986–2005, b 2005–2020, and c 1986–2020

4 Results and Discussion

4.1 Accuracy Assessment

We used six confusion matrices to assess the classification accuracy in this study. Sup Tables 1–6 indicate that the image classification achieved satisfying accuracy results. The OA of the classified images was 88.73, 89.54, 89.13, 88.93, 90.34, and 95.77% for the temporal dates 1986, 1994, 2000, 2005, 2013, and 2020, respectively, thus ascertaining that the classification was within the excellent range according to Anderson et al. (1976), as he indicated that the minimum accuracy value for reliable LC-classification is 85%. The KA reached 86.85, 87.79, 87.32, 87.09, 88.73, and 95.07% for 1986, 1994, 2000, 2005, 2013, and 2020, individually. The results of the seven classes for all the temporal times have excellent individual user’s accuracy ranges between 80% and more than 94% for 1986. The user’s accuracy for all the classes is outstanding except for the “Rock outcrop,” about 80.28%, and “Gypsic soil,” which is 81.69%. In 1994, the UA ranged from 73% to more than 97%, and the results indicate that the “Vegetated” surfaces were most accurately classified (97.18). The less accurate class is “Reclaimed” land, 73.24%. In 2000, the UA ranged between 70 and 100%. The 2000 classified image results indicate that the “Vegetated” and “Rock outcrop” surfaces were most accurately classified (95.77%). The less accurate class is Gypsic, which is 83.1%. The 2005 classified image results indicate that the “Vegetation” surface revealed that UA reached 95.55%, while all the other classes of UA ranged between 83.1% and 87.32%. The UA of 2013 and 2020 Landsat8 OLI-classified images ranged between 91.55% and 98.59% and 87.32 and 100%, respectively. The producer’s accuracy (PA) has a nearly excellent result for all class-classified images. In 1986, the PA ranged between 76.47 and 100%. In addition, the results indicate that the “Reclaimed area” surfaces were most accurately classified (100%). The less accurate class is “Bare Land” land, 76.47%. In 1994, the PA ranged between 68% and more than 98.39%. Furthermore, the results indicate that the “Gypsic soils” surfaces were most accurately classified (98.39). The less accurate class is the “Bare land” area, 68%. The 2000 classified image results indicate that the “Rock outcrop.” Moreover, “Gypsic soils” surfaces were most accurately classified (100%). The less accurate class is the built-up area, which is 77.11%. The 2005 classified image results indicate that the “Gypsic soils” surface revealed a PA up to 96.72%, while all the other classes of PA ranged between 77.78 and 95.24%. The UA of 2013 and 2020 Landsat8 OLI-classified images ranged between 93.15 and 100% and 92.65 and 100%, respectively.

4.2 LULC Classified Maps

The six classified raster maps were used to reach the objectives of LULC change detection in this study. Table 3, Fig. 3a–f, and chord diagram Fig. 4a–f explicate the result of the six classified raster maps of 1986, 1994, 2000, 2005, 2013, and 2020 in detail. The chord diagram represents flows (connections) between the seven classes of LULC (reclamation, vegetation, built-up, rock outcrop, bare soil, gypsic soil, and wetland/sabkhas). A fragment on the outer part of the circular layout represents each class in the chord diagram, and then arcs are drawn between each class. The size of the arc is proportional to the significance of the change. The results of the 1986 LULC classified image Fig. 3a shows that Bare soil dominates a total area (41.4%), followed by wetland and sabkhas (15.9%), as illustrated in Fig. 4a. The built-up area covered 12.6% of the total area. More than 78 km2 of vegetation covered the ground, accounting for 11.2% of the total land surface. The rock outcrop takes up the minor land, accounting for only 4.7% of the total area. The 1994 LULC classified map results are demonstrated in Table 3, Figs. 3b, and 4b; the bare soil dominates a total area (29.1%), followed by wetland and sabkhas (21.9%). the built-up area covered 14.5% of the total area. More than 55.4 km2 of vegetation covered the ground, accounting for 7.8% of the total land surface. The rock outcrop takes up a minor land area, accounting for only 2.7% of the total area. In 2000, the results of the classified image Figs. 3c and 4c indicated that the rock outcrop covers a minor part of the total area, which is 3.4% or 24.3 km2. Bare soil covers a significant part of the total area, 37.2% (266.5 km2), whereas the wetland and sabkhas cover 13.2% (94.7 km2). The built-up area covers about (15.6%) 112.1 km2 of the total area. The reclaimed area covers the smallest amount of land, accounting for only 7.6% of the total area. According to the classification results of 2005, Figs. 3d and 4d, wetland and sabkhas account for 30.2% of the total area or 220.7 km2. Bare soil class accounts for 28.4% (208.1 km2) of the entire area. Nonetheless, the built-up area covers 19.6% (143.2 km2) of the land. Vegetation land covers 8.8% of the total land area (64.6 km2). The smallest class is the gypsic soils, which make up 5.9% of the total land area (42.9 km2). In the 2013 classified image results, Figs. 3e and 4e, the land area was covered by 28.6% wetland and sabkhas (220.8 km2), followed by built-up land (190.9 km2, 24.8% of the total area). The bare soil covered 21.2% or 163.7 km2. The reclaimed area covered 6.9% or 52.8 km2 of the total area. The LULC classes for 2020, Fig. 3f, indicate that the built-up land and bare soil area almost occupied the same total area of 312.2 and 301.1 km2, which account for 39.8% and 38.4% of the total area, respectively, Fig. 4f. On the other hand, the gypsic soils occupied the minor area this year and reached 0.8% or 6.3 km2 of the total area.

4.3 LULC Change Detection

The spatial analysis was conducted to assess the patterns of LC-change and overall LU-changes from 1986 to 2020. The LULC-classes of the study area were reclaimed, vegetation, built-up, rock outcrop, bare land, gypsic soils, and wetland/sabkhas. We designated three critical years: the starting year of the study, 1986, the mid-year 2005 (approximately), and 2020. The change detection analysis concentrated on three periodic times by creating three area matrices from 1986–2005 and 2005–2020, and the whole period of the temporal study dates from 1986 to 2020. The changing patterns were depicted using chord diagrams. Table 4 illustrates the LULC classes in 1986, 2005, and 2020 for each class, the total area, and the spatio-temporal changes in the study area for the last 34 years. In addition, three chords’ diagrams, Fig. 5a–c presents the changes from 1986–2005, 2005–2020, and 1986–2020. The total area of Bahrain has been increased by about 86 km2 from 1986 to 2020. Dredging and reclaiming the islands’ shallow water areas added nearly 33.8 and 52 km2 to the total land area between 1986 and 2005 and 2005 and 2020. The LULC change detection for the last 34 years in Bahrain revealed that the “Built Up” areas increased, whereas gypsic soils, rock outcrop areas, wetland/Sabkhas, and vegetation decreased, Table 4. Comparing the statistics of 1986 and 2005 (19 years) reveals that there is a decline in reclaimed, vegetation, rock outcrop, bare soil, and gypsic soil areas by 66.6, 13.76, 15.47, 80.84, and 14.90 km2, respectively, while there is an increase in built-up areas by 55.50 km2. Between 2005 and 2020 (15 years), the changes indicate a decline in wetland/sabkhas, Gypsic soil, and vegetation areas by 193.40, 36.60, and 10.17 km2, respectively. The built-up area has grown to reach 169.05 km2. The changes in the last 34 years indicate that the built-up area is the most dominant category-class. From 1986 to 2020, the built-up area increased from 88 km2 (12.6%) in 1986 to 224.54 km2 (39.8%) of the total area of Bahrain land in 2020, respectively. The wetland/sabkhas, gypsic soils, vegetation, and rock outcrop have lost 83.41, 51.50, 23.93, and 14.11 km2. The produced change maps and matrices were used to understand the changing status and magnitude rate for 1986–2005, 2005–2020, and 1986–2020.

Table 4 shows that four LULC classes have registered a negative annual change from 1986 to 2020. The vegetation, rock outcrop, gypsic soil, and wetland/sabkhas classes have recoded loss annually by −0.70, −0.42, −1.51, and −2.45 km2, respectively. The annual rate of change (ARC) of these classes were −1.59, −0.90, −1.26, −2.26, and −2.22%, respectively, for the same period. The built-up and bare soil gain was annually 6.6 and 0.36 km2, respectively, and their annual rate of change was 7.49 and 0.12%, respectively. Built-up areas have registered the highest annual growth rate. The wetland/sabkhas registered the lowest annual growth rate in the last 34 years.

4.3.1 LULC Change Detection from 1986 to 2005

Table 5 and Fig. 5a depict LULC change transitions from 1986 to 2005. The LULC class is stated in terms of km2. Most changes in the classes’ category between 1986 and 2005 were in negative values except for built-up and wetland/sabkhas classes. The reclaimed areas converted mostly to wetland (14.1 km2). During this period, the vegetation class lost 13.8 km2 and converted mainly to built-up. Furthermore, the rock outcrop class area lost 15.5 km2, which converted to bare soil in 2005. 78.9 km2 of the bare soil lands changed to wetlands in 2005, and 19.6 km2 of the wetlands in 2005 were Gypsic soil in 1986. Moreover, 14.4 km2 of the wetland in 1986 converted to built-up areas in 2005. Around 11.8 km2 of the built-up area in 1986 became wetland areas in 2005 (reconstruction of new urban area). The chord diagram transitions are colorized to see how LULC-classes changed from 1986 to 2005. In 1986, the dominant classes were bare soils, wetland/sabkhas, and built-up areas, while in 2005, the bare soil area lost more than 80 km2. The change detection from 1986 to 2005 indicates the shifting to the built-up area and reduction in vegetation land; however, the wetland/sabkhas were still the dominant class in 2005.

4.3.2 LULC Change Detection from 2005 to 2020

Table 5 and Fig. 5b describe the LULC-classes change transitions from 2005 to 2020. Three classes’ category changes were negative, which means loss of wetland/sabkhas, Gypsic, and vegetation. The built-up, bare soils and reclamation classes increased in the area. Reclaimed areas have become mostly built-up areas; at least 25 km2 from the reclaimed areas have been converted to built-up class. The vegetation class lost 10.2 km2 and converted mainly to built-up. The rock-outcrop class area lost 6.30 km2, which converted to bare soil in 2020. 25.8 km2 of the bare soil lands were changed to built-up areas by 2020, 102.5 km2 of the wetland in 2005 have converted to bare soil areas, and around 64.9 km2 became built-up areas by 2020. The chord diagram visualizes the transitions of LULC-classes changes from 2005 to 2020. In 2005, the dominant classes were wetland/sabkhas, bare soils, and built-up areas. The chord diagram shows the reduction in vegetation land. The change detection from 2005 to 2020 indicates that the shifting to the built-up area has become dominant. Built-up area is equal to 20% of the country’s total area in 2005.

4.4 LULC Change Detection from 1986 to 2020

Table 5 and Fig. 5c depict the LULC-classes change transitions from 1986 to 2020. Most changes in the classes’ category between 1986 and 2020 were directed to built-up areas. The reclaimed areas were converted mostly to built-up and wetlands/sabkhas, and 18.2 and 18 km2 were added to these classes. The vegetation class lost 43.6 km2, which converted mainly to built-up areas. The rock outcrop class area lost 14.1 km2, which converted mainly to bare soil in 2005. 60.8 km2 of the bare soil areas changed to built-up by 2020. In addition, 23.9 and 34.4 km2 from the gypsic soil wetland/sabkhas were added to built-up areas. 12.3 km2 of the rock outcrop areas in 1986 converted to built-up areas by 2020. Around 83.4 km2 of the wetland/sabkhas area in 1986 was lost. The change detection from 1986 to 2020 indicates the shifting of most LULC-classes to the built-up area and the reduction in vegetation and wetland/sabkhas land in the last 34 years. The change detection from 1986 to 2005 indicates the shifting to the built-up area and the reduction in vegetation land.

5 Conclusions

This study integrated remote sensing and GIS to quantify and analyze the LULC changes in Bahrain’s main islands over 34 years from 1986 to 2020. Moreover, LULC changes between 1986, 2005, and 2020 were demonstrated and visualized using statistics and chord diagrams. Furthermore, the identified seven LULC-classes revealed substantial change patterns in the study area. LULC-changes were determined using six multitemporal Landsat satellite imagery, and the classification accuracy was measured using the confusion matrix. The overall classification accuracy was acceptable. Conferring to the quantitative evidence from our study, Bahrain Island has witnessed significant land use and land cover changes since 1986. The study results showed a significant change in the LULC during the study period. During the study period, the built-up areas showed an increasing trend of 7.5% annually, while the Gypsic, wetland/sabkhas, and vegetation showed a decreasing trend of 2.26, 2.22, and 0.9%, respectively. The changes in LULC were effectively captured by the remote sensing Landsat satellite sensors with different spectral, spatial, and temporal resolutions with GIS analysis. The change detection analysis using GIS and remote sensing delivers valuable information to understand the annual patterns of land use dynamics for planners and decision-makers; therefore, sustainable land management planning is possible. The expansion of the built-up area in the study area was mainly at the expense of vegetation and wetland/sabkhas lands. The results indicate that the built-up area covered more than 40% of Bahrain’s main islands land. For the environment’s long-term sustainability, LULC changes should be constantly monitored in the future. Moreover, Landsat TM5, ETM+7, and OLI8 images archives and the newly invented remote sensing satellite imagery should be used to create and monitor accurate maps of LULC changes in Bahrain.