1 Introduction

The population of the world is projected to reach 8.5 billion inhabitants in 2030 [1]. It has also been forecasted that an additional 2 billion people to be added to the world, and 1.05 billion (52%) may come from sub-Saharan countries [1]. Urbanisation is a global phenomenon that has significant implications for the environment, particularly in relation to temperature dynamics within urban areas [2]. Urbanisation is the rise in both population and spatial expansion in urban areas [3]. It is a complex network of processes that leads to the transformation of the environment [4] and a multifaceted phenomenon encompassing demographic, ecological, sociological, and economic aspects [5]. Urbanisation results in the concentration of populations within urban areas and possesses the capacity to either stimulate or hinder the growth and advancement of these areas [6].

The magnitude of global urbanisation is remarkable, with the United Nations projecting that by 2050, nearly two-thirds of the world's population will reside in urban areas [4]. The fastest rate of urbanisation is found in Africa and it is projected that more than half of the continent’s inhabitants will be living in urban areas [7]. Urbanisation has positive sides that include human development, economic rise and reduction in the alleviation of poverty but it has been a major challenge for African countries including Ghana [8].

Many portions of Ghana have been classified as urban. These areas have a population of 5000 or more inhabitants [9]. In Ghana, 56.7% of the population live in urban/built-up areas in 2021 [10]. Ghana’s total land size is estimated at 238,539 square kilometres [11]; the urban/built-up areas covered 1460 square kilometres (0.6%) and 3836 square kilometres (1.6%) in 1975 and 2013, respectively, of the total area of Ghana [12]. Most cities in Ghana were founded for economic reasons during the pre-colonial era. In addition, a chain of coastal cities, primarily Elmina, Cape Coast and Accra (from 1860 to 1905), was established for trade reasons [13].

Cities in Ghana have increased both in population and areal extent [14, 15] and thus rapid spatial urban expansion has become a key feature of the major urbanised centres in Ghana [16,17,18]. The Accra metropolis is regarded as the most populous city in Ghana with 2,218,487 inhabitants [10]. It has experienced a lot of land use land cover changes (LULCC) due to urbanisation [16, 17]. Mapping using satellite imagery has led to a deeper understanding of the dynamics in urban/built-up areas [19] at the global level [20]; regional level [21] and city level utilising remote sensing techniques [22].

Remote Sensing (RS) is the art of obtaining observations from a phenomenon utilising instrument-based techniques [23]. It has been utilised to obtain meaningful data from the electromagnetic spectrum and has enabled features to be measured in urban/built-up areas [24]. Geographic Information Systems (GIS) and Remote Sensing (RS) have been used to generate Land Use Land Cover (LULC) maps from Landsat satellite images in various cities in Ghana [18, 22, 25,26,27,28] with results indicating a high transformation of other LULC classes especially, agricultural and forestlands into urban/built-up areas and has adversely impacted temperature [22, 27, 29]. These techniques have been adopted for both retrospective [28] and prospective studies [22].

Future land use land cover modelling techniques have been adopted around the world; in Florida, USA [30]; Beijing, China [31]; Daqahlia Governate, Egypt [32]; Abuja, Nigeria [33]; Kumasi [22] & Greater Accra Metropolitan Area [26] in Ghana. The Artificial Neural Network (ANN) based cellular automation technique was adopted for future modelling in several studies [34, 35]. The Cellular Automata – Markov chain [26, 36]; the CLUE-S model with the Markov model [31]; and the use of the Markov chain model [22, 37]. The results from the works of [22, 26, 37] predicted increases in the urban/built-up areas for the predicted years of the study areas investigated.

Seminal studies have documented the relationship between LULC and Land Surface Temperature (LST) worldwide [38,39,40,41]. In Ghana, the relationship between LULC and LST has been studied in the Sekondi-Takoradi Metropolis and revealed that LST was higher in the urban/built-up areas than in the city's non-urban/built-up areas. The study concluded that there was an indirect relationship between the Normalised Difference Vegetation Index (NDVI) and LST [42]. The works of [17] and [43] investigated the extent of warming trends in Tarkwa and the Greater Accra Region respectively. Results revealed that urban/built-up areas yielded increasing warming trends in both study areas.

Ordinary Least Squares (OLS) regression is regarded as a straightforward technique and generates an equation to represent a process [44]. It has been used to determine the impact of biophysical parameters on LST [45,46,47]. The Geographically Weighted Regression (GWR) is also a linear regression used to model spatially varying relationships. It constructs individual equations for each component in the dataset [48]. The correlation between LULC and land surface temperature [49]; the relationship between LST and NDVI, Normalised Difference Built-up Index (NDBI) [50]; assess the spatial correlation between temperature and environmental factors [51]. Results have revealed a negative relationship between LST and NDVI and a positive relationship between LST and NDBI [50, 51].

Previous literature have characterised the magnitude and the spatial coverage in Ghana. The research of [52] used a short duration of 12 years (2005–2017) to investigate the relationship between Land Use Land Cover (LULC) and Land Surface Temperature (LST) in Accra. It provided limited insights into the magnitude, intensity and spatial coverage in the selected cities. The research of [53] investigated rising temperatures resulting from urbanisation in Accra. However, the focus was not hinged on LULCC and spatial extent. The scientific work of [17] provided insights into increasing temperatures in the Greater Accra Region but put less focus on the nexus between LST and (Normalised Difference Vegetation Index (NDVI) and Normalised Difference Built-up Index (NDBI) which are good indicators of urban warming within an area according to the research of [54, 55].

Again, the studies of [56,57,58] focused on the characterisation and quantification of urban/built-up areas in Ghana whilst the research of [59] focused on environmental pressures exerted by urban expansion and introduced the spatial transformation of the various land cover classes in the metropolitan areas in Ghana. Most reviewed studies in Ghana have not focussed on the increasing temperatures resulting from spatial urban expansion [26, 43, 60]. Works cited used the traditional classifier, especially, the Maximum Likelihood Classifier which is based on probability [16, 17, 26] whilst the advent of state of the art, Machine Learning Algorithms has improved image classification [61,62,63] but little attention has been on the utilisation of machine learning algorithms for satellite image classification in Ghana.

In our research, we seek to fill this knowledge gap by utilising RS images from 1986 to 2022 to analyse the regression between LST and (NDVI and NDBI) not addressed in previous research in the Accra Metropolis. Moreover, the research will provide an insightful comprehension of spatial urban expansion in the Accra Metropolis in Ghana.

2 Materials and methods

2.1 Study area

Accra acts simultaneously as the capital of the Greater Accra Region and Ghana. The capital is in the South-Eastern portion of the coast of the Gulf of Guinea. The centre of the Metropolis is coordinated as 5.56°, -0.21°. The average monthly temperature ranges from 24.7 to 33 °C. The Local Government Act, 1993 (Act 462) and the Legislative Instrument (LI) 1615 established the Accra Metropolitan Assembly (AMA). The AMA consists of the Ablekuma (North, South and Central), Ayawaso (East, West and Central), Okaikoi (North and South), Ashiedu Keteke, Osu Klottey and Kpeshie (Fig. 1).

Fig. 1
figure 1

Map showing the study area (A = Map of Ghana showing the Greater Accra Region; B = Map of Greater Accra region showing the Accra Metropolis)

2.2 Dataset

The essential Landsat satellite imagery with a cloud cover of less than 10% was downloaded, acquired and assembled as described in Table 1.

Table 1 Datasets used in the research

The supplementary dataset utilised were the Open Street Map (OSM), World Topographic Map (WTM) and the World Street Maps (WSM). The OSM road network dataset was obtained from Geofabrik´s website. The WTM and the WSM were Base maps in the ArcGIS software.

The generalised methodology adopted for the overall of this study is shown in Fig. 2. The entire workflow is segmented into two portions;

Fig. 2
figure 2

Methodological framework adopted for this research

  • The utilisation of satellite images for LULC changes and accuracy assessment, urban modelling for 2030 and urban expansion analysis

  • The usage of satellite images for remote sensing indices and LST analysis (Fig. 2)

2.3 Satellite image processing and classification

Landsat images were resampled utilising the Nearest Neighbour technique. The Nearest Neighbour (NN) resampling technique was utilised. The original values of the original image are preserved in comparison to other techniques [64]. The satellite images were projected to the World Geodetic System (WGS) 84, Universal Transverse Mercator (UTM) Zone 30 N and the Ghana Datum War Office. The area of interest i.e. Accra Metropolis was extracted from the World Topographical Map and the Landsat images for onward processing. The different bands (excluding the thermal and panchromatic bands) were stacked to form a composite band.

According to [65], four LULC groups were classified for the selected Landsat images regarding earlier works on LULC classification in Accra [16]. The LULC classes were urban/built-up areas, agricultural, forestlands and water. A concise description of the classes is illustrated in Table 2.

Table 2 LULC classes

The classification technique utilised was supervised classification. It was utilised for identifying, classifying and evaluating LULC groups. The assigning of pixels to their chosen classes was done using the Random Forest technique. The Random Forest algorithm is utilised to classify satellite images through the creation of trees [66]. This algorithm offers improved accuracy in classification and reduced computational time compared to other methods [67, 68]. The input data consisted of spectral bands, and the output was the dominant land use and land cover (LULC) class determined by majority voting. Each tree operated independently, enabling simultaneous training and testing processes. The technique is not limited by statistical assumptions was thus the reason for choosing the technique [66].

2.4 Assessment of the accuracy of the classified satellite images

The training areas of the various LULC types were assessed for the 2022 Landsat image. The ground control points were evaluated and validated utilising the GPS etrex 64 s. The fieldwork for validating the classified image of 2022 was done in January 2023 for ten days. The field visit was untaken in the dry season since the Landsat images were also obtained during the same period. This made it suitable to correlate satellite images to the ground features. The equalised stratified random sampling technique was used for the generation of the field points because it created the same number of random points for each LULC class. These ground control points were 200 sample points for the four LULC classes (i.e.) 50 samples for each LULC class. The supplementary map datasets aided in the identification of the different LULC groups.

2.5 Land use land cover modelling and validation for 2030

The methodological flow of the adopted techniques for land use land cover modelling and validation is illustrated in Fig. 3.

Fig. 3
figure 3

Flowchart for future LULC modeling

Before modelling and validation, the conditions to be met are carried out and these are; initial checks to ensure that the temporal satellite images were harmonised, had the same spatial features, legend and the same consistent categories and finally, the backgrounds of the satellite images had the same value of zero.

The temporal images of 2003 and 2015 were designated as t1 and t2 respectively. Both temporal images were used as the base maps. The first step was the change analysis. The ‘from—to’ identified were the transitions from one land cover type to another. The analysis of the prediction was achieved by utilising an aggregate of empirically transitioned sub-models. The driving variables utilised in this research were; Slope, Elevation, Distance to urban/built-up areas and Distance to major roads.

The driver variables were considered the main variables that influenced spatial urban expansion. The selection of the driver variables was based on the review of initial studies [25, 69]. The slope and elevation were used to account for the geophysical factors. The distance drivers indicated the nearness of pixels to forces that act as motivation for spatial urban expansion. In 2003, the distance to urban/built-up areas was set as a dynamic variable. The distance to the urban variable indicated the proximity of a pixel to existing urban/built-up areas in 2015. The modelling of the distance to urban/built-up areas was set up as a dynamic variable because the distance of urban pixels changes with time.

The Multi-Layer Perceptron handles multiple transitions at a time. The neural network was trained to build a model utilising the information from the data such that the network could hypothesise and forecast the outputs. The error in the network was minimised by adjusting the weights between the nodes (utilising a proportion of the input data), and utilising the backpropagation algorithm. After iterations, the calculation of the error across the neural network was executed. The model was trained utilising sample pixels in the two LULC maps that experienced changes from one land use to another, as well as pixels that remained unchanged. The sample size was set to the least number of pixels that changed from one LULC type to another by default. The pixels were divided into two groups, one for training the model and the other for testing or validating the model's accuracy. The Markov Chain Analysis was utilised to model the quantitative changes in each transition for the projected year (2030). The potential maps generated were utilised to forecast the future scenario of 2022. The forecasted map with the same LULC classes as the input maps were produced.

Validation enabled the determination of the quality of the predicted maps of 2022 compared to the real 2022 LULC maps. Visual and statistical approaches were used to validate the models (Pontius & Chen, 2006). This resulted in a map consisting of four main categories:

  1. i.

    ‘Hits’ meant the changes predicted by the model were correct.

  2. ii.

    ‘False alarms’ meant the model predicted changes but no changes occurred.

  3. iii.

    ‘Misses’ signified that the model predicted no changes, but changes happened.

  4. iv.

    ‘Null success’ meant areas the model predicted no changes and no changes occurred.

Finally, after the verification, the model's predictive potential was performed between (2003 and 2015). Then, the simulation was done to predict the LULC map of 2022 utilising (2003 and 2015 LULC maps) as the base maps. The entire procedure was repeated to obtain the projected map of 2030.

2.6 Remote sensing indices

The NDVI is the most widely used remotely sensed vegetation index [70] and thus was extracted for this study. It is among the extensively used vegetation indices to communicate vegetation information [71]. The NDVI reveals the greenness of an area of interest due to its higher sensitivity in the presence of vegetation [72]. NDBI was used to extract urban/built-up areas from the satellite imagery. It indicated the intensity of development and impervious surfaces of urban/built-up areas. Both indices are indicators of Urban Heat Islands (UHIs) effects by determining their relationships with LST [55]. Thus, the rationale for analysing NDVI, NDBI and LST in this research. We adopted a pragmatic approach by using each one of the vegetation indices and built-up indices despite the variety of indices available. This enabled a precise investigation into the study area utilising the RS indices.

The Near Infra-Red (NIR) and the Red (R) bands were utilised for the computation of NDVI as;

$$\text{NDVI }= \text{ } \frac{\text{NIR }\text{- }{\text{R}}}{\text{ NIR }+ \text{ } {\text{R}}}$$
(1)

The NDBI was computed using the Short Wave Infra-Red (SWIR) and the NIR bands as shown in the equation:

$$\text{NDBI}=\frac{\text{SWIR}-\text{NIR}}{\text{SWIR}+\text{NIR}}$$
(2)

2.7 Retrieval of land surface temperature

To obtain the LST, the following computations were done. The thermal Digital Numbers (DNs) were converted into spectral radiance using the Equation based on [73].

$${\text{L}}{\lambda}= \text{ } \frac{{\text{Lmax}}{\lambda }\text{- Lmin}{\lambda}}{\text{ Qcalmax - Q calmin}}\left(\text{DN - Qcalmin}\right)\text{ + Lmin}{\lambda}$$
(3)

where: Lλ = Spectral Radiance, Lmax = Spectral radiance of DN value 255, Lmin = Spectral radiance of DN value 1, Qcalmax = Maximum numerical value of the image (retrieved from header file), Qcalmin = Minimum numerical value of the image (retrieved from header file).

For the Landsat 8 satellite image of 2015, computation into spectral radiance from DNs was done using the equation:

$$L_{\lambda } \, = \,M_{L} *{\text{ }}Q_{{cal}} \, + \,A_{L} .$$
(4)

where: Lλ= Spectral radiance (W/ (m2*sr*μm)), ML= Radiance multiplicative scaling factor for the band (RADIANCE_MULT_BAND_n from the metadata)

AL = Radiance additive scaling factor for the band (RADIANCE_ADD_BAND_n from the metadata)

Qcal = Level 1 pixel value in DN [74].

However, the flow for the calculation of LST remained the same. Thus, the spectral radiance was then converted into brightness temperature using the equation:

$${\text{T}}_{\text{sat}}\text{ = }\frac{{\text{K}}_{2}}{\text{ln (}\frac{{\text{K}}_{1}}{\text{L}\mathrm{\lambda}}\text{+1)}}$$
(5)

where: Tsat = Brightness Temperature, Lλ = Spectral Radiance, K1 = Calibration constant 1 (607.76 for TM, 666.09 for ETM+ and 774.89 in Landsat 8 Band 10), K2 = Calibration constant 2 (1260.56 for TM, 1260.56 for ETM+ and 1321.08 in Landsat 8 Band 10)

Finally, the surface temperature was converted from Kelvin Tsat to Degrees Celsius Tc by applying the equation:

$${\text{Tc }} = {\text{ Tsat }}{-}{ 273}$$
(6)

The emissivity was calculated from the NDVI [75] by applying the equation:

$$\varepsilon \, = { 1}.00{94 } + \, 0.0{\text{47Ln }}\left( {{\text{NDVI}}} \right)$$
(7)

hence, LST based on object reflectance using was calculated using the algorithm:

$${\text{LST }} = \frac{{{\text{T}}({^\circ } {\text{C)}}}}{{{\text{(1 + (}}\lambda {\text{* T (}}{^\circ } {\text{C)/}} \rho {\text{) *Ln(}}\varepsilon {\text{)))}}}}$$
(8)

where: T (°C) = calculated brightness temperature, λ = Raw wavelength of emitted radiance, ρ = h* c/σ (1.4038*10−2mk); σ = Stefan Boltzmann’s constant; h = Plank’s constant; c = light velocity.

2.8 Correlation analysis

The satellite images from the Landsat series were employed. It was used to provide a general overview of NDVI, NDBI and LST of the Accra metropolis. A total of 260,005 were generated using an approximated factor of 0.09. This optimum number of points (260,005) was used to determine the NDVI, NDBI and LST value for each point created. This was done to ascertain the correlation amongst them. Thus, the Accra metropolis with 23,400 ha was divided by the approximated factor (0.09) to obtain the total number of points.

2.9 Regression analyses of LST, NDVI and NDBI

Both global (Ordinary Least Squares) and local (Geographic Weighted Regression) were used to evaluate the relationships between LST, NDVI and NDBI.

2.9.1 Ordinary least squares

OLS based on the assumption that there are independent observations and thus fails to encompass the spatial dependence of the data when applied to geo-referenced data analysis [51].

The Ordinary Least Squares (OLS) was computed as;

$${\text{Y}} = \lambda + \beta _{1} {\text{X}}_{1} + \beta _{2} {\text{X}}_{2} {\text{ + }}\beta _{3} {\text{X}}_{3}$$
(9)

2.9.2 Efficiency Criteria for the OLS

The coefficient for each explanatory variable reflected the strength and relationship the explanatory variable has with the dependent variable. If the coefficient was positive, the relationship was positive and vice versa. The model performance was evaluated using the R-squared and the adjusted R-squared values. The values ranged from 0.0 to 1.0. It was regarded as an accurate measure of model performance [76].

The assessment of each explanatory variable was done using the Variance Inflation Factor (VIF). It measured multicollinearity. A high VIF value was an indication of high collinearity. Mostly, the VIF threshold was set at 7.5 [77]. The joint F-statistic and joint Wald statistic evaluated the statistical significance of the model. For a 95% confidence level, a p-value less than 0.05 indicated a statistically significant model [78].

The Koenker (BP) Statistic tested if the explanatory variable in the model had a consistent relationship with the dependent variable in geographical space. At a 95% confidence level, a p-value (probability) smaller than 0.05 indicated statistically significant heteroscedasticity and/or non-stationarity [78]. The evaluation of model bias was evaluated using the Jarque-Bera statistic that indicated whether the residuals were normally distributed. If the residuals were not normally distributed it implied that the model was biased [77]. The Akaike's Information Criterion (AICc) provided additional information on the model performance [78].

2.9.3 Geographic weighted regression

GWR used in this research was mathematically equated as;

$$y_{{\text{i}}} \; = \;{\text{a}}_{0} \left( {{\text{u}}_{{\text{i}}} ,\;{\text{v}}_{{\text{i}}} } \right)\; + \;~\sum {\text{ka}}_{{\text{k}}} \left( {{\text{u}}_{{\text{i}}} ,\;{\text{v}}_{{\text{i}}} } \right){\text{x}}_{{{\text{ik}}}} \; + \;~\sum {\text{i}}$$
(10)

where (ui, vi) represented the coordinates of the i-th point in space and ak (ui, vi) was a realisation of the continuous function ak (u, v) at the point i” [79].

2.9.4 Efficiency criteria for GWR

The Bandwidth or Number of Neighbors was utilised for the local estimation and was regarded as the most essential parameter for GWR. The bandwidth was automatically generated as 205 neighbours [78]. The sum of the squared residuals in the model was the residual squares. The smaller the residual squares, the closer the fit of the GWR model to the observed data [78].

Effective Number reflected the trade-off between the variance of the fitted values and the bias in the coefficient estimates and was related to the choice of bandwidth [78]. Sigma was the value of the square root of the normalised residual sum of squares when the residual sum of squares was divided by the effective degrees of freedom of the residuals. Smaller values of this statistic were preferable. Sigma was used for AICc computations [78].

The AICc measured model performance and was useful for the comparison of different regression models. If the AICc values for the two models differ by more than 3, the model with the smaller AICc was regarded as better [78]. The R-squared and the Adjusted R-squared were measures of goodness of fit. Its value varied from 0.0 to 1.0 with higher values being preferable [76].

3 Results

3.1 Satellite image processing and Classification

The classified image of 1986 revealed that the urban/built-up areas began along the coast and spread upwards and outwards to the other portions of the Metropolis (Fig. 4). Again, most of the urban/built-up areas were close to the coastline (the southern boundary of the Metropolis). The forestlands were surrounded by a large number of agricultural lands generally in the northern parts of the city. The classified image of 2003 revealed dominance by urban/built-up areas. Most of the agricultural lands were gradually depleting. Moreover, in the classified image of 2015, a higher proportion of urban/built-up with agricultural lands decreasing was observed. The visual inspection indicated a rise in forestlands. The water environment visually appears to have maintained its shape throughout the study.

Fig. 4
figure 4

LULC maps for Accra for the years 1986, 2003 and 2015

The urban/built-up areas dominated the other land cover types for the entire study. It covered more than half of the land size of Accra. The water environment occupied the smallest area. However, agricultural lands continued to diminish until the last year. There was a decrease from 1986 to 2003 for forestlands but a gradual increase was observed in 2015 (Fig. 4)

Similar patterns in the increasing trends in urban/built-up areas continued unabated in the last year of study. It is also evident that agricultural and forestlands have decreased in 2022 (Fig. 4).

The comparison of LULC for the years 1986, 2003 2015 and 2022 revealed that urban/built-up areas increased throughout the study period. There was a reduction in agricultural lands and a decrease in forestlands in 2003 but an increment was observed for forestlands in 2015 and a decline in the last year of the study. An upward increase in urban/built-up from an initial 5.22% (1986 – 2003) and a further upward adjustment of 8.74% (2003 – 2015) was observed. From 2015 to 2022, 8.30% of urban/built-up areas were added. However, agricultural lands had the highest rate of decrease of − 4.37% between 1986 and 2003 and an appreciable decline of − 9.02% between 2003 and 2015. Agricultural lands decreased by 2.52% in 2022. Forestlands had a percentage change of − 1.42% between 1986 and 2003 and a marginal percentage increase of 0.71% from 2003 to 2015 and finally, decreased by 1.78% in 2022. The water environment experienced the least changes as aforementioned in the general description (Table 3).

Table 3 Distribution of LULC

Furthermore, the annual rate of change (Ceteris paribus) revealed that urban/built-up increased throughout the study period. It recorded the highest rate of change (0.92) between 2015 and 2022. Agricultural lands decreased drastically, especially between the period (2003–2015). The highest rate of changes occurred for forestlands occurred between 1986 and 2003 whilst the water environment recorded the least changes throughout the study period (Table 4).

Table 4 Changes in area and percentages

3.2 Accuracy assessment

The quality of any thematic map is determined by its accuracy. Kappa statistic was used to measure the agreement between the classified satellite images and the training samples or the reference data.

The kappa statistic using the RF classifier produced accuracies for 1986, 2003, 2015 and 2022 of 85.3%, 96.0%, 94.7% and 96.0% respectively. The producer and user and accuracies are outlined in Appendix 1.

3.3 Land use land cover modelling

The urban/built-up was the major transition model studied, and as a result, the vital transitions from other LULC classes were explored. Cramer’s V indicated the potential explanatory strength of a driving factor in spatial urban expansion (Table 5). Cramer’s V value greater than or equal to 0.15 for any driving factor was regarded as valuable, but if values are higher than or equal to 0.4, the Cramer’s V value was considered very good [80]. The next procedure was to run the sub-model of important transitions, and the transition maps were generated. The LULC map of 2015 was simulated. It yielded quite satisfactory LULC predictions.

Table 5 Cramer’s V driving factors for the potential Land Use Land Cover Changes in 2030

3.3.1 Validation of LULC maps

Figure 5 depicted the validated projection that illustrated the accuracy accomplished by the model. The maps were composed of hits, misses, false alarms and null successes. The visualisation revealed that null success was dominant. The misses were observed, mostly in the validated map. However, the false alarms were less observed in the validated map but the hits were sparsely distributed.

Fig. 5
figure 5

Validated maps of 2022

3.3.2 Projection of land use land cover into 2030

The Land Use Land Cover (LULC) projection into the year 2030. The dominant class for the projected maps was urban/built-up areas. The water environment on the western boundary maintained its shape for Accra. However, the agricultural and forestlands were mostly found in the upper boundary of Accra and gradually spread into the central portions of Accra (Fig. 6).

Fig. 6
figure 6

Land Use Land Cover projections into 2030

3.3.3 Areal extent of land use land cover in 2030

The urban/built-up areas are projected to occupy 19053 ha (81.4%). Agricultural lands are tipped to cover 1897 ha (8.1%). However, forestlands are expected to have 1509 (6.5%) and 941 (4.0%) in 2030. Nevertheless, the water environment is forecasted to maintain almost its areal extent in 2030 (Table 6).

Table 6 Area in hectares and percentages

3.4 Spatial urban expansion analysis

To expedite the analysis of the trend of expansion of the urban/built-up areas, the pixels of urban/built-up were extracted from the classified images. The overlay of the urban/built-up areas of the selected years showed the extent of spatial urban expansion that occurred from 1986 to 2022. Figure 7 revealed the urban/built-up areas throughout the study period.

Fig. 7
figure 7

Spatial urban expansion in Accra

3.4.1 Spatial urban expansion in the sub-metropolitan zones

The Accra Metropolis was segmented into 5 sub-metropolitan zones: Ablekuma, Ayawaso, Kpeshie, Okaikoi and Osu Klottey. The spatial urban/built-up maps were overlaid with the respective zones to depict the spatial occurrence and the area covered in each zone (Fig. 8).

Fig. 8
figure 8

Spatial urban/built-up areas in the sub-metropolitan area

3.4.2 Spatial urban extent in each sub-metropolitan area

The years 1986, 2003, 2015 and 2022 yielded urban extents of 14,707, 16,159, 16,997 and 19,497 ha respectively. Kpeshie had the highest urban extent whilst Osu Klottey had the lowest urban extent for the entire study period. In 1986, the urban extent ranged from 1524 to 6421 ha. However, in 2003, it ranged from 1242 to 6482 ha. In 2015, the values were from 1586 to 6608 ha. In 2022, the urban extent ranged from 1573 to 7308 ha. Between the years 1986 and 2003, the Ayawaso zone recorded the highest rise of 713 ha, followed by Okaikoi (366 ha), Ablekuma (293 ha), Kpeshie (61 ha) and finally Osu Klottey (18 ha). Interestingly, between 2003 and 2015, the zones ranked the same way as between 1986 and 2003 except for variations in values (Table 7).

Table 7 Urban Extent in each sub-metropolitan in Accra

3.4.3 Urban expansion analysis along a major road

The highway used was the George Walker Bush Highway with a road stretch of 35 km. The expansion was mainly found in the southern parts of the created buffers. The significant area in the northern parts of the study area were mainly forestlands (Fig. 9).

Fig. 9
figure 9

Urban extent in 500 m buffer along a major road

3.4.4 Density of spatial urban expansion

The total area covered by urban/built-up areas ranged from 1253 to 8542 from the 500 m to 4000 m buffer. The total area was from 1388 ha to 14,499 ha. There was a decreasing trend in the density of spatial urban expansion from the 500 m to the 4000 m buffer as the values ranged from 0.90 to 0.59 (Table 8).

Table 8 Density of spatial urban expansion

3.4.5 Density decay curve

The relationship between the distance to a major road and the density of urban expansion showed a decreasing trend. The linear regression also indicated an inverse relationship between the two sets of variables (Fig. 10).

Fig. 10
figure 10

Density Decay Curve of Accra

3.5 Remote sensing indices

3.5.1 General overview of NDVI

The NDVI values ranged from -0.38 to 0.54. Matching the NDVI and LULC maps of Accra (Fig. 2), the visual interpretation broadly indicated that the lowest values for NDVI were recorded in the water environment located on the western boundary of Accra. It was observed that the urban/built-up areas produced low values for NDVI, followed by agricultural lands. Visually, the highest values were recorded in the forestlands. The above pattern was uniform for the NDVI maps of 1986, 2003, 2015 and 2022 (Fig. 11).

Fig. 11
figure 11

General overview of Normalised Difference Vegetation Index

3.5.2 General overview of NDBI

The NDBI values ranged from − 0.25 to 0.43. The visualisation revealed that the lowest values were recorded in the water environment followed by the agricultural and forestlands in the NDBI maps. The portions classified as urban/built-up yielded the highest values for NDBI. The 2003 imagery produced similar patterns as the 1986 satellite imagery. The 2015 and 2022 NDBI maps had the forestlands also record lower values in the water environment whilst the agricultural lands produced moderate values and ultimately, urban/built-up recorded the highest values for NDBI (Fig. 12).

Fig. 12
figure 12

General overview of Normalised Difference Built-up Index in Accra

3.6 Retrieval of land surface temperature

The daily temperature values ranged from 19.3 to 33.3 °C. It was noticed that temperature values for 1986 were the least compared to the other study years. It was also observed that the water component and the forestlands produced very low temperatures. Visually, the urban/built-up areas produced higher temperatures in comparison to the other LULC types. The agricultural lands produced moderate temperatures throughout the study period (Fig. 13).

Fig. 13
figure 13

General overview of land surface temperature

3.7 Correlation analyses

It was observed that the coefficients showing the relationships between LST and NDBI were higher than between LST and NDVI. There was a negative correlation between NDVI and LST, likewise NDVI and NDBI throughout the entire study period. However, a higher positive correlation was observed between NDBI and LST (Table 9).

Table 9 Correlation analysis amongst the RS indices

3.8 Comparison of OLS and GWR

Comparing model diagnostics was considered one of the best ways to assess the performance of models and thus, OLS and GWR were compared using their adjusted R squared and AICc values.

Generally, GWR yielded higher values for the adjusted R squared compared to OLS. Similarly, it also produced smaller values for AICc when ranked with the OLS. It was therefore evident that the GWR models performed better than the OLS models (Table 10).

Table 10 Evaluation of OLS and GWR based on model diagnostics

The coefficient for the three years was positive as observed from the equations. Although both coefficients produced positive numbers, the coefficients of NDBI produced higher values compared to NDVI.

The following equations were derived to showcase the relationship between the dependent variable (LST) and the independent variables (NDVI and NDBI).

The equation for the satellite image for 1986 was;

$${\text{LST }} = { 21}.{39 } + \, \left( { - {5}.{57}*{\text{NDVI}}} \right) \, + \, \left( {{12}.{15}*{\text{NDBI}}} \right) \, + {\text{ E}}$$
(11)

The equation for the satellite image for 2003 was;

$${\text{LST }} = { 27}.{22 } + \, \left( { - {1}.{17}*{\text{NDVI}}} \right) \, + \, \left( {{8}.{34}*{\text{NDBI}}} \right) \, + {\text{ E}}$$
(12)

However, the equation for the satellite image for 2015 was;

$${\text{LST }} = {\text{ 27}}.{\text{59 }} + {\text{ }}\left( { - {\text{8}}.{\text{39}}*{\text{NDVI}}} \right){\text{ }} + {\text{ }}\left( {{\text{21}}.0{\text{9}}*{\text{NDBI}}} \right){\text{ }} + {\text{ E}}$$
(13)

Finally, the equation for the satellite image of 2022 was;

$${\text{LST }} = { 28}.{95 } + \, \left( { - {9}.{42}*{\text{NDVI}}} \right) \, + \, \left( {{24}.0{5}*{\text{NDBI}}} \right) \, + {\text{ E}}$$
(14)

where E is the error term.

4 Discussion

4.1 Classification of satellite imageries and their assessed accuracy

The random forest classification algorithm utilised the pixel values to delineate the different Land Use Land Cover types. The evaluation criteria for the classified satellite images were based on their classification accuracies. The kappa statistics of the classified LULC maps were above 85% regarded as the minimum standard set by the USGS classification scheme [65]. The user and producer accuracies and kappa statistics produced excellent results. These results were compared to the work of [61] that used RF classification. The kappa statistics obtained in the works of [16, 25, 60, 81] in Ghana using the Maximum Likelihood Classifier were lower compared to the results obtained using the RF classifier in this research. The initial works of [82, 83] revealed the superiority of the RF classifier over other types of classification algorithms. The extraction of the different LULC types using Landsat satellite imagery provided insights that sought to contribute to the existing knowledge on the usage of random forest algorithm for satellite image classification. Therefore, it upgraded the traditional extraction techniques classifiers utilised in satellite image classification in Accra and other study areas in Ghana. Therefore, attention should be shifted to the utilisation of machine learning algorithms for the classification of LULC classes by researchers, especially in Ghana.

4.2 Land use land cover

Spatial urban expansion began along the coast of Accra before 1986 and spread upwards and outwards to the other areas of Accra as observed from the classified images. The LULC maps revealed that most urban/built-up areas were close to the coastline (southern boundary of the Metropolis) in Accra. The depletion of agricultural lands in the classified image of 2003 could be attributed to the increased demand for land for residential and commercial hubs. The reduction in the forestlands may have resulted from the illegal logging of trees in Accra. This assertion supports the earlier works of [84, 85] that concluded that forestlands were depleted in the Metropolis. The classified image of 2015 also revealed a higher proportion of urban/built-up areas with agricultural lands decreasing. The visual inspection indicated a rise in forestlands and it might be attributed to the afforestation activities and stringent measures put in place by the Forestry Commission of Ghana through its afforestation programmes [86]. The water environment visually appeared to have maintained its shape throughout the study.

The analysis of the LULC maps revealed a continuous increase in the urban/built-up areas at the expense of agricultural and forestlands. The increase in population is a distinct factor that could have triggered spatial urban expansion in supporting the outcome of the works of [26, 87, 88]. The increased nexus between urban/built-up areas and population could be attributed to the architectural style of buildings that are dominated by the horizontal spread of houses (one-storey houses) without making use of vertical space (two or more storey buildings) [89, 90]. In Ghanaian society, house ownership apart from the provision of shelter serves as a higher indicator of one’s social status and prominence in society [91]. Moreover, for most Ghanaians, the ownership of lands and houses is key and necessary for attaining pride which is an integral part of their traditions and culture [92]. Therefore, spatial urban expansion could be due to the uncontrolled individual usage of land as most people prefer to buy lands at the urban fringes where lands for development into urban/built-up areas have relatively lower prices [93].

The results indicate that the focus has been on the transformation of different LULC into urban/built-up areas. The scramble for land for the construction of housing units could lead to the continuous transformation of other LULC types into urban/built-up areas. It might also increase the conflict among the populace over the ownership of lands. It is urged that the lands commission of Ghana, traditional authorities and communities and the Ministry of Works and Housing may have to collaborate to control the accelerated spatial urban expansion through education and enforcement of the policies related to land use in Accra and other developing cities in Ghana.

4.3 Spatial urban expansion

Accra's spatial urban expansion trend was found to be outwards and northwards. Demand for residential land may be the primary cause of spatial metropolitan growth. According to the findings of the study, urban/built-up areas have been under intense pressure [17, 27, 94]. It was recorded that 50% of the structures in the Metropolis were single-story compound houses, with the majority of multi-story buildings not exceeding the third floor [95, 96].

Osu Klottey had the least geographic urban growth among Accra's sub-metropolitan zones. This could be ascribed to the fact that the first Accra settlers resided in Osu Klottey, leaving smaller sections for future spatial urban growth. The presence of most governmental organisations in Osu Klottey during the colonial and post-independence periods could be ascribed to spatial urban growth [97]. The spatial urban growth was most noticeable in the northern sections of the Ablekuma, Okaikoi, and Kpeshie sub-metropolitan zones, which were heavily transformed from other LULC types to urban/built-up areas. The spatial urban expansion revealed in this study backs up the findings of [98], who found that Accra witnessed earlier spatial urban expansion.

The declining tendency in the density decay curve supported the previous work of [99, 100] that Accra had been growing before the 1980s. Some of the noted effects of spatial urban growth included intrusion on farming and forestlands for domestic and business purposes. Public open spaces, particularly green spaces, have been converted into urban/built-up regions [101].

The distance to a main road had a negative association with the density of spatial urban growth. For places further away from the George Bush Highway, the population decreased. The distance to a main route had a negative association with the density of spatial urban growth. The greater the distance from the George Bush Highway, the lower the intensity of spatial metropolitan growth. The closeness of social facilities to the highway may explain the trend. City planners prefer to build near highways to take benefit of established facilities and social amenities. As the extent of the buffer grew, non-urban regions were converted into urban/built-up areas.

This suggests that the scramble for land near a major highway appears to be unabated. Other types of flora and fauna may be impacted, emphasising their numbers and ecological significance to the ecosystem's equilibrium. The impacts of these conversions are not examined with a primary emphasis on the growth of the lands for business or residential uses.

4.4 Analysis of Remote Sensing Indices (NDVI, NDBI) and LST

The general overview of NDVI and NDBI revealed that urban/built-up areas produced lower values for NDVI but higher values for NDBI. This situation is the reverse for both agricultural areas and forestlands. In the case of the water environment, it recorded the lowest values for both NDVI and NDBI. This outcome was similar to the works of [38, 102] in Abuja (Nigeria); Florence and Naples (Italy) respectively.

The highest temperature values were recorded in the urban/built-up areas because these areas store more solar energy, as was also observed by (Adeyeri et al., 2017). Agricultural and forestlands reflect a moderate amount of solar radiation and play a vital role in the modification of land surface temperature (Adeyeri et al., 2017; Li et al., 2016). Therefore, it was expected that the temperature around the south-western boundary of Accra covered by water produced the least temperature for the city-level evaluation of LST due to a higher reflection of radiation [38, 40, 42, 105]. Temperature values in 1986 were the least whilst the highest values were recorded in 2022. However, LST in 2003 was higher than in 2015. This was quite surprising and could be attributed to the different months during which the satellite images of 2003 and 2015 were taken. The satellite images of 1986 2015 and 2022 were taken in December, whilst that of 2003 was taken in February.

The results imply that there could be ineffective and inefficient ways of buffering environmental temperature throughout the study period. The reduction of the agricultural and forestlands without proper planning to incorporate the various LULC types may have generally led to decreased green areas (agricultural and forestlands) as NDVI decreased throughout the study. It is therefore important for city planners to incorporate practical environmental sustainability policies. When these policies are not implemented, there could be an increase in land surface temperature which might affect human lives and other living organisms in the environment.

4.5 Correlation analysis

The correlation analysis also revealed similar patterns as the relationships between LST and NDBI, which was positive. This outcome is comparable to the works of [106, 107]. The relationship may be described as stronger. However, the correlation between LST and NDVI was negative. Similar correlation patterns were revealed in the works of [102, 108] and illustrated that NDVI has a negative correlation with LST. The correlation between NDVI and NDBI was negative throughout the study years and could be described as strong. This is also similar to earlier works [40, 109].

Therefore, it is imperative to protect the forestlands, especially the Achimota forest in Accra, by the Forest Commission as further encroachment into the forest reserve in Accra could further increase and decrease the correlation between LST/NDBI and LST/NDVI, respectively. This could have adverse effects on the balance of the provision of ecosystem services.

4.6 Ordinary least squares and geographical weighted regression

The relationship between LST and (NDBI/NDVI) was assessed. The outcome of OLS regression revealed a significant positive correlation between LST and NDBI, whilst a significant negative correlation was revealed between LST and NDVI. This outcome agreed with the negative linear relationship between mean LST and NDVI values in the works of [45, 110,111,112]. The variable NDBI had a larger influence on LST prediction based on the equations for OLS regression. This corroborated with the findings of [45, 47] that concluded that the most influential biophysical parameter of LST variation was NDBI. The inland water environment of Accra yielded a negative correlation between LST and NDVI and was comparable to the findings of the work of [46]. This outcome was in line with the findings related to the UHI effect, in Quanzhou City [113] and some megacities of Southeast Asia [114]. These initial works indicated a significant positive correlation between mean LST and the density of impervious surfaces, whilst this correlation was negative for green areas [115].

It can be argued that the GWR model’s prediction accuracy was better than OLS as the works of [50, 102] revealed similar results compared to this work. It was found that GWR performed better, and it is attributed to the fact that there was the inclusion of the spatial component as emphasised in the research of [116].

The findings from the regression analysis between LST and (NDVI/NDBI) will serve as a base study for further research by the scientific community, especially for related studies in Ghana and other tropical countries. The regression analysis also revealed that more attention had been given to the spatial urban expansion of urban/built-up areas without consideration for green areas (agricultural and forestlands). This could lead to exposure to high radiation from the sun, which may not be well for the general well-being of humans and other mammals in urban/built-up areas.

5 Conclusions

The Accra Metropolis was used as the test site to address the relationship between spatial urban expansion on temperature by integrating RS, temperature observations and GIS modelling approaches. The limited cloud-free Landsat series of sensors (TM, ETM, and OLI) and temperature observations were mainly employed for this research. The LULC classification scheme grouped the study area into urban/built-up areas, agricultural, forestlands and the water environment. The classified images of all the study areas revealed that urban/built-up areas dominated the other LULC classes throughout the timeframe of the study.

The techniques used in tracking spatial urban expansion (i.e. spatial urban expansion in sub-metropolitan zones, buffer (500 m), density of spatial urban expansion (density decay curve, correlation between density of spatial urban expansion and distance from a major road) revealed that there was an imbalance for lands for spatial urban expansion (housing units, infrastructural projects, and economic activities) on one side and the preservation of forested areas on the other).

Another vital part of this research was on the intense impervious surfaces occurring that have increased urban temperature. The relationship was addressed in this research to ascertain the alterations in temperature and urban expansion effects on temperature. The satellite images were employed to retrieve the RS indices and Land Surface Temperature. The RS indices and the statistical approaches were useful for the assessment of the spatiotemporal distribution of temperature. The OLS Regression and the GWR yielded satisfactory results in modelling LST with (NDBI and NDVI). It was found that the urban/ built-up areas accelerated temperature and that is in sharp contrast with agricultural, forestlands and the water environment that decreased temperature regarding the developed equations (11-13). The study proved beneficial for the investigation of the adverse effects of urban expansion on the local temperature.

The interplay of the RS indices revealed rudimentary relationships among them. The correlation analysis between the extracted land surface temperature and NDVI revealed a negative correlation and a positive relationship between LST and NDBI. It aided in showcasing the roles LULC changes most notably urban expansion plays in the alteration of the urban temperature.

Decision-makers can utilise this information to evaluate the effectiveness of existing land use policies and regulations in managing urbanisation impacts on temperature. It can help identify areas where land use practices have contributed to temperature increases and guide the development of targeted interventions to promote sustainable land use planning and design. The findings of this research may guide the implementation of green infrastructure, urban greening initiatives, and the promotion of climate-responsive urban design to enhance thermal comfort, reduce energy consumption, and improve overall liveability.

This study, therefore, concludes that there is a causal relationship between spatial urban expansion and temperature. The limitation of this study included the inability to obtain very high-resolution satellites for the study period. Again, the same interval of years for the study was not possible. It is recommended that Ghana should be decentralised to decrease the influx of people into Accra and also forestlands should be protected in the metropolis. It is therefore imperative that policymakers and land use planners in the Accra Metropolis should encourage urban forestry.

Future scientific investigations could enhance their findings by delving into additional factors with the potential to exert an influence on urban climate, population density and the implementation of land use policies. Moreover, subsequent works may be directed towards examining the ecological and social ramifications associated with land use transformations, aiming to identify effective strategies for ameliorating the adverse effects stemming from urbanisation. Such studies may prioritise the mitigation of the adverse consequences of spatial urban expansion on temperature. Future guidelines may include the development of an environmental monitoring protocol that could be monthly, seasonal or yearly utilising multispectral or hyperspectral satellite images.

The integration of RS and GIS provided this research with gainful insights into spatiotemporal patterns resulting from urbanisation. It was an efficient technique used in tracking urban expansion and accessing its relationship with temperature using Landsat satellite imageries.