1 Introduction

The rapid urbanization trend in Sub-Saharan Africa as seen in recent decades is expected to continue in the coming years due to increasing disparities between rural and urban environments, as well as climate change induced hazards, that force rural to urban migration (Barrios et al. 2006; Nawrotzki et al. 2017). Such fast and uncontrolled urban growth can lead to a very heterogeneous urban landscape with wealthy districts characterized by modern houses, big green gardens and asphalted roads, but also with informal settlements usually composed of densely built shacks made of corrugated metal sheets that are only accessible via small murrum alleys, typical unpaved but tamped roads (Vermeiren et al. 2012; Hemerijckx et al. 2020). In an observational study, Scott et al. (2017) demonstrated that such heterogeneous urban landscape results in intra-urban temperature variations in the city of Nairobi, Kenya, highlighting the presence of hotter air in informal settlements and suggesting heterogeneous hazards and vulnerability to urban heat in Sub-Saharan African cities.

The heat island effect of cities has been extensively studied worldwide, dating back to the early nineteenth century with pioneering studies by, e.g., Howard (1833), Renou (1868) and Hann (1885) (for a complete overview of classic urban heat island studies, see Stewart (2019)). Nevertheless, according to Stewart (2011a, 2011b), less than 5% of the 177 urban climate studies he reviewed focused on African cities—most likely due to the data scarcity in the region (Roth 2007; Acuto and Parnell 2016). Most of them were based on mobile or stationary in situ observations (Louw and Meyer 1965; Nakamura 1966; Gogh 1979; Okoola 1979; Okpara 2002; Jonsson et al. 2004; Offerle et al. 2005; Ndetto and Matzarakis 2013; 2015). Yet, the additional urban heat can affect (i) human health by increasing extreme heat and drought probability and by facilitating the transmission of some diseases (Frumkin 2002; Coccolo et al. 2016; Wouters et al. 2017); (ii) energy consumption, by increasing cooling demand and reducing heating needs (Martilli 2014; Santamouris et al. 2015; Ramon et al. 2020); and (iii) urban biodiversity (Beninde et al. 2015). Hence, more African urban climate studies are needed.

One of the first steps to be taken for coping with the regional data scarcity is to provide urban climate scientists with detailed urban canopy parameters (UCPs)—referring to parameters describing the urban characteristics between the surface and the highest roof level, typically varying at a local length scale of hundreds of meters to several kilometres (Oke et al. 2017). Such detailed UCPs are needed for a realistic representation of cities, which helps understanding the relations between the urban characteristics and the resulting urban climates (Stewart and Oke 2012). They are also required by urban climate models where they lead to an improved performance (Demuzere et al. 2017; Hammerberg et al. 2018; Ching et al. 2018; Varentsov et al. 2020). However, obtaining such detailed UCPs is a labour-intensive work that normally requires substantial technical and financial efforts Muller et al. (2013, 2015), Kamusoko (2017) and Ching et al. (2018). Information about building morphology, typology or materials is therefore rarely available for developing African cities, hence calling for cost-friendly and easily applicable data gathering methods (Acuto and Parnell 2016). Great effort has been made through the development of Local Climate Zones (LCZ) as a universal land use/cover classification for urban climate studies to provide a range of suitable UCPs for built and natural environments (Stewart and Oke 2012). The UCPs are not site-specific, but meant to be generally applicable to cities worldwide, having been sourced globally from in situ measurements, modelling studies, existing land-cover classifications, and urban climate literature reviews. Ultimately, the UCPs are placed into lookup tables for a variety of uses (e.g. Ching et al. 2018). However, for cities in Sub-Saharan Africa, and more particularly in Central and Eastern Africa, the representativity of these universal parameters remains unclear. Therefore, further investigation is needed to specify their values to the context of Central and Eastern Africa.

Brousse et al. (2019, 2020) already used these universal UCPs over the city of Kampala, Uganda. In particular, they demonstrated that recently developed open-source techniques can already provide morphological UCPs for the TERRA_URB urban parametrization scheme embedded in the COSMO-CLM regional climate model (Rockel et al. 2008; Wouters et al. 2016). This was done by deriving spatially varying UCPs out of the LCZ map of Kampala (Brousse et al. 2019), following the World Urban Database and Access Portal Tools (WUDAPT) framework (Ching et al. 2018). But, as explained before, site-specific information might also be of added value for this East-African city. For example, site-specific morphological UCPs could be retrieved directly from remote sensing techniques, and climate-relevant radiative and thermal properties for a variety of building materials could be obtained using in situ, airborne or laboratory techniques (Herold et al. 2004; Nasarudin and Shafri 2011; Loridan and Grimmond 2012; Kotthaus et al. 2014). Yet, gathering site-specific information is best done by a visit to the field sites in person (Stewart and Oke 2012).

Therefore, the primary goal of this study is to compare universal UCPs with site-specific field-derived UCPs for the city of Kampala, taken as a case study. To this end, we describe the fieldwork methodology for efficiently gathering relevant parameters at low cost that best represent the urban heterogeneity in terms of morphology and building materials for urban climate studies. Moreover, we elaborate a framework for the extrapolation of the field data using LCZ, and show the possibility to upscale the field data to a grid at the kilometre scale. To test the applicability of our field data and upscaled fields, a climate model is run for the short dry season of 2017–2018 and (near-) surface temperatures are evaluated against remotely sensed and in situ observations.

2 Methods

The methodology is divided into four parts. First, the general procedure to derive universal UCPs is described, explaining both the WUDAPT initiative and the LCZ framework. Second, we describe the details of the fieldwork, resulting in a large dataset exceeding 1300 measuring points. Third, we explain how this data can be used (i) to obtain an alternative fieldwork-derived LCZ map and (ii) to derive site-specific UCP values. Fourth, we exemplify how to use this site-specific fieldwork-derived data for urban climate modelling.

2.1 Procedure for universal urban canopy parameters

The World Urban Database and Access Protal Tools (WUDAPT, Ching et al. (2018)) project is a community effort aiming at acquiring climate-relevant information on cities worldwide and developing tools to support climate research. An important part of the WUDAPT project relies on the use of the universal LCZ classification (Stewart and Oke 2012) that distinguishes typical urban landscapes that are known to have recognizable effects on and interactions with the local climates. Therefore, the LCZ classification is divided in 7 natural and 10 built classes (Stewart and Oke 2012) that offer universal ranges of parameters representing the urban environment in terms of impervious coverage, building density and height, anthropogenic heat fluxes and heat storage capacities. Former studies already demonstrated that the derivation of some UCPs through the WUDAPT framework are very helpful for urban climate modelling, though usually implementing morphological parameters only (e.g. Brousse et al. 2016, Alexander et al. 2016, Verdonck et al. 2017, Hammerberg et al. 2018, Wong et al. 2019, Zonato et al. 2020). This UCP derivation typically consists of associating the mean value of the universal ranges of Stewart and Oke (2012) to each built LCZ that is present in the city. These ranges were defined based on data gathered for cities worldwide (Stewart and Oke 2012). Yet, local climate work in tropical African cities could potentially benefit from additional measurements of site-specific UCPs.

The WUDAPT framework offers an innovative open-source method for mapping LCZ globally. Following this framework, the first step for obtaining an LCZ map consists of manually creating polygons in Google Earth that exemplify each LCZ type present in the selected domain. These polygons then serve as a training set for a random forest classifier—implemented in the SAGA software (Breiman 2001; Bechtel et al. 2015)—for mapping a predefined domain out of Landsat 8 images, possibly combined with other satellite sources retrieved from the Google Earth Engine (Gorelick et al. 2017). In addition, accuracy measures of the LCZ map are obtained by repeatedly creating the LCZ map with a random sample of 70% of the training data, while the remaining 30% is used for validation. This bootstrap procedure is repeated 25 times, ultimately providing accuracies per LCZ class as well as overall accuracy measures Bechtel et al. (2017, 2020). The LCZ mapping approach has been used to map Europe and the Continental United States Demuzere et al. (2019a, 2020) in the form of LCZ. Brousse et al. (2019, 2020) also applied this procedure for mapping the city of Kampala.

2.2 Fieldwork

The LCZ map and corresponding universal UCP values produced by this generic procedure will be compared to site-specific in situ measurements, to test how the latter can improve the physical representation of African cities. Therefore, a fieldwork exercise has been set up. In particular, for a well-considered sample of points, quantitative measurements of building heights and canyon widths are taken, and estimates are made by two independent observers for material fractions of roads, walls and roofs.

2.2.1 Preparation

Multiple locations within Kampala are selected for sampling the heterogeneous city during the field work, considering three main criteria. First, the LCZ map for Kampala from Brousse et al. (2019) is used as an initial guideline. The fraction of measuring points per LCZ is chosen proportional to the area fraction of the LCZ map, with approximately 1% compact mid-rise (LCZ 2), 17% compact low-rise (LCZ 3), 77% open low-rise (LCZ 6), 3% informal settlements (LCZ 7) and 2% large low-rise (LCZ 8). Second, measuring points are selected along short transects crossing parts of the city, making the work logistically feasible. Third, the accessibility and safety of each of the mapped transects is approved by local guides. The total distance of each transect is aimed at 3 to 5 km and includes an observation every 120 m. This length scale is a compromise between capturing the heterogeneity and maximizing the number of measurements per walkable distance, and follows the optimal horizontal resolution of 100 m proposed by Bechtel et al. (2015) for digital LCZ mapping. Transects are drawn in Google Earth (GE) and transferred to Basecamp software that allows uploading the route to a Global Positioning System (GPS) device. This resulted in a total sample of more than 1300 measurement locations across the city (Fig. 1, the raw data with the exact locations is freely available in Van de Walle et al. (2020a)).

Fig. 1
figure 1

Google Earth image of the evaluation domain, centred around the city of Kampala, Uganda, indicated by the red box on the inset figure, while the bigger green box presents the simulation domain of the high-resolution regional climate model. The transects along which the measurements are gathered are indicated in red

2.2.2 In the field

Fig. 2
figure 2

Examples of some measuring locations in different LCZ environments throughout the city, seen from above (first column) and from eye height (second and third columns). The former image is obtained from Google Earth, with the observation location centred and the image edges set equal to 2 km. The latter pictures are taken during the fieldwork, showing two buildings at each side of the road. They are labelled with their measurement date by day and month, followed by the measurement number and the side of the road (L/R) relative to our walking direction. This labelling is consistent with the raw dataset

Two teams are assembled to survey both sides of the street canyon. Each team consists of at least three members taking care of specific tasks. First, the straight transects on the GPS have to be followed as close as possible, taking either big roads or small alleys and avoiding dead ends. The actual transects are saved using the Locus smartphone application (Fig. 1). Second, at each stop the two closest buildings are selected at both sides of the street canyon. In theory, these two buildings should be representative of the local-scale environment. In practice, buildings can vary at much smaller scales and the predefined buildings at fixed distances can thus result in unrepresentative buildings. However, we can rely on the large sample of measurements to assume that the derived statistics do represent the variety within one LCZ environment. Several examples of such stop locations are shown together with images viewed from Google Earth (Fig. 2). When the buildings are selected, quantitative measurements related to those buildings are obtained using a Nikon Forestry Pro laser distance meter. The measurements include building height, road width and distance from the road to the building, measured perpendicular to the road (Fig. 3). Road width and distances from the road to the building at both sides of the street canyon are summed to obtain the canyon width. Together with the stop location and timing, these measurements are entered in an Open Data Kit (ODK-Collect) smartphone application. Third, two independent observers make estimates for material fractions of the street, the wall and the roof of the two selected buildings. For simplicity, only three major material categories are considered for this study: asphalt, earthenware and metal. The asphalt group includes road asphalt, asphalted roofs and concrete walls. In the earthenware group, murrum roads as well as red bricks for walls and tiles for roofs are included, all originating from compressed earth. Corrugated metal is an important building material in Kampala, particularly for roofs. The resulting information from the estimations of both observers is collected in the ODK-Collect app, each specifying fractions for the parameters road_asphalt, road_murrum, wall_concrete, wall_brick, wall_metal, roof_asphalt, roof_tiles and roof_metal. Despite the fact that the estimates are prone to the observer’s interpretation, the estimates of both independent observers are checked afterwards. For very different results (percentages off by more than 20%), pictures are consulted if available to check for errors, possibly leading to the removal of the data at this location. For similar results, the mean of both estimates is computed. All collected information is schematically represented in Fig. 3 and online available (Van de Walle et al. 2020a).

Fig. 3
figure 3

Overview of the measurements taken during each stop. Material fractions (abbreviated as Mat Fr, consisting of metal, asphalt or earthenware) for roof, wall and street are estimated separately by two independent observers. A laser distance meter is used to measure building height and canyon width. Note that roof area fraction information is added afterwards from Open Street Map and Google Earth calculations

2.3 Processing the fieldwork data using local climate zones

The fieldwork dataset includes morphological parameters and building material fractions for each of the 1312 measuring points. The use of this data is twofold. First, the LCZ map from Brousse et al. (2019) is upgraded by supplementing fieldwork data into the default WUDAPT LCZ mapping procedure. Second, site-specific UCP values are derived from the data for each LCZ separately.

2.3.1 Fieldwork-derived local climate zone map

The detailed information gathered during the fieldwork are used as training data for the built LCZ, supplemented with the natural LCZ training areas from Brousse et al. (2019). Hence, a built LCZ label is assigned to each fieldwork point, based on (i) measurements of the building height, (ii) the roof fraction derived from both Open Street Map and Google Earth and (iii) ground knowledge about easily recognizable large low-rise environments. In particular, low-rise and mid-rise classes are separated at 10 m building height, while open and compact classes are differentiated by 40% roof fraction. Very small (building height < 4 m) and very compact (roof fraction > 60%) environments are classified as LCZ 7. Large low-rise (LCZ 8) environments have building heights below 10 m and typical roof fractions within 30–50%. The parameter values that are used to assign the LCZ classes are based on the definitions of Stewart and Oke (2012) and provided in Table 1. Because the characteristics for LCZ 8 partly overlap with the compact and open low-rise classes, an objective differentiation based on raw measurements is impossible. However, the large low-rise (LCZ 8) class is easily recognizable in the field as it consists of light industrial as well as commercial buildings, mainly used as storage facilities. All buildings of this LCZ 8 class take up a large area and are well separated by big roads. This allows us to easily classify them in the field. Once all points are classified, a circle around each point builds the urban training polygons. A radius of 50 m is chosen to avoid overlap between different polygons. However, two measurements at both sides of the road result in two LCZ labels per training circle. Because training areas have to be uniquely labelled, conflicting labels are manually removed from the training set. It is important to note, however, that this removal does not suggest unreliable measurements, as some streets indeed form the boundary between different LCZ neighbourhoods.

Table 1 Morphological requirements for building height H and roof fraction R that define the built LCZ of each measurement. The values in the third and fifth column follow (Stewart and Oke 2012) (abbreviated as SO12), while our definitions (columns four and six) only consider an upper or lower boundary, sufficient to distinguish the different classes from each other. Large low-rise (LCZ 8) are easily recognizable, and therefore distinguished from the other classes subjectively in the field

This new LCZ training data set then feeds into the random forest classifier implemented in Google’s Earth Engine as prescribed by the adaptive WUDAPT protocol Demuzere et al. (2019a, 2019b). By using the same earth observation input features used in Brousse et al. (2019) for the random forest classifier (composites of Landsat 8 bands, Sentinel 1 backscatter, minimum and maximum Normalized Difference Vegetation Index from Landsat 8, and the Normalized Difference Urban Index as given by Zhang et al. (2015)) the simulation domain is mapped into LCZ at 100 m horizontal resolution.

2.3.2 Site-specific urban canopy parameters

The fieldwork data set includes building height, road width and building-to-road measurements per point; thus, UCP values for H and \(\frac {h}{w_{c}}\) can be derived. Moreover, roof fraction R per measurement location is calculated from Open Street Map and Google Earth. Additionally, radiative and thermal UCP are derived from the estimates of average material fractions (χ) per building combined with a lookup table of the material properties from Jackson et al. (2010) (Table 2). This is done separately for streets, walls and roofs:

$$ \phi_{street/wall/roof} = \underset{i}{\sum} \chi_{i} \phi_{i}, $$
(1)

with ϕ representing the four radiative and thermal UCPs (α, 𝜖, Cv and λ), while i loops over the different material categories: metal, asphalt and earthenware. The resulting road, wall and roof information are then combined to form bulk UCPs, following (Wouters et al. 2016):

$$ \begin{array}{@{}rcl@{}} \phi_{\text{rad, bulk}} & =& (1-R) \frac{\phi_{street} + 2 \frac{h}{w_{c}} \phi_{wall}}{1+2\frac{h}{w_{c}}} \exp{\left( -0.6 \frac{h}{w_{c}} \right)} \\&&+ R \phi_{roof} \end{array} $$
(2)
$$ \begin{array}{@{}rcl@{}} \phi_{\text{th, bulk}} & =& (1-R) \left( \phi_{street} + 2\frac{h}{w_{c}} \phi_{wall} \right) + R \phi_{roof}, \end{array} $$
(3)

with Eq. 2 being valid for both α and αth = 1 − 𝜖, and Eq. 3 being valid for both Cv and λ. The ISA estimates from the fieldwork were unusable, since large inconsistencies were detected between estimates from different independent observers. Also remote sensing products are too uncertain over the region at such high resolution (Kaspersen et al. 2015; De Colstoun et al. 2017). Therefore this UCP is derived following the standard WUDAPT protocol, and are taken as the median value per LCZ class suggested by Stewart and Oke (2012).

Table 2 Radiative and thermal material properties: albedo (α), emissivity (𝜖), volumetric heat capacity (Cv) and heat conductivity (λ) for three material categories. From: Jackson et al. (2010)

2.3.3 High-resolution local climate zone and urban canopy parameter maps

Estimates for each climate-relevant bulk UCP are available per measurement location, to which also an LCZ class has been assigned. Combining this information produces UCP distributions per LCZ, from which the mean is taken as representative value. Each UCP is then extrapolated to the model domain of interest by applying the mean values to the corresponding classes of the fieldwork-derived LCZ map, resulting in UCP maps at 100 m resolution. These site-specific high-resolution UCP maps can be considered as a basis input for many urban climate studies. As an example, we show the application to the COSMO-CLM regional climate model including the TERRA_URB scheme.

2.4 Urban climate modelling

2.4.1 Regional climate model

To demonstrate the effects of site-specific urban surface information on the local climate, the regional climate model COSMO-CLM 5.0 (Rockel et al. 2008) is used. In the dynamical core of the model, prognostic variables are calculated on an Arakawa C-grid by numerically solving the set of primitive equations (Doms et al. 2011). Furthermore, the dynamical core of the model is fed with source and sink terms from physical parametrization schemes accounting for subgrid-scale processes that describe radiative transfer (Ritter and Geleyn 1992), turbulence (Mironov and Raschendorfer 2001), microphysical cloud processes as well as shallow convection (Tiedtke 1989). All surface–atmosphere interactions are parametrized by one-dimensional parametrization schemes: lake processes by FLake (Mironov et al. 2010), land processes by TERRA-ML (Schulz et al. 2015), and urban–atmosphere interactions by TERRA_URB (Wouters et al. (2016), see Section 2.4.2).

The COSMO-CLM model with FLake coupled has been extensively used in the Lake Victoria region, first by Thiery et al. (2015) at 7 km resolution, after extensive offline testing of FLake Thiery et al. (2014b, 2014a). Recently, a multi-year convection-permitting (\(\sim 2.8\) km) simulation was performed, focusing on the complex effects of the lake, synoptic wind and mountains on the regional climate (Van de Walle et al. 2020b). The first performance tests of COSMO-CLM including TERRA_URB in this region was accomplished by Brousse et al. (2020), simulating at 1 km grid spacing over the city of Kampala.

Here, the model is used with an identical configuration as described in Brousse et al. (2020), based on its tropical setup (Panitz et al. 2014) with an extension of the soil-moisture heat conductivity formulation (Schulz and Vogel 2020). The latter represents vegetation in the surface energy balance, thereby introducing a temperature for the vegetation layer and accounting for insulating effects by the vegetation in the default surface land parametrization scheme TERRA-ML (Schulz et al. 2015). A three-step downscaling strategy is applied from ERA-Interim (Dee et al. 2011) over coarse-resolution (12 km) nesting step to a high-resolution (1 km) domain centred on the city of Kampala. The coarse-resolution multi-year simulation is available from January 2005 to February 2018, but only a short period has been further downscaled to 1 km, in particular the three dry season months from December 2017 to February 2018 (Brousse et al. 2020).

2.4.2 Urban parametrization scheme

The urban parametrization scheme TERRA_URB (Wouters et al. 2016) supplements the default TERRA-ML surface scheme (Schulz et al. 2015). It improves the city representation by indirectly accounting for shadowing and multiple scattering of radiation, heterogeneous surface–atmospheric interaction in terms of turbulent momentum, heat and moisture transport and the inner-building energy budget. Instead of explicitly calculating these processes, a Semi-empirical URban canopy dependencY algorithm (SURY; Wouters et al. (2016)) condenses three-dimensional canopy information to a limited number of bulk properties, thereby strongly reducing the computational cost without loss of performance Wouters et al. (2016, 2017), Demuzere et al. (2017). The TERRA_URB urban parametrization scheme is fed with the following morphological, thermal and radiative UCPs in the form of surface bulk input fields at the climate model’s resolution: impervious surface area fraction (ISA), building height (H), roof area fraction (R), height-to-width ratio (\(\frac {h}{w_{c}}\)), volumetric heat capacity (Cv), heat conductivity (λ), shortwave albedo (α) and thermal albedo (αth). The latter is defined as one minus thermal emissivity (αth= 1-𝜖).

2.4.3 Implementation of site-specific fieldwork-derived urban canopy parameters

All these UCPs are available from the fieldwork data. However, to deliver the fieldwork data in the form of surface bulk input fields to the urban parameterization scheme TERRA_URB, an additional step is required: the remapping of the high-resolution UCP maps to the coarser grid of the climate model (1 km). For ISA, the high-resolution pixels (100 m) are averaged over an area of 1 km while being remapped to the coarser grid. This is called an Area-weighting. For other UCPs however, an ISA-weighted mean is applied, ensuring the conservation of the city characteristics after remapping to coarser resolution:

$$ \mathit{UCP}_{1\text{km}} = \frac{{\sum}_{\text{j} \in 1\text{km}} \mathit{UCP}_{100\text{m,j}} \mathit{ISA}_{100\text{m,j}}}{{\sum}_{\text{j} \in 1\text{km}} \mathit{ISA}_{100\text{m,j}}} , $$
(4)

where j selects each 100 m resolution cell within the coarse 1 km pixel. This aggregation methodology has been applied by Varentsov et al. (2020), and is formalized in a freely available online tool (https://github.com/matthiasdemuzere/WUDAPT-to-COSMO).

2.4.4 Model experiments and evaluation

To assess the impacts of the site-specific data on the modelled surface temperature (Ts) and near-surface air temperature (T2m), two regional climate simulations are performed. The input fields for the first control simulation (CTL) are derived following the standard WUDAPT protocol. Except for the Area-weighted and the ISA-weighted remapping, the approach is identical to the one described by Brousse et al. (2020), that used a conservative remapping of second-order (Jones 1999) to upscale UCPs. A summary of the data flow is given in the upper part of Fig. 4. The data flow resulting in the input fields for the second, site-specific simulation (FW) was described in Section 2.3 and is schematically summarized in the lower half of Fig. 4.

Fig. 4
figure 4

Data flow chart for both control (CTL) and fieldwork (FW) simulations. The CTL procedure follows the standard WUDAPT protocol: the user starts creating an LCZ training set that exemplifies LCZ areas based on Google Earth (GE) imagery. The training set serves the random forest algorithm (implemented as a Google Earth Engine script) that extrapolates the training set to the model study domain. The resulting LCZ map has a resolution of 100 m. Next, the user derives urban canopy parameter (UCP) values, for instance from UCP ranges per LCZ proposed by Stewart and Oke (2012), or from city-invariant, fixed values proposed by Wouters et al. (2016). Combining the LCZ map and UCP values per LCZ results in UCP maps at 100 m resolution, which are then remapped to a coarser resolution to match the climate model grid, here at 1 km resolution. These UCP maps contain all surface information necessary for the urban parametrization scheme of the climate model (TERRA_URB embedded in COSMO-CLM, abbreviated as CCLM-TU). The input, and therefore also the FW procedure, differs in many aspects. Building height H and height-to-width ratio \(\frac {h}{w_{c}}\) are gathered during site-specific fieldwork measurements, as well as estimates of material fractions. Roof fraction R is derived from Open Street Map (OSM) and GE, and the value for ISA is derived following the standard procedure. Derivations for morphological (UCPmor) and radiative and thermal urban canopy parameters (UCPthr) are different. The text box colours refer to figures discussed later. Red: both LCZ maps are compared in Fig. 5. Blue: UCP values per LCZ are shown in Fig. 6. Green: the UCP maps serving as input to the climate model are summarized in Fig. 7, while Fig. S3 shows the UCP distributions. They are discussed in Section 3.1, Sections 3.2 and 3.3. Orange: climate comparisons are provided in Figs. 8 and 9. These results build Section 3.4

Simulation results for surface temperature are evaluated against the Moderate Resolution Imaging Spectroradiometer (MODIS) data from the polar-orbiting TERRA and AQUA satellites (Wan et al. 2004; Wan 2007). Each satellite overpasses twice a day over Kampala around 10:30 local time (TERRA) and 13:30 (AQUA) during daytime, and around 22:30 (TERRA) and 01:30 (AQUA) during nighttime. The high horizontal resolution (\(\sim \)1 km) is in good agreement with the simulation resolution. Differences between the CTL and FW simulation are assessed over the CTL urban extent (ISA> 0.1). Furthermore, near-surface temperatures are evaluated against nine Trans-African Hydro-Meteorological Observatory (TAHMO, van de Giesen et al. (2014)) automatic weather stations located within a radius of 100 km around the centre of Kampala. We note that all stations except Entebbe and Makerere are located outside the CTL urban extent. Due to this sparse network, the evaluation only applies to the overall model performance, not to intra-urban variations.

3 Results

3.1 Spatial distribution of local climate zones

The LCZ map derived following the generic WUDAPT protocol (named CTL) and the map derived directly from our fieldwork (named FW) are compared. Visual inspection of both maps reveals clear differences (Fig. 5a and b). First, the overall urban extent, defined as ISA> 0.1 and shown by the black contour line, is lower in FW. Second, the fraction of compact built LCZ classes types is reduced and replaced by open low-rise (LCZ 6). All similarities and changes are quantified in the comparison matrix (Fig. 5c). As an example on the interpretation of this matrix, we consider the upper left cells. Only 21% of compact mid-rise (LCZ 2) pixels in CTL close to the city centre are similarly classified in FW, while 35% has changed to compact low-rise (LCZ 3). Over three quarter of the compact low-rise areas (LCZ 3) is reclassified as open low-rise (LCZ 6). Besides this reclassification of compact low-rise (LCZ 3), 14% of LCZ 2 in the central business district, 68% of the informal settlements (LCZ 7) and 59% of the large low-rise (LCZ 8) are changed to LCZ 6 in the FW map. The overall reduction of urban extent as seen in the maps, also appears in this comparison matrix, as one-third of the open low-rise areas (LCZ 6) changes to vegetated areas with low plants (LCZ D). The informal settlements of CTL have almost completely been reclassified, as only 4% of the LCZ 7 pixels are identical in both maps.

Fig. 5
figure 5

Comparison of the LCZ maps from (a) CTL and (b) FW over the evaluation domain. The black contour lines show the urban extension for both maps, defined as ISA> 0.1. (c) Percentage similarity/change for each LCZ class, to be read from CTL (ordinate) to FW (abscissa). Each row in this comparison matrix adds up to (rounded) 100%, except the open mid-rise class (LCZ 5) which was not considered in CTL. The overall accuracies of the map, as well as the accuracies of each class separately are shown in Fig. S1

The substantial changes in both LCZ maps are also depicted in the accuracy measures (Fig. S1). For the central business district, the informal settlements and large low-rise (LCZ 2, LCZ 7 and LCZ 8, respectively), accuracies increase by 10–20%. The open low-rise class (LCZ 6) continues to have the highest accuracy (\(\sim \)0.8). From all built classes, only the accuracy of compact low-rise (LCZ 3) decreases with almost 20%. Training areas for open mid-rise (LCZ 5) were not identified, excluding this class in the CTL map. However, some fieldwork measurement locations were classified as LCZ 5, adding this class to the FW LCZ map. The mean accuracy of approximately 0.1 suggests that the LCZ mapping experiences difficulties in recognizing and distinguishing this class, probably caused by the lack of proper building height data in the classification process (Vandamme et al. 2019). Despite these clear accuracy changes per LCZ, the overall accuracy (OA) is very similar between CTL and FW LCZ maps.

3.2 Urban canopy parameters per local climate zone

The differences between the CTL and FW LCZ maps also alter the UCP values per LCZ class (Fig. 6, for which the means are summarized in Table S1). Except for ISA, distributions of fieldwork measurements are summarized as boxplots per LCZ class. For the morphological UCPs (building height, roof fraction and height-to-width ratio), the first and third quartiles of the distributions are compared against the universal UCP ranges of Stewart and Oke (2012), while the mean is compared against the values derived following the CTL procedure, also applied in Brousse et al. (2020). Even though the morphological UCP values for CTL and FW generally agree very well, mean values for height-to-width ratio substantially differ, especially for the densely built environments (LCZ 2, 3 and 7). These differences can probably be explained by wider site-specific street canyons measured in Kampala compared to the universal values. This result suggests that the LCZ classification in Kampala could potentially benefit from the use of subclasses as introduced by Stewart and Oke (2012).

Fig. 6
figure 6

(a-d) Comparison of site-specific fieldwork (FW) observed urban canyon parameter distributions (UCP, shown as boxplots) with (i) values used in the control (CTL) approach, copied from Brousse et al.(2020, orange filled dots), and (ii) universal ranges from Stewart and Oke (2012, SO12, blue bars). (e-h) Distributions for radiative and thermal UCP are derived from material fraction estimates (Fig. S2) and material properties (Table 2), following Eqs. 12 and 3. FW UCP distributions are compared against the CTL values proposed by Wouters et al. (2016, W16, orange open dots) and applied by (Brousse et al. 2020). The UCP include impervious surface fraction (ISA), building height (H), roof fraction (R), height-to-width ratio (h/wc), albedo (α), emissivity (𝜖), heat capacity (Cv) and conductivity (λ) for the six built LCZ clases compact mid-rise (LCZ 2), compact low-rise (LCZ 3), open mid-rise (LCZ 5), open low-rise (LCZ 6), informal settlements (LCZ 7) and large low-rise (LCZ 8)

For radiative and thermal UCPs, larger differences between CTL and FW are observed, which are analysed considering both material properties (Table 2) and site-specific material fraction estimates (Fig. S2). These material fraction estimates suggest that street canyons in the city centre (LCZ 2) are constructed almost completely out of concrete/asphalt. Asphalted roads are also observed in LCZ 5 and LCZ 8, the open mid-rise and large low-rise areas. Especially in LCZ 3 and LCZ 7, corrugated metal is an abundant material, mainly used for roofs. While earthenware in these classes mainly appears from compressed earth (murrum) roads, the earthenware fraction in LCZ 5 and LCZ 6 is attributed to brick walls and red tile roofing. These fraction estimates are combined following Eqs. 12 and 3, resulting in the radiative and thermal FW distributions (Fig. 6), which are compared against the city-invariant values proposed by Wouters et al. (2016) and applied in Brousse et al. (2020). While albedo and heat capacity are substantially higher in each class, changes in emissivity and thermal conductivity are most pronounced in compact low-rise and informal settlements (LCZ 3 and LCZ 7). In these classes, corrugated metal was shown to be an abundant material in the street canyon and is characterized by a very low emissivity and a very high conductivity (Table 2).

3.3 Spatial distribution of urban canopy parameters

Following Eq. 4, UCP fields are remapped to a coarser resolution grid (Figs. 7 and S3). For the FW-derived results, ISA is lower throughout the city, mainly caused by the changes in LCZ classes: the open low-rise class (LCZ 6) to vegetation types in the suburbs and compact low-rise (LCZ 3) to the more open LCZ 6 neighbourhood in the city (Fig. 5). Decreases in building height, roof fraction and height-to-width ratio are the combined results of changes in LCZ classes and changes of UCP values per LCZ. Systematic changes in the radiative and thermal UCP maps are large. For albedo and heat capacity they even exceed intra-urban variations. Overall, we conclude that the derivation of urban parameters via the LCZ universal parameters (CTL) or via site-specific measurements (FW) results in substantially different UCP input fields.

Fig. 7
figure 7

Urban canopy parameter (UCP) fields at the remapped 1 km resolution grid for both the control (CTL) and fieldwork (FW) approaches. The UCP include impervious surface fraction (ISA), building height (H), roof fraction (R), height-to-width ratio (h/wc), albedo (α), emissivity (𝜖), heat capacity (Cv) and conductivity (λ)

3.4 (Near-)surface temperature modelling

Given the differences in LCZ and UCP maps between the CTL and FW approaches (Figs. 5 and 7), changes in modelled (near-)surface temperatures are expected. Overall, the implementation of site-specific UCPs leads to an overall cooler modelled urban environment (Fig. 8). While Ts is reduced by approximately 1 K on average and down to 2 K in the outskirt of the urban area, urban near-surface air temperatures are reduced by approximately 0.2 K to 0.6 K. Differences are most significant over the outskirts, where open low-rise (LCZ 6) urban pixels in CTL are replaced by vegetation class LCZ D (Fig. 5), reducing the ISA fraction (Fig. 7). On the contrary, temperature differences over the central parts of the city are minor and mostly insignificant.

Fig. 8
figure 8

Hourly averaged modelled surface temperatures (Ts) and near-surface temperatures at 2 m (T2m) of the control simulation (CTL) in the left panels and their respective mean differences between the fieldwork (FW) and the control simulation in the right panels. Both simulations were run for the short dry season from December 2017 to Februari 2018. Significant differences in temperatures at the 5 % confidence level are hatched in green

The overall cooler urban surface leads to an improved performance for Ts of the FW simulation against CTL. In fact, when evaluating the modelled urban surface temperatures against MODIS, both the mean bias and the root mean squared error are significantly reduced, especially during daytime (Table 3). Air temperature at 2 m in the broader Kampala region is evaluated against observations from nine TAHMO weather stations (van de Giesen et al. 2014), resulting in very similar scores between CTL and FW (Table S2). This result is not surprising, as all stations except Entebbe and Makerere are located outside the CTL urban extent, and those two fall outside the region characterized by statistically significant differences between CTL and FW near-surface temperatures (Fig. 8). This evaluation therefore only applies to the overall model performance, not to intra-urban near-surface temperature variability.

Table 3 Mean bias, root mean squared error (RMSE), and Perkins Skill Score (PSS) for simulated surface temperature (Ts) against MODIS observations from both TERRA and AQUA satellites, each overpassing the domain twice a day. Differences between the biases from the two model runs are all significant at the 5 % confidence level. Note that this evaluation is performed over the same evaluation domain as Brousse et al. (2020; Fig. 2 herein), with latitude [0.0,0.6 N] and longitude [32.3 E, 32.9 E]. Hours are in local time

Averaged over the CTL urban extent (ISA> 0.1), surface temperature differences between CTL and FW are statistically significant throughout the day, with largest differences during daytime when FW is up to 2 K cooler (Fig. 9 a,c). Mean near-surface temperature differences are smaller, with FW being \(\sim \)0.2 K cooler than CTL during nighttime and morning, but nearly identical in the afternoon (Fig. 9 b, d).

Fig. 9
figure 9

(a,b) Diurnal cycle of surface and near-surface temperature for both CTL and FW runs. (c,d) Difference of FW against the CTL simulation, averaged over the CTL urban extent (ISA> 0.1). This difference is the combined effect of (i) an updated LCZ map, (ii) the site-specific morphological and (iii) site-specific spatially variable thermal and radiative UCP per LCZ. Statistically significant (5% confidence level) differences between the simulation are indicated per hour by grey stars. (e,f) Diurnal cycles of the FW simulation distinguishing different LCZ. The open mid-rise (LCZ 5) and large low-rise (LCZ 8) had no or too few common pixels between CTL and FW, thus excluded from the figure. (g,h) Differences between FW and CTL per LCZ. Note that only pixels are selected with identical LCZ classes between CTL and FW

Intra-urban temperature differences between CTL and FW are investigated by selecting pixels that share a common LCZ class in both approaches. Shared open low-rise (LCZ 6) locations are clearly cooler than others, during daytime and nighttime for surface temperature, and especially during nighttime for near-surface temperature (Fig. 9 e, f). While not present in the open low-rise (LCZ 6) signal, (near-)surface afternoon temperatures are higher in FW compared to CTL in compact mid-rise (LCZ 2), compact low-rise (LCZ 3) and informal settlements (LCZ 7) (Fig. 9 g, h). In the latter classes, the FW introduces higher values for the thermal properties Cv and λ (Fig. 6), possibly forming a larger and easier accessible heat reservoir. As this reservoir might store more heat before noon and release more heat in the late afternoon, it can explain this particular Ts and T2m signal.

4 Discussion and conclusions

In this study we demonstrated that site-specific field-derived observations of the urban forms and building materials provide a different physical description of the city in comparison to a universal approach. To properly interpret this result, we discuss the advantages and limitations of the data gathering and processing procedures as well as the universal approach, keeping in mind its potential usage in urban climate studies.

First, the measurements taken at fixed stops along transects unavoidably include nonrepresentative locations within one LCZ environment. As an example, the walks also pass big streets through compact LCZ classes. These big streets might thus be included in the site-specific UCP distribution of that class. However, such open areas are not included in the universal UCP ranges (Stewart and Oke 2012), since they do not exemplify the compact zone. This mismatch might partly explain the lower measured height-to-width ratio in densely built environments compared to the universal UCP ranges.

Second, applying a different methodology to obtain a training set can lead to a different resulting LCZ map. Two factors play a role in explaining those differences. On the one hand, the site-specific LCZ training set probably has the highest quality, thanks to the objective and three-dimensional information about the street canyon. This is advantageous over the standard WUDAPT classification procedure that is prone to subjective biases and that uses training polygons based on two-dimensional Google Earth images (Bechtel et al. 2017; Verdonck et al. 2019). On the other hand, the standard WUDAPT procedure is user-friendly and allows the user to quickly produce a large training set, while the site-specific LCZ training set is limited to small areas enclosing the measuring points. Possibly, the optimal LCZ map has to benefit from the advantages of both approaches.

Third, this different LCZ classification has a direct impact on the spatial distribution of the ISA fraction, a very important parameter for urban studies (Weng 2012). Impervious surface has emerged as indicator of the degree of urbanization, but also as indicator of environmental quality (Arnold and Gibbons 1996). Therefore, the uncertainty about the spatial distribution of the ISA fraction has to be reduced. Yet, deriving ISA from fieldwork campaigns faces an important challenge which we experienced during our fieldwork exercise. Two observers independently estimated ISA at each measuring location, but often resulted in very inconsistent values due to the interpretation of the perviousness of unpaved but tamped earth. Therefore, impervious surface fractions could alternatively be retrieved from remote sensing techniques (Weng 2012; Kaspersen et al. 2015; Kamusoko 2017), providing products up to \(\sim \)30 m resolution such as the Global Man-made Impervious Surface (GMIS, De Colstoun et al. 2017). Though the latter is also available for the city of Kampala, they acknowledge the challenge of proper ISA representation in cities with areas of bare soil, similar to our fieldwork experience.

Fourth, the thermal properties of materials are characterized by larger uncertainties than morphological parameters. For example, all urban canyon materials are divided into only three groups (metal, asphalt and earthenware), but more groups would naturally result in more precise thermal characteristics. In addition, some thermal properties are highly sensitive to the exact composition of that material. For painted walls for example, it is not trivial to see whether the underlying material is concrete or bricks. Moreover, material fraction estimates per selected building at each measuring location are prone to individual observer interpretations. Attempting to reduce this subjectivity, two observers estimate independently from each other. More independent estimates or better protocols could lead to more reliable results, but also access to information from building companies or dedicated fieldwork specifically on materials would be beneficial. Nevertheless, our study is the first attempt in tropical Africa to acquire such information on the building materials.

Despite these limitations, our method for deriving site-specific morphological, thermal and radiative UCPs showed its relevance over the data-scarce city of Kampala while using our results in an urban climate modelling exercise. The fieldwork-derived LCZ map and the site-specific parameters resulted in an overall lowering of near-surface temperatures, especially during the day. This result was mainly attributed to a higher vegetation fraction in the city outskirts. The importance of vegetation fraction in determining local temperature variations in tropical African cities was already suggested by observations (Scott et al. 2017). In addition, the new representation of Kampala in the climate model resulted in higher afternoon temperatures in the densely built environments (LCZ 2, 3 and 7) compared to the control run not including any site-specific data. These high temperatures are hypothesized as an effect of more heat storage at noon and more efficient heat transfer from the ground in the afternoon, due to the use of metal as building material.

Three important outcomes follow from this study. First, up-to-date and detailed parameters on the neighbourhood typologies can be obtained by fieldwork measurements, accomplished with a cost-friendly, easily applicable, efficient and practically designed methodology. Second, through the use of LCZ maps, both morphological and thermal parameters can be extrapolated to the entire city, making them particularly relevant for urban climate studies. Though we recognize that the simple averaging per LCZ of the morphological and thermal characteristics that were measured in the field can lead to uncertainties, it already provides a more site-specific influence of the tropical African city of Kampala on the local climate than the universal LCZ parameters. Third, implementing detailed site-specific measurements significantly impacts the modelled urban surface temperature, and to a lesser extent the near-surface air temperature.