Introduction

Urban system models aim to describe the urban system consisting of individuals, society, morphology, and environmental stressors (von Szombathely et al. 2017) with its complex relations to model, for instance, health-related urban well-being. One important aspect of health-related urban well-being is the outdoor activity of urban dwellers. Not only does this positively affect individual health, but also the city as a whole can benefit from outdoor activities in physical, environmental, economical, and social ways (Chen and Ng 2012). To promote outdoor activities, an attractive design of public spaces is needed (Chen and Ng 2012). One aspect thereof is thermal comfort (Chen and Ng 2012). Thermally comfortable designs can be evaluated by thermal indices, since they summarize the effect of the thermal environment on the human body into one value (ASHRAE 2001; Parsons 2014). In total, at least 165 indices have been proposed (de Freitas and Grigorieva 2016). However, not all of them can be applied to assess outdoor thermal comfort because they neglect important parameters such as solar radiation or wind speed.

There are several reviews on the assessment of the thermal environment (Cheng et al. 2012; Djongyang et al. 2010; Goshayeshi et al. 2013; Walgama et al. 2006), including the reviews by Monteiro (2005) and Coccolo et al. (2016) that focus on the outdoor environment. However, no paper reviews the application of thermal indices in combination with obstacle-resolving numerical models of the atmosphere (from here on shorted ORM) such as ENVI-met (Bruse and Team 2015; Bruse and Fleer 1998) or MITRAS (Salim et al. 2018; Schlünzen et al. 2003), although those are increasingly used for urban planning (Goldberg et al. 2013) to assess different design strategies (e.g., implementation of vegetation and building orientation) as suggested by the German Engineering guideline (VDI 2008a).

ORMs are helpful to design public spaces in a comfortable way since different design scenarios can be evaluated for various meteorological conditions. Typically, ORM simulations have a domain size between 0.1 and 5 km with a spatial resolution of 0.1 to 100 m (Blocken 2015). Due to this high resolution, ORMs not only explicitly resolve buildings, but also the thermal conditions on temporal and spatial scales where people actually experience it.

Considering the large number of proposed indices and the characteristics of ORMs, this study addresses the overall question: which thermal indices can be used globally in their current form to evaluate the outdoor thermal environment in ORM applications? The term “in ORM applications” in this paper refers to an evaluation of the thermal situation simulated by the ORM by thermal indices, both online during the simulation and offline using model output. Since ORMs are applicable to design public spaces for urban residents, this study focusses on the typical urban resident and might not be directly applicable to outdoor workers or tourists.

This paper is structured as follows. In the section “Thermal indices and application demands,” the principles of thermal indices are introduced, and the characteristics of human environmental heat exchange of outdoor urban environments and of ORMs are summarized from a literature analysis. Based on those characteristics, criteria for suitable thermal indices are determined. The criteria are evaluated for the 165 indices included in the catalog by de Freitas and Grigorieva (2015) by reviewing their original literature and using the existing literature review by de Freitas and Grigorieva (2016) in the “Results” section. The “Discussion and conclusions” section summarizes and discusses the results and indicates prospects for further research.

Thermal indices and application demands

This section describes the characteristics of human environmental heat exchange and the related concept of thermal indices (“Thermal indices and related definitions”), characteristics of outdoor urban environment (“Outdoor urban environments”), and characteristics of ORMs (“Obstacle-resolving atmospheric models (ORMs)”). Based on these characteristics, criteria for suitable thermal indices in ORM applications are derived along with additional features of suitable indices (“Evaluation procedure for suitable indices”).

Thermal indices and related definitions

Human thermal environments and thermal indices

The human body exchanges heat with its surroundings by different processes: radiation, convection, evaporation, respiration, and, if a significant area is in direct contact with solid material, via conduction (Fiala and Havenith 2015). How much heat is exchanged via the different processes depends on four environmental variables, namely air temperature (T), humidity (H), wind speed (v), and long and shortwave radiation (Q, often summarized in the integrating variable mean radiant temperature, Tmrt), and two human-related factors: activity and clothing. Activity controls the amount of heat produced by the body, and clothing insulation determines the resistance to heat exchange. All six factors together are referred to as the “six basic parameters” (Parsons 2014). The specific combination of the six basic parameters makes up the human thermal environment a person experiences. How a person feels in such a human thermal environment is defined as thermal sensation, e.g., hot, cold, or neutral. Thermal sensation cannot be expressed directly in physical or physiological terms as it is a psychological phenomenon. However, thermal sensations have been shown to correlate with environmental conditions and physiological responses of the human body (Parsons 2014).

A useful technique for the assessment of a thermal environment is the thermal index. The term “thermal index” is rarely defined in literature. Parsons (2014) defines an assessment of the thermal environment as an index, if it maps the factors that influence the human response to thermal environments to a single value that varies with the human response. This definition is applied in the present study.

Categories of thermal indices

Based on the measured human response, indices can be categorized into comfort or stress indices. Thermal comfort is defined as “that condition of mind which expresses satisfaction with the thermal environment” (ASHRAE 2001), whereas thermal stress quantifies the effect of the six basic parameters in terms of thermal strain experienced by the person (Parsons 2014).

Another possibility to categorize thermal indices has been proposed by MacPherson 1962, who discriminates direct, empirical, and rational indices. Direct indices are based on direct measurements of environmental variables, either by using integrated measurement devices, which model a human body, or by combining measured meteorological parameters using an algebraic weighted expression (MacPherson (1962), Eissing (1995)). In contrast, empirical indices are developed by exposing people to different environmental conditions (e.g., in a climate chamber) and measuring physiological parameters such as heart rate or rectal temperature. By means of multiple regression analysis, the different environmental conditions and possibly different clothing and activities are linked to the physiological reactions (MacPherson 1962). The third category, rational indices, formalizes the heat exchange mechanisms of the human body (“Human thermal environments and thermal indices”) to yield the heat balance equation (Eq. 1) of the human body (ASHRAE 2001; VDI 2008b):

$$ M+W+{Q}^{\ast }+{Q}_H+{Q}_L+{Q}_{SW}+{Q}_{Re}+S=0 $$
(1)

where M denotes metabolic heat; W mechanical work accomplished; Q radiation budget; QH, QL, and QSW the turbulent flux of sensible heat, of latent heat by diffusion, and of latent heat by sweat evaporation; QRe the respiratory heat flux (sensible and latent); and S the rate of storage of heat. Individual heat fluxes are calculated from gradients between physiological variables such as skin temperature and environmental variables. The physiological state of a person results from regulation mechanisms in the body in response to the environmental conditions. The regulation mechanisms are simulated with different complexity in one-node, two-node, multi-node, and multi-element models (Cheng et al. 2012). Not only individual differences, such as gender and age (ASHRAE 2001; Rida et al. 2014), but also acclimatization (Froehle 2008) have been noted to influence the physiological thermoregulation. In addition to the thermoregulatory system of the body, people adapt to a stressful environment by changing their behavior (e.g., change in activity or exposure, Jendritzky and de Dear (2009)). Rational indices either refer to equilibrium conditions (S = 0), or to dynamic, transient conditions, or changing activities (S ≠ 0). Out of the three categories of indices, they have the most objective basis, since they are based on the first law of thermodynamics. However, empirical relationships are used to calculate the regulation mechanisms within the body (ASHRAE 2001).

Many indices apply the concept of a standard or reference environment. These thermal indices calculate the air temperature that would result in the “equivalent effect” for a person as the actual environment does, which consists of the six basic parameters (Parsons 2014). What is defined as “equivalent effect” depends on the individual index, e.g., some require the core temperature to be equal in both environments. These so-called equivalent temperatures have the same unit as air temperature and can therefore be understood by laypeople (Höppe 1999).

Assessment scales for indices

A thermal index value itself is not necessarily meaningful, since it depends on the assumptions of the underlying equations. It is not clear, for instance, whether an equilibrium temperature of 10 °C is desirable in terms of thermally optimal design, or a value of 25 °C is better. Therefore, an assessment scale is needed that maps individual index values into categories of similar and generally understood thermal sensations or thermal stresses.

Different types of assessment scales can be identified based on (1) strain reactions of the human body (e.g., Bröde et al. (2012)), (2) regression between accepted scales from climate chambers and index values (e.g., Matzarakis and Mayer (1996)), or (3) regression between thermal sensation votes (denoted TSV in the following) from surveys and index values (e.g., Watanabe et al. (2014)). The first two scales aim to predict the value of the thermal index for a clearly defined reference person who chooses a place freely without specific expectations and before it adapts to this particular thermal environment (Staiger et al. 2012). In contrast, scales derived from TSVs represent the thermal perception after adaptation and include cultural norms and expectations for people attending the place at a specific time without free choice (Staiger et al. 2012). Although TSVs are important to identify regional particularities, they are unsuitable for ORM applications, since they are valid only for the regional climatic context where they have been derived. The standardization initiative of thermal comfort studies (Johansson et al. 2014) may lead to a globally standardized database of TSVs. Those may be dense enough to be used in ORM applications; however, people who are deliberately avoiding the place due to uncomfortable environmental conditions are still not included in the TSVs, and thus TSVs may lead to skewed results (Staiger et al. 2012).

Outdoor urban environments

Outdoor air temperature range

Outdoor thermal environments exhibit a much wider range of environmental parameters than controlled indoor environments (Jendritzky and de Dear 2009). To derive the air temperature range people are exposed to when being outdoors, two data sets have been combined. First, a global data set of observation-based monthly mean 2-m-air-temperature values (T) over land covering the period from 1986 to 2015 (Fan and van den Dool 2008), and second, a global data set for the population count (P) for the year 2000 (Center for International Earth Science Information Network - CIESIN - Columbia University et al. 2005). Both data sets have a resolution of 0.5° × 0.5°. To estimate the air temperature range people are exposed to, an air temperature weighted population distribution is derived by calculating the population exposed (PE, Eq. 2) to a specific 5K-ΔT-range between 1986 and 2015:

$$ PE\ \left({T}_{\mathrm{min}}\right)=\frac{1}{N}\cdotp \left(\sum \limits_{m=1}^M\sum \limits_{i=1}^Nf\left({T}_{\mathrm{min}}\right)\cdotp {P}_i\right) $$
(2)

with

$$ f\left({T}_{\mathrm{min}}\right)=\left\{\begin{array}{l}1,{T}_{\mathrm{min}}\le T<{T}_{\mathrm{min}}+5K\\ {}0,\mathrm{else}\end{array}\right. $$

M is the number of months (m) between January 1986 and December 2015 (M = 360), N is the number of grid cells (index i), and Tmin is varied between − 60 °C and 60 °C in 5 K-steps. For the air temperature data from 1986 to 2015 T lay between − 55.0 °C and 62.6 °C.

Due to slight differences in the land-sea-mask of the two data sets, about two million people (0.03% of the world population) could not be considered in the analysis. Most of them live on islands in the Pacific Ocean.

Figure 1 shows PE for each ΔT-range. Only few people exposed to monthly mean air temperature values below – 25 °C or above 40 °C (less than 0.1% of the world’s population per range). Ninety-five percent of the world population lives in an air temperature range of − 5 °C to 35 °C. The hatched bars in Fig. 1 mark the two ranges enclosing 95% of the population.

Fig. 1
figure 1

Percentage of the world population exposed to a specific 5 K monthly mean 2-m-air-temperature range. To each range below − 25 °C and above 40 °C less than 0.1% of the world’s population is exposed. Gray-colored bars indicate 95% of world population, hatched bars indicated the range containing the accumulated upper and lowermost 2.5% of the world population, and black-colored bars indicate the outer 5% of world population. Basic data have been taken from GHCN Gridded V2 data (Fan and van den Dool 2008) and the Gridded Population of the World dataset, Version 3 (see text)

Radiation fluxes and wind speed

A particular feature of outdoor environments is the presence of direct solar short-wave radiation fluxes. These include direct, diffuse, and reflected radiation fluxes. In an urban environment, long-wave radiation is not only emitted from the sky and the ground, but also from surrounding building walls. These walls, in turn, can shade areas and shield people from direct short-wave radiation.

The mean radiant temperature (Tmrt), which is usually applied to express the effect of radiation (Kantor and Unger 2011), is the most variable parameter within an urban street canyon (Ali-Toudert and Mayer 2006; Chen et al. 2016; Jendritzky et al. 2007; Lee et al. 2016, 2014; Mayer et al. 2008). The second most variable parameter is the wind speed due to drag and advection effects. Radiation and wind are also those parameters that can best be modified for a thermally comfortable design (Barry and Blanken 2016) and affect thermal perception most (Moonen et al. 2012).

Urban residents’ activities and clothing behavior

Urban activities include standing, e.g., while smoking or talking, or walking, e.g., while shopping or commuting. Although activities vary for different types of urban spaces (Thorsson et al. 2007), in the current study, standing and walking are considered as typical urban activities as they reflect the typical behavior outside parks.

Clothing behavior in urban areas has been shown to vary seasonally (Havenith et al. 2011; Nikolopoulou et al. 2001) but within certain limits: Even in hot conditions, a minimum of 0.2 clo (1.0 clo is equivalent to a thermal resistance of clothing of 0.155 m2 K W−1, ASHRAE (2001)) has been observed, which corresponds for instance to short-sleeve shirt and short trousers (de Freitas 1987). Those limits might be due to cultural rules and norms (Knez et al. 2009). Urban clothing behaviors may differ significantly from clothing of beach tourists or workers wearing special protective clothes. Therefore, the indices selected in this study might not be applicable for those groups.

Persistence of outdoor environmental conditions

Today, many urban activities usually take place indoors: In most industrialized countries, people spend about 90% of their time inside buildings (Höppe 2002). Thus, the time spent outdoors is usually too short to achieve thermal equilibrium, especially as people tend to stray between different microclimates (Thorsson et al. 2007). Furthermore, the meteorological conditions are changing: A quasi-steady state, e.g., a state for which the thermal conditions of the body per time unit change only marginally, may be achieved for a certain microclimate within 2 h if the weather is constant, but not only the person might move, usually also the meteorological situation changes in that time (e.g., diurnal cycle). Therefore, an index considering dynamic conditions would be most suitable (“Categories of thermal indices”). However, such an index strongly depends on the thermal history of a person, e.g., exiting from a sauna or from an air-conditioned building. Therefore, to evaluate a certain design, simulations of an ensemble of people with different thermal histories would be required. However, getting that kind of information is difficult, and even then ensemble simulations are computationally intensive. Therefore, although dynamic indices are more realistic, steady state indices offer advantages for urban planning applications.

Obstacle-resolving atmospheric models (ORMs)

Time scales

ORMs simulate thermal and dynamic atmospheric processes by numerically solving partial differential equations for conservation of energy, mass, and momentum. These so-called Navier-Stokes equations cannot be solved directly due to computational limitations (Blocken 2015). Hence, the equations are filtered and approximated. Nowadays, the time and space averaged so-called Reynold-Averaged-Navier-Stokes Equations (RANS) are used for simulating flows within urban areas (Blocken 2015). Those RANS models simulate the temporal mean flows in detail but with a typical time average of 10 to 20 min that mainly results from the parameterization of turbulent motion. The spatial resolution depends on the grid size used. RANS models are applied for studying urban areas (e.g., Ali-Toudert and Mayer 2006; Salim et al. 2015). For specific applications, quality guidelines are established (Franke et al. 2011; VDI 2017).

Input variables for thermal indices

By solving the RANS equations, ORMs simulate the temporal evolution and spatial distribution of several meteorological variables (e.g., air temperature and flow field, Bohnenstengel et al. 2004), which can then serve as an input for thermal indices. For the two human-related factors, clothing insulation and activity, standardized input tables have been established (e.g., ASHRAE 2001) which can be used to derive input values. In contrast, physiological input parameters such as heart rate or rectal temperature would require expert knowledge or a suitable thermophysiological model. Only if such a model exists for a particular index, it can be used in ORM applications.

Calculation of thermal indices in ORM applications

The calculation of thermal indices from ORM outputs requires either a set of equations or a suitable calculation program, since the manual estimation of index values from nomograms or tables is not feasible due to the high number of grid points in ORMs. In the past, several integrated measurement devices have been proposed for a convenient estimation of direct indices (“Categories of thermal indices”). Indices derived from those devices can be used in ORMs if either a methodology to model the device within the ORM or an equation fitted from standard meteorological parameters exists.

Indices can be calculated either on-line during the simulation or off-line using model output. From a physical point of view, an on-line calculation would only be necessary, if the heat released by a person impacts the surrounding atmosphere. Outdoors, a person’s impact on the thermal environment is small because the wind speed is large and the air is often well mixed. Indoors, the impact of persons on the air is commonly larger due to smaller exchange rates of air, and thus on-line coupling is attempted (e.g., Cropper et al. 2010). From a computational point of view, off-line calculation is favorable because the effect of a set of meteorological conditions can be estimated for different personal characteristics without the need to rerun the ORM. However, Buzan et al. (2015) showed for global simulations that infrequent model output can cause an underestimation of thermal stress experienced. To avoid this effect in ORMs, the output needs to be frequent enough to reflect the changing air temperature and wind conditions (e.g., about 20 min). The output might have to be even more frequent to capture changes in meteorology if the ORM is nested (Schlünzen et al. 2011).

Fields of application for ORMs

ORMs are applied for design and performance analysis of building components, pollutant dispersion, and wind and thermal comfort (Moonen et al. 2012). In terms of thermal comfort, various studies assess the impact of different urban features (vegetation, albedo, etc.) or building configurations on the human thermal environment (Jännicke et al. 2015; Lee et al. 2016; Moonen et al. 2012). To do so, it is essential that the index can evaluate the thermal environment at a specific location (not only relative to a different location) for a specific meteorological situation (no climate average values required as inputs). Frequently applied thermal indices allow for a comparison of thermally comfortable designs in different climatic zones.

Evaluation procedure for suitable indices

From the characteristics described in Sections “Thermal indices and related definitions, Outdoor urban environments, and Obstacle-resolving atmospheric models (ORMs),” the following 11 selection criteria for determining indices suitable for ORM application are derived. A pre-condition for all indices selected is that they shall provide only one output value (“Human thermal environments and thermal indices”). The criteria cover input demands (C1, C2), calculation demands (C3–C9), and interpretation demands on the index (C10, C11). The numbering follows the order how an index would be applied in an ORM application:

  1. C1.

    The input of the index is retrievable from ORMs or from standardized tables (i.e., for activity and clothing; “Input variables for thermal indices”).

  2. C2.

    The index exploits meteorological input values on the same temporal scale as typical for output time scales of ORMs (“Fields of application for ORMs”).

  3. C3.

    The index is computable using a formula or a numerical model (“Calculation of thermal indices in ORM applications”).

  4. C4.

    The index assesses the local thermal environment at a specific location within an urban area (“Fields of application for ORMs”).

  5. C5.

    The index considers the influence of all six basic parameters (Temperature (T), humidity (H), wind speed (v), and radiation (Q), clothing and activity) in the calculation and includes both long-wave and short-wave radiative fluxes (“Human thermal environments and thermal indices” and “Radiation fluxes and wind speed”).

  6. C6.

    The index considers long-wave radiative fluxes from all directions (“Radiation fluxes and wind speed”).

  7. C7.

    The index considers the average air temperature range in which a large proportion of mankind lives (− 5 °C to 35 °C, “Outdoor air temperature range”).

  8. C8.

    The index considers typical clothing behavior and activities of urban residents (“Urban residents activities and clothing behavior”).

  9. C9.

    The index assesses thermal conditions for an exposure time of 10 min and more; instantaneous reactions should not be assessed (“Time scales”).

  10. C10.

    An assessment scale exists for the thermal index (“Assessment scales for indices”).

  11. C11.

    The assessment scale of the index is not derived from thermal sensation votes in a specific region (“Assessment scales for indices”).

The criteria are applied in the order given above (C1 to C11) to the 165 indices listed in the catalogue by de Freitas and Grigorieva (2015), which is the most comprehensive list of indices existing so far. If an index does not fulfill a specific criterion, subsequent criteria are not further assessed. To assess the criteria, the original literature of the indices has been reviewed. For 21 indices, the original literature could not be obtained and therefore secondary sources have been used. In our review of the original literature, differences have been found compared to the review by de Freitas and Grigorieva (2016). Those differences are described in Appendix B in ESM 1. For the analysis, indices are evaluated according to our review. For the three indices (“Perceived Temperature according to Linke,” “Physical saturation deficit,” and “Thermal Insulation of Clothing according to Aizenshtat”), the cited reference by de Freitas and Grigorieva (2016) did not contain such an index. Therefore, those indices had to be excluded from the analysis (Appendix B in ESM 1). For the index “Respiratory Heat Loss,” neither the original publication nor sufficient secondary literature could be obtained. Here, the review by de Freitas and Grigorieva (2016) was used, although it only allows an evaluation of some criteria (Appendix A in ESM 1). For criterion C7, the air temperature ranges given by de Freitas and Grigorieva (2016) are used for all indices.

After the 11 criteria are applied, for all remaining indices, six additional index features, also derived from “Thermal indices and related definitions” section to “Obstacle-resolving atmospheric models (ORMs)” section, are analyzed:

  1. F1.

    Unit of the thermal index (“Categories of thermal indices”)

  2. F2.

    Type of human response evaluated by the index (“Categories of thermal indices”)

  3. F3.

    Temporal resolution considered (“Persistence of outdoor environmental conditions”)

  4. F4.

    Implementation of index calculation in ORM applications (“Calculation of thermal indices in ORM applications”)

  5. F5.

    Available methods for the calculation of the index (“Calculation of thermal indices in ORM applications”)

  6. F6.

    Application frequency of the index in ORMs (“Fields of application for ORMs”)

The features F1 to F5 serve as information but do not lead to an exclusion of an index. The features are assessed by reviewing the original literature of the indices. For F6, a systematic literature review was performed using the databases “Scopus” (https://www.scopus.com/home.uri) and “Web of Science” (https://apps.webofknowledge.com) with the keywords “numerical model,” “thermal index,” “urban” including all fields in Scopus, and the topic in Web of Science on 15th November 2016. A total of 116 publications between 2000 and 2016 were obtained of which 106 were left after duplicates had been removed. By screening, 74 records were excluded because of at least one of the following reasons: (1) no ORM application, (2) study of a different spatial scale, (3) did not estimate a thermal index, or (4) were not published in a peer-reviewed journal. In total, 32 studies with different thermal indices remained to evaluate the application frequency (F6). The flow diagram and the 32 studies ordered by applied indices and by climatic zone are shown in Appendix C.

Results

The assessment criteria derived in “Thermal indices and application demands” section are applied using the method described in “Evaluation procedure for suitable indices” section in order to identify suitable thermal indices for ORM applications.

Application of criteria

From the 165 analyzed indices, two entries do not meet the definition of thermal indices used in this paper (“Human thermal environments and thermal indices”), since they provide more than one output (pre-condition for selected index): the Predicted effects of heat acclimatization (Givoni and Goldman 1973) and the Predicted Heat Strain (Malchaire et al. 2001). Therefore, they are not further analyzed.

All indices excluded because of criterion C1 to C7 are shown in Appendix A in ESM 1 including their abbreviations, references, equations for their calculation as far as possible, as well as reasons for their exclusion. As noted before, the criteria are applied in the order given in “Evaluation procedure for suitable indices” section. If an index fails a criterion, subsequent criteria are not assessed. Figure 2 shows the number of indices excluded by C1 to C7 and the remaining number indices. Most indices do not consider all six basic parameters (C5). After C1 to C7 are applied, 13 indices remain. For those indices, the air temperature design ranges and restrictions for other meteorological variables are shown in Table 1.

Fig. 2
figure 2

Number of indices excluded by criterion C1 to C7 (bars) and remaining number of indices (line). A detailed table of excluded indices is given in Appendix A in ESM 1

Table 1 Air temperature design ranges (ΔT) of thermal indices meeting criteria C1 to C7. Ranges of wind speed in persons height (v) or 10 m (v10), relative humidity (RH), and mean radiant temperature (Tmrt) are indicated as far as documented in the original publications. Air temperature ranges have been taken from de Freitas and Grigorieva (2016)

All 13 remaining indices (Table 1) clearly cover the air temperature range of − 5 °C to 35 °C (C7), where 95% of the world population lives (“Outdoor air temperature range”). For PST and UTCIapp, additional restrictions concerning wind speed (both), relative humidity, and mean radiant temperature were found. Although no restrictions for the other indices were found, it is likely that their application range is also constrained, since the underlying parameterizations have been derived only for a limited number of conditions. C8 to C11 are applied to the indices in Table 1 (Table 2). Since all remaining indices are rational indices based on thermophysiological models for the human heat budget, they can be applied for every combination of clothing and activity. Therefore, no index is excluded due to criterion C8.

Table 2 Indices excluded from further analysis due to criterion (C). Full index names and references are shown in Table 1

After assessing the indices with respect to C1 to C11, five indices, PMVO, PTJ, PET, PST, and UTCI (and UTCIapp) are found suitable for applications in ORMs. Since PTJ is an extension of PMVO and improves the limited humidity-sensitivity in warm situations (Staiger et al. 2012), PMVO is excluded from further analysis.

Evaluation of index features

The indices PTJ, PET, PST, UTCIapp, and UTCI are analyzed with respect to their additional features (F1 to F6); the results are compared in Table 3. All remaining indices have a temperature unit (°C, F1). PTJ uses PMV to measure the equivalent effect and is therefore comfort-based (F2). Additionally, PTJ was linked to stress categories (Table 4, Staiger et al. (2012)). PET and UTCI also evaluate thermal stress (F2) since they use strain reactions to measure the equivalent effect in the reference environment and in the actual environment. PET is linked to the PMV scale via a linear regression (Matzarakis and Mayer 1996) and can therefore also be viewed as comfort-based (Blazejczyk et al. 2012). The validity of the regression method was questioned (e.g. Lee and Mayer 2016). Consequently for PET, other scales from TSVs have been derived for various climates (e.g., Lin and Matzarakis (2008); Holst and Mayer (2010); Kantor et al. (2012); Cohen et al. (2013)). However, these scales differ from the original scale (“Assessment scales for indices”) in terms of their implications. The categories of the UTCI assessment scale (Table 4) are derived from the occurrence of strain reactions such as the onset of shivering (Bröde et al. 2012). PST estimates thermal sensation (F2), but in contrast to the other indices is not an equilibrium temperature. Instead, PST is defined as the temperature established around the skin surface (under clothing) after 15–20 min of adaptation to maintain homeothermy. Therefore, the temporal resolution (F3) considered for PST is much more detailed than for UTCI (average over 2 h) and PTJ and PET, which estimate steady state conditions.

Table 3 Thermal indices for ORM applications fulfilling criteria C1 to C11. For entries related to features F1 to F6, the corresponding feature is given The following abbreviations are used: ADu is body surface area, BF is body fat content, e is water vapor pressure, ea is water vapor pressure under actual conditions (related to reference environments), H is a general measure for humidity with specification such as relative humidity (RH), h is height, Icl is clothing insulation, M is metabolism, m is weight, T is air temperature, Tcr is core temperature, Tmrt is mean radiant temperature, Tsk is skin temperature, v is wind speed in person’s height, vw is walking speed, v10 is wind speed in 10 m, and W is work metabolism. Superscripts have the following meaning: a Regression version of UTCI, b Look-up-table version of UTCI, and c full model version of UTCI. For index abbreviations see Table 1
Table 4 Assessment scales of thermal indices suitable for ORM applications based on criteria C1 to C11. For index abbreviations see Table 1. Physiological stress categories refer to PTJ, PET, and UTCI but not to PST

Whether an index is capable to be applied on-line (F4) depends primarily on the computational cost required for the calculation. The computational cost can be estimated from the evaluated temporal state (e.g., more calculations are needed to reach thermal equilibrium) and the complexity of the thermophysiological model (e.g., more complex multi-elements models require more calculations). Temporal state, the thermophysiological models, and the derived on-line or off-line application type are shown in Table 3. The thermophysiological model of PST (MENEX) is a one-node model. Due to the nature of a one-node model, PST cannot account for thermophysiological regulation processes within the body, e.g., heat exchange between different body parts. To consider these processes, at least two nodes are necessary (“Categories of thermal indices”) as considered in thermophysiological models of PET and PTJ. Out of the thermophysiological models of four suitable indices, the UTCI-Fiala model is the most sophisticated. Due to its multi-element structure, it predicts the state of individual body parts, although the UTCI index itself currently represents an entire body value.

The software (F5) to calculate the indices is indicated in Table 3. The source code is only publically available for UTCIapp. For PET and PTJ, source code is available from VDI (2008b).

PTJ, PET, PST, and UTCI not only differ with respect to the index features but also regarding the treatment of clothing and activity (criterion C8). PET uses a fixed clothing insulation of 0.9 clo for the definition of the assessment scale (Table 3). Hence, it is a purely climatic index independent of individual behavior (Höppe 1999). However, other clothing values may be used in MEMI, although the assessment scale is technically applicable only for 0.9 clo. The three other indices account for a behavioral adjustment of clothing. In the calculation of UTCI, a full clothing model is incorporated (Havenith et al. 2011), which considers typical clothing behavior of urban residents, derived from studies in Europe and Russia. By considering adjustable clothing, behavioral adaptation (“Categories of thermal indices”) is accounted for. For the wide range of atmospheric conditions experienced outdoors, fixed clothing is unlikely to represent the clothing behavior of the population during all seasons. However, to be able to compare thermal climates at two locations, fixed clothing may be preferred. PET considers a very light activity (standing still), which represents the lowest expected outdoor body heat production under normal circumstances. The three other indices consider walking at 4 km h−1. The UTCI index is currently further developed to include other clothing and activity levels (Bröde et al. 2016).

The indicated assumptions and limitations of individual indices must be kept in mind by the user when applying these indices. Despite the differences between the indices, they have been shown to be strongly correlated (e.g., Blazejczyk et al. (2012); Staiger et al. (2012); Park et al. (2014); Matzarakis et al. (2014); Fröhlich and Matzarakis (2015)). The correlations, however, were found to be regime dependent (Staiger et al. (2012); Fröhlich and Matzarakis (2015)) because of the sensitivity of the indices to specific meteorological parameters and the different clothing models.

For the evaluation of F6, a systematic literature review was conducted as described in “Evaluation procedure for suitable indices” section. Figure 3 shows the results of the 32 identified studies (references are given in Appendix C in ESM 1). PET is the most widely applied index in ORM applications (Fig. 3a). It remained popular even after the development of UTCI in 2012. Similar results were obtained by Coccolo et al. (2016), who did not focus on the microscale. PET has been applied in all three climatic zones, whereas most other indices have been applied only in some zones (Fig. 3b). Most studies have been conducted for the subtropics, followed by temperate climate and the tropics. No study for polar climate was found in the systematic review.

Fig. 3
figure 3

Number of ORM applications using the different indices published in different years (gray-colored) (a) and per climate zone (indices gray-colored) (b). Appendix C in ESM 1 summarizes the studies included in the analysis derived from the method in “Evaluation procedure for suitable indices” section. For abbreviations of indices see Appendix A in ESM 1 and Table 1. Note that some studies applied several indices and that PMV and SET* are used here to summarize studies that apply these indices in their original or derived form

This statistical analysis shows that of the selected indices in this study, only PET has been applied in different climatic zones. PET is also the most frequently applied index. Therefore, PET is most suitable for comparing simulation results for different cities around the globe. No studies were found to apply PST or PTJ, although with PMV, a precursor of PTJ was applied. The two indices without a rational basis (THI, WBGT; Appendix A) are least frequently applied; Morakinyo et al. (2016) use them in addition to PET.

Discussion and conclusions

This paper identifies thermal indices suitable for ORM applications in urban environments for evaluating thermally comfortable designs. For the identification, 11 criteria have been derived based on the characteristics of human-environmental heat exchange (“Thermal indices and related definitions”), of outdoor urban environments (“Outdoor urban environments”), and of ORMs (“Obstacle-resolving atmospheric models (ORMs)”). The criteria were developed to identify indices that can be applied in ORMs in their current version globally without too much consideration by the users. The criteria have been evaluated for 165 indices listed by de Freitas and Grigorieva (2016) by inspecting the original literature. In total, four indices (PET, PTJ, PST, and UTCI) with different characteristics (“Evaluation of index features”) have been found to fulfill all criteria.

Indices not selected in this analysis are not “bad” but not targeted for the intended type of application. Indices developed in form of nomograms, for instance, can be transformed to a program but that would require extra work from the user (C3). This may change, however, if the index is further developed. The indices selected in this study were targeted for the average urban resident with typical urban clothing and activity. To assess the thermal environment of beach tourists or workers wearing special protective clothes, different indices may be needed than those selected in this study. Additionally, only those indices were selected that can be applied to an air temperature range of − 5 °C to 35 °C. This range was determined to cover the climatic air temperature range 95% of the world population experiences. If only warm conditions shall be thermally assessed, all indices discarded by C7 in Appendix A in ESM 1 might be usable. For those indices criteria, C8 to C11 should be evaluated before use. Besides the temperature design range also the ranges for humidity, wind, and radiation to which the world population is exposed to should be evaluated. However, only for very few indices, the applicable design ranges for those parameters are given in the original literature. Therefore, this has not been attempted here. For future index developments, the design range of all input parameters should be clearly defined.

In urban planning, a design should result in thermal comfort for the average population. The thermal indices selected in this study assess the average population by considering a reference person (Table 3). As a result, however, the individual perception of a specific environment may differ from the assessment calculated by the index. Individual perception depends on the thermal history (“Persistence of outdoor environmental conditions”), the expectations of an individual and the interaction with other stressors such as noise or odors (Coccolo et al. 2016). Although important, this multitude of factors currently cannot be taken into account when designing thermal comfortable spaces for the general population. However, with increasing computational power and increased knowledge on human behavior, new methods for thermal environmental assessment in ORM applications in the context of urban planning may be established. Computational power may favor application of turbulence resolving ORMs, for which a suitable index should be able to consider the unsteadiness of the flow (Fanger et al. 1988). Furthermore, ensemble simulations for individuals with different personal characteristics and thermal histories could be used to evaluate the environment dynamically, as recommended for outdoor applications by Höppe (2002) and Coccolo et al. (2016). First steps in this direction have been taken by Bruse (2007). In the modeling framework of urban system models (von Szombathely et al. 2017), all those interactions could be combined to model health-related urban well-being. By extending studies such as by Hoffmann et al. (2018) to realistic cases, ORMs along with the found suitable thermal indices can make up one component in a suite of different multi-sectorial models to model the entire urban system.