Introduction

For analysis of continent-scale human energy use, a key question is whether people build better-insulated cities in places where superior thermal insulation would have the greatest effect. We argue in this paper that, for quantifying city thermal insulation, it may be possible to treat the city as the unit of measure by applying methods developed for the analysis of insulation in animal homeotherms. Our specific interest is in insulation as a defense against heat loss in cold environments.

At present, there is great multidisciplinary interest in the thermal insulation of the built environment in cities. Cities are where much of human energy use takes place (Burnside et al. 2012; Burger et al. 2017), and in the contemporary quest for improved energy efficiency to mitigate global warming, the thermal insulation provided by the urban built environment in cold climates is a pivotal consideration (Wilkinson et al. 2007; Rickwood et al. 2008). Recognizing the current multidisciplinary interest in insulation, we use this Introduction to explain our concepts for measuring city insulation in terms accessible to a multidisciplinary audience.

Hill et al. (2013) demonstrated that individual cities behave like thermoregulating superorganisms: the total rate of energy use by the buildings in a city varies with daily outdoor air temperature (Ta) in much the same way as the metabolic rate of a homeothermic animal varies with Ta (Fig. 1). As a shorthand expression, we will often call the relationship between the rate of heat production and Ta the “energy-Ta” curve. In both cities and homeothermic animals, the curve has a similar shape because of similar underlying mechanisms, and because of this similarity, the extensive body of theory developed to understand the energetics of animal homeotherms can in principle be applied to understanding city energetics.

Fig. 1
figure 1

Rate of heat production as a function of air temperature (Ta) in a mammal and a city. A Total individual metabolic rate as a function of Ta in winter-acclimatized rabbits, Sylvilagus audubonii. Each symbol represents one individual at one Ta. B Total city rate of heat production by use of electricity and natural gas as a function of outdoor Ta in Ames, Iowa, in 2009. Each symbol represents one calendar day [A from (Hinds 1973); B from (Hill et al. 2013)]

The thermal insulation, I, provided by the exterior construction of an individual building is a major determinant of the energy cost of heating the living space inside the building in winter. An entire city can be considered to have a city-wide value of I, which is a function of the values of I for the hundreds or thousands of individual buildings in the city. To estimate city-wide I, one approach would be reductionistic: treat individual buildings (or other living units) as the units of measure, estimate I for each, and synthesize. Such an approach would be fraught with privacy concerns and expensive. An alternative approach—far less fraught and expensive—is to treat the entire city as the unit of measure and calculate city-wide I from the city energy-Ta curve.

Figure 2 depicts a standard model of the energy-Ta curve of an individual mammal (Scholander et al. 1950a; McNab 2002; Hill et al. 2013, 2016). There is a range of Tas in which the animal’s rate of heat production (its metabolic rate) is constant regardless of Ta. This range, the thermoneutral zone (TNZ), is bounded at its lower limit by the lower critical temperature (TLC) and at its upper limit by the upper critical temperature (TUC). The rate of heat production in the TNZ is the basal metabolic rate, which we call the resting metabolic rate (RMR) in application to cities. Within the TNZ, an animal varies its value of thermal insulation, I, as Ta varies. However, according to the standard model, an animal’s I is maximal and constant at Tas below the TLC. Because of this constancy of I below the TLC, the animal’s rate of heat production increases linearly as Ta decreases below the TLC. The absolute value of the slope of the energy-Ta curve at Tas below the TLC is equal to conductance, C, which is the inverse of I (I = 1/C) (Scholander et al. 1950a; McNab 2002; Hill et al. 2013, 2016). Specifically, according to the standard model, the inverse of the absolute value of the slope is equal to the animal’s maximal value of thermal insulation, I. The insulation of a city is also expected to be constant and maximal at Ta < TLC because it depends on the relatively unchanging physical structures of buildings that, at such Tas, are configured to minimize heat loss. Thus, the model developed for animal homeotherms (Fig. 2) can in principle be applied to cities.

Fig. 2
figure 2

Standard model of the relation between the rate of heat production and Ta in an individual homeotherm. Within the thermoneutral zone (TNZ), the animal’s rate of heat production (metabolic rate), termed its BMR or RMR, is low and independent of Ta. The lower and upper limits of the TNZ are the lower (TLC) and upper (TUC) critical temperatures. At Ta < TLC, the animal uses shivering and nonshivering thermogenesis (NST) to increase its rate of heat production as the environment becomes colder. Moreover, according to the standard model, at Ta < TLC, body insulation I is approximately constant, and accordingly the rate of metabolic heat production required for thermoregulation increases linearly as Ta falls. The absolute value of the slope of this increase is conductance C = 1/I. According to this model, extrapolation of the line segment below TLC intersects the x-axis at a Ta equal to the animal’s deep body temperature, a consideration discussed later in this paper. In a city, rather than shivering and NST, furnaces or other building-heating devices are employed to increase the rate of heat production as the environment becomes colder at Ta < TLC. At Ta > TUC, an animal’s rate of heat production increases for complex reasons (Hill et al. 2016), including that panting and other mechanisms of increasing evaporative cooling require an investment of metabolic energy. In a city at Ta > TUC, the rate of heat production increases because of the energy cost of air conditioning and cooling fans

Thermal insulation is defined in a standardized way for multiple applications. For any object that is warm on the inside and surrounded on the outside by cooler air, two factors determine the rate of heat loss. One is the magnitude of the temperature difference, Tinside − Toutside. The other is the resistance to heat transfer posed by the physical structures that lie between inside and outside. Insulation (I), which formally refers to this resistance, is defined operationally by the following equation, analogous to Ohm’s law:

$$ I = \frac{{T_{inside} - T_{outside} }}{H} $$
(1)

where the numerator and denominator are measured simultaneously in steady-state and H represents the rate of internal heat production required to replace heat lost. In an animal, H is the animal’s metabolic rate (Scholander et al. 1950a; McNab 2002; Hill et al. 2013, 2016). In a building, H is the rate of heat production by the furnace or other devices that add heat to the building interior. Equation (1)—besides being used to understand the energy-Ta curves of animals and cities—is incorporated into all common measures of insulation. The equation is used to measure the insulative effectiveness of mammal furs (Scholander et al 1950b), the clo values that express clothing insulation (Gagge et al. 1941; Burton and Edholm 1955; Yan and Oliver 1996; Hill et al. 2013), and the R values that express the insulative effectiveness of construction materials (ASTM International 2017).

A critically important attribute of I is that it is a holistic measure of resistance to heat transfer. In a mammal, I depends on all the “resistive” factors that affect how readily the body loses heat in a cold environment. These factors include not only the animal’s pelage, but also its body posture and the vigorousness of blood flow to its body surface. When I is measured using Eq. (1), the effects of all these factors are taken into account and reflected in the value of I obtained. This holistic attribute is especially important for the study of city insulation. In a city, the city-wide I depends on a great many “resistive” factors: the types of materials built into the walls and attics of hundreds of variously shaped houses constructed by different people at different times, the types of window designs used in these varied houses, the spacing of living units (e.g., the proportion built with shared walls), the quality of structural maintenance over time, and other structural features. When I is measured using Eq. (1), all of these factors are taken into account and reflected in the value of I obtained, meaning that I provides a measure of the overall resistance to heat loss from a city’s buildings.

In this paper, we use the methods developed by Hill et al. (2013) to obtain energy-Ta plots for 42 North American cities that vary widely in thermal climate. To quantify thermal climate we use the average Ta during all days of the year for a 10-year period, T10-year, a metric that ranges from − 10.8 to + 25.5 °C in the 42 cities studied. We ask the question: does city-wide I vary significantly with T10-year? Our a priori null hypothesis is that there is no significant relationship between I and T10-year. Our alternative hypothesis is that I increases as T10-year decreases. This alternative relationship is what would be expected if people constructing cities in cold climates have incorporated more features that impede heat loss than people constructing cities in thermally moderate climates.

Materials and methods

We approached the electric and natural-gas utilities of over 150 mid-sized cities in North America (population: 2000–155,000) to request daily, city-wide usage data for an entire year, and here we report on all 42 cities in which the primary energy sources for home heating are electricity and natural gas, and for which the utilities agreed to provide data. In the process of gathering these data, we sought cities with widely different exposure to low Ta, as judged by published data on annual average air temperature, measured specifically as T10-year (we calculated T10-year for all cities using a standardized method described later). We recognized six categories of published annual average temperature: < − 4 °C, − 4 °C to + 2 °C, 2–8 °C, 8–14 °C, 14–20 °C, and 20–26 °C. Then, as we sought cities for inclusion in this study, we maintained two objectives: (1) obtain data on as many cities as possible, but also (2) include, to the extent possible, approximately equal numbers of cities in the six temperature categories. We limited our search to mid-sized cities because large cities often have indefinite physical and functional boundaries, and because the inclusion of large cities would have substantially increased among-city heterogeneity in the urban-heat-island effect (Oke 1973, 1982).

Prior to approaching the utilities in a city, we used publicly available satellite images and narratives (e.g., city descriptions at city websites) to assess whether large, energy-intensive factories—or other industrial or military facilities—were present in the city. We did not approach cities in which we were able to identify such facilities. We also focused on city boundaries and did not pursue cities that we became aware were seamlessly connected to adjacent cities. If we approached the utilities in a city, we queried utility personnel about the possible presence of heavy industrial or military customers and withdrew our request if we learned of their presence.

Approached candidate cities were often unusable because one of the utilities could not or would not provide the data needed. If the utilities expressed willingness to cooperate but could not provide the exact types of energy-use data we sought, we sometimes modified our request. Most notably, in two far-northern cities (Inuvik, Utqiagvik) we obtained data on energy use at weekly or monthly resolution because data at daily resolution were unavailable.

Our target year for data was 2013, and for 35 cities we report data for that year. We also include (and reanalyze) the six cities we earlier studied (Hill et al. 2013), for which data apply to 2009 or 2010, and one city with data for 2014.

For each city, we calculated city-wide, within-building heat production during each 24-h calendar day—symbolized Hcity—as the sum of the heat equivalents of electric and natural-gas energy used that day (or, analogously, if the resolution was weekly or monthly, we summed heat production over all 24-h days in a week or month). Some cities (particularly in relatively warm regions) do not use natural gas, and in those cases we calculated daily heat production from electric data alone. To produce an energy-Ta plot, daily Ta must be known for each calendar day. To obtain data on the average Ta during each day, we used MapServer at the Utah Climate Center (Utah State University) to access temperature records for cities in the United States. For cities in Canada, we used either the Government of Canada Historical Climate Data database or the Environment Canada database. For a typical city, using these methods, we obtained 365 x, y data points (one per calendar day in a year), where x is daily average Ta and y is daily city heat production, Hcity. In Utqiagvik (formerly Barrow), AK, we obtained 49 weekly data points. In Inuvik, NWT, we obtained 36 monthly data points by combining data for three adjacent years (2013–2015).

All statistical methods were planned in detail a priori. After data were obtained, the data were processed strictly according to the a priori plan.

To calculate city-wide I, we use two methods, which differ in how effectively they address two types of limitations in the data available (see “Discussion”). We term these the “point-by-point” and “slope” methods.

Point-by-point method to calculate city-wide I

The point-by-point method is a straightforward application of Eq. (1), which—in the study of cities—takes the form: I = (TLSTa)/Hcity, where TLS is the air temperature in the living spaces of buildings. As already described, for each city we have daily data on Ta and Hcity for an entire year. Accordingly, we can calculate city-wide I for each day (thus, “point-by-point” method), limiting ourselves to days when Ta ≤ TLC because, as described earlier, I is expected to be constant at such Tas, meaning all such estimates of I are estimating the same value. The obvious problem with the application of this method is the lack of empirical knowledge of the temperature maintained in living spaces, TLS. However, we do know that people in North America tend to set thermostats near 20 °C in the cold seasons of the year (Huchuk et al. 2018). We carry out the calculations three times with three TLS estimates, looking for convergences.

Slope method

The slope method explicitly addresses a different limitation: the problem that city population size, P, affects I, but there is no established theory for taking P into account. The slope method addresses this problem by calculating a derivative, unitless measure of insulation that is independent of P. To apply this method to a city, we calculate C, the absolute value of the slope of the linear regression of Hcity versus Ta below TLC (see Fig. 2). C itself provides a measure of city-wide I because I = 1/C. With the regression, however, we can calculate the regression-based ratio of energy use at two different Tas, a unitless measure of insulation independent of city size. We use the regression to calculate the ratio: estimated Hcity at Ta = − 20 °C/estimated Hcity at Ta = 10 °C, symbolized Hcity(-20) /Hcity(10) (see “Discussion” for explanation of Tas chosen). In addition to being unitless, this ratio provides a measure that is easily understood by nonspecialists.

All statistical calculations were carried out in IBM SPSS Statistics (version 26) except that we used the Segmented package (for segmented regression; Muggeo 2008) in R (R Project for Statistical Computing; R Core Team 2021) to estimate the lower critical temperature, TLC, of each city (SPSS lacks a procedure for segmented regression). To keep the calculation of TLC as straightforward as possible and minimize the chances of confounding factors (see “Discussion”), we wanted the data analyzed by Segmented to include only one breakpoint. Thus we used the data for Ta ≤ 19 °C to calculate TLC and specified one breakpoint in the application of Segmented. We ran Segmented three times for each city, seeding the iterative calculations with three starting approximations of TLC, 10 °C, 12 °C, and 14 °C. Where the estimate of TLC differed because of starting value, we used minimization of the Akaike information criterion (AIC) to determine which estimate of TLC to use.

To calculate the regression of the energy-Ta relationship below thermoneutrality for a city, we first calculated an upper Ta limit for the data to be used. Specifically, to minimize chances of including data points from the TNZ (see Fig. 2), we calculated the upper Ta limit for the regression as our estimated TLC minus 1 SE of the TLC estimate. We carried out linear least-squares regression on the data for all days in which Ta was less than or equal to this upper limit.

Although the energy-Ta relationship at high Tas is not of central concern for this paper, we estimated the upper critical temperature of each city, TUC, and the slope above TUC in ways analogous to the procedures just described. To estimate TUC we used the data for Tas ≥ 14 °C and specified one breakpoint in Segmented. We ran Segmented three times for each city (seeding with starting approximations of 18 °C, 20 °C, and 22 °C) and resolved ambiguities using AIC. To estimate the regression of the energy-Ta relationship above thermoneutrality, we calculated a lower Ta limit for the data to be used as TUC plus 1 SE. We then carried out linear least-squares regression, using the data for all days when Ta was equal to or higher than this lower limit.

To calculate RMR, the mean daily energy consumption in the TNZ of each city, we averaged the data for all days when the Ta was (1) more than 1 SE above TLC (using the SE of the estimate for TLC) and (2) more than 1 SE below TUC (using the SE of the estimate for TUC). There were many cities for which Segmented detected no TUC (i.e., no breakpoint representing TUC). In such cases, we calculated the mean daily energy consumption in the TNZ using the data for all days meeting criterion (1).

Throughout the rest of this paper, we use a short-hand terminology for referring to data boundaries. Where we speak of data gathered on days when Ta was “below the TLC,” in fact the days were those with Ta below TLC minus 1 SE of the TLC. Similarly, when we speak of data “above the TUC,” the data were actually above the TUC plus 1 SE of the TUC.

For each city, we obtained three additional types of information used in our analysis: population size, number of electric accounts, and T10-year. For cities in the United States, we obtained population size from the US Census Bureau (2010 census); for cities in Canada, we used Statistics Canada. To obtain the number of electric accounts, we asked the electric utility.

To calculate T10-year we used data acquired through MapServer (Utah Climate Center), the Government of Canada Historical Climate Data database, or the Environment Canada database. Barring lapses in data records, our target 10-year period was typically 2004–2013, and we averaged the Ta data for all days in that period. If data were missing for some of those years, to obtain 10 years of data, we included data (if available) from the adjacent years for which data were available and that were closest in time to the target 10-year period. For cities in which long-term Ta records have not been maintained, we used data for the closest cities, of similar size and altitude, where data are available. For three cites, we could not obtain 10 years of data near our target years and used 8 years (Fort Macleod) or 5 years (Cairo; Thomasville); for Inuvik, the only suitable data were “climate normal” data for 1981–2010.

Results

The energy-Ta curves for all 42 cities are presented in Fig. 3, where the cities are plotted at their geographic locations on a map that delineates zones of average annual air temperature based on historical climate records. Table 1 presents additional information on each city: altitude, T10-year, and population P. Although the zones of average annual air temperature in Fig. 3 provide useful background information, for the statistical analysis of the data each city’s specific T10-year (Table 1), not zone, was used.

Fig. 3
figure 3

Energy-Ta plots for 42 North American cities. Each plot depicts the city rate of heat production Hcity normalized to population size P (i.e. W/ person) as a function of average daily outside air temperature (Ta) for an entire year. For most cities, the plot covers ca. 365 days, with each symbol representing one day. Axes of all plots are scaled identically to the scaling shown in the enlarged plot for Timmins. To display the geographical distribution of the cities (important information addressed in the “Discussion”) and to help identify cities that are found in similar thermal climates, the locations of cities are plotted on a map color-coded to show average annual air temperature based on historical climate records (Wang et al. 2016): dark blue, < − 4 °C; medium blue, − 4 to + 2 °C; light blue, 2–8 °C; yellow, 8–14 °C; orange, 14–20 °C; and red, 20–26 °C. These are the same six categories of annual average air temperature employed in our data acquisition protocol as described in “Materials and methods”. Annual average air temperature depends on altitude and proximity to seacoasts as well as latitude

Table 1 Altitude, T10-year, and population size of the cities

We analyzed the potential correlation between T10-year and P for the 42 cities. The Pearson correlation coefficient between the two variables is 0.04 (p > 0.8), indicating no relationship. A scatterplot of the two variables likewise suggests that T10-year and P are completely unrelated in our dataset.

Point-by-point method

For each city, we calculated I = (TLSTa)/Hcity for each day when Ta was below TLC, assuming in three sequential analyses that TLS was 20 °C (Huchuk et al. 2018), 17 °C, or 23 °C (see “Materials and methods”, and “Discussion”, for explanation). Table 2 lists for each city the TLC and the number of days, N, when Ta was below TLC; N is the number of estimations of I provided by the point-by-point method at each TLS.

Table 2 Results of the two methods

To analyze the relation between the daily measures of city-wide I and T10-year, our initial plan had been to remove effects of city population size, P, by analysis of covariance. However, scatter-plots revealed that assumptions of analysis of covariance (e.g., normality, homoscedasticity, and linearity) were not always met. Thus, following our a priori strategy, we calculated the average P-normalized I [see Eq. (1)] for each city by dividing (TLSTa) by Hcity/P for each day and then averaging the data over all N days at each assumed TLS. Given that thermostats are commonly set near 20 °C during the cold seasons of the year in North America (Huchuk et al. 2018), we first analyzed the data with TLS = 20 °C (Table 2). The datum for one city (Summerside, PEI) was a statistical outlier and thus not included in further analysis. After removing it, we fitted a linear regression (Fig. 4), which indicates a statistically significant inverse relationship between city-wide I and T10-year, although the regression explains just 9.6% of variance in I (Table 3). We repeated the analysis assuming TLS = 17 °C and TLS = 23 °C. Assumed TLS affects the statistical relationship between I and T10-year (Table 3). As TLS is set to higher values, the absolute value of the estimated slope decreases, the fraction of variance explained by the regression decreases, and certainty in a non-zero slope decreases (Table 3).

Fig. 4
figure 4

Results of the point-by-point method: average P-normalized I plotted versus T10-year for 41 cities. Each symbol represents one city. For each city, P-normalized I = (TLSTa) ·P/Hcity was calculated for each day when Ta was below TLC, assuming TLS = 20 °C, and all days were averaged. TLC and the number of days analyzed are tabulated in Table 2. The coefficients of the regression are in Table 3

Table 3 Results of the point-by-point method

Slope method

To implement this method, we first calculated the linear regression for the total rate of heat production (Hcity) as a function of Ta, at Ta < TLC, for each of the 42 cities (see Supplementary Table 1 for slope, intercept, and r2). In all cases, the regression had a statistically significant slope different from 0 (p < 0.0025, except p = 0.01 for Fort Pierce).

Using the Hcity-versus-Ta regression for each city, we calculated the ratio of predicted Hcity at Ta = − 20 °C divided by predicted Hcity at Ta = 10 °C, symbolized Hcity(-20)/Hcity(10), to obtain a unitless index of city insulation (Table 2). Figure 5A shows this index as a function of T10-year for all 42 cities. The slope of the regression (− 0.022 per °C) is significantly different from 0 (p = 0.044, one-tailed), although the adjusted r2 is 0.048, indicating that the regression explains just 5% of variance. According to the standard model (Fig. 2), the slope of the Hcity-versus-Ta regression of a city at Ta < TLC is approximately equal to 1/I if the regression extrapolates to intersect the x-axis at a temperature approximating TLS (Scholander et al. 1950a; McNab 2002; Hill et al. 2013, 2016). The x-intercept for some cities is too high to plausibly approximate TLS. Following our a priori plan, we thus winnowed the set of cities analyzed to include only those 26 cities for which x-intercept < 30 °C, presented in Fig. 5B. Within this subset of cities, there is no statistically significant relationship between Hcity(-20) /Hcity(10) and T10-year (p = 0.95).

Fig. 5
figure 5

Results of the slope method: Hcity (-20) /Hcity (10) plotted in relation to T10-year. Each symbol represents one city. For each city, the linear regression of Hcity as a function of Ta, for all days when Ta < TLC, was calculated (see Supplementary Table 1). Using the regression, predicted Hcity at Ta = − 20 °C divided by predicted Hcity at Ta = 10 °C was calculated as a measure of city insulation. A includes 42 cities. The slope of the regression is significantly different from zero: Hcity (-20) /Hcity (10) = 2.95−0.022·T10-year (p = 0.044). B includes the subset of 26 cities for which the x-intercept of the city regression is < 30 °C

Related to the slope method is the concept that the predicted value of Hcity at a specific low Ta could also serve as an index (inverse) of city resistance to heat loss. Thus, for each of the 42 cities in our full dataset, we calculated Hcity at Ta = − 20 °C using the Hcity-versus-Ta regression (Supplementary Table 1). Recognizing that city population size, P, helps determine Hcity, we carried out an analysis of covariance to examine the potential relationship between Hcity at Ta = − 20 °C and T10-year. P explains a highly significant part of the total variance (p < 0.005). However, there is no evidence of a significant relationship between Hcity at Ta = − 20 °C and T10-year (p = 0.18).

Metrics in the TNZ and at T a > T UC

The Segmented routine in R identified a breakpoint corresponding to TUC in the energy-Ta curves for 25 cities, but not the other 17 cities. Supplementary Table 2 summarizes the estimates of TUC, estimates of RMR/person (mean Hcity/P in the TNZ), and—for cities with a defined TUC—the regression coefficients for regression between Hcity and Ta at Ta > TUC.

For statistical analysis of RMR, the regression of mean total city-specific RMR (mean Hcity in the TNZ) as a function of T10-year was examined with P as a covariate. Although the effect of P was significant (p < 0.001), that of T10-year was not (p = 0.61).

Discussion

The summary of data in Fig. 3 abundantly confirms that, as first demonstrated by Hill et al. (2013) (see also Meehan 2012), individual North American cities typically exhibit energy-Ta curves similar to those of animal homeotherms. Cities do this because the basic principles of heat exchange that apply to homeotherms also apply to cities. General trends correlated with climate zones are readily apparent (Fig. 3). Virtually all cities manifest segment ① of the full energy-Ta curve (see Fig. 6) because this part reflects the increasing energy cost of maintaining a steady internal temperature (TLS) by use of heating systems as Ta decreases below TLC. However, southern Florida is so warm that Ft. Pierce and Key West barely express segment ①. Conversely, cities in the Far North such as Inuvik and Utqiagvik (previously Barrow) experience such cold environments that they do not express segment ②, the part of the TNZ at Tas immediately above the TLC; that is, these cities do not express the TNZ. Cities in high mountains (see Table 1 for altitude data) tend to express a narrower part of the TNZ than lowland cities at similar latitudes. Segment ③ of the energy-Ta curve (see Fig. 6) reflects the use of air conditioning and its increasing energy cost as Ta increases above TUC. Cities in the two warmest climate zones (orange and red, Fig. 3) vary in how vividly they express this part of the curve, some cities expressing it vividly (e.g., Key West), whereas others barely express it. Probably, a significant factor in this variation is the percentage of residences in a city that employ air conditioning; an entire city will strongly express segment ③ only if a substantial subset of residences employ air conditioning. In the set of 42 cities, a few exhibit atypical curves (e.g., Lawrenceville, Summerside). In this study, we did not reject cities because of their energy-Ta curves. We applied rigorous quality standards in sourcing data and calculating results from data, but if a city met these quality standards, it has been included in our analysis. As stressed earlier, we defined every step of our data analysis a priori; thus, the nature of results was never a factor in the results reported.

Fig. 6
figure 6

Line segments in the energy-Ta relationship of a city. Shown is a generic energy-Ta curve on which three parts are identified and numbered so that they can be referenced specifically in the accompanying discussion: ①, the part of the curve below the thermoneutral zone (TNZ); ②, the initial (i.e., low Ta) part of the TNZ; and ③, the part of the curve above the TNZ. Compare with Fig. 2

The paucity of northern cities in our dataset is evident in Fig. 3. This paucity reflects a fundamental limitation for studies using our methods. Namely, energy data at defined temporal resolution are not equally available from the relatively cold and relatively warm parts of North America. To a dramatic degree, as we turned to colder and colder parts of the continent in our search for city data, we found the data to be more and more difficult to obtain. The reason is that in many cold-climate regions, households rely principally on fuel oil or other heating fuels delivered in bulk. In such cases, the times of use of fuel to produce heat are not sufficiently defined to be correlated with Ta records.

Relation between city-wide insulation below the TNZ and thermal climate

Considered in its entirety, this research indicates that in North American cities built in a wide range of thermal climates (T10-year between − 11 °C and + 26 °C), city-wide I tends to be higher, the colder the climate.

To analyze the data with the point-by-point method, we used three estimated values of TLS (living-space temperature). An underlying assumption is that TLS is independent of Ta at Ta < TLC, an assumption supported by the data for the heating seasons in Huchuk et al. (2018). The regression between population-normalized city-wide I and T10-year is statistically significant for TLS = 17 °C (p = 0.0012) and TLS = 20 °C (p = 0.014), although moderately short of statistical significance at TLS = 23 °C (p = 0.062) (Table 3). Certainly, 17 °C and 20 °C would seem to be more plausible values than 23 °C for average thermostat settings in cold seasons of the year. The results of our analysis with the unitless slope method also tend to point to a significant inverse relationship between city-wide I and T10-year. In that analysis, we quantified I as the ratio of two regression-calculated values of Hcity: Hcity(-20) /Hcity(10). The regression for all 42 cities (Fig. 5A) is statistically significant (p = 0.022). Although analysis with just cities that adhere to the x-intercept expectation of the standard Hcity-Ta model does not yield a significant regression (Fig. 5B), the fact remains that—quite apart from any model—the index Hcity(-20) /Hcity(10) is an instructive, accessible metric in its own right, reflecting the extent to which a city’s energy use must be elevated for thermoregulation at Ta = − 20 °C relative to Ta = 10 °C. Thus the full set of data on Hcity(-20) /Hcity(10) is pertinent (Fig. 5A).

Considering all the data available and the results of both the point-by-point and slope methods, we conclude that as people have constructed cities in various climates in North America, they have—at a community level—tended to build better-insulated living spaces in cold climates than moderate ones: a trend that aids energy conservation. Variance is high, however, and statistically speaking, T10-year explains only a small proportion of total variance in city-wide I: a mean of 11% in the point-by-point method and 5% in the slope method. Thus, the evidence for adaptive modulation of city-scale thermal insulation might be aptly described as suggestive but tentative. The high variance might indicate that cities in North America have varied a great deal in the extent to which insulation has been a priority in city development. The high variance certainly presents an obstacle to the accurate elucidation of any trends that exist.

Focusing on our estimates of city-wide I in the point-by-point analysis (estimates expressed in units of measure, rather than being unitless), we can address a significant question: Is the slope of the regression between I and T10-year (Fig. 4) great enough that the increasing I at low T10-year entirely offsets the effect of the increasing cold? Here we contribute just a relatively simple analysis based on the definition of I [Eq. (1)], assuming TLS to be 20 °C (Huchuk et al. 2018). As a starting point, if we exclude extremes—notably Key West and the two extreme-cold cities (Inuvik and Utqiagvik)—we can calculate from published databases (for the U.S., U.S. Climate Data; for Canada, Environment Canada Climate Normals 1981–2010) that the average daily low Ta in December-February at our five warmest cities (average T10-year = 21.0 °C) is 7 °C, whereas at our five coldest cities (average T10-year = 3.1 °C), the average daily low Ta in December-February is − 15 °C. Combining the energy-Ta plots of the five warmest cities, we calculate a combined city-wide Iwarm cities of 4200 °C·person/MW and estimate the energy cost (W/person) at Ta = 7 °C for those cities taken as a group. We then ask what the value of I in the five coldest cities, Icold cities, would need to be for the energy cost in the cold cities (W/person) at Ta = − 15 °C to be the same; this is the I in the cold cities that would be required to completely offset the increased cold there so they would have no higher energy cost than the warm cities. By this analysis, Icold cities would need to be 2.7 times higher than Iwarm cities to achieve this end. In actuality, the regression we calculated from our city energy-Ta curves (Fig. 4) indicates that the average P-normalized I is 1.4 times higher in our cold cities (again excluding Inuvik and Barrow) than our warm cities (excluding Key West). The trend we detect is thus only about half as great as would be required for increased city-wide I to offset fully the effects of thermal climate.

Prior to this research, there were already reports in the literature that, taking a reductionistic approach, focused on particular variables in home construction that affect insulation. For example, Healy (2003) reported that 85–100% of houses in Finland, Norway, and Sweden have cavity insulation in the walls, whereas only 6–68% in other European countries do. Holden and Norland (2005) presented evidence that per-capita energy use in Norway is lower in living units with shared walls than in stand-alone living units, although the difference is small in modern (post-1980) construction.

However, for energy analysis on regional or continental spatial scales, it is important to know holistically the resistance to heat loss of the large assemblages of houses and other occupied buildings in entire cities. Moreover—and just as important—this resistance does not depend solely on individual reductionistic properties (e.g., insulating material in house walls). Resistance to heat loss at city scale depends on many factors, including the totality of materials used to build dwellings at various stages in a city’s history, dwelling geometry, and maintenance. As explained in the Introduction, city-wide I—as we use the concept in this report—is a holistic measure of resistance to heat loss that takes all these considerations into account. Thus, city-wide I is the operationally relevant property that energy analysts need to know for understanding how efficiently energy consumed for thermoregulation is used to create desired temperatures in living spaces. Our methods estimate this city-wide I.

Statistical methods for estimating city-wide I

Our methodological approach has two great strengths. It (1) uses publicly available information to (2) quantify a hard-to-measure property, city-wide I, of considerable potential importance for understanding city energy use and the effectiveness of energy-conservation efforts. Our approach to measuring city-wide I evades intrusive house-by-house insulation measurements that would be complex, expensive, and politically tenuous. Because of its practicality and relatively low cost, our approach provides perhaps the only immediately available and realistic means of measuring city-wide I on regional or continental scales.

However, difficult challenges [e.g., taking account of the influence of P (Packard and Boardman 1988)] are encountered in extracting information on city-wide I from energy-Ta plots. Thus, this paper is as much an exploration into techniques for calculating city-wide I from energy-Ta plots as it is a study of the relationship between I and T10-year.

From the outset, we have put a high priority on rule-based data processing, using rules articulated a priori. We tested and refined analytical methods by working on a subset of 6 cities prior to carrying out any analysis on the other 36 cities. Then, when we analyzed the full dataset, we followed the well-defined protocols established by the preliminary work.

A critical first step in any analysis is the estimation of TLC. Only a subset of a city’s data are pertinent to calculating city-wide I, and that subset is defined by the calculated value of TLC because I is expected to be constant only at Ta < TLC. We, in fact, calculated city-wide I using the subset of data for Ta < TLC − 1 SE of TLC (see “Materials and methods”). This approach was adopted to minimize the odds of unintentionally including data from the TNZ in the data subset used to calculate I.

In both the present report and Hill et al. (2013), we have used procedures for segmented regression to estimate TLC. These procedures define breakpoints in a city’s energy-Ta data. Hill et al. (2013) used SAS (version 9.2) to estimate TLC and TUC simultaneously, and the procedure behaved in a nuanced manner, rarely producing a result when the human eye suggested there was no breakpoint. Here we used the Segmented package in R, which behaved differently. Comparing the two procedures, both of which are iterative, Segmented is notably more deterministic; instructed to find one breakpoint, it nearly always finds one, even in cases where the human eye can barely perceive a breakpoint; instructed to find two breakpoints, it finds two. We used separate analyses to identify TLC and TUC (see “Materials and methods”) as a way to minimize the identification of nonsense breakpoints. Despite the differences between SAS and Segmented, the two programs were impressively similar in the TLCs they identified for the six cities studied in Hill et al. (2013) that have been re-analyzed for inclusion in this study. The estimates of TLC by SAS and Segmented differed by ≤ 0.6 °C for four of the six cities. For Timmins, whereas SAS estimated 11.9 °C, Segmented estimated 10.2; and for Key West, whereas SAS found no breakpoint, Segmented—true to its tendency to always find something—estimated a value.

Comparing the point-by-point and slope methods we used to extract insulation information from the energy-Ta plots, the point-by-point method rests on a particularly solid theoretical foundation. It is based directly on the defining equation for calculation of insulation, Eq. (1), which in the present context takes the form I = (TLSTa)/Hcity. The method has two limitations: (1) TLS is unknown, although it is constrained by normative thermostat-setting behavior (Huchuk et al. 2018), and (2) something must be done to take account of differences in P among cities. Analysis of covariance would be the ideal method for taking account of P (Packard and Boardman 1988), but—as we learned—the data may fail to meet assumptions of covariance analysis. We calculated the average P-normalized city-wide I for each city: a procedure entailing the use of Hcity/P, rather than Hcity, in the denominator of Eq. (1). We set TLS to three estimated values, and the results of the three resultant analyses are reasonably convergent—pointing to a statistically significant inverse relation between I and T10-year (Table 3). For future research using the point-by-point method, the most pressing methodological concerns are to better understand strategies for addressing the two limitations.

Population size, P, may not be the only useful variable for taking city size into account. A logical alternative would be to use the number of households in the city. The number of households can be estimated as the number of electric utility accounts, and in fact, Hill et al. (2013) used that approach. However, self-reports by electric utilities are the only way of knowing the number of electric accounts, and we have discovered in doing the present research that electric utilities sometimes report implausible numbers of accounts. The US Census Bureau (Lofquist et al. 2012) states that the average number of people per household in the United States is 2.6. We, therefore, expect P to be 2.6 times the number of electric accounts. In our full dataset of 42 cities for this report, P averages only 2.0 times the number of electric accounts, and part of the reason is that the electric utilities of five cities implausibly reported numbers of accounts close to population size, even when asked a second time. Our present view is that P (obtained from government census) is probably, in general, the more reliable measure of city size for purposes of energy analysis.

The slope method for extracting insulation information from energy-Ta plots is more indirect than the point-by-point method because, rather than being based directly on the definition of insulation [Eq. (1)], it is based on the standard model of city energy use (Fig. 2) (Scholander et al. 1950a; McNab 2002; Hill et al. 2013, 2016). To address the potential confounding effects of P, we used a unitless slope method by expressing the slope as a ratio of the calculated energy values at two predetermined Tas. The use of a unitless slope method has a long history. In many ways, the unitless slope method we use is a direct descendant of the method that Scholander et al. (1950a) employed in their much-cited classic paper that started the modern comparative study of thermoregulatory energetics in mammals. For each species in their study, Scholander et al. expressed metabolic rates at various Tas as multiples of the species’ RMR or BMR (they used both “resting” and “basal” to refer to this rate), thereby expressing heat production in a unitless way. Scholander et al. then demonstrated the unique usefulness of the unitless approach for comparing species of widely different body sizes: their unitless method allowed them to define major trends in body insulation across species and environments. Our unitless measure, although different from that of Scholander et al., shares some of the same virtues. For our purposes in this paper, the unitless measure has three signal advantages. First, the estimation of I as the ratio of two regression-calculated values of Hcity removes the simple, direct effect of city size (P), permitting cities of many different sizes to be compared straightforwardly. Second, as a corollary, the ratio remains the same regardless of how Hcity might be normalized for city size; whether city size is measured using P or the number of utility accounts, the ratio stays the same. Third, recognizing that the present report is intended to be read by non-specialists in thermoregulatory energetics as well as specialists, the unitless measure Hcity(-20) /Hcity(10) has straightforward meaning for any reader (unlike the units of measure for slope; see Table 3).

A limitation of the unitless slope method is that the Tas in numerator and denominator must be defined. We chose Ta = 10 °C in the denominator because 10 °C is a value that, although relatively high, is below the TLC of 38 of the 40 cities that have a defined TLC (Table 2). We chose Ta = − 20 °C in the numerator because it is a relatively low but not extreme value, considering North America as a whole. For each city the estimates of Hcity in numerator and denominator were calculated from the city regression equation (Supplementary Table 1). Thus, the calculated ratio reflects all the data available for the city, and a city did not actually need to experience a Ta as low as − 20 °C for the ratio to be calculated. These are assets of the method. On the other hand, the calculations (by not taking within-city variance directly into account) lose information, and they provide just a single value for each city, thereby reducing the sample size for statistical analysis to equal the number of cities.

The thermoneutral zone (TNZ)

Hill et al. (2013), in their analysis of six cities, noted a marked tendency for RMR/person (Hcity/P in the TNZ) to be a function of thermal climate, cold-climate cities having higher size-normalized values than warm-climate cities. Now that we have data on 42 cities (see Supplementary Table 2), it seems clear that that apparent trend was an artifact of the small sample size. We find no evidence for a relationship between RMR/P and T10-year.

In a mammal, the TNZ represents a range of variable insulation. Specifically, the TNZ is a range of Ta in which body insulation increases as Ta decreases between TUC and TLC (Scholander et al. 1950a; McNab 2002; Hill et al. 2013, 2016). The analogy between the TNZ in a city and that in a mammal is complicated and needs to be examined in more detail than heretofore. Here we simply offer brief thoughts on the concept of variable city-wide I. Buildings—lacking a mechanism like piloerection (fluffing out of the pelage)—would seem to be far less poised to vary their insulation than a mammal. Nonetheless, two mechanisms exist by which city-wide I might, in fact, be variable in the TNZ. First, people in traditional houses do things like close the windows to an increasing extent as Ta decreases and often ultimately add extra window protection (e.g., storm windows). Second, people vary their personal clothing insulation (e.g. switch tee-shirts for sweaters). One way to think of the TNZ for a city is that it is the range of outdoor temperatures in which people in their dwellings feel no need to use energy-demanding devices to feel comfortable. This formulation helps emphasize that human behavior is an important factor (Rickwood et al. 2008).

SAS and Segmented were impressively similar in the TUCs they identified for the six cities studied in Hill et al. (2013) that have been re-analyzed for inclusion in this study. Neither program identified a TUC for Flagstaff or Timmins. For Ames, Dothan, and Kissimmee, the programs agreed to within 0.5 °C. Whereas SAS estimated the TUC of Key West as 21.4 °C, Segmented estimated 22.3 °C.

Future directions

The premise of this research is that city-wide insulation—an important property for energy analysis—need not be measured in a reductionistic fashion, house by house. Instead, more-efficient methods might be developed to treat the entire city as the unit of measure for quantifying city-wide I. Hill et al. (2013) demonstrated that the energy-Ta plot of an individual city (e.g., Fig. 1B) can be constructed from data routinely gathered by weather stations and energy utilities. The energy-Ta plot contains information on city-wide insulation. In this paper, we have taken the first steps to develop methods for extracting and applying this insulation information.

We have found that variance is high. Considering any large set of cities, intrinsic differences in relevant city properties undoubtedly contribute to this high variance. Artifacts introduced by the nature of the data available probably also contribute, recognizing that protocols for recording, consolidating, and archiving utility energy data are not highly standardized. The high variance is an impediment to quantifying trends, and for overcoming this impediment, the most obvious next step will be to greatly increase the number of cities investigated.

This study has been self-funded by the authors. As we gathered data from cities, we were in effect acting as private citizens, cold-calling energy utilities to ask for their data. Thus it is not surprising that electric and natural-gas utilities often declined to participate. Hopefully, funding by government agencies will become possible in the future. Utilities might be more willing to provide data when asked under the aegis of a government-funded project, and increased funding would enlarge numbers of personnel available to process the data. In these ways, the number of cities investigated could be increased dramatically, helping to clarify trends despite high variance.

Another important goal that might be realized with more formalized authority and increased personnel would be the analysis of utility substation data within cities to estimate insulation in residential and commercial districts separately. City variance in corporate energy use is undoubtedly a significant contributing factor to city variance in measured insulation. If substation data could be acquired and analyzed separately, it might prove possible to compare cities on the basis of residential districts alone, providing less-confounded, more actionable results.

With the enhancements noted, the study of city-wide I—using the city as the unit of measure—might make important contributions to the ongoing project of understanding and managing energy efficiency in the built environment as a potential target of climate-change mitigation (Wilkinson et al. 2007; Rickwood et al. 2008). This research provides the first, tentative insight into how city wide I varies with thermal climate in North America, and it suggests more can be done in enhancing I to reduce the increased energy usage, and CO2 production, incurred for living-space heating in cold climates.