# Quantifying the Differences in Geometry and Size Distributions of Buildings Within Cities

- First Online:

DOI: 10.1007/s00004-014-0191-y

- Cite this article as:
- Mohajeri, N. & Gudmundsson, A. Nexus Netw J (2014) 16: 417. doi:10.1007/s00004-014-0191-y

- 2 Citations
- 256 Downloads

## Abstract

There have been many studies on the spatial configuration of cities, but few attempts to quantify the difference in building patterns between the old and new parts of cities. This may be partly attributable to lack of suitable study methods. This paper presents a new application of statistical methods for quantifying the geometric difference between different parts of a city using, as a case study, the old (historical) and new parts of the city of Yazd in Iran. We measured 341 edge lengths of 4 bazaars, 302 edge lengths of 5 mosques and tombs, and 239 edge lengths of 3 schools. We also measured 6,804 edge lengths and the areas of 1,243 well-preserved courtyard houses in the old part and 4,948 edge lengths and the areas of 1,237 houses in the new part of the city. In the old part, all edge-length and house-area frequency distributions, to a first approximation, follow power laws, indicating that there are many small and very few large buildings. By contrast, in the new part the edge-length and house-area frequency distributions follow bimodal (two-peak) distributions. The calculated entropies (measures of dispersion) of the house edge lengths and areas in the old part are much higher than of those in the new part and provide a clear, quantitative measure of the geometric differences between the built-up structures of old and the new parts of the cities. The power-law distributions in the old part indicate a gradual and continuous variation in sizes of edge lengths and house areas, whereas the bimodal distributions in the new part indicate abrupt (discontinuous) changes in the edge lengths and house areas. The entropy results imply that the size distributions of houses in the old part are more dispersed than those in the new part, indicating more interconnected structures in the old part of the city. The results also demonstrate quantitatively that there is a lack of coherence between the structures of old and new parts of city.

### Keywords

Urban design Spatial configuration Building geometries Size distribution Statistical analysis Entropy## Introduction

There have been many studies of the geometric patterns of cities. Some use fractal analysis (e.g., Batty and Longley 1994; Bovill 1996; Salingaros and West 1999; Salingaros 2005; Liang et al. 2013) while others focus on the relation between the geometric patterns and energy (Kasmaii 1983; Steemers et al. 1998; Ratti et al. 2003; Johansson 2006; Kämpf et al. 2009; Fuller and Crawford 2011). City morphology, socio-cultural patterns, and associated landscape architecture (Clark and Costello 1973; Bonine 1979; Kostof 1991; Kheirabadi 2000; Memarian and Brown 2003; Hu 2008; Habib et al. 2012), street patterns (Marshall 2005; Scellato et al. 2006; Mohajeri and Gudmundsson 2012; Gudmundsson and Mohajeri 2013), and size distributions (Batty et al. 2008; Mohajeri et al. 2013) have also been the subject of recent studies.

Yazd is particularly suitable for this kind of study because a large fraction of its old part has maintained its original structure for thousands of years, whereas a large fraction of the new part dates from the last decades. The city has expanded enormously in the past decades: in 1975 its population was about 136,000, whereas in 2009 its population was about 465,000 (Iranian Statistical Centre 2009).

A quantitative study of Yazd is important in that it throws light on the textural development of cities that (a) are of a medium size (population between 100,000 and 500,000), (b) have had very rapid recent urban growth, (c) are located in hot-arid regions, and (d) have new parts that have developed largely independently of, and are geometrically different from, the old (historical) parts. Examples of point (d) include the cities of Tunis in Tunisia, Sana’a in Yemen, and Bukhara in Uzbekistan (http://whc.unesco.org).

While it is commonly clear from aerial images that the new and old parts of a city do not fit smoothly together as regards their geometric patterns, little attempt has been made to quantify the geometric difference between the parts. One reason for so little work in this direction may be the lack of suitable methods for quantifying the textural difference. The principal aim of this paper is to show, using the example of Yazd (Fig. 1a), how geometric differences between parts of a city can be quantified by methods that have not, previously, been used for such a study. The focus is on detailed measurements of the edge lengths and the areas (sizes) of various built structures, so as to detect their underlying order and to compare the geometric patterns of the old part with those of the new part of the city. We select edge lengths and areas for the present study because these are easily and accurately measureable quantities (through AutoCad/GIS), but in a further development of the approach used here other parameters could be used, such as, for example, the heights of the buildings. Using calculated entropies, length ranges (differences between the maximum and the minimum length), and the scaling-exponents of the size distributions, we are able to quantify the geometric patterns of the old and the new parts of the city of Yazd and show how they are fundamentally different.

## Geographical and Historical Background of Yazd

The city of Yazd (31°54′ N, 54°22′ E) is located in the eastern central part of Iran, the heart of the main desert area in the country, in a wide dry valley between the southwestern and northeastern parts of the mountains of Shirkooh (Fig. 1b). The maximum daily summer temperature commonly reaches 40 °C, during which time there is hardly any rain at all. The total annual rainfall is 50–60 mm. The city climate is characterised by hot and very dry summers and cooler and dry winters.

Because of the shortage of surface water, Yazd depends very much on the groundwater channelled through a system of qanats. The system is based on underground tunnels or galleries to extract groundwater and has been described as the greatest contribution by Persians to the science of hydraulics (Lambton 1992; English 1998). Because of the difficult climatic conditions and the shortage of water, agriculture is not of primary importance for present-day Yazd, which is primarily a trading and industrial city.

The historical part of the city is more than 3000 years old and regarded as the second oldest continuously inhabited city in the world after the city of Venice in Italy (http://whc.unesco.org). It is culturally, historically, and architecturally a remarkable city. The present-day city is of medium size, with a population about 465,000 in 2009 (Iranian Statistical Centre 2009).

*ab*-

*anbar*) and the residential quarters around them. The old part has also a city centre which consists of the Jameh Mosque, a tomb, a bazaar, a religious school, a public bath, and residential quarters (Habibi 1998; Tavasoli 1989, 2001; Sharifi and Murayama 2013). The location of the old part within the modern city is shown in Fig. 2.

The courtyard houses, the most common built-up structures of the old part, vary considerably in form and size. These types of houses date back to the earliest settlements in the Middle East (Lampl 1968; Lavas 1974; Bonine 1979; Memarian and Brown 2003). Most houses are single storey (one floor) and are densely clustered along narrow and curved pathways or streets. The traditional courtyard houses are inward looking with a courtyard in the centre (Fig. 3), arranged so that they are side by side and back to back (Tavasoli 1989, 2001; Memarian and Brown 2003). The courtyard houses have thick walls, commonly 400–800 mm thick (Kasmaii 1983; Memarian and Brown 2003; Tavasoli 1989, 2001). Commercial activity was traditionally confined to bazaars, often arranged by trade, with stalls opening directly onto the streets, covered by brick vaults or domes. Open public spaces are mainly associated with mosques, tombs, and schools for the creation of several small squares and the widening of some alleys. The new part of the city, by contrast, has a completely different structure. Buildings are outward looking and the streets, which are designed mainly for vehicles, are wide and follow a grid pattern, thereby forming a regular network (cf. Fig. 2b).

## Data

We analysed the geometric properties of built-up areas in both the old and the new parts of the city using a two-dimensional Auto-CAD/GIS model of Yazd based on datasets from the National Cartographic Centre of Iran (2005). This is a digital model of all the buildings in the selected sample areas, namely an area of 793,734 m^{2} (about 0.80 km^{2}) from the old part and an area of 972,418 m^{2} (about 0.97 km^{2}) from the new part of the city. To digitalise the data we used the Auto-CAD maps of Yazd and converted them into GIS-shape files. The Auto-CAD dataset of Yazd was generated in 2003 by the National Cartographic Centre of Iran using a UTM projection system and WGS-84 datum for the entire urban area of Yazd based on aerial photography at the scale 1:4,000.

The numbers of main structures measured in the historical part of the city are as follows: 4 bazaars, 5 mosques and tombs, and 3 schools. The numbers of edge-lengths measured for these structures are 341 for bazaars, 302 for mosques and tombs, and 239 for schools. In the historical part we measured 6,804 edge-lengths of 1,243 courtyard houses and the same number (1,243) of house areas. In the new part, we measured only ordinary houses, a total of 4,948 edge-lengths of 1,237 houses and the same number (1,237) of house areas.

## Measurement Methods

### Circular Statistics and Rose Diagrams

Circular statistics, which are primarily visualised by the rose diagrams, show the frequencies of data with different trends (Fig. 5). Rose diagrams provide a particularly clear visualisation of the variation in trend in two ways: as complete circles or as half circles (semi-circles). Rose diagrams are used to show the trends of either a certain process, such as the wind directions at a certain locality over a certain period (a wind rose), or the trends of certain lineaments such as fractures or streets (Fisher 1993; Swan and Sandilands 1995; Smith et al. 2009). The present analysis of the edge-length trends uses the program GEOrient (http://www.holcombecoughlinoliver.com).

When analysing lineament trends, both directional and oriented data can be used. In directional data we can distinguish one end of the lineament from the other, or left from the right. This applies, for example, to the flow in a river or a dominating wind direction. Oriented data, by contrast, relate to phenomena without a directional distinction. These data include crustal fractures and streets in a city (Fisher 1993; Swan and Sandilands 1995). Thus, when the data are directional, then the rose diagram shows a unidirectional or asymmetric trend distribution (Fig. 5b). By contrast, when the data are oriented, then the rose diagram shows a bidirectional or symmetrical trend distribution (Fig. 5a). For directional data, the measured data azimuths range from 0° to 360°. For oriented data, however, the opposite directions, 180° apart, are equivalent. Thus, for oriented data, the graphical presentation should either be restricted to a semi-circle (Fig. 5c) or have a rotational symmetry, in which case the opposite classes or sectors in the rose have the same frequency (Swan and Sandilands 1995). In this study, we use oriented data (edge lengths) so that the rose diagrams show bidirectional or symmetric trend distribution on a complete circle (Fig. 5a).

An additional point is that rose diagrams can be constructed using either normalised (weighted) or non-normalised (unweighted) data. The trends of houses are non-normalised when their edge lengths are not considered, in which case short edges and long edges have equal weight in the rose diagram. By contrast, when the edge trend is normalised by the length of the shortest edge, more weight is given to the long edges because they are then considered as composed of many short parts.

### Size Distributions

Here we give a brief overview of normal distribution and power laws and their use in statistical size-distribution analyses. More details can be obtained from numerous books (e.g. Schroeder 1991; Liebovitch 1998; Peitgen et al. 2004; Brown and Liebovitch 2010).

*The normal (Gaussian) distribution.* The normal or bell-shaped distribution is widely regarded as the most important probability distribution in statistics (Fig. 6a; Ebdon 1985; Shaw and Wheeler 1985). In a normal distribution, the horizontal scale along the bottom of the distribution represents values of a variable, for example, the tallness (height) of people (Spiegel and Stephens 2011). The midpoint is the mean (arithmetic average) of the values. For a normal distribution, the data can be described in terms of the mean and the standard deviation (σ), the latter being a measure of spreading. The normal-distribution mean, median (the middle value), and mode (the most common value) all coincide. A mixture of two normal distributions creates a bimodal frequency distribution or curve, with two different modes and means, which shows two distinct peaks or maxima (Spiegel and Stephens 2011).

Although normal distributions are very common, heavy-tailed power-law distributions are increasingly recognised as the fundamental statistical distributions in many physical, biological, social, economic, business and organisational phenomena (Newman 2005; Andriani and McKelvey 2009; Pisarenko and Rodkin 2010). In a normal distribution the variability of the sample data points are similarly distributed and generally clustered around the mean (Fig. 6a). Power-law distributions, by contrast, have many more data points with extreme values than the corresponding normal distribution. They thus exhibit a long or heavy tail (Fig. 6b). Normal distributions have a limited variance and stable means. Therefore, for a normal distribution, the data can be described through a typical value namely, the mean value. Power-law distributions have potentially infinite variance. When extreme events occur they also alter the value of the mean, pulling it towards the tail where the extreme events or object are located. For this reason, power-law means are unstable and this distribution has no typical size to describe the data (in contrast to the mean of a normal distribution). A power-law distribution implies that there are many small objects/processes and very few large ones.

*The power-law size distribution.*As indicated above, power-law size distributions are very common in artificial (man-made) and natural processes and structures (Fig. 6b). The populations of cities, the sizes (intensities) of earthquakes, word frequencies in literature, and the frequencies of family names all give rise to power-law distributions (e.g., Schroeder 1991; Peitgen et al. 2004; Newman 2005, 2010). As mentioned, power-law distributions imply that the number of small events, processes, or objects of a particular type is large in comparison with the number of large events, processes, or objects of the same type. The concept of a fractal, based on self-similarity, where the shapes or geometric patterns do not change when observed at different scales, reveals itself in a power law (Batty and Longley 1994; Bovill 1996; Liang et al. 2013). When applied to a cumulative frequency (probability) distribution, a power law has the form:

In a power law, the straight line is, however, rarely observed over the entire range of the values or sizes of x. More commonly, there are one or more breaks in the log–log plot, yielding two or more straight lines with different slopes over different length ranges (values on the horizontal X-axis of the plot). These slopes are referred to as the scaling exponents of the power laws. When the log–log plot yields two such straight lines, the distribution is referred to as a double power-law (Newman 2005, 2010; Han et al. 2011).

### Entropy Analysis

*k*), which gives the entropy for a general probability distribution as:

*k*is a constant, t is the number of classes or bins that contain edges or house areas in the frequency distribution, that is, the number of bins with nonzero probabilities of edge lengths/house areas, and p

_{i}is the frequency or probability of edges belonging to the i-th bin, that is, the probability of the i-th class or bin.

## Results

### Trend Analysis of Houses

Figure 3 shows the location of the main areas within the old part where the bazaars, mosques, tombs, and schools were measured. The figure also shows the connection of these structures with the surrounding houses through irregular edges and open spaces such as pathways, squares, and yards. The irregular shape of each structure, and the way the structures connect to each other and their surroundings, is partly due to the land divisions and ownerships and partly to the irregularity of the open spaces, mainly pathways (Memarian and Brown 2003).

The dominating trends of houses in the old and new parts are similar, mostly northeast-southwest and northwest-southeast, that is, roughly orthogonal. There is, however, a greater spread in the trend of houses in the old part. This follows because the houses in the new part are associated with a strict grid-like street network, whereas in the old part the houses are associated with a street network which is more varied in geometry. It is likely that the trends reflect partly the climatic conditions, and partly the trends of the streets. It has been suggested that the dominating northeast-southwest trend, in particular, is well suited as regards the division of the houses into summer and winter quarters (Memarian and Brown 2003). Other factors that are likely to affect house trends include the pre-urban boundaries of fields, land divisions, and orchards. These lines (boundaries, division lines), in turn, are determined by the slope of lands and direction of water channels (Bonine 1979; Kostof 1991).

### Size Distributions of Edge-Lengths of Houses

Structures | Number of edges | Scaling exponent (D) | Entropy (S) | Length range | Range |
---|---|---|---|---|---|

Bazaar | 341 | 2.219 | 2.253 | 0.41–48.52 | 48.11 |

A | 275 | 0.824 | 1.799 | 0.41–12 | 11.59 |

B | 66 | 3.783 | 2.207 | 12–48.52 | 36.52 |

Mosque | 302 | 1.673 | 2.617 | 0.31–48.52 | 48.21 |

A | 257 | 0.758 | 2.309 | 0.3–20 | 19.69 |

B | 45 | 5.101 | 2.164 | 20–48.52 | 28.52 |

School | 239 | 1.904 | 2.581 | 0.38–61 | 60.62 |

A | 163 | 0.501 | 1.876 | 0.38–12 | 11.62 |

B | 76 | 3.141 | 2.496 | 12–61 | 49 |

House | 6,804 | 1.989 | 2.622 | 1.5–125 | 123.5 |

A | 3,121 | 0.319 | 1.924 | 1.5–12 | 10.5 |

B | 3,683 | 3.488 | 2.464 | 12–125 | 113 |

The edge-length distributions are consistent with power laws that have different slopes for different edge-length ranges, that is, are double power laws (Figs. 8c, 9b). The breaks in the slopes of the edge-lengths occur at about 12 m for all the structures, except for mosques, where the break is at about 20 m (Table 1). The shallow-sloping straight lines range in scaling exponents (slopes) from 0.319 (houses) to 0.824 (bazaars), whereas the steep-sloping lines range in scaling exponents from 3.141 (schools) to 5.101 (mosques).

Using the scaling exponents, each population can be divided into three (not all mutually exclusive) subpopulations, namely: (1) short edges (with shallow straight-line slopes and low scaling exponents), (2) long edges (with steep straight-line slopes and high scaling exponents), and (3) the whole edge population for the specific structure. In Table 1, whole population is given its specific name (bazaar, mosque, etc.), and the subpopulation of short edges is denoted by *a,* and that of long edges by *b*. For each of these populations, the length ranges and the entropies (Eq. 3) were calculated (Table 1).

The bimodal distribution is easily explained for the edge lengths of the new part. Most of the measurements are from (mainly) rectangular buildings (Fig. 4) where the edge lengths form pairs. This means that the two edges that form the one pair are longer than the two edges that form the other pair. If the buildings are mostly similar in sizes (in square metres), as for the measured new part of the city of Yazd, there will be two peaks for the edge-length size distribution: one corresponding to the longer edges (or sides) of the buildings, the other to the shorter edges (or sides) of the buildings. The results also show that extremely long edges and extremely short edges are rare (Fig. 8). This follows because most of the buildings are similar in size; few are either very small or very large.

The entropy of the edge lengths of houses in the new part, S = 2.234 (Fig. 10), is less than that of the houses in the old part, S = 2.622 (Fig. 9a). The results imply that, despite having a bimodal distribution, the edge lengths of houses in the new part are less dispersed than those in the old part of the city. The results also demonstrate quantitatively that there is a lack of coherence, in other words, a dissimilarity, in texture between the house structures of old and new parts of Yazd.

### Size Distributions of House Areas

The calculated entropies for the area-size distributions show very different results for the new and the old parts. The entropy for the areas of houses in the old part is S = 2.638 (Fig. 11c), whereas that of the areas of houses in the new part is S = 1.629 (Fig. 12b). Clearly, the entropy of the area-size distribution of houses in the old part is much higher than that of houses in the new part. The results fit well with the entropies calculated for the edge-length size distributions, although the entropy difference between the old and new part is larger for the area distribution than for the edge-length distribution.

## Discussion and Conclusions

The old parts of many Iranian cities consist of several basic structures including a mosque and tomb, a bazaar, a square, residential quarters, a religious school, and a public bathhouse (Habibi 1998; Tavasoli 2001; Karimi 2000; Sharifi and Murayama 2013). All these physical structures are multifunctional and have important characteristics such as socio-economic, cultural, environmental, and political (Clark and Costello 1973; Kheirabadi 2000). In particular, they have been, to a large extent, successful in accommodating social and environmental needs (Kasmaii 1983; Tavasoli 2001; Soflaee and Shokouhian 2005; Mobaraki et al. 2012; Sharifi and Murayama 2013). The fine integration and cohesion of the physical structures in the old part of many Iranian cities is one of their most prominent characteristics (Tavasoli 2001; Sharifi and Murayama 2013).

However, the new developments of the city structures often seem to lack the detailed integration and cohesion of the old part—and this lack may be partly the result of modernisation during the last century (Habibi 1998; Karimi 2000; Fanni 2006; Sharifi and Murayama 2013). Many of the old city structures mentioned above lost some of their characteristics as well as their functionality and gradually became parts of much larger cities with poor connection with the new parts. The old and new parts of cities are to a degree almost separated and are poorly connected. In addition, the physical characteristics as well as the social and environmental characteristics of the old and new parts are commonly totally different (Tavasoli 2001; Sharifi and Murayama 2013). In this study we have shown quantitatively some differences between the physical characteristics of the old and the new (most recent) parts, focusing on the bazaar, mosque, school from the old part, and residential quarters in both old and new parts. Other aspects of these differences could be explored in the future when more data is available.

A comparison between the edge-length size distributions in the old part and new parts shows a striking difference. In the old part, the distributions follow power laws for all the structures studied here (bazaars, mosques, and schools) including courtyard houses (Figs. 8, 9). By contrast, the edge lengths of houses in the new part follow a bimodal distribution (Fig. 10), apparently the result of the mixture of two (roughly) normal distributions. Generally, structures in the old part are irregular, that is, have uneven edge lengths (Fig. 4), which makes them highly interconnected and coherent. The high degree of interconnectivity at the small scale is also reflected in the coherence of the overall structure of the old part of the city at a large scale (Figs. 2, 3, 4). The edges in the newer parts are regular and less interconnected in a statistical sense (Fig. 7) than in the old part.

The area-size distributions of the houses in the old and the new part of the city yield similar results to those of the edge-length size distributions. The areas in the old part show power-law size distribution (Fig. 11), whereas those in the new part show bimodal size distribution (Fig. 12).

The calculated entropies of the size-frequency distributions, both for edge lengths and for house areas, also show significant differences between the old and the new parts. For the edge lengths, the entropies are 2.622 (old part) and 2.234 (new part), whereas for the house areas the results are 2.638 (old part) and 1.629 (new part). The edge lengths and house areas in the old part follow power laws (Figs. 8, 9, 11) that yield greater dispersal or spreading, and thus higher entropies (Baierlein 1971; Blundell and Blundell 2006; Kardar 2007), than the bimodal distributions in the new part.

^{2}= 0.552, but R

^{2}= 0.750 when the houses are omitted from the data set (Fig. 13b). We decided to use the latter also because the number of measured houses is so many times greater than those of measured bazaars, mosques, tombs, and schools combined and thus dominates the correlation determination. Generally, the linear correlations mean that as the length range increases, so does the entropy of the population.

The results show a clear geometric and quantified (through edge-length size and house-area size distributions) differences between the old and the new parts of Yazd. These methods, presented here (to the best of our knowledge) for the first time, should be very useful in analysing and quantifying the urban morphology and the geometric patterns in cities in general. They should be particularly useful for studying those cities that have evolved over long periods of time and contain new parts that have evolved largely independently of the older parts.

The present results show that, perhaps in contrast to what we could have expected, the edge lengths and house areas in the old part of the city of Yazd show clear power-law size distributions. By contrast, the edge lengths and house areas in the new part of the city show bimodal size distributions. The power-law distributions indicate a gradual and continuous change in sizes of edge lengths and house areas based in the old part of city, whereas bimodal distributions indicate abrupt (discontinuous) changes in the edge lengths and house areas in the new part. This gradual change in the old part may be regarded as more natural, as indicated by the power-law size distributions of many landscape forms and lineaments (Turcotte 1997). One suggested implication of the results presented here is that the structures in the old part of Yazd are well interconnected, form a coherent whole, and provide a quantitative measure of the clear geometric difference between the old and the new parts of the city.

The present new methods for analysing the geometric differences between new and old parts of cities have, for the case study of Yazd, shown clear textural differences between the old and new part of the city. Since these methods are completely general, they should be applicable to similar analyses worldwide and be particularly suitable for the analysis of fast-growing small to medium-size cities. One development of the methods presented here is to measure different quantities, such as the heights or tallness of the houses, and relate them to the edge lengths and areas of the houses. Also, further expansion of the analysis presented here could, in addition to quantifying the physical differences, include the socio-economic and environmental differences between the old and new parts of cities. These aspects of the present methods might suggest solutions for making the old and new (more recent) parts better integrated and are worth exploring in future studies.

The methods presented here can also be used to make quantitative studies of different cities, in different environments, and relate the results to energy considerations and general building physics. Such a development should provide results of great value for authorities and decision makers on environmentally sustainable policies regarding urban design and planning.