Introduction

Research suggests an association between the physical configuration of the street network and the spatial distribution of crime. Typically, more (less) accessible streets, for which urban movement and public awareness of them will be highest (lowest), are found to experience more (less) crime (e.g., Bevis and Nutter 1977). However, previous studies concerned with how the configuration of the street network influences crime pattern formation have mainly focused on property crime (particularly residential burglary), largely neglecting the study of violent crime, which limits the generalizability of the findings. Prior research has also varied in terms of methodological sophistication, often using simple indices of accessibility (Bevis and Nutter 1977) or using solely descriptive statistics or correlation analyses to test theory (Alford 1996; Hillier and Shu 2000), which limits statistical conclusion validity. The influence of other important characteristics of the urban environment, such as variation in land uses that feature in most people’s routine activities, have also been ignored in most studies of this kind. Since routine activity nodes affect urban flows and the places with which people will become familiar, their exclusion in previous studies raises the possibility that criminogenic effects attributed to the street network might be spurious.

The current research extends the existing literature in three ways. First, we estimate the association between characteristics of the street network—measured using space syntax, a systematic methodology developed in the field of urban studies—and crime, whilst simultaneously considering the influence of other features of the urban environment. Second, unlike many previous studies, we employ an appropriate statistical framework to test hypotheses, and to estimate the influence of unobserved factors. And, third, we do so for a type of crime rarely studied within this context: outdoor serious violence (OSV). This type of offending refers to violent offenses where the victim(s) sustained lethal or near-lethal injuries (to include murders, attempted murders, as well as serious robberies, sexual attacks, and aggravated assaults) that took place in an open, public space. Such crimes are often the result of an expressive act and consequently it is possible that the configuration of the urban environment will have little part to play in its distribution. However, as with other types of crime, there are good reasons, articulated below, to expect the distribution of incidents of OSV to be influenced by the configuration of the street network. The research reported here thus extends criminological understanding of the influence of urban form on crime risk and does so using a method imported from another discipline.

The paper is organized as follows: in the next section we discuss why the street network would be expected to influence the clustering of crime from the perspective of environmental criminology. After reviewing the relevant empirical literature, we consider the related issue of how variation in land use might be expected to affect the likelihood of crime occurrence. A series of hypotheses are formulated and subsequently tested using a sample of data from one UK police force area. A brief description of the space syntax methodology, used to quantify the characteristics of the street network, is provided before presenting empirical analyses. We conclude the paper with a discussion of the findings and what they mean for criminological theory and crime prevention.

Crime Pattern Theory and Offender Collective Familiarity Surfaces: How Streets Might Shape Crime Risk

Crime pattern theory (Brantingham and Brantingham 1984, 1993) suggests that rather than actively searching for crime opportunities in locations that are unfamiliar, offenders commit offenses in areas that are already known to them. Like everyone else, offenders have routine activities (Cohen and Felson 1979) and, as a consequence of engaging in these, visit some places frequently (e.g., their home, places of work and recreation) and develop an awareness of them, the places that surround them, and the paths they must travel to move between them (Brantingham and Brantingham 1993). Offenders are predicted to offend at locations encapsulated by their activity spaces that furnish exploitable criminal opportunities. Some aspects of an offender’s awareness space may be highly idiosyncratic. However, since the street network shapes the possible paths from one location to another, there will be particular street segments that, purely based on their position within the street network, are more accessible and thus more likely to be part of any offender’s activity space (e.g., Beavon et al. 1994).

The implication of this is that, all else equal, street segments which are more accessible and/or likely to lie in people’s everyday paths will be expected to be more familiar to everyone and, consequently, more likely to be frequented and targeted by offenders. Therefore, from a crime pattern theory perspective, more crime would be expected to occur on (or near to) more “central” (to use the terminology of graph theory) street segments. Considering different types of crime, this should apply to both spontaneous and predatory violence. To explain, many predatory offenders (e.g., stranger rapists) are likely to choose a crime location in advance, based on factors such as typical levels of formal and informal surveillance, likely escape routes, and so on—factors that would influence their perceived risk of apprehension. Such an assessment, however, would not be possible for locations they do not frequent or are unfamiliar with. For such streets there would be greater uncertainty about the risks involved which—according to crime pattern theory—would reduce the likelihood that such locations would be selected, or even considered (Brantingham and Brantingham 1993; Cornish and Clarke 1986). In contrast, spontaneous or unplanned violence, by definition, relates to a crime location that is not chosen a priori. However, also implied in the term is that the crime occurs while the offender is engaged in everyday non-criminal routine activities (e.g., walking home after having gone out for a drink), in places that are typically familiar to the offender. Familiarity in this case is assumed because individuals have been shown to spend most of their time in or around just a few locations, regardless of the total number of locations they visit or the distances they typically travel (Song et al. 2010). Therefore, according to crime pattern theory, one would expect both predatory (planned) and spontaneous (unplanned) violence to occur in places the offender is familiar with.

Most studies that have examined the influence of the street network on the spatial distribution of crime at a micro level have focused either on residential burglary (e.g., Armitage 2004, 2006, 2007; Armitage and Smithson 2007; Bevis and Nutter 1977; Brantingham and Brantingham 1984; Chang 2011; Hillier 1988; Johnson and Bowers 2010; Reis and Rosa 2012; Shu 2000, 2009; Shu and Huang 2003; Ward et al. 2014; White 1990; Yang 2006; Young 1999; Young et al. 2003; Zaki and Abdullah 2012), or a combination of residential burglary and other forms of property crime such as willful damage (e.g., Beavon et al. 1994), vehicle crime (e.g., Hillier and Shu 2000; López and van Nes 2007; van Nes and López 2010), and theft (Lee et al. 2007). Some authors have included violent crime in the data analyzed, but with a few exceptions (see below) failed to disaggregate by crime type (e.g., Armitage et al. 2011; Dhiman 2006; Fanek 1997; Greenberg and Rohe 1984; Jones and Fanek 1997; Long and Baran 2006; Nubani and Wineman 2005). In general, at least in the case of residential burglary, the evidence suggests that crime is more likely to occur on more accessible street segments. However, residential burglary is a crime of stealth with rather different precipitators to OSV offenses, and so notwithstanding the above discussion, patterns for the two types of crime might plausibly differ.

In terms of how the configuration of the street network might influence the spatial distribution of OSV, what we currently know comes from a small number of studies that have employed the space syntax methodology. In the main, space syntax studies of violent crime have focused on street robbery. Such studies are of clear relevance to the current research, but the outcomes of such offenses will (by definition) be less serious than those considered in the current study. These studies are reviewed below but, before doing so, we provide a brief introduction to the space syntax methodology—used to quantify the characteristics of the street network—for those unfamiliar with the approach (a more detailed exposition of the full procedures of space syntax is beyond the scope of this paper; interested readers are referred to Bafna 2003, Hillier 2007, Hillier and Hanson 1984, and Summers and Johnson 2015).

Space Syntax

The overall aim of the space syntax methodology is to estimate how urban movement varies across the street network for both pedestrian and vehicular journeys. It employs techniques based on the mathematical field of graph theory, which involve an analysis of the street network, to generate a series of street network accessibility and usage measures at the street segment level. The most commonly used space syntax measure is integration, which measures accessibility or mathematical closeness centrality to estimate (pedestrian or vehicular) to-movement or destination potential. The integration value of a street segment is based on the mean depth or topological distance (the number of street segments that must be traveled) to all other segments in the network. Integration is calculated using shortest path algorithms and is adjusted for factors such as the size and shape of the network. It estimates how likely particular street segments are to be the destination for those using the network, based on the position of a street segment in the network. A second frequently reported accessibility measure is connectivity, which represents the number of street segments a given segment intersects. This is analogous to the order of a vertex in graph theory and it is this metric that has most commonly been used in studies of environmental criminology (e.g., Beavon et al. 1994; Johnson and Bowers 2010). Finally, choice is a measure of through-movement potential or mathematical betweenness centrality. This is derived by identifying the shortest paths for all possible origin–destination pairs in the network, and then counting how frequently each street segment features in these. While there is only one possible connectivity value for a given street segment, integration and choice may be calculated at varying radii (which determine which origin–destination pairs are considered in the analysis), to approximate movement at different scales (e.g., to reflect pedestrian vs. vehicular movement). Both integration and choice have been shown to be highly correlated with pedestrian and vehicular flows (e.g., Hillier and Iida 2005).

The space syntax methodology differs from the graph theoretical metrics on which it is based in that it accounts for what is known about human wayfinding (Peponis et al. 1990). For example, when calculating the shortest paths between two locations, purely graph theoretical approaches consider only the network metric (or topological) distance between them. In contrast, drawing on the fact that when completing everyday journeys people prefer to travel along relatively linear routes, space syntax metrics penalize turns with larger angular deviations from an existing trajectory or path. Based on the same principle, the underlying street network used in studies of space syntax is represented by a graph of solely straight lines (visual lines derived from the existing street network), referred to as axial lines (more details below). This contrasts with other analytical approaches, which tend to employ existing road network databases where some of the street segments may be non-linear.

The main findings from space syntax studies that have examined the influence of the configuration of the street network on the spatial distribution of violent crime are as follows: first, counts of street robbery and violent crime tend to be greatest on segments with high integration and/or choice values, particularly at night (Alford 1996; Farooq 2007; Hillier and Sahbaz 2005; Sahbaz and Hillier 2007). Second, connectivity also appears to be associated with greater robbery counts but this relationship does not always appear as strong as for integration (Hillier and Sahbaz 2009; Nubani 2006; Reis et al. 2007). However, when the length of street segments (which accounts for the opportunity of offenses per unit of street length) is controlled for, inconsistent findings emerge. Some authors report a positive association between accessibility and crime (Baran et al. 2006, 2007), while others state that it is less accessible segments that are associated with greater numbers of robberies per unit of street length (Hillier and Sahbaz 2005).

Some of these inconsistencies may be attributable to the methodological and analytic approaches adopted. For instance, Baran et al. (2006, 2007) performed regression analyses and controlled for street segment length by including this as an independent variable in the model; their results were consistent with crime pattern theory and revealed integration (to-movement) to be positively associated with logged crime counts. In contrast, in their study, Hillier and Sahbaz (2005; also see Hillier and Sahbaz 2012) aggregated the street segments examined into 45 “bands” of similar length and calculated an aggregated number of robberies per street meter for each band, by adding up all crimes within the band and dividing by the total segment length. They reported a negative association between the aggregated number of robberies per street meter and the mean integration value of the band. Aggregating segments based on segment length, however, does not really control for any confounding effects that length may exert, and may also obscure relationships between features of the street network and crime at the segment level. Perhaps more importantly, the approach is problematic in that it ignores the fact that there is an explicit spatial structure to the street network. By aggregating the data as discussed, street segments are assumed to be independent units of analysis, which clearly they are not (Tobler 1970).

As noted above, much of the research concerned with how the street network might shape the spatial distribution of crime has focused on property crime, or analyses have been conducted for aggregated categories of offenses. To test the general utility of the theory, it is important to explicitly examine the patterning of offenses that might not be thought of as being the outcome of routine everyday activity, such as outdoor serious violent offenses. Given the paucity of research concerned with the configuration of the street network that has focused on incidents of OSV, and the variation in research methods employed hitherto, we examine this issue here. Based on crime pattern theory, which states that offenders are more likely to commit crime in places they are aware of, our first hypothesis is that:

H 1

Street segments that facilitate movement to (as estimated by connectivity and integration) or through (choice) the network will be more likely to contain incidents of OSV, as offenders—just like anyone else—will be more likely to be familiar with such segments.

A potential shortcoming of space syntax is the extent to which its measures—which relate to the physical characteristics of places—are confounded with other characteristics of such places, particularly in relation to land use (Batty et al. 1998; Ratti 2004). In other words, if a significantly greater amount of crime takes place on more permeable street segments, is this simply because such segments host more (crime-attracting or -generating) land uses? Research has shown that particular land uses, such as entertainment, commercial and transport-related facilities, are related both to the spatial distribution of crime (e.g., Browning et al. 2010; Kinney et al. 2008; McCord and Ratcliffe 2009; Stucky and Ottensmann 2009; Weisburd et al. 2012, 2014) and the configuration of the street network (Hillier 1996, 2007; Peponis 2004). However, the joint influence of these two factors—land use and street network configuration—on crime has not previously been examined, raising the possibility that the conclusions of previous research might be spurious. Consequently, we examine the role of both influences here.

Land Use

While the street network influences the routes that people take from origins to destinations, it is the location of people’s activity nodes that determines what these origins and destinations will be. Some activity nodes will be particular to an individual, while some will be shared by many. Several non-residential facilities have been found to be associated with the spatial distribution of violent crime (for a recent review, see Groff and Lockwood 2014). The facilities that have received most attention are those where alcohol can be purchased, for consumption either on- or off-site (e.g., Jennings et al. 2014; Pridemore and Grubesic 2013; Ratcliffe 2012; Roncek and Maier 1991). However, the spatial distribution of violent crime has also been found to be associated with commercial and recreational land uses more generally (e.g., Browning et al. 2010; Groff and McCord 2012; Kinney et al. 2008; Kurland et al. 2014), as well as the amount of commercial zoning in an area (Anderson et al. 2013).

As such facilities can both generate or attract crime (Brantingham and Brantingham 1995), they may exert an influence on the likelihood of crime occurrence that exceeds any contribution that might be associated with the influence of the street network on movement flows. Alternatively, and in the extreme, the location of commercial and recreational land uses—rather than the configuration of the street network—might largely explain movement flows. However, Hillier (2007) has suggested that it is the influence of the street network on pedestrian and vehicular movement that shapes the distribution of land uses, rather than the other way round; in his own words (Hillier 2007: 125),

“far from explaining away the relation between grid structure and movement by pointing to the shops, we have explained the location of the shops by pointing to the relation between grid and movement.”

Of course, just because a segment has a high potential of containing certain land uses due to its position within the street network, this does not mean that this is necessarily realized. Therefore, to accurately measure the independent influence of the street network on spatial crime distributions, a land use measure is included in the analyses, and a second hypothesis is formulated, as follows:

H 2

OSV will be more likely to occur on street segments with more commercial land uses after controlling for how accessible they are.

So far, we have considered how estimated movement flows along a particular street segment are expected to affect the level of crime on that street segment. According to crime pattern theory, individuals are thought to become familiar not only with the routes that they travel, but also those areas that are immediately adjacent to them (Brantingham and Brantingham 1982, 1993; also see Groff 2011). Moreover, Angel (1968) suggested that for robbery and other predatory offenses, offenders might favor a “critical intensity zone” that, while being isolated, is situated in close proximity to busy locations, thus yielding a “spillover” effect. One scenario might involve an offender identifying potential (moving) targets within a busy area and following them until they both reach a more isolated location, where the offender perceives the risk of apprehension as acceptable. This would be much more efficient than trying to identify a suitable target from an isolated spot, where the offender might also be more likely to appear conspicuous. In line with this, Wilcox (1973) found that there was an elevated risk of victimization in the alleyways behind shops and the periphery of nearby car parks half an hour after the shops had closed. For these reasons, we hypothesize that the risk of crime will not only be high on street segments with the highest estimated to- and through-movement flows, but also on those that are situated near such segments. That is:

H 3

Street segments in close proximity to the connectivity core (i.e., the segments most directly connected to others), the integration core (i.e., the segments with the highest integration, indicative of high to-movement potential), or the choice core (i.e., the segments with the highest through-movement potential) of the network will be more likely to contain OSV than segments situated further away from these cores.

In the next section, we describe the data used to test the above hypotheses and explain how the space syntax metrics were calculated.

Methodology

Data

Data were analyzed for one of the 32 boroughs in Greater London, England. This particular borough was selected because Metropolitan Police Service (MPS) figures indicated that it was one of the areas most affected by serious violence. It is an outer London borough where the percentage of minority ethnic residents is considerably higher than the Greater London average, but which has largely mainstream rates of unemployment, crime and deprivation as compared to the rest of the city. Its estimated population for 2013 was just under 330,000, with a population density of about 75 residents per hectare (this is higher than the average for Greater London, which is about 55 residents per hectare).Footnote 1

Crime Data

Data were available for crimes recorded by the MPS over the 5-year period 1 April 2002 to 31 March 2007. The offenses considered consisted of all outdoor murders, attempted murders and other forms of violence where the injuries inflicted on the victim(s) were severe. In the first instance, only homicides and attempted murders were considered, as the analyses presented here formed part of a larger research project focusing on the situational determinants of homicide and attempted murder in Greater London. The use of space syntax called for a smaller geographical area but this resulted in a crime count that was too small for meaningful analysis (i.e., 20 murders and 10 attempted murders). For this reason, the sample was expanded to also include other violent incidents where the injuries to the victim(s) were classified as severe, which is the second most serious category on a five-point scale used by London’s MPS (ranging from lethal to no injury). In this way, this research was concerned with lethal and near-lethal violent crime.

Other authors have advocated for the inclusion of incidents of this type when studying homicide. For instance, Gottfredson and Hirschi (1990: 34) stated “the difference between homicide and assault may simply be the intervention of a bystander, the accuracy of a gun, the weight of a frying pan, the speed of an ambulance or the availability of a trauma centre.” In a similar way, Harries (1990: 48) argued “the legal labels ‘homicide’ and ‘assault’ represent essentially similar behaviors differing principally in outcome rather than process.”

The resulting sample consisted of a total of 512 crimes for the 5-year period for which data were available. However, 13 % of these offenses had to be excluded from the analyses, as no geographical coordinates were available for them.Footnote 2 This produced a final sample of 447 offenses. Each offense was assigned to the nearest street segment using a Geographical Information System (GIS; the shortest distance was measured as the perpendicular to anywhere along the line, rather than the distance to the closest vertex). 95 % of offenses occurred on a public pathway (e.g., street, alleyway; see Table 1).

Table 1 Outdoor serious violence offenses recorded by the London Metropolitan Police in the research area between April 2002 and March 2007, by crime and location type

Land Use

To allow for the estimation of the influence of land uses at the segment level, point-level land use data were obtained from UKMap. There were approximately 390,000 land use data points within the research area, of which approximately three quarters were residential. The remaining points were recorded as belonging to one of 12 categories from the National Land Use Database (NLUD) classification system.Footnote 3 A “commercial” land use super-category was created by combining land uses from the “Recreation and leisure” and “Retail distribution and servicing” categories, which amounted to 4.2 % of all land uses. It was not possible to determine the exact nature of each of these land use points (e.g., grocery store, restaurant) from the available data; the “Recreation and leisure” category consists of amenity, amusement and show places, libraries, museum and galleries, as well as land and water sport places, while the “Retail distribution and servicing” category includes retail distribution places and retail centers. Although the two types of land uses are different in their nature, it was assumed that both types would increase the likelihood of people (including offenders) becoming familiar with the street segments on which they were located. To calculate the total number of commercial land uses associated with each segment, each commercial land use point was assigned to the nearest segment using a GIS (again using the perpendicular to anywhere along the segment, rather than the distance to the closest vertex). Only those land uses within 100 m of the nearest segment were included in the analyses (97 % of all commercial land uses; N = 14,351); land use points excluded under this criterion largely consisted of facilities within larger parks and sports complexes that were actually some distance away from the street network (e.g., cafes, running tracks, etc.).

Calculation of Space Syntax Measures

The starting graph was an axial (fewest) straight line map covering the whole Greater London area (N = 99,476 axial lines), which was obtained from Space Syntax Ltd. This graph had been created using an established procedure which can be summarized as follows. First, the continuous open space of the area is delineated; this covers any space that is not occupied by buildings and thus represents all potential movement (for an example, see Fig. 1a). This space is then broken down into a set of fewest and fattest convex spaces. The axial map can then be generated by identifying “the least set of straight lines which passes through each convex space and makes all axial links” (Hillier and Hanson 1984: 92; see Fig. 1b). The last part of the definition simply indicates that every line should be connected to every other line in the network (i.e., the network is continuous).

Fig. 1
figure 1

Process of generating a street segment network in space syntax: a the continuous open space of the relevant area is delineated and broken down into convex spaces; b an axial map is created by identifying the smallest connected network of straight lines passing through such convex spaces; c axial lines are split at intersections to create a street segment map; and d each street segment becomes a vertex with edges representing its direct links to other vertices

Although axial analyses can be performed at this stage, it is common to break down the network further and create a segment map. This is done by sectioning all axial lines—which represent lines of visibility—at the points they intersect others (see Fig. 1c). The resulting street segment map can then be converted into a graph by treating the segments as nodes and connecting those that intersect each other (see Fig. 1d). Once the street segment network graph is generated, syntactic analyses may be performed.

All street segments within the research area and a 3-km buffer around it were considered in the first instance. This buffer was applied to avoid an “edge effect” in the calculation of the integration and choice measures (for the segments in the research area) that can arise with the use of artificially imposed boundaries. The 3-km limit was chosen as the maximum radius as research suggests that this encompasses most pedestrian and routine vehicular journeys in London (e.g., Hillier and Iida 2005).

Space syntax measures were calculated using Depthmap (see Turner 2004). Following Hillier (1996, 2007, 2012), integration and choice were calculated using a radius of 800 m (i.e., all origin–destination journeys used to compute the space syntax metrics for pedestrian movement were equal to or less then 800 m) to estimate local movement (akin to pedestrian flows), and a radius of 3 km to estimate global movement (akin to vehicular flows). In line with most previous space syntax studies, a decision was made to consider both local and global measures, as they are thought to represent different levels of movement and, consequently, spatial awareness.

Shortest path algorithms were used to calculate the integration and choice measures, which can be applied using either angular deviation, metric distance, or the number of topological steps. Angular analysis—which accounts for the fact that people prefer to move in relatively straight lines—has been shown to result in more accurate estimates of movement levels when compared to the other two methods (Hillier and Iida 2005). For this reason, all space syntax measures were calculated using this method. Table 2 shows that while the space syntax measures are correlated, they clearly measure different things. In particular, the correlations between the connectivity and other space syntax metrics are quite modest.

Table 2 Spearman’s rank correlations for the space syntax measures (N = 7176 segments)

To determine if any effects of accessibility on OSV extended to nearby segments, the integration, choice and connectivity cores of the network were identified. In line with previous research (Hillier 2007; Hillier et al. 1993), the integration and choice cores of the street network were identified by selecting the highest scoring 10 % of all segments. Segments within 100 m of these cores were selected as being “nearby” to these cores of the network.Footnote 4 In the case of the “connectivity core,” street segments with a connectivity value of six or higher (N = 730, or 10.2 % of all segments) were classified as belonging to the connectivity core.

All 27,549 street segments (located in the research area and the 3-km buffer) were considered when calculating the space syntax measures, but only those street segments within the research area (N = 7176) were included in the statistical analyses. To control for the fact that longer segments are likely to contain more origin and destination points, segment length was included in all analyses that follow.

Results

Most segments in the research area (N = 6806, or 95 % of all segments) contained no crime. This was to be expected considering the large number of segments and the volume of offenses available for analysis. Having said this, a small number of segments contained more than one offense, up to a maximum of five crimes per segment, indicating some degree of spatial concentration (see Table 3). As expected, segments containing no crime (N = 6806; hereafter zero-crime segments) were significantly shorter than those where at least one crime occurred (N = 370; hereafter crime-targeted segments; mean lengths 73.2 and 132.8 m, median 56.5 and 116.4 m, standard deviation 71.3 and 85.7 m, respectively; Mann–Whitney test results U = 1,849,321, Z = 15.21, p < .001).

Table 3 Number of street segments in the research area by the number of incidents of outdoor serious violence offenses (between April 2002 and March 2007) they contained

Analytic Strategy

A series of regression analyses were conducted to estimate the relative influence of the space syntax measures on crime occurrence. As most segments contained no crime, we employed zero-inflated negative binomial models, with the street segment as the unit of analysis. Zero-inflated regression models (Greene 1994; Lambert 1992) provide a means to deal with overdispersion and the existence of excessive zeros. In our case, zero-inflated negative binomial models were employed—as opposed to zero-inflated Poisson models—because the variance of our dependent variable (i.e., the number of crimes occurring on street segments; 0.086) was greater than the mean (0.062). Post-estimation tests also confirmed that the models selected provided the best fit to the data.

In the first instance, a series of models were performed, each assessing the influence of a single space syntax measure. After doing this, a full regression model including all space syntax measures was conducted, so that the relative influence of such measures could be determined.

The data analyzed have an explicit spatial structure and it is important to account for any effect this may have that is separate from the network structure already specified in the models. Failing to do so would potentially violate the assumption of independence common to most statistical tests. In the current case this is important because statistical models usually omit some variables that might explain observed patterns, and some of these variables may exhibit a spatial pattern such that locations that are near to each other will be more likely to share similar characteristics on important factors than those that are far apart. A number of approaches can be taken to estimate the influence of unobserved variables that vary spatially in statistical models. Here, we use a spatial lag (see Anselin and Bera 1998) calculated for the dependent variable as follows:

$$ SLag_{i} = \sum\limits_{j = 1}^{N} {\frac{{c_{j} }}{{d_{ij}^{{}} }}} $$

where i represents the segment for which the spatial lag value is being calculated; j represents all other segments; N is the total number of segments; c j is the number of crimes on segment j; and d ij is the distance (in meters) between the centroids of segments i and j.Footnote 5

Assessing the Influence of Single Space Syntax Measures

For all zero-inflated negative binomial models reported here and in the next section, the unit of analysis was the street segment (with N = 7176) and the dependent variable was the number of crimes on the segment. Unless otherwise stated, diagnostic tests indicated no evidence of multicollinearity in the analyses that follow. Prior to examining the influence of the street network on crime, a “baseline” model that included only two predictors—the spatial lag variable and the number of commercial land uses in the segment—was estimated (see Table 4). While not included as a predictor variable, street segment length was included in the model as an exposure variable to account for the fact that (more) crime would be more likely on longer street segments. The estimated model yielded a statistically significant McFadden pseudo-ρ2 value of 0.160. Zero-inflated models have two components (for a discussion, see McDowell 2003). The first is used to estimate the probability of excess or structural zeros and is often estimated using a logistic regression (hereafter, the binary component). The second is used to estimate the likelihood of non-excess (or sampling) zeros and non-zero counts, and is typically estimated using a Poisson or—as is the case here—a negative binomial model (hereafter, the count component). Both independent variables had statistically significant coefficients indicative of a positive association with crime. This was true for both the binary and count components of the model. Considering the values themselves, an increase of one commercial unit was associated with an additional 0.014 crimes per unit of street length (the unstandardized coefficient was 0.0142 in the count part of the model) and a 15.4 % reduction in the odds of a segment being “immune” from crime (the unstandardized odds ratio from the binary part of the model was 0.846).

Table 4 Standardized incidence rate ratios (sIRR) and odds ratios (sOR) of the baseline zero-inflated negative binomial model with segment length as exposure variable (N = 7176)

In the case of the spatial lag variable, crime was more likely on street segments that were closer to other street segments on which (more) crimes occurred. Considering this variable in a little more detail, the spatial lag of a given street segment is influenced both by the number of crimes on other segments as well as how far away these segments are. In this way, a unit increase in the spatial lag variable may result either from, say, an additional 70 crimes on a segment that is 70 m away (the typical distance between adjacent street segments in 100-m blocks), or an additional 700 crimes in a segment that is 700 m away (because the spatial lag calculation involves dividing the total number of crimes on a segment by the distance that separates it for the segment for which the spatial lag is being calculated). Put differently, an additional single crime in an adjacent street segment (assuming this was 70 m away) would be associated with an increase of 0.014 in the spatial lag value (i.e., 1 divided by 70). According to the model estimates, a unit increase in the spatial lag variable was associated with a 0.0033 increase in the crime count per unit of street length (0.0033 was the unstandardized coefficient for the spatial lag variable in the count component of the model) and a decrease of 1.4 % in the odds of a segment being “immune” to crime (the unstandardized odds ratio from the binary part of the model was 0.986). Based on this, an additional crime on an adjacent street segment 70 m away would be associated with (an increase of 0.014 in the spatial lag variable and) a decrease of 0.02 % in the odds of a segment being “immune” to crime, and an additional 0.00005 crimes per unit of street length. Table 4 shows the parameter estimates as standardized coefficients to allow the relative contributions of the two variables to be more easily compared.

Table 5 shows the results from the five separate models conducted to examine the influence of each of the space syntax variables independently (see Table 5). For the index of connectivity, neither the incidence rate ratio (IRR) for the count part of the model, or the odds ratio (OR) in the binary part were statistically significant. In contrast, for the remaining four models, statistically significant effects were observed either for the predicted count of crime (for the local integration, local choice, and global choice measures), or for the odds of a street segment being “immune” from crime (for global integration). All were in line with expectation and essentially suggest that crime was more likely on street segments for which movement to or through was higher. As shown, this was the case after accounting for the influence of the control variables (the spatial lag variable and the number of commercial land uses).

Table 5 Standardized incidence rate ratios (sIRR) and odds ratios (sOR) of the full zero-inflated negative binomial model with segment length as exposure variable (N = 7176), with separate models for each space syntax measure

As discussed, Tables 4 and 5 show standardized coefficients. In contrast to the raw IRRs and ORs, these allow the relative influence of the independent variables included in each model to be directly compared. As expected, all else equal, there was a statistically significant and positive association between the location of commercial land uses and crime occurrence. Where the space syntax measures were significantly associated with the predicted crime count, however, it seemed that the influence of these variables was greater than the effect of the commercial land uses variable (e.g., for local integration, the standardized IRR or sIRR was 1.30, while the land use variable yielded a 1.11 sIRR value in this model). However, the number of commercial land uses did appear to be a better predictor of immunity from crime in that the estimated ORs were consistently significant, and the effects stronger in every case.

Analysis of the influence of proximity to the “movement cores” of the network yielded non-significant coefficients for the space syntax variables in most cases. The exception was for local integration, where an increase in this variable was found to be associated with a decrease in the odds of the segment being immune from crime. The changes to the McFadden pseudo-ρ2 values (as compared to the baseline model) were modest.

Assessing the Relative Influence of Space Syntax Measures

After estimating the influence of each space syntax measure one at a time, we conducted an analysis that simultaneously considered all of the independent variables to determine their relative contributions. Two of the measures (local and global integration) led to high variance inflation factor (VIF) values (i.e., >10), which are indicative of multicollinearity problems. After excluding these two variables, all VIFs were <10. The results of this final model are shown in Table 6.

Table 6 Standardized incidence rate ratios (sIRR) and odds ratios (sOR) of the full zero-inflated negative binomial model with segment length as exposure variable (N = 7176)

For this model, the number of commercial land uses on a segment and the spatial lag were again statistically significant and positively related to the number of crimes. In addition, an increase in the spatial lag—but not the number of commercial land uses—was associated with a decrease in the odds of a segment being immune from crime. Having accounted for these influences, significant increases in the predicted crime count were also significantly associated with increases in global choice (through-movement), as predicted. Local integration (to-movement) and local choice (through-movement at a smaller scale) yielded significant coefficients in the single-measure models reported earlier, but failed to reach significance in the full model (see Fig. 2 for an illustration of how global choice relates to the spatial distribution of OSV). Also contrary to hypotheses, connectivity was significantly and negatively associated with predicted crime counts, so that an increase in connectivity was related to a decrease in crime. No space syntax measures yielded significant ORs in the binary part of the model.

Fig. 2
figure 2

Section of research area displaying the spatial distribution of crimes against street segments, colored based on the segments’ global choice values. N.B.: Classification based on natural breaks

The criminogenic effect of global choice appeared to extend to nearby segments. This is apparent in the statistically significant and positive sIRR for the variable that indicated proximity to those segments situated within 100 m of the global choice core (i.e., the 10 % of segments with the highest global choice values; sIRR = 1.20, p < .05). Being situated within 100 m of the local integration core was also found to be associated with a significant reduction of the odds of being immune from crime (sOR = 0.52, p < .05). None of the other core proximity variables yielded significant coefficients. The model, as presented in Table 6, yielded a pseudo-ρ2 value of 0.179, which was the highest of all those tested.Footnote 6

Summary and Discussion

The results presented here provide mixed support for the hypotheses formulated. When the space syntax measures were examined one at a time, as hypothesized, predicted crime counts appeared to be greater for segments that had high levels of integration (to-movement potential) and choice (through-movement potential), both at local (which approximates pedestrian movement) and global (which approximates vehicle movement) levels. In the case of connectivity, which provides the most localized estimate of to-movement, the estimated coefficients were non-significant but, if anything, and in contrast to expectation, suggested that crime was less likely on street segments for which local to-movement would be high. In the full model, this effect was statistically significant. Due to problems of multicollinearity, it was not possible to examine the unique influence of every space syntax variable simultaneously. However, the analysis conducted indicated that at the global scale, through-movement was positively associated with crime risk and that this effect extended to nearby street segments. At the local scale, the direction of the estimated effect of through-movement was consistent with the previous analysis, but was non-significant after accounting for the influence of the other variables.

As noted, the independent contributions of local and global integration could not be evaluated in the full model (due to multicollinearity problems), so these results are harder to interpret. However, assuming the positive associations between the integration measures and crime occurrence had remained in the full model, a scenario would emerge where an increase in accessibility at the most local level—as estimated through connectivity—would be associated with a reduction in the likelihood of victimization, while an increase in accessibility in relation to local (pedestrian) and global (vehicular) journeys—as estimated by the integration measures—would have the opposite effect.

Why the different measures of accessibility might have opposing effects will be for future research to establish, but we offer one interpretation here. Before discussing this, it is important to consider how the different measures are constructed. The index of connectivity is calculated by counting how many other segments each street segment is directly connected to. Thus, no consideration is given to those segments to which a segment is not directly connected. This ignores the position and influence of a segment in the wider network within which it is situated. In contrast, the other measures of space syntax (i.e., integration and choice) do take account of the position of a street segment within the wider network. This difference is partly illustrated by the fact that the correlations between connectivity and the other space syntax measures were not as strong as those among the other measures (see Table 2, presented earlier).

Thus, it is possible that connectivity and the space syntax measures of integration and choice measure rather different things. Hillier and Iida (2005) have shown both integration and choice to be highly correlated with pedestrian and vehicular flows, but no such data are available relating to connectivity. One possibility is that since connectivity is not strongly correlated with the other space syntax measures, it may not represent a good estimate of the general movement of people through or to a street segment (note how differently the segments with high values of global choice and connectivity are distributed among the network, in Figs. 2 and 3, respectively). Instead, being a very local indicator of accessibility, it may represent a proxy for the ambient movement of “locals” within an area—those who perhaps have the most incentive to act as guardians against crime in their neighborhood (see Jacobs 1961; Newman 1972). In contrast, the more global estimates of integration and choice—which consider the wider network in their computation—may better identify the places which larger volumes of people (including offenders) from across the wider city will frequent and become aware of. This increase in usage and awareness may increase opportunities for crime without generating a complementary increase in active guardianship. Further research might aim to test such hypotheses.

Fig. 3
figure 3

Section of research area displaying the connectivity values of the street segments against the spatial distribution of crimes

As stated in the introduction, separate bodies of literature have considered the influence of either the street network or land use in isolation, but no previous studies have tested their joint contribution. Since one factor may mediate or even explain the role of the other, in our view one of the most important contributions of this study was the simultaneous modelling of these two aspects of urban form. Our analyses suggest that both factors have a part to play. Of particular interest is the finding that—having accounted for the distribution of commercial land uses—incidents of OSV were more likely to be found on or nearby (within 100 m) to those street segments for which global through-movement was high. This is consistent with the findings of Angel (1968) and Wilcox (1973), who found the streets and alleys behind or near busy roads to be at greater risk of crime, and suggests a spillover effect of the influence of the street network on crime.

The current findings inform criminological understanding and also have implications for crime prevention. For example, one recommendation is that interventions (e.g., police patrols or CCTV cameras) intended to suppress OSV should focus not only on (more accessible and/or traveled) main streets, but also on those that are adjacent to or within close proximity of them, particularly those that contain commercial land uses.

This study was not without limitations. For instance, due to our focus being on a low-incidence crime type, the small sample size did not allow for disaggregation by time of the day or by day of the week. Such disaggregation would enable the differentiation of spatial awareness (the focus of this paper, which crime pattern theory is concerned with) and space–time-specific crime opportunities, which result from the routine activities of individuals (Cohen and Felson 1979). Previous research has shown ambient population levels to vary over time, with pedestrian flows peaking at around 1 pm and dipping at 4 am, and with weekdays tending to be busier than weekends (Aultman-Hall et al. 2009). For this reason, it is possible that the relationships observed between the configuration of the street network and crime occurrence may vary in strength (and significance) at different times of the day (see also Johnson and Bowers 2013). In the same way, movement flows driven by commercial land uses may also be time-dependent, as different businesses will be open at different times of the day. The land use data examined here was not sufficiently detailed to allow us to disaggregate on this basis, but future researchers with access to better land use data and crime data from higher-crime areas may be able to conduct such analyses.

Another limitation of the current research is that data for other variables that may influence the likelihood of crime occurrence were not available for analysis, and hence not explicitly included in the models (though we attempted to estimate their influence using a spatial lag term). For example, in a series of studies that involved the collection of rich data at the street segment level, Weisburd et al. (2012) found that factors such as ethnic heterogeneity and other indicators of social disorganization were associated with crime risk (see also Weisburd et al. 2014). These latter studies, however, did not examine the influence of the street network in the way described here, and hence future studies might aim to combine these two approaches to provide a still more complete understanding of what factors influence crime risk at the street segment level, and if and how these factors interact.

Neighborhood-level variables and their effect on OSV might also be examined in the future, using a hierarchical modeling framework. Some authors have argued this approach better approximates the spatial decision making process of offenders, which is thought to consist of “a series of spatially nested choices” (Taylor and Gottfredson 1986: 395–396). In this way, an offender may select a neighborhood in the first instance, and then make subsequent decisions as to which street, and eventually individual target, to select. Such an approach has been adopted for the crime of burglary (Davies and Johnson 2015; Johnson and Bowers 2010) but no studies have used such an approach for other forms of crime. This was not possible in the current study as such models are difficult to specify for zero-inflated models and not available in most software packages.

As with any other study, the issue of external validity applies. It is possible that the findings reported are not applicable to other crime types, or locations. The fact that most of the results are largely consistent with previous research is encouraging, but replication will be required before the external validity of the findings can be established. To conclude, the current study suggests that, after controlling for other factors (observed and estimated), both the configuration of the street network and the spatial distribution of commercial land uses systematically influence the placement of OSV. This is an important finding since previous research has either considered only the effect of the street network or that of land use on the spatial distribution of crime, but not both within a single study. Overall, the research provides further support for theories of environmental criminology and, importantly, suggests their utility in explaining the spatial distribution of expressive as well as acquisitive crimes.