Is Gun Violence Contagious? A Spatiotemporal Test



Existing theories of gun violence predict stable spatial concentrations and contagious diffusion of gun violence into surrounding areas. Recent empirical studies have reported confirmatory evidence of such spatiotemporal diffusion of gun violence. However, existing space/time interaction tests cannot readily distinguish spatiotemporal clustering from spatiotemporal diffusion. This leaves as an open question whether gun violence actually is contagious or merely clusters in space and time. Compounding this problem, gun violence is subject to considerable measurement error with many nonfatal shootings going unreported to police.


Using point process data from an acoustical gunshot locator system and a combination of Bayesian spatiotemporal point process modeling and classical space/time interaction tests, this paper distinguishes between clustered but non-diffusing gun violence and clustered gun violence resulting from diffusion.


This paper demonstrates that contemporary urban gun violence in a metropolitan city does diffuse in space and time, but only slightly.


These results suggest that a disease model for the spread of gun violence in space and time may not be a good fit for most of the geographically stable and temporally stochastic process observed. And that existing space/time tests may not be adequate tests for spatiotemporal gun violence diffusion models.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4


  1. 1.

    Previous typologies of gun violence (Fagan et al. 2007) use the term epidemic to describe both clustered gun violence with diffusion and clustered gun violence without diffusion. Since each of these two types of clustered gun violence are potentially the product of different mechanisms, for clarity, we refer to diffusing gun violence as epidemic and non-diffusing gun violence as endemic (Christoffel 2007).

  2. 2.

    Johnson and Bowers (2004); Johnson et al. (2007) used a non-cumulative Knox test in analyzing burglary data.

  3. 3.

    Interestingly, a recent hotspot experiment suggests that the distinction between place-based and offender-based interventions may not always hold in practice (Groff et al. 2015).


  1. Bartlett MS (1964) The spectral analysis of two-dimensional point processes. Biometrika 51(3/4):299–311

    Article  Google Scholar 

  2. Blumstein A (1995) ‘Youth violence, guns, and the illicit-drug industry’. The J Crim Law and Criminol (1973) 86(1):10–36

    Article  Google Scholar 

  3. Braga AA (2005) Hot spots policing and crime prevention: a systematic review of randomized controlled trials. J Exp Criminol 1(3):317–342

    Article  Google Scholar 

  4. Braga AA, Papachristos AV, Hureau DM (2010) The concentration and stability of gun violence at micro places in Boston, 1980–2008. J Quant Criminol 26(1):33–53

    Article  Google Scholar 

  5. Branas CC, Cheney RA, MacDonald JM, Tam VW, Jackson TD, Ten Have TR (2011) A difference-in-differences analysis of health, safety, and greening vacant urban space. Am J Epidemiol 174(11):1296–1306

    Article  Google Scholar 

  6. Branas CC, Jacoby S, Andreyeva E (2017) Firearm violence as a diseasehot people or hot spots? JAMA Intern Med 177(3):333–334

    Article  Google Scholar 

  7. Chicago Police Department (2012) 2011 Chicago Murder Analysis. Technical report, City of Chicago, Chicago, IL

  8. Christakis NA, Fowler JH (2007) The spread of obesity in a large social network over 32 years. N Engl J Med 357(4):370–379

    Article  Google Scholar 

  9. Christoffel KK (2007) Firearm injuries: epidemic then, endemic now. Am J Public Health 97(4):626–629

    Article  Google Scholar 

  10. Cohen J, Tita G (1999) Diffusion in homicide: exploring a general method for detecting spatial diffusion processes. J Quant Criminol 15(4):451–493

    Article  Google Scholar 

  11. Coleman J, Katz E, Menzel H (1957) The diffusion of an innovation among physicians. Sociometry 20(4):253–270

    Article  Google Scholar 

  12. Cressie N, Wikle C (2011) Statistics for spatio-temporal data. Wiley, Hoboken

    Google Scholar 

  13. de Tarde G (1903) The laws of imitation. H. Holt and Company, New York

    Google Scholar 

  14. Diggle PJ, Chetwynd AG, Hggkvist R, Morris SE (1995) Second-order analysis of space-time clustering. Stat Methods in Med Res 4(2):124–136

    Article  Google Scholar 

  15. Diggle PJ, Moraga P, Rowlingson B, Taylor BM (2013) Spatial and spatio-temporal log-gaussian cox processes: extending the geostatistical paradigm. Stat Sci 28(4):542–563

    Article  Google Scholar 

  16. Diggle P, Rowlingson B, Su T-L (2005) Point process methodology for on-line spatio-temporal disease surveillance. Environmetrics 16(5):423–434

    Article  Google Scholar 

  17. Dodge KA (2008) Framing public policy and prevention of chronic violence in american youths. The Am Psychol 63(7):573–590

    Article  Google Scholar 

  18. Fagan J, Wilkinson DL, Davies G (2007) Social contagion of violence. In: Flannery DJ, Vazsonyi AT, Waldman ID (eds) The cambridge handbook of violent behavior and aggression. Cambridge University Press, New York, pp 688–726

    Google Scholar 

  19. Gelman A, Jakulin A, Pittau MG, Su Y-S (2008) ‘A weakly informative default prior distribution for logistic and other regression models’. The Ann Appl Stat, 360–1383.

  20. Green B, Horel T, Papachristos AV (2017) Modeling contagion through social networks to explain and predict gunshot violence in chicago, 2006–2014. JAMA Intern Med 177(3):326–333

    Article  Google Scholar 

  21. Groff ER, Ratcliffe JH, Haberman CP, Sorg ET, Joyce NM, Taylor RB (2015) Does what police do at hot spots matter? The philadelphia policing tactics experiment. Criminology 53(1):23–53

    Article  Google Scholar 

  22. Hemingway D (2004) Private guns. Public Health, University of Michigan Press

    Google Scholar 

  23. Johnson SD (2008) Repeat burglary victimisation: a tale of two theories. J Exp Criminol 4(3):215–240

    Article  Google Scholar 

  24. Johnson SD, Bernasco W, Bowers KJ, Elffers H, Ratcliffe J, Rengert G, Townsley M (2007) Space-time patterns of risk: a cross national assessment of residential burglary victimization. J Quant Criminol 23(3):201–219

    Article  Google Scholar 

  25. Johnson SD, Bowers KJ (2004) The stability of space-time clusters of burglary. Br J Criminol 44(1):55–65

    Article  Google Scholar 

  26. Knox EG (1964a) The detection of space-time interactions. J Royal Stat Soc Ser C (Appl Stat) 13(1):25–29

    Google Scholar 

  27. Knox G (1964b) Epidemiology of childhood leukaemia in northumberland and Durham. Br J Prevent Soc Med 18(1):17–24

    Google Scholar 

  28. Loftin C (1986) Assaultive violence as a contagious social process. Bull N Y Acad Med 62(5):550–555

    Google Scholar 

  29. Mantel N (1967) The detection of disease clustering and a generalized regression approach. Cancer Res 27(2 Part 1):209–220

    Google Scholar 

  30. Mares D, Blackburn E (2012) Evaluating the effectiveness of an acoustic gunshot location system in St. Louis, MO’. Policing 6(1):26–42

    Article  Google Scholar 

  31. Messner SF, Anselin L, Baller RD, Hawkins DF, Deane G, Tolnay SE (1999) The spatial patterning of county homicide rates: an application of exploratory spatial data analysis. J Quant Criminol 15(4):423–450

    Article  Google Scholar 

  32. Metropolitan Police Department (2006) A Report on Juvenile and Adult Homicide in the Distict of Columbia. Technical report, Metropolitan Police Department, Research and Resource Development Department, Washington, D.C

  33. Meyer S, Warnke I, Rossler W, Held L (2016) Model-based testing for space-time interaction using point processes: an application to psychiatric hospital admissions in an urban area. Spat Spatio-temporal Epidemiol 17:15–25

    Article  Google Scholar 

  34. Mohler G (2013) Modeling and estimation of multi-source clustering in crime and security data. Ann Appl Stat 7(3):1525–1539

    Article  Google Scholar 

  35. Mohler G (2014) Marked point process hotspot maps for homicide and gun crime prediction in chicago. Int J Forecast 30(3):491–497

    Article  Google Scholar 

  36. Mohler GO, Short MB, Brantingham PJ, Schoenberg FP, Tita GE (2011) Self-exciting point process modeling of crime. J Am Stat Assoc 106(493):100–108

    Article  Google Scholar 

  37. Morenoff JD, Sampson RJ, Raudenbush SW (2001) Neighborhood inequality, collective efficacy, and the spatial dynamics of urban violence*. Criminology 39(3):517–558

    Article  Google Scholar 

  38. Ornstein JT, Hammond RA (2017) The burglary boost: a note on detecting contagion using the Knox test. J Quant Criminol 33(1):65–75

    Article  Google Scholar 

  39. Papachristos A (2009) Murder by structure: dominance relations and the social structure of gang homicide. Am J Sociol 115(1):74–128

    Article  Google Scholar 

  40. Papachristos A, Anthony B, Piza E, Grossman L (2016) The company you keep? The spillover effects of gang membership on individual gunshot victimization in a co-offending network. Criminology 53:624–649

    Article  Google Scholar 

  41. Patel DM, Simon MA, Taylor RM (2012) Contagion of violence–workshop summary, technical report. Institute of Medicine and National Research Council of the National Academies, Washington

    Google Scholar 

  42. Petho A, Fallis DS, Keating D (2013) Investigations. In: Gunfire detection system captures about 39,000 shooting incidents in the District. The Washington Post.

  43. Philadelphia Police Department (2014) Murder/Shooting Analysis 2013. Technical report, Philadelphia Police Department, Philadelphia, PA

  44. Ramlau-Hansen H (1983) ‘Smoothing counting process intensities by means of kernel functions’. The Ann Stat, 453–466.

    Article  Google Scholar 

  45. Ratcliffe JH, Rengert GF (2008) Near-repeat patterns in Philadelphia shootings. Secur J 21(1):58–76

    Article  Google Scholar 

  46. Reiss AJ, Roth JA, Miczek KA, National Research Council (US). Panel on the understanding and control of violent behavior (1993), understanding and preventing violence. National Academy Press-1994, Washington, DC

  47. Rosenfeld R, Bray TM, Egley A (1999) Facilitating violence: a comparison of gang-motivated, gang-affiliated, and nongang youth homicides. J Quant Criminol 15(4):495–516

    Article  Google Scholar 

  48. Sallai J, Hedgecock W, Volgyesi P, Nadas A, Balogh G, Ledeczi A (2011) Weapon classification and shooter localization using distributed multichannel acoustic sensors. J Syst Archit 57(10):869–885

    Article  Google Scholar 

  49. Shalizi CR, Thomas AC (2011) Homophily and contagion are generically confounded in observational social network studies. Sociol Methods Res 40(2):211–329

    Article  Google Scholar 

  50. Short MB, Brantingham PJ, Bertozzi AL, Tita GE (2010) Dissipation and displacement of hotspots in reaction-diffusion models of crime. Proc Nat Acad Sci 107(9):3961–3965

    Article  Google Scholar 

  51. Short M, Mohler G, Brantingham PJ, Tita G (2014) Gang rivalry dynamics via couple point process networks. Disc Contin Dyn Syst 19(5):1459–1477

    Article  Google Scholar 

  52. Skogan W, Hartnett S, Bump N, Dubois J (2009) Evaluation of ceasefire-Chicago, technical report. US, Department of Justice, Washington

    Google Scholar 

  53. Stan Development Team (2016) ‘Stan: A c++ library for probability and sampling, version 2.9.0’

  54. Tita G, Cohen J (2004) Measuring spatial diffusion of shots fired activity. In: Goodchild MF, Janelle DG (eds) Spatially integrated social science. Oxford University Press, New York, pp 171–204

    Google Scholar 

  55. Tita G, Ridgeway G (2007) The impact of gang formation on local patterns of crime. J Res Crime and Delinquency 44(2):208–237

    Article  Google Scholar 

  56. Tita G, Riley J, Ridgeway G, Grammich C, Abrahamse A, Greenwood P (2003) Reducing gun violence: results from an intervention in east Los Angeles. RAND Corporation, Santa Monica

    Google Scholar 

  57. Townsley M, Homel R, Chaseling J (2003) Infectious Burglaries. A Test of the Near Repeat Hypothesis. British Journal of Criminology 43(3):615–633

    Article  Google Scholar 

  58. Webster DW, Whitehill JM, Vernick JS, Curriero FC (2012) Effects of baltimores safe streets program on gun violence: a replication of Chicagos ceasefire program. J Urban Health 90(1):27–40

    Article  Google Scholar 

  59. Weisburd D, Groff ER, Yang S-M (2012) The criminology of place: street segments and our understanding of the crime problem. Oxford University Press, Oxford

    Google Scholar 

  60. Wooditch A, Weisburd D (2015) Using space-time analysis to evaluate criminal justice programs: an application to stop-question-frisk practices. J Quant Criminol 32(2):1–23

    Google Scholar 

  61. Zimring F (1967) Is gun control likely to reduce violent killings. Univ Chicago Law Rev 35(4):721

    Article  Google Scholar 

Download references


SRF was supported by ERC (FP7/617071) and EPSRC (EP/K009362/1). The authors thank Richerd Berk, Shawn Bushway, John MacDonald, Theresa Smith, George Tita, and three anonymous reviewers for helpful comments on this manuscript.

Author information



Corresponding author

Correspondence to Charles Loeffler.



Hawkes Process

Given a temporal point pattern \(t_1, \ldots , t_n \in {\mathcal {R}}\), observed on a time window [0, T], the Hawkes process is a so-called self-exciting point process. It has a parsimonious form, parameterized by a conditional intensity function with two parts, a background rate \(\mu _t(t)\) which may or may not vary with t and a self-excitatory rate which is based only on events which occurred before time t:

$$\begin{aligned} \lambda (t) = \mu _t(t) + \theta \cdot \sum _{i: t_i < t} \omega \cdot \text{ exp }(-\omega (t-t_i)) \end{aligned}$$

Here the triggering kernel is given by an exponential with inverse lengthscale \(\omega\) and weight \(\theta\), which characterizes the strength and duration of an event’s influence on future events. The likelihood of the Hawkes process is simply that of an inhomogeneous Poisson process with intensity \(\lambda (t)\):

$$\begin{aligned} {{\mathcal {L}}}(t_1, \ldots , t_n | \theta , \omega , \mu ) = \prod _i \lambda (t_i) \exp \left( -\int _{t=0}^T \lambda (t) dt\right) \end{aligned}$$

To include a spatial dimension, we consider a spatiotemporal point pattern with events at locations \((x_i,y_i,t_i)\) observed on a window \(S \times [0,T]\). We further pick a spatial kernel \(k(s,s')\), and then we have the following conditional intensity:

$$\begin{aligned} \lambda (s,t) = \mu (x,y,t) + \theta \sum _{i: t_i < t} \omega \cdot \text{ exp }(-\omega (t-t_i)) k((x,y),(x_i,y_i)) \end{aligned}$$

The likelihood is as follows:

$$\begin{aligned} {\mathcal {L}}(s_1, t_1, \ldots , s_n, t_n | \theta , \omega , \mu ) = \prod _i \lambda (s_i,t_i) \exp \left( -\int _S \int _{t=0}^T \lambda (s,t) ds dt\right) \end{aligned}$$

Then we can explicitly calculate the log-likelihood corresponding to Eq. (5):

$$\begin{aligned}&\log {\mathcal {L}} = \sum _j \mu (x_j,y_j,t_j) + \theta \sum _{i: t_i < t_j} \omega \cdot \text{ exp }(-\omega (t_j-t_i)) k((x_j,y_j),(x_i,y_i)) \end{aligned}$$
$$\begin{aligned}&- \int _S \int _{t=0}^T \left( \mu (x,y,t) + \theta \sum _{i: t_i < t} \omega \cdot \text{ exp }(-\omega (t-t_i)) k((x,y),(x_i,y_i))\right) ds dt \end{aligned}$$

Now for simplicity, we assume that \(\int _S k((x,y),(x_i,y_i)) ds = 1, \,\, \forall i\) which we can ensure by properly normalizing the spatial kernel, and also assuming that the lengthscale is much shorter than the size of the spatial domain. We use a Gaussian kernel for \(k((x,y),(x',y'))\) with lengthscale \(\sigma\). Since we are in two dimensions, we have the following normalized kernel:

$$\begin{aligned} k(s,s') = \frac{1}{2\pi \sigma ^2} \exp \left(-\frac{\Vert s-s'\Vert ^2}{2\sigma ^2}\right) \end{aligned}$$

We estimate the background intensity \(\mu (x,y,t)\) using kernel smoothing with an Epanechnikov kernel:

$$\begin{aligned} k(d) = \frac{3}{4}(1-d^2) I(|d| < 1) \end{aligned}$$

We separately fit \({\hat{\mu }}_s(x,y)\) and \({\hat{\mu }}_t(t)\) and include a weighting term \(m_0\) to enable the model to downweight this background intensity:

$$\begin{aligned} {\hat{\mu }}(x,y,t) = m_0 \cdot {\hat{\mu }}_s(x,y) {\hat{\mu }}_t(t) \end{aligned}$$

We performed a sensitivity analysis in Section to investigate the impact of the kernel bandwidth.

In order to learn the parameters of the spatiotemporal Hawkes process, we use Bayesian inference implemented with Markov Chain Monte Carlo methods to find the posterior distribution of the parameters, where we place weakly informative priors Gelman et al. (2008) on them:

  • \(m_0 \sim {\mathcal {N}}_+(0,1)\)

  • \(\theta \sim {\mathcal {N}}_+(0,10)\)

  • \(\sigma \sim {\mathcal {N}}_+(0,10)\) where distance is measured in kilometers

  • \(\omega \sim {\mathcal {N}}_+(0,10)\) where time is measured in 1/days

with \({\mathcal {N}}_+\) denoting the Normal distribution truncated to be positive.

We coded our model in the probabilistic programming language Stan Stan Development Team (2016), which implements Hamiltonian Monte Carlo sampling to efficiently explore the posterior distribution ove all of the parameters of our model. We ran four chains for 1000 iterations each. Convergence diagnostics are discussed in Sect.  A.7.

Dealing with Holidays

As shown in Fig. 5 (left), many extra shootings are observed in our dataset on the days before and after New Year’s Eve and the Fourth of July. There were 5653 total shootings on these days, accounting for 36% of all shootings. We considered these shootings to be “celebratory” and thus removed July 1st–6th and December 29th-January 2nd from our dataset for each year. The time series without these extra shootings is shown in Fig. 5 (right). We performed a sensitivity analysis to see whether our main findings were robust to including the holidays. The posterior over the lengthscales was slightly longer—19 min and 210 m and the parameter \(\theta\) (which characterizes the number of additional shootings we expect to see after a single shooting) was higher: 0.43 (vs 0.13 in our main model) and correspondingly \(m_0\) the weight placed on the underlying intensity was 0.57 (vs 0.87 in our main model). All of these findings make sense, as the holiday shootings are by their nature spatiotemporally clustered and the model represents this by increasing the weight and spatiotemporal extent of the self-excitatory component.

Fig. 5

Smoothed time series of shots fired in Washington DC, 2011, comparing the original data (left) to the data after removing July 1st–6th and December 29th-January 2nd for each year (right)

Crime Offense Reports Data

Conventional reported crime data on firearm assaults, as seen in Fig. 6, manifests a similar spatial distribution and temporal distribution as AGLS-detected gunshots. However, the intensity levels are markedly different. As a result of these intensity differences between measures, heterogeneity in the intensity of incident clusters is much more visible in the AGLS spatial distribution than the conventional firearm assault distribution. In addition, as seen in the bottom right graph in Fig. 6, firearm assault data lags AGLS data by several hours. A lag of several minutes would be consistent with existing knowledge on reporting practices, but a lag of hours likely reflects a recording practice associated with duty shifts or a similar process. For this reason, 911 calls for service data on “shots fired” will provide a better source for estimating spatiotemporal models if AGLS data is not present and binning is observed in publicly reported crimes data.

Fig. 6

Spatial and temporal distributions of firearm assaults and shots fired in Washington DC, 2011. The spatial distribution of aggravated assaults a 911 “shots fired” calls b and acoustically detected gunshots c is shown subfigures ac. Monthly and day-of-week temporal distributions are shown in subfigures de

To verify that our spatiotemporal models could be used on 911 calls for service data, we refit our final model using the 911 calls for service data for “sounds of gunshots” for an overlapping temporal period beginning in 2011 and ending in 2013. The spatial lengthscale was 221 [214, 228] m. The temporal lengthscale was 9 [8.4, 9.4] min. And the self-exciting paramters, corresponding to the fraction of events attributable to the conditional as opposed to background intensity, was 0.15 [0.14, 0.16]. These results closely matched those obtained using AGLS data. The degree of similarity is likely a product of the high fraction of gunshot events reported to police both by civilians and the AGLS system. In cities with more limited reporting or deployment, models fit on different data sources may not yield as consistent results.

Sensitivity Analysis for Smoothing Kernel Bandwidth

We investigate the choice of the bandwidth for the smoothing kernel for estimating the underlying intensity \({\hat{\mu }}(x,y,t)\). For spatial bandwidths in [0.5, 1, 1.6] km and temporal bandwidths in [0.5, 1, 7, 14] days we fit our model using only data from 2011–12. The learned spatial lengthscales were between 130 and 210 m, matching our main model’s posterior (126 m), while the learned temporal lengthscales were between 12 and 20 min, slightly longer than in our main model (10 min).

Sensitivity Analysis for Inclusion of Duplicate Events

In our main analysis, we merged shootings which occurred within 1 min and 100 m together, treating them as a single shooting (after also removing holidays). We thus removed 415 shootings which we considered to be duplicates (4% of our sample). As a sensitivity analysis, we fit our model to the original dataset, without holidays, but with each of the near-repeats included. All of our findings were consistent with our main findings: the lengthscale was 100 m in space (slightly shorter than the main model) and 11 min in time, and \(m_0\), the weight placed on the endogeneous component was 0.86, while \(\theta\) was 0.14. This underscores the very local and limited extent to which we are able to find evidence of diffusion in our dataset.

Samples from the Posterior for Lengthscale Parameters

Inference for our model for the AGLS data resulted in a posterior concentrated around its mode, as shown in Fig. 7.

Fig. 7

2000 Draws from the posterior of the main model are shown for the temporal and spatial lengthscale parameters. The posterior is concentrated around its mode

Traceplots and Convergence Diagnostics

We implemented the model in Sect. A.1 in the probabilistic programming language Stan Stan Development Team (2016), which uses Hamiltonian Monte Carlo to approximately sample from the posterior of the distribution over the parameters. We ran four chains for 1000 iterations each, discarding the first 500 draws as burn-in, for a total of 2000 draws. We show traceplots in Fig. 8. The Gelman-Rubin statistic \({\hat{R}} = 1\) for every parameter, with effective sample sizes above 1000. Thus we conclude that the chains have mixed and converged, and we are sampling from the true posterior of the model.

Fig. 8

Traceplots for the parameters of the model fit with HMC, showing good convergence and mixing

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Loeffler, C., Flaxman, S. Is Gun Violence Contagious? A Spatiotemporal Test. J Quant Criminol 34, 999–1017 (2018).

Download citation


  • Gun violence
  • Contagion
  • Spatiotemporal methods