Skip to main content

Advertisement

Log in

A conditional machine learning classification approach for spatio-temporal risk assessment of crime data

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

Crime data analysis is an essential source of information to aid social and political decisions makers regarding the allocation of public security resources. Computer-aided dispatch systems and technological advances in geographic information systems have made analysing and visualising historical spatial and temporal records of crimes a vital part of police operations and strategy. We look at our motivating crime problem as a spatio-temporal point pattern. Using a conditional approach based on properties of Poisson point processes, we transform the spatio-temporal point process prediction problem into a classification problem. We create spatio-temporal handcrafted features to link future and past events and use machine learning algorithms to learn behavioural patterns from the data. The fitted model is then used to carry out the reverse transformation, i.e. to perform spatio-temporal risk predictions based on the outcomes of the classification problem. Our procedure has theoretical formalism from point process theory and gains flexibility and computational efficiency inherited from the machine learning field. We show its performance under some simulated scenarios and a real application to spatio-temporal prediction and risk assessment of homicides in Bogota, Colombia.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  • Baddeley A, Rubak E, Turner R (2015) Spatial Point patterns: methodology and applications with R. CRC Press, Boca Raton, Florida, Chapman & Hall Interdisciplinary Statistics Series

    Book  Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  Google Scholar 

  • Catlett C, Cesario E, Talia D, Vinci A (2019) Spatio-temporal crime predictions in smart cities: a data-driven approach and experiments. Pervasive Mob Comput 53:62–74

    Article  Google Scholar 

  • Clark NJ, Dixon PM (2018) Modeling and estimation for self-exciting spatio-temporal models of terrorist activity. Annals Appl Stat 12:633–653

    Article  Google Scholar 

  • Clark NJ, Dixon PM (2020) Spatially correlated self-exciting statistical models with applications to modeling criminal activity. Statistical modelling Forthcoming

  • Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13(1):21–27

    Article  Google Scholar 

  • Curry A, Latkin C, Davey-Rothwell M (2008) Pathways to depression: the impact of neighborhood violent crime on inner-city residents in Baltimore. Soc Sci Med 67(1):23–30

    Article  Google Scholar 

  • Diggle PJ (1990) A point process modelling approach to raised incidence of a rare phenomenon in the vicinity of a prespecified point,. J Royal Stat Soc Ser A (Sta Soc) 153(3):349–362

    Article  Google Scholar 

  • Diggle PJ (2008) Point Source Modeling. Am Cancer Soc 3:49–1351

    Google Scholar 

  • Diggle PJ (2013) Statistical analysis of spatial and Spatio-Temporal point patterns. Chapman & Hall monographs on statistics & applied probability, 3rd edn. CRC Press, Boca Raton, Florida

    Book  Google Scholar 

  • Diggle PJ, Morris SE, Wakefield JC (2000) Point-source modelling using matched case-control data. Biostatistics 1(1):89–105

    Article  CAS  Google Scholar 

  • Diggle PJ, Rowlingson BS (1994) A conditional approach to point process modelling of elevated risk. J R Stat Soc A Stat Soc 157(3):433–440

    Article  Google Scholar 

  • Hart TC (2020) Hot spots of crime: methods and predictive analytics, geographies of behavioural health, crime, and disorder. Springer, USA, vol 126, pp 87–103

    Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction, 2nd edn. Springer, New York

    Book  Google Scholar 

  • Hu Y, Wang F, Guin C, Zhu H (2018) A spatio-temporal kernel density estimation framework for predictive crime hotspot mapping and evaluation. Appl Geogr 99:89–97

    Article  Google Scholar 

  • Leng J, Li G (2018) Big data-driven predictive policing innovation. In: Proceedings of the 2018 7th international conference on energy, environment and sustainable development (ICEESD 2018). Atlantis Press, pp 123–127

  • Lundberg SM, Lee S-I (2017) A unified approach to interpreting model predictions. In: Advances in Neural Information Processing Systems, vol 30, pp 4765–4774

  • Matérn B (1986) Spatial variation. Vol 36 of lecture notes in statistics, Springer-Verlag, Berlin

  • McAssey MP (2013) An empirical goodness-of-fit test for multivariate distributions. J Appl Stat 40(5):1120–1131

    Article  Google Scholar 

  • McCallum A, Nigam K et al. (1998) A comparison of event models for naïve Bayes text classification. AAAI-98 workshop on learning for text categorization Vol 752, pp 41–48

  • McClendon L, Meghanathan N (2015) Using machine learning algorithms to analyze crime data. Machine Learn Appl Int J (MLAIJ) 2(1):1–12

    Article  Google Scholar 

  • Mohler G (2014) Marked point process hotspot maps for homicide and gun crime prediction in Chicago. Int J Forecast 30(3):491–497

    Article  Google Scholar 

  • Mohler GO, Short MB, Brantingham PJ, Schoenberg FP, Tita GE (2011) Self-exciting point process modeling of crime. J Am Stat Assoc 106(493):100–108

    Article  CAS  Google Scholar 

  • Mohler GO, Short MB, Malinowski S, Johnson M, Tita GE, Bertozzi AL, Brantingham PJ (2015) Randomized controlled field trials of predictive policing. J Am Stat Assoc 110(512):1399–1411

    Article  CAS  Google Scholar 

  • Mukhopadhyay A, Zhang C, Vorobeychik Y, Tambe M, Pence K, Speer P (2016) Optimal allocation of police patrol resources using a continuous-time crime model. International Conference on decision and game theory for security Springer, pp 139–158

  • Nadaraya EA (1989) Nonparametric estimation of probability densities and regression curves. Kluwer, Dordrecht

    Book  Google Scholar 

  • Ramirez-Alcocer UM, Tello-Leal E, Mata-Torres JA (2019) Predicting incidents of crime through LSTM neural networks in smart city domain, SMART 2019: the eighth international conference on smart cities, systems, devices and technologies IARIA pp 32–37

  • Ratcliffe J (2014) What is the future of predictive policing. Trans Criminol 6(2):4–5

    Google Scholar 

  • Reinhart A (2018) A review of self-exciting spatio-temporal point processes and their applications. Stat Sci 33(3):299–318

    Google Scholar 

  • Rish I, et al. (2001) An empirical study of the naïve Bayes classifier, IJCAI 2001 workshop on empirical methods in artificial intelligence Vol 3, pp 41–46

  • Rodrigues A, Diggle P, Assuncao R (2010) Semiparametric approach to point source modelling in epidemiology and criminology. J Roy Stat Soc: Ser C (Appl Stat) 59(3):533–542

    Google Scholar 

  • Rodrigues A, Diggle PJ (2012) Bayesian estimation and prediction for inhomogeneous spatiotemporal log-Gaussian Cox processes using low-rank models, with application to criminal surveillance. J Am Stat Assoc 107(497):93–101

    Article  Google Scholar 

  • Rosser G, Cheng T (2019) Improving the robustness and accuracy of crime prediction with the self-exciting point process through isotropic triggering. Appl Spat Anal Policy 12:5–25

    Article  Google Scholar 

  • Rummens A, Hardyns W, Pauwels L (2017) The use of predictive analysis in spatiotemporal crime forecasting: building and testing a model in an urban context. Appl Geogr 86:255–261

    Article  Google Scholar 

  • Shirota S, Gelfand AE (2017) Space and circular time log Gaussian Cox processes with application to crime event data. Annals Appl Stat 11(2):481–503

    Article  Google Scholar 

  • Silverman BW (1986) Density estimation for statistics and data analysis. CRC Press, Boca Raton, Florida, Chapman & Hall Monographs on Statistics & Applied Probability

    Google Scholar 

  • Steinwart I, Christmann A (2008) Support vector machines. Springer, USA

    Google Scholar 

  • Wu S, Wang C, Cao H, Jia X (2020) Crime prediction using data mining and machine learning, in Q. Liu, M. Mısır, X. Wang and W. Liu (eds), The 8th international conference on computer engineering and networks (CENet2018), Springer International Publishing pp 360–375

  • Yuki JQ, Sakib MMQ, Zamal Z, Habibullah KM, Das AK (2019) Predicting crime using time and location data. Proceedings of the 2019 7th international conference on computer and communications management, ICCCM 2019, Association for computing machinery, pp 124–128

  • Zhang H (2004) The optimality of naïve Bayes. In: Proceedings of the seventeenth international Florida artificial intelligence research society conference, Miami Beach, Florida, USA, pp 562–567

  • Zhuang J, Mateu J (2019) A semiparametric spatiotemporal Hawkes-type point process model with periodic background for crime data. J R Stat Soc A Stat Soc 182(3):919–942

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jonatan A. González.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was partially funded by grant PID2019-107392RB-I00 from the Spanish Ministry of Science and Innovation.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rodrigues, A., González, J.A. & Mateu, J. A conditional machine learning classification approach for spatio-temporal risk assessment of crime data. Stoch Environ Res Risk Assess 37, 2815–2828 (2023). https://doi.org/10.1007/s00477-023-02420-5

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-023-02420-5

Keywords

Navigation