Abstract
Crime data analysis is an essential source of information to aid social and political decisions makers regarding the allocation of public security resources. Computer-aided dispatch systems and technological advances in geographic information systems have made analysing and visualising historical spatial and temporal records of crimes a vital part of police operations and strategy. We look at our motivating crime problem as a spatio-temporal point pattern. Using a conditional approach based on properties of Poisson point processes, we transform the spatio-temporal point process prediction problem into a classification problem. We create spatio-temporal handcrafted features to link future and past events and use machine learning algorithms to learn behavioural patterns from the data. The fitted model is then used to carry out the reverse transformation, i.e. to perform spatio-temporal risk predictions based on the outcomes of the classification problem. Our procedure has theoretical formalism from point process theory and gains flexibility and computational efficiency inherited from the machine learning field. We show its performance under some simulated scenarios and a real application to spatio-temporal prediction and risk assessment of homicides in Bogota, Colombia.
Similar content being viewed by others
References
Baddeley A, Rubak E, Turner R (2015) Spatial Point patterns: methodology and applications with R. CRC Press, Boca Raton, Florida, Chapman & Hall Interdisciplinary Statistics Series
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Catlett C, Cesario E, Talia D, Vinci A (2019) Spatio-temporal crime predictions in smart cities: a data-driven approach and experiments. Pervasive Mob Comput 53:62–74
Clark NJ, Dixon PM (2018) Modeling and estimation for self-exciting spatio-temporal models of terrorist activity. Annals Appl Stat 12:633–653
Clark NJ, Dixon PM (2020) Spatially correlated self-exciting statistical models with applications to modeling criminal activity. Statistical modelling Forthcoming
Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13(1):21–27
Curry A, Latkin C, Davey-Rothwell M (2008) Pathways to depression: the impact of neighborhood violent crime on inner-city residents in Baltimore. Soc Sci Med 67(1):23–30
Diggle PJ (1990) A point process modelling approach to raised incidence of a rare phenomenon in the vicinity of a prespecified point,. J Royal Stat Soc Ser A (Sta Soc) 153(3):349–362
Diggle PJ (2008) Point Source Modeling. Am Cancer Soc 3:49–1351
Diggle PJ (2013) Statistical analysis of spatial and Spatio-Temporal point patterns. Chapman & Hall monographs on statistics & applied probability, 3rd edn. CRC Press, Boca Raton, Florida
Diggle PJ, Morris SE, Wakefield JC (2000) Point-source modelling using matched case-control data. Biostatistics 1(1):89–105
Diggle PJ, Rowlingson BS (1994) A conditional approach to point process modelling of elevated risk. J R Stat Soc A Stat Soc 157(3):433–440
Hart TC (2020) Hot spots of crime: methods and predictive analytics, geographies of behavioural health, crime, and disorder. Springer, USA, vol 126, pp 87–103
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction, 2nd edn. Springer, New York
Hu Y, Wang F, Guin C, Zhu H (2018) A spatio-temporal kernel density estimation framework for predictive crime hotspot mapping and evaluation. Appl Geogr 99:89–97
Leng J, Li G (2018) Big data-driven predictive policing innovation. In: Proceedings of the 2018 7th international conference on energy, environment and sustainable development (ICEESD 2018). Atlantis Press, pp 123–127
Lundberg SM, Lee S-I (2017) A unified approach to interpreting model predictions. In: Advances in Neural Information Processing Systems, vol 30, pp 4765–4774
Matérn B (1986) Spatial variation. Vol 36 of lecture notes in statistics, Springer-Verlag, Berlin
McAssey MP (2013) An empirical goodness-of-fit test for multivariate distributions. J Appl Stat 40(5):1120–1131
McCallum A, Nigam K et al. (1998) A comparison of event models for naïve Bayes text classification. AAAI-98 workshop on learning for text categorization Vol 752, pp 41–48
McClendon L, Meghanathan N (2015) Using machine learning algorithms to analyze crime data. Machine Learn Appl Int J (MLAIJ) 2(1):1–12
Mohler G (2014) Marked point process hotspot maps for homicide and gun crime prediction in Chicago. Int J Forecast 30(3):491–497
Mohler GO, Short MB, Brantingham PJ, Schoenberg FP, Tita GE (2011) Self-exciting point process modeling of crime. J Am Stat Assoc 106(493):100–108
Mohler GO, Short MB, Malinowski S, Johnson M, Tita GE, Bertozzi AL, Brantingham PJ (2015) Randomized controlled field trials of predictive policing. J Am Stat Assoc 110(512):1399–1411
Mukhopadhyay A, Zhang C, Vorobeychik Y, Tambe M, Pence K, Speer P (2016) Optimal allocation of police patrol resources using a continuous-time crime model. International Conference on decision and game theory for security Springer, pp 139–158
Nadaraya EA (1989) Nonparametric estimation of probability densities and regression curves. Kluwer, Dordrecht
Ramirez-Alcocer UM, Tello-Leal E, Mata-Torres JA (2019) Predicting incidents of crime through LSTM neural networks in smart city domain, SMART 2019: the eighth international conference on smart cities, systems, devices and technologies IARIA pp 32–37
Ratcliffe J (2014) What is the future of predictive policing. Trans Criminol 6(2):4–5
Reinhart A (2018) A review of self-exciting spatio-temporal point processes and their applications. Stat Sci 33(3):299–318
Rish I, et al. (2001) An empirical study of the naïve Bayes classifier, IJCAI 2001 workshop on empirical methods in artificial intelligence Vol 3, pp 41–46
Rodrigues A, Diggle P, Assuncao R (2010) Semiparametric approach to point source modelling in epidemiology and criminology. J Roy Stat Soc: Ser C (Appl Stat) 59(3):533–542
Rodrigues A, Diggle PJ (2012) Bayesian estimation and prediction for inhomogeneous spatiotemporal log-Gaussian Cox processes using low-rank models, with application to criminal surveillance. J Am Stat Assoc 107(497):93–101
Rosser G, Cheng T (2019) Improving the robustness and accuracy of crime prediction with the self-exciting point process through isotropic triggering. Appl Spat Anal Policy 12:5–25
Rummens A, Hardyns W, Pauwels L (2017) The use of predictive analysis in spatiotemporal crime forecasting: building and testing a model in an urban context. Appl Geogr 86:255–261
Shirota S, Gelfand AE (2017) Space and circular time log Gaussian Cox processes with application to crime event data. Annals Appl Stat 11(2):481–503
Silverman BW (1986) Density estimation for statistics and data analysis. CRC Press, Boca Raton, Florida, Chapman & Hall Monographs on Statistics & Applied Probability
Steinwart I, Christmann A (2008) Support vector machines. Springer, USA
Wu S, Wang C, Cao H, Jia X (2020) Crime prediction using data mining and machine learning, in Q. Liu, M. Mısır, X. Wang and W. Liu (eds), The 8th international conference on computer engineering and networks (CENet2018), Springer International Publishing pp 360–375
Yuki JQ, Sakib MMQ, Zamal Z, Habibullah KM, Das AK (2019) Predicting crime using time and location data. Proceedings of the 2019 7th international conference on computer and communications management, ICCCM 2019, Association for computing machinery, pp 124–128
Zhang H (2004) The optimality of naïve Bayes. In: Proceedings of the seventeenth international Florida artificial intelligence research society conference, Miami Beach, Florida, USA, pp 562–567
Zhuang J, Mateu J (2019) A semiparametric spatiotemporal Hawkes-type point process model with periodic background for crime data. J R Stat Soc A Stat Soc 182(3):919–942
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work was partially funded by grant PID2019-107392RB-I00 from the Spanish Ministry of Science and Innovation.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Rodrigues, A., González, J.A. & Mateu, J. A conditional machine learning classification approach for spatio-temporal risk assessment of crime data. Stoch Environ Res Risk Assess 37, 2815–2828 (2023). https://doi.org/10.1007/s00477-023-02420-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-023-02420-5