Abstract
The concentration of police resources in conflict zones contributes to the reduction of crime in the region and the optimization of those resources. This paper presents the use of regression techniques to predict the number of criminal acts in Colombian municipalities. To this end, a set of data was generated merging the data from the Guardia Civil with public data on the demographic structure and voting trends in the municipalities. The best regressor obtained (Random Forests) achieves a RRSE (Root Relative Squared Error) of 40.12% and opens the way to keep incorporating public data of another type with greater predictive power. In addition, M5Rules were used to interpret the results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zaharia, M., Xin, R.S., Wendell, P., Das, T., Armbrust, M., Dave, A., Meng, X., Rosen, J., Venkataraman, S., Franklin, M.J., Ghodsi, A., Gonzalez, J., Shenker, S., Stoica, I.: Apache spark: a unified engine for big data processing. Comm. ACM 59(11), 56–65 (2016)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, VLDB, pp. 487–499 (1994)
Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S.J., McClosky, D.: The Stanford CoreNLP natural language processing toolkit. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55–60 (2014)
Hahsler, M., Karpienko, R.: Visualizing association rules in hierarchical groups. J. Bus. Econ. 87, 317–335 (2017)
Alves, L.G.A., Ribeiro, H.V., Rodrigues, F.A.: Crime prediction through urban metrics and statistical learning. Phys. A Stat. Mech. Appl. 505, 435–443 (2018)
Silverstein, C., Brin, S., Motwani, R., Ullman, J.: Scalable techniques for mining causal structures. Data Min. Knowl. Discov. 4(2–3), 163–192 (2000)
Amelec, V., Carmen, V.: Relationship between variables of performance social and financial of microfinance institutions. Adv. Sci. Lett. 21(6), 1931–1934 (2015)
Viloria, A., Lezama, O.B.P.: Improvements for determining the number of clusters in k-means for innovation databases in SMEs. Procedia Comput. Sci. 151, 1201–1206 (2019)
Kamatkar, S.J., Kamble, A., Viloria, A., Hernández-Fernandez, L., Cali, E.G.: Database performance tuning and query optimization. In: International Conference on Data Mining and Big Data, pp. 3–11. Springer, Cham (2018)
Erlandsson, F., Brodka, P., Borg, A., Johnson, H.: Finding influential users in social media using association rule learning. Entropy 18, 164 (2016)
Baculo, M.J.C., Marzan, C.S. de Dios Bulos, R., Ruiz, C.: Geospatial-temporal analysis and classification of criminal data in Manila. In: Proceedings of 2nd IEEE International Conference on Computational Intelligence and Applications, pp. 6–11. IEEE (2017)
Viloria, A., et al.: Integration of data mining techniques to PostgreSQL database manager system. Procedia Comput. Sci. 155, 575–580 (2019)
Clougherty, E., Clougherty, J., Liu, X., Brown, D.: Spatial and temporal analysis of sex crimes in Charlottesville, Virginia. In: Proceedings of IEEE Systems and Information Engineering Design Symposium, pp. 69–74. IEEE (2015)
Pineda, C.J.: Apuntes críticos: Visión Colombia 2019. Institución Universitaria Politécnico Grancolombiano (2016)
Torres, A.X.O.: Los derechos de los colombianos en el extranjero y de los extranjeros en Colombia. En mora de un enfoque integral. Vniversitas 57(117), 357–376 (2008)
Drucker, H.: Improving regressors using boosting techniques. In: Proceedings of the Fourteenth International Conference on Machine Learning, ICML ’97, San Francisco, CA, USA, pp. 107–115. Morgan Kaufmann Publishers Inc. (1997)
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)
Kang, H.-W., Kang, H.-B.: Prediction of crime occurrence from multimodal data using deep learning. PLoS ONE 12(4), e0176244 (2017)
Kianmehr, K., Alhajj, R.: Effectiveness of support vector machine for crime hot-spots prediction. Appl. Artif. Intell. 22(5), 433–458 (2008)
Leitão, J.C., Miotto, J.M., Gerlach, M., Altmann, E.G.: Is this scaling nonlinear? R. Soc. Open Sci. 3(7) (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Silva, J. et al. (2020). Algorithms for Crime Prediction in Smart Cities Through Data Mining. In: Rocha, Á., Paredes-Calderón, M., Guarda, T. (eds) Developments and Advances in Defense and Security. MICRADS 2020. Smart Innovation, Systems and Technologies, vol 181. Springer, Singapore. https://doi.org/10.1007/978-981-15-4875-8_45
Download citation
DOI: https://doi.org/10.1007/978-981-15-4875-8_45
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4874-1
Online ISBN: 978-981-15-4875-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)