Abstract
Substantial increase in criminal activities has been directly affecting socio-economical development and quality of life. Predictive crime data analysis is one of the challenging tasks in the field of data analysis, where machine learning algorithm plays a significant role. In this paper, machine learning techniques are applied to a crime dataset for predicting features that affect the pattern of crime occurrence. Decomposition method has used to split the large dataset into sub-dataset in order to increase the accuracy rate. In this work, we used different supervised classification machine learning techniques to predict crime incident by resolution, depending on its feature attributes. Different classification techniques are used like Naïve Bayes, Generalized linear model, Binary Logistic Regression, Decision Tree, Random Forest, Gradient Boost. Results of different algorithms have been compared and effective approach has been discussed. The average prediction accuracy of these machine learning approaches is approximately 86%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Akers, R.: Social Learning and Social Structure: A General Theory of Crime and Deviance. Routledge, Abingdon (2017)
Domingos, P.M.: A few useful things to know about machine learning. Commun. ACM 55(10), 78–87 (2012)
https://datasf.org/opendata/. Accessed 27 Apr 2019
Ang, S.T., Wang, W., Chyou, S.: San Francisco Crime Classification. University of California, San Diego (2015)
Wang, X., Gerber, M.S., Brown, D.E.: Automatic crime prediction using events extracted from twitter posts. In: Social Computing, Behavioral-Cultural Modeling and Prediction, pp. 231–238. Springer (2012)
Sadhana, C.S.: Survey on Predicting Crime Using Twitter Sentiment and Weather Data israce 2015 (2015)
Shojaee, S., Mustapha, A., Sidi, F., Jabar, M.A.: A study on classification learning algorithms to predict crime status. Int. J. Digit. Content Technol. Appl. 7(9), 361 (2013)
Yerpude, P., Gudur, V.: Predictive modelling of crime dataset using data mining. Int. J. Data Min. Knowl. Manage. Process (IJDKP) 7(4), 43–58 (2017)
Nyce, C., Cpcu, A.: Predictive analytics white paper. American Institute for CPCU. Insurance Institute of America, pp. 9–10 (2007)
Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, vol. 3, no. 22 (2001)
Nelder, J.A., Wedderburn, R.W.M.: Generalized linear models. J. Roy. Stat. Soc.: Ser. A (General) 135(3), 370–384 (1972)
Hosmer Jr., D.W., Lemeshow, S., Sturdivant, R.X.: Applied Logistic Regression, vol. 398. John Wiley, Hoboken (2013)
Zhou, J., Troyanskaya, O.G.: Predicting effects of noncoding variants with deep learning–based sequence model. Nat. Methods 12(10), 931 (2015)
Bhargava, N., et al.: Decision tree analysis on j48 algorithm for data mining. Proc. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 3(6), 1114–1119 (2013)
Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2(3), 18–22 (2002)
Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Mukherjee, A., Ghosh, A. (2020). Heterogeneous Decomposition of Predictive Modeling Approach on Crime Dataset Using Machine Learning. In: Dawn, S., Balas, V., Esposito, A., Gope, S. (eds) Intelligent Techniques and Applications in Science and Technology. ICIMSAT 2019. Learning and Analytics in Intelligent Systems, vol 12. Springer, Cham. https://doi.org/10.1007/978-3-030-42363-6_116
Download citation
DOI: https://doi.org/10.1007/978-3-030-42363-6_116
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-42362-9
Online ISBN: 978-3-030-42363-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)