Skip to main content

Heterogeneous Decomposition of Predictive Modeling Approach on Crime Dataset Using Machine Learning

  • Conference paper
  • First Online:
Intelligent Techniques and Applications in Science and Technology (ICIMSAT 2019)

Part of the book series: Learning and Analytics in Intelligent Systems ((LAIS,volume 12))

Abstract

Substantial increase in criminal activities has been directly affecting socio-economical development and quality of life. Predictive crime data analysis is one of the challenging tasks in the field of data analysis, where machine learning algorithm plays a significant role. In this paper, machine learning techniques are applied to a crime dataset for predicting features that affect the pattern of crime occurrence. Decomposition method has used to split the large dataset into sub-dataset in order to increase the accuracy rate. In this work, we used different supervised classification machine learning techniques to predict crime incident by resolution, depending on its feature attributes. Different classification techniques are used like Naïve Bayes, Generalized linear model, Binary Logistic Regression, Decision Tree, Random Forest, Gradient Boost. Results of different algorithms have been compared and effective approach has been discussed. The average prediction accuracy of these machine learning approaches is approximately 86%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Akers, R.: Social Learning and Social Structure: A General Theory of Crime and Deviance. Routledge, Abingdon (2017)

    Book  Google Scholar 

  2. Domingos, P.M.: A few useful things to know about machine learning. Commun. ACM 55(10), 78–87 (2012)

    Article  Google Scholar 

  3. https://datasf.org/opendata/. Accessed 27 Apr 2019

  4. Ang, S.T., Wang, W., Chyou, S.: San Francisco Crime Classification. University of California, San Diego (2015)

    Google Scholar 

  5. Wang, X., Gerber, M.S., Brown, D.E.: Automatic crime prediction using events extracted from twitter posts. In: Social Computing, Behavioral-Cultural Modeling and Prediction, pp. 231–238. Springer (2012)

    Google Scholar 

  6. Sadhana, C.S.: Survey on Predicting Crime Using Twitter Sentiment and Weather Data israce 2015 (2015)

    Google Scholar 

  7. Shojaee, S., Mustapha, A., Sidi, F., Jabar, M.A.: A study on classification learning algorithms to predict crime status. Int. J. Digit. Content Technol. Appl. 7(9), 361 (2013)

    Google Scholar 

  8. Yerpude, P., Gudur, V.: Predictive modelling of crime dataset using data mining. Int. J. Data Min. Knowl. Manage. Process (IJDKP) 7(4), 43–58 (2017)

    Article  Google Scholar 

  9. Nyce, C., Cpcu, A.: Predictive analytics white paper. American Institute for CPCU. Insurance Institute of America, pp. 9–10 (2007)

    Google Scholar 

  10. Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, vol. 3, no. 22 (2001)

    Google Scholar 

  11. Nelder, J.A., Wedderburn, R.W.M.: Generalized linear models. J. Roy. Stat. Soc.: Ser. A (General) 135(3), 370–384 (1972)

    Article  Google Scholar 

  12. Hosmer Jr., D.W., Lemeshow, S., Sturdivant, R.X.: Applied Logistic Regression, vol. 398. John Wiley, Hoboken (2013)

    Book  Google Scholar 

  13. Zhou, J., Troyanskaya, O.G.: Predicting effects of noncoding variants with deep learning–based sequence model. Nat. Methods 12(10), 931 (2015)

    Article  Google Scholar 

  14. Bhargava, N., et al.: Decision tree analysis on j48 algorithm for data mining. Proc. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 3(6), 1114–1119 (2013)

    Google Scholar 

  15. Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2(3), 18–22 (2002)

    Google Scholar 

  16. Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anupam Mukherjee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mukherjee, A., Ghosh, A. (2020). Heterogeneous Decomposition of Predictive Modeling Approach on Crime Dataset Using Machine Learning. In: Dawn, S., Balas, V., Esposito, A., Gope, S. (eds) Intelligent Techniques and Applications in Science and Technology. ICIMSAT 2019. Learning and Analytics in Intelligent Systems, vol 12. Springer, Cham. https://doi.org/10.1007/978-3-030-42363-6_116

Download citation

Publish with us

Policies and ethics