Abstract
With the rapid expansion of digital newspapers, readers have an overwhelming amount of news available daily. However, it is difficult to keep track of the news that is only of interest to the reader. Because of this, this research discusses the use of deep learning for the classification of news, especially crime related, published by Mexican digital newspapers as well as an analysis of the predictions obtained through the proposed model. According to the experimental results, the proposed system achieves 98.87% of accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cerviño Beresi, U., GarcÃa Adeva, J.J., Calvo, R.A., Ceccatto, H.A.: Automatic classification of news articles in Spanish. In: Actas del Congreso Argentina de Ciencias de Computación (2004)
Bajaj Mangal, S., Goyal, V.: Text news classification system using Naïve Bayes classifier. Res. Cell Int. J. Eng. Sci. 3, 209–2013 (2014)
Liliana, D.Y., Hardianto, A., Ridok, M.: Indonesian news classification using support vector machine. World Acad. Sci. Eng. Technol. 81, 767–770 (2011)
Krishnalal, G., Rengarajan, S., Srinivasagan, K.: A new text mining approach| based on HMM-SVM for web news classification. Int. J. Comput. Appl. 1, 98–104 (2010)
Dilrukshi, K., De Zoysa, K., Caldera, A.: Twitter news classification using SVM. In: 2013 8th International Conference on Computer Science & Education, Colombo, pp. 287–291 (2013)
Nowak, J., Taspinar, A., Scherer, R.: LSTM recurrent neural networks for short text and sentiment classification. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, Lotfi A., Zurada, Jacek M. (eds.) ICAISC 2017. LNCS (LNAI), vol. 10246, pp. 553–562. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59060-8_50
Cardellino, C.: Spanish Billion Words Corpus and Embeddings, March 2016. https://crscardellino.github.io/SBWCE/
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Hochreiter, S., Bengio, Y., Frasconi, P., Schmidhuber, J.: Gradient flow in recurrent nets: the difficulty of learning long-term dependencies (2001)
Gers, F., Schraudolph, N., Schmidhuber, J.: Learning precise timing with LSTM recurrent networks. J. Mach. Learn. Res. 3, 115–143 (2002)
Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. Official J. Int. Neural Netw. Soc. 18(5), 602–610 (2005)
Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)
Zhang, Y., Wallace, B.C.: A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. In: Proceedings of International Joint Conference Natural Language Processing, pp. 253–263 (2017)
Incidencia delictiva del Fuero Común. https://www.gob.mx/sesnsp/acciones-y-programas/incidencia-delictiva-del-fuero-comun-nueva-metodologia. Accessed 11 Dec 2019
Jing, R.: A self-attention based LSTM network for text classification. J. Phys: Conf. Ser. 1207, 012008 (2019)
Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2016)
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Proceedings of Conference on Empirical Methods in Natural Language Processing, vol. 14, pp. 1532–1543 (2014)
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. NAACL-HLT 1, 4171–4186 (2019)
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012)
Reyes-Ortiz, J.A., Bravo, M.: Enhancing patterns with linguistic information for criminal event recognition. J. Intell. Fuzzy Syst. 34(5), 3027–3036 (2018)
Fauzi, M.A., Arifin, A.Z., Gosaria, S.C., Prabowo, I.S.: Indonesian news classification using naïve bayes and two-phase feature selection model. Indonesian J. Electr. Eng. Comput. Sci. 8(3), 610–615 (2017)
Aggarwal, C.C., Zhai, C.: A survey of text classification algorithms. In: Aggarwal, C., Zhai, C. (eds.) Mining text data, pp. 163–222. Springer, Boston (2012). https://doi.org/10.1007/978-1-4614-3223-4_6
GarcÃa-Mendoza, C.-V., Gambino Juárez, O.: News article classification of Mexican newspapers. In: Mata-Rivera, M.F., Zagal-Flores, R. (eds.) WITCOM 2018. CCIS, vol. 944, pp. 101–109. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-03763-5_9
Acknowledgment
This work is supported by the Sectoral Research Fund for Education with the CONACYT project 257357 and partially supported by the VIEP-BUAP project. The authors also would like to thank Universidad Autonoma Metropolitana, unit Azcapotzalco, with the research project SI001-18.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Vidal, M.T., RodrÃguez, E.S., Reyes-Ortiz, J.A. (2020). Classification of Criminal News Over Time Using Bidirectional LSTM. In: Lu, Y., Vincent, N., Yuen, P.C., Zheng, WS., Cheriet, F., Suen, C.Y. (eds) Pattern Recognition and Artificial Intelligence. ICPRAI 2020. Lecture Notes in Computer Science(), vol 12068. Springer, Cham. https://doi.org/10.1007/978-3-030-59830-3_61
Download citation
DOI: https://doi.org/10.1007/978-3-030-59830-3_61
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59829-7
Online ISBN: 978-3-030-59830-3
eBook Packages: Computer ScienceComputer Science (R0)