Abstract
With the increase of text stored in electronic format, it is no longer possible for humans to understand all the incoming data or even categorize it. We need an automatic text classification system in order to classify them into predefined classes and quickly retrieve information. Text classification can be achieved by machine learning, it requires a set of approaches for vectorization and classification. In vectorization phase, this work proposes two approaches (BOW and TF-IDF), but in the classification phase, the algorithms of machine learning used are: RL, SVM and ANN. At the end, a comparison study is given.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Duwairi, R.M.: Arabic text categorization. Int. Arab J. Inf. Technol. 4(2), 125–132 (2007)
Mesleh, A.: Support vector machines based arabic language text classification system: feature selection comparative study. In: Sobh, T. (ed.) Advances in Computer and Information Sciences and Engineering, pp. 11–16. Springer, Dordrecht (2007). https://doi.org/10.1007/978-1-4020-8741-7_3
Hrala, M., Král, P.: Evaluation of the document classification approaches. In: Burduk, R., Jackowski, K., Kurzynski, M., Wozniak, M., Zolnierek, A. (eds.) Proceedings of the 8th International Conference on Computer Recognition Systems CORES 2013. Advances in Intelligent Systems and Computing, vol. 226, pp. 877–885. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-319-00969-8_86
Abu-Errub, A.: Arabic text classification algorithm using TFIDF and chi square measurements. Int. J. Comput. Appl. 93, 40–45 (2014). https://doi.org/10.5120/16223-5674
Bahassine, S., Madani, A., Al-Sarem, M., Kissi, M.: Feature selection using an improved Chi-square for Arabic text classification. J. King Saud Univ. Comput. Inf. Sci. 32, 225–231 (2020)
Boukil, S., Biniz, M., El Adnani, F., Cherrat, L., Moutaouakkil, Abd Elmajid El.: Arabic text classification using deep learning technics. Int. J. Grid Distrib. Comput. 11(9), 103–114 (2018)
Simeone, O.: A Brief Introduction to Machine Learning for Engineers, 168083472X, pp. 6–7 (2017). ISBN 9781680834727
Hilbe, J.M.: Practical Guide to Logistic Regression, pp 3–4. Taylor & Francis, Abingdon (2016). ISBN 9781498709576, 1498709575
Caropreso, M., Sebastiani, F., Ricerche, C.: Statistical Phrases in Automated Text Categorization (2001)
Bisong, E.: Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners, pp. 247–248. Apress, Berkeley (2019). ISBN 978-1-4842-4469-2, 978-1-4842-4470-8
Tambade, S., Somvanshi, M., Chavan, P., Shinde, S.: SVM based diabetic classification and hospital recommendation. Int. J. Comput. Appl. 167, 40–43 (2017)
Rimouche, N., Hadjira, H.: Amélioration du produit scalaire via les mesures de similarités sémantiques dans le cadre de la catégorisation des textes. Université abou Beker Belkaid Tlemcen (2016)
Mohammed, B., Brahim, B.: L`apprentissage profond (Deep Learning) pour la classification et la recherche d’images par le contenu, UNIVERSITE KASDI MERBAH OUARGLA Faculté des Nouvelles Technologies de l’Information et de la Communication (2017)
Sahin, Ö.: Text Classification (2021). https://doi.org/10.1007/978-1-4842-6421-8_3
Jalam, R.: Apprentissage automatique et catégorisation de textes multilingues, pp. 9–10. Université Lumière Lyon 2 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Jamaleddyn, I., Biniz, M. (2021). Contribution to Arabic Text Classification Using Machine Learning Techniques. In: Fakir, M., Baslam, M., El Ayachi, R. (eds) Business Intelligence. CBI 2021. Lecture Notes in Business Information Processing, vol 416. Springer, Cham. https://doi.org/10.1007/978-3-030-76508-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-76508-8_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-76507-1
Online ISBN: 978-3-030-76508-8
eBook Packages: Computer ScienceComputer Science (R0)