Skip to main content

Contribution to Arabic Text Classification Using Machine Learning Techniques

  • Conference paper
  • First Online:
Business Intelligence (CBI 2021)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 416))

Included in the following conference series:

Abstract

With the increase of text stored in electronic format, it is no longer possible for humans to understand all the incoming data or even categorize it. We need an automatic text classification system in order to classify them into predefined classes and quickly retrieve information. Text classification can be achieved by machine learning, it requires a set of approaches for vectorization and classification. In vectorization phase, this work proposes two approaches (BOW and TF-IDF), but in the classification phase, the algorithms of machine learning used are: RL, SVM and ANN. At the end, a comparison study is given.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Duwairi, R.M.: Arabic text categorization. Int. Arab J. Inf. Technol. 4(2), 125–132 (2007)

    Google Scholar 

  2. Mesleh, A.: Support vector machines based arabic language text classification system: feature selection comparative study. In: Sobh, T. (ed.) Advances in Computer and Information Sciences and Engineering, pp. 11–16. Springer, Dordrecht (2007). https://doi.org/10.1007/978-1-4020-8741-7_3

    Chapter  Google Scholar 

  3. Hrala, M., Král, P.: Evaluation of the document classification approaches. In: Burduk, R., Jackowski, K., Kurzynski, M., Wozniak, M., Zolnierek, A. (eds.) Proceedings of the 8th International Conference on Computer Recognition Systems CORES 2013. Advances in Intelligent Systems and Computing, vol. 226, pp. 877–885. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-319-00969-8_86

  4. Abu-Errub, A.: Arabic text classification algorithm using TFIDF and chi square measurements. Int. J. Comput. Appl. 93, 40–45 (2014). https://doi.org/10.5120/16223-5674

    Article  Google Scholar 

  5. Bahassine, S., Madani, A., Al-Sarem, M., Kissi, M.: Feature selection using an improved Chi-square for Arabic text classification. J. King Saud Univ. Comput. Inf. Sci. 32, 225–231 (2020)

    Google Scholar 

  6. Boukil, S., Biniz, M., El Adnani, F., Cherrat, L., Moutaouakkil, Abd Elmajid El.: Arabic text classification using deep learning technics. Int. J. Grid Distrib. Comput. 11(9), 103–114 (2018)

    Article  Google Scholar 

  7. Simeone, O.: A Brief Introduction to Machine Learning for Engineers, 168083472X, pp. 6–7 (2017). ISBN 9781680834727

    Google Scholar 

  8. Hilbe, J.M.: Practical Guide to Logistic Regression, pp 3–4. Taylor & Francis, Abingdon (2016). ISBN 9781498709576, 1498709575

    Google Scholar 

  9. Caropreso, M., Sebastiani, F., Ricerche, C.: Statistical Phrases in Automated Text Categorization (2001)

    Google Scholar 

  10. Bisong, E.: Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners, pp. 247–248. Apress, Berkeley (2019). ISBN 978-1-4842-4469-2, 978-1-4842-4470-8

    Google Scholar 

  11. Tambade, S., Somvanshi, M., Chavan, P., Shinde, S.: SVM based diabetic classification and hospital recommendation. Int. J. Comput. Appl. 167, 40–43 (2017)

    Google Scholar 

  12. Rimouche, N., Hadjira, H.: Amélioration du produit scalaire via les mesures de similarités sémantiques dans le cadre de la catégorisation des textes. Université abou Beker Belkaid Tlemcen (2016)

    Google Scholar 

  13. Mohammed, B., Brahim, B.: L`apprentissage profond (Deep Learning) pour la classification et la recherche d’images par le contenu, UNIVERSITE KASDI MERBAH OUARGLA Faculté des Nouvelles Technologies de l’Information et de la Communication (2017)

    Google Scholar 

  14. Sahin, Ö.: Text Classification (2021). https://doi.org/10.1007/978-1-4842-6421-8_3

  15. Jalam, R.: Apprentissage automatique et catégorisation de textes multilingues, pp. 9–10. Université Lumière Lyon 2 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jamaleddyn, I., Biniz, M. (2021). Contribution to Arabic Text Classification Using Machine Learning Techniques. In: Fakir, M., Baslam, M., El Ayachi, R. (eds) Business Intelligence. CBI 2021. Lecture Notes in Business Information Processing, vol 416. Springer, Cham. https://doi.org/10.1007/978-3-030-76508-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-76508-8_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-76507-1

  • Online ISBN: 978-3-030-76508-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics