Skip to main content

Analysis the Arabic Authorship Attribution Using Machine Learning Methods: Application on Islamic Fatwā

  • Conference paper
  • First Online:
Recent Trends in Data Science and Soft Computing (IRICT 2018)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 843))

Abstract

In context of Arabic, the authorship attribution (AA) problem is not addressed well comparing with other natural languages such English, Chinese and Dutch. This paper addresses the attribution problem in context of Islamic fatwā’. To the best of our knowledge, this is the first study of its kind that addresses this problem in such domain. In term of attribution methods, three machine-learning classifiers namely, the locally weighted learning (LWL) classifier, decision tree C4.5, and Random Forest (RF) are used. The experiment is performed with a selected list of stylomatric features. To extract the most discriminating features, various feature selection techniques are used. The experimental results show that the classifiers have different behaviour respect each feature reduction techniques. Among the used classifiers, the C4.5 method gives the best accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://www.dar-alifta.org/Foreign/default.aspx?LangID=2&Home=1.

  2. 2.

    http://www.cs.waikato.ac.nz/ml/weka/.

References

  1. Neal, T., Sundararajan, K., Fatima, A., Yan, Y., Xiang, Y., Woodard, D.: Surveying stylometry techniques and applications. ACM Comput. Surv. (CSUR) 50(6), 86 (2017)

    Article  Google Scholar 

  2. Jockers, M.L., Witten, D.M.: A comparative study of machine-learning methods for authorship attribution. Lit. Linguist. Comput. 25, 215–223 (2010)

    Article  Google Scholar 

  3. Altheneyan, A.S., El Bachir Menai, M.: Naıve Bayes classifier for authorship attribution of Arabic texts. J. King Saud Univ. Comput. Inf. Sci. 26(4), 473–484 (2014)

    Google Scholar 

  4. Labbé, D.: Experiments on authorship attribution by intertextual distance in English. J. Quant. Linguist. 14(1), 33–80 (2017)

    Article  Google Scholar 

  5. Savoy, J., Attribution, A.: A comparative study of three text corpora and three languages. J. Quant. Linguist. 19(2), 132–161 (2012)

    Article  Google Scholar 

  6. Oppliger, R.: Automatic authorship attribution based on character n-grams in Swiss German. In: Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016)

    Google Scholar 

  7. Crespo, M., Frías, A.: Stylistic authorship comparison and attribution of Spanish news forum messages based on the Tree Tagger POS Tagger, The Authors. Published by Elsevier, Multimodal Communication in the 21st Century: Professional and Academic Challenges. In: 33rd Conference of the Spanish Association of Applied Linguistics (AESLA), XXXIII AESLA Conference, Madrid, Spain, 16–18 April 2015

    Google Scholar 

  8. Shaker, K., Corne, D.: Authorship attribution in Arabic using a hybrid of evolutionary search and linear discriminant analysis. In: 2010 UK Workshop on Computational Intelligence (UKCI), pp. 1–6 (2010). https://doi.org/10.1109/ukci.2010.5625580

  9. Al-Ayyoub, M., Jararweh, Y., Rababa’ah, A., Aldwairi, M.: Feature extraction and selection for Arabic tweets authorship authentication. J. Ambient Intell. Hum. Comput. 8, 383–393 (2017). https://doi.org/10.1007/s12652-017-0452-1

    Article  Google Scholar 

  10. https://en.wikipedia.org/wiki/Fatwa. Last visit: 22:48. Accessed 18 Jan 2018

  11. Stamatatos, E.: A survey of modern authorship attribution methods. J. Am. Soc. Inf. Sci. Technol. 60(3), 538–556 (2009). https://doi.org/10.1002/asi.21001

    Article  Google Scholar 

  12. Zheng, R., Li, J., Chen, H., Huang, Z.: A framework for authorship identification of online messages: writing-style features and classification techniques. J. Am. Soc. Inf. Sci. Technol. 57(3), 378–393 (2006)

    Article  Google Scholar 

  13. Abbasi, A., Chen, H.: Applying authorship analysis to extremist-group Web forum messages. IEEE Intell. Syst. 20(5), 67–75 (2005). https://doi.org/10.1109/MIS.2005.81

    Article  Google Scholar 

  14. Al-Falahi, A., Ramdani, M., Bellafkih, M., Al-Sarem, M.: Authorship attribution in Arabic poetry’ context using Markov Chain classifier. IEEE (2015)

    Google Scholar 

  15. Hall, M.A.: Correlation-based feature selection for machine learning. Ph.D. thesis, The University of Waikato (1999)

    Google Scholar 

  16. Guetlein, M., Frank, E., Hall, M.: Large scale attribute selection using wrappers. In: Proceedings of IEEE Symposium on Computational Intelligence and Data Mining, pp. 332–339 (2009)

    Google Scholar 

  17. Dunteman, G.H.: Principal Components Analysis, vol. 69. Sage, Thousand Oaks (1989)

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdel-Hamid Emara .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Al-Sarem, M., Emara, AH. (2019). Analysis the Arabic Authorship Attribution Using Machine Learning Methods: Application on Islamic Fatwā. In: Saeed, F., Gazem, N., Mohammed, F., Busalim, A. (eds) Recent Trends in Data Science and Soft Computing. IRICT 2018. Advances in Intelligent Systems and Computing, vol 843. Springer, Cham. https://doi.org/10.1007/978-3-319-99007-1_21

Download citation

Publish with us

Policies and ethics