Abstract
In context of Arabic, the authorship attribution (AA) problem is not addressed well comparing with other natural languages such English, Chinese and Dutch. This paper addresses the attribution problem in context of Islamic fatwā’. To the best of our knowledge, this is the first study of its kind that addresses this problem in such domain. In term of attribution methods, three machine-learning classifiers namely, the locally weighted learning (LWL) classifier, decision tree C4.5, and Random Forest (RF) are used. The experiment is performed with a selected list of stylomatric features. To extract the most discriminating features, various feature selection techniques are used. The experimental results show that the classifiers have different behaviour respect each feature reduction techniques. Among the used classifiers, the C4.5 method gives the best accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Neal, T., Sundararajan, K., Fatima, A., Yan, Y., Xiang, Y., Woodard, D.: Surveying stylometry techniques and applications. ACM Comput. Surv. (CSUR) 50(6), 86 (2017)
Jockers, M.L., Witten, D.M.: A comparative study of machine-learning methods for authorship attribution. Lit. Linguist. Comput. 25, 215–223 (2010)
Altheneyan, A.S., El Bachir Menai, M.: Naıve Bayes classifier for authorship attribution of Arabic texts. J. King Saud Univ. Comput. Inf. Sci. 26(4), 473–484 (2014)
Labbé, D.: Experiments on authorship attribution by intertextual distance in English. J. Quant. Linguist. 14(1), 33–80 (2017)
Savoy, J., Attribution, A.: A comparative study of three text corpora and three languages. J. Quant. Linguist. 19(2), 132–161 (2012)
Oppliger, R.: Automatic authorship attribution based on character n-grams in Swiss German. In: Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016)
Crespo, M., Frías, A.: Stylistic authorship comparison and attribution of Spanish news forum messages based on the Tree Tagger POS Tagger, The Authors. Published by Elsevier, Multimodal Communication in the 21st Century: Professional and Academic Challenges. In: 33rd Conference of the Spanish Association of Applied Linguistics (AESLA), XXXIII AESLA Conference, Madrid, Spain, 16–18 April 2015
Shaker, K., Corne, D.: Authorship attribution in Arabic using a hybrid of evolutionary search and linear discriminant analysis. In: 2010 UK Workshop on Computational Intelligence (UKCI), pp. 1–6 (2010). https://doi.org/10.1109/ukci.2010.5625580
Al-Ayyoub, M., Jararweh, Y., Rababa’ah, A., Aldwairi, M.: Feature extraction and selection for Arabic tweets authorship authentication. J. Ambient Intell. Hum. Comput. 8, 383–393 (2017). https://doi.org/10.1007/s12652-017-0452-1
https://en.wikipedia.org/wiki/Fatwa. Last visit: 22:48. Accessed 18 Jan 2018
Stamatatos, E.: A survey of modern authorship attribution methods. J. Am. Soc. Inf. Sci. Technol. 60(3), 538–556 (2009). https://doi.org/10.1002/asi.21001
Zheng, R., Li, J., Chen, H., Huang, Z.: A framework for authorship identification of online messages: writing-style features and classification techniques. J. Am. Soc. Inf. Sci. Technol. 57(3), 378–393 (2006)
Abbasi, A., Chen, H.: Applying authorship analysis to extremist-group Web forum messages. IEEE Intell. Syst. 20(5), 67–75 (2005). https://doi.org/10.1109/MIS.2005.81
Al-Falahi, A., Ramdani, M., Bellafkih, M., Al-Sarem, M.: Authorship attribution in Arabic poetry’ context using Markov Chain classifier. IEEE (2015)
Hall, M.A.: Correlation-based feature selection for machine learning. Ph.D. thesis, The University of Waikato (1999)
Guetlein, M., Frank, E., Hall, M.: Large scale attribute selection using wrappers. In: Proceedings of IEEE Symposium on Computational Intelligence and Data Mining, pp. 332–339 (2009)
Dunteman, G.H.: Principal Components Analysis, vol. 69. Sage, Thousand Oaks (1989)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Al-Sarem, M., Emara, AH. (2019). Analysis the Arabic Authorship Attribution Using Machine Learning Methods: Application on Islamic Fatwā. In: Saeed, F., Gazem, N., Mohammed, F., Busalim, A. (eds) Recent Trends in Data Science and Soft Computing. IRICT 2018. Advances in Intelligent Systems and Computing, vol 843. Springer, Cham. https://doi.org/10.1007/978-3-319-99007-1_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-99007-1_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99006-4
Online ISBN: 978-3-319-99007-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)