Skip to main content

Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10868))

Abstract

An interesting way to analyse the authorship authenticity of a document, is the use of stylometry. However, the use of conventional features and classifiers has some disadvantages such as the automatic authorship decision, which usually gives us a speechless authorship classification without (often) any way to measure or interpret the consistency of the results.

In this paper, we present a visual analytics based approach for the task of authorship discrimination. A specific application is dedicated to the authorship comparison between two ancient religious books: the Quran and Hadith. In fact, an important raising question is: could these ancient books be written by the same Author?

Thus, seven types of features are combined and normalized by PCA reduction and three visual analytical clustering methods are employed and commented on, namely: Principal Component Analysis, Gaussian Mixture Models and Self Organizing Maps.

The new visual analytical approach appears interesting, since it does not only show the distinction between the author styles, but also sheds light on how consistent was that distinction (i.e. visually).

Concerning the discrimination application on the ancient religious books, the results have shown the appearance of two separated clusters: namely a Quran cluster and Hadith cluster. The clusters distinction corresponds to a clear authorship difference between the two investigated documents, which implies that the two books (i.e. Quran and Hadith) come from two different Authors.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Blascheck, T., John, M., Kurzhals, K., Koch, S., Ertl, T.: VA2: a visual analytics approach for evaluating visual analytics applications. IEEE Trans. Vis. Comput. Graph. 22(1), 61–70 (2016)

    Article  Google Scholar 

  2. Sayoud, H.: Segmental analysis based authorship discrimination between the Holy Quran and Prophet’s statements. Digital Stud. J. 2014–2015 (2015)

    Google Scholar 

  3. Sayoud, H.: A visual analytics based investigation on the authorship of the Holy Quran. In: International Conference on Information Visualization Theory and Applications (IVAPP’2015), 11–14 March 2015, pp. 177–181 (2015)

    Google Scholar 

  4. Ibrahim, I.A.: A brief illustrated guide to understanding Islam. Library of Congress, Darussalam Publishers, Houston. www.islam-guide.com/contents-wide.htm

  5. Sayoud, H.: Author discrimination between the Holy Quran and Prophet’s statements. Literary Linguist. Comput. 27(4), 427–444 (2012)

    Google Scholar 

  6. Norusis, M.: Cluster analysis. In: SPSS 17.0 Statistical Procedures Companion, Marija Norusis, pp. 361–391. Pearson editor (2008). Chap. 16

    Google Scholar 

  7. Ellis, G., Mansmann, F.: VisMaster, Visual Analytics. In: Mastering the Information Age. Scientific Coordinator of VisMaster. Daniel Keim Jörn Kohlhammer (2010). Chap. 2

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Halim Sayoud .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sayoud, H. (2018). Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith. In: Mouhoub, M., Sadaoui, S., Ait Mohamed, O., Ali, M. (eds) Recent Trends and Future Technology in Applied Intelligence. IEA/AIE 2018. Lecture Notes in Computer Science(), vol 10868. Springer, Cham. https://doi.org/10.1007/978-3-319-92058-0_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-92058-0_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-92057-3

  • Online ISBN: 978-3-319-92058-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics