Abstract
A novel approach to authorship and style attribution and differentiation on the phonological level has been suggested. Each style is considered a statistical system the elements of which are mean frequencies of groups of consonants chosen as a style attribution and differentiation criterion. Statistical analogues of the phonological subsystems of style systems have been obtained by mathematical statistical methods (the hypothesis, ranking and style distance determination methods). Interrelations of style, language and individual manner of writing factors as well as the style-differentiating capability of eight groups of consonants (labial, forelingual, mediolingual, backlingual, nasal, constrictive, occlusive and sonorant) have been established. The results of the research show that only the three methods combined above allow to fully characterize each style (belles-lettres, colloquial and scientific) under study and establish authorship of a text. The closeness and distance established between the compared styles have been shown in the three models proposed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alamothoda, S.M.: Phonostatistics and phonotactics of the syllable in modern Persian. Series: Studia Orientalia. Finnish Oriental Society, Helsinki (2000)
Altmann, G., Levickij, V., Perebyinis, V.: Problems of Quantitative Linguistics. Ruta, Chernivtsy (2005)
Argamon, S., Koppel, M., Pennebaker, J., Schler, J.: Automatically profiling the author of an anonymous text. Commun. ACM 52(2), 119–123 (2009)
Bisikalo, O.V., Vysotska, V.A.: Sentence syntactic analysis application to keywords identification Ukrainian texts. Radio Electron. Comput. Sci. Control 3(38), 54–65 (2016)
Everitt, B.S.: A Handbook of Statistical Analyses Using R. Chapman and Hall/CRC, London/Boca Raton (2009)
Gomez, P.C.: Statistical methods in language and linguistic research. University of Murcia, Spain (2013)
Gries, Th.S.: Statistics for linguistics with R. Mouton Textbook (2009)
Juala, P.: Authorship attribution, foundations and trends(R) in information retrieval. 1(3), 233–334 (2008)
Kapociute-Dzikiene, J., Utka, F., Sarkute, L.: Authorship attribution and author profiling of Lithuanian literary texts. In: Proceedings of the 5th Workshop on Balto-Slavic Natural Language Processing, Hissac, Bulgaria, pp. 96–105 (2015)
Khomytska, I., Teslyuk, V.: The method of statistical analysis of the scientific, colloquial, belles-lettres and newspaper styles on the phonological level. In: Advances in Intelligent Systems and Computing, vol. 512, pp. 149–163 (2016)
Khomytska, I., Teslyuk, V.: Modelling of phonostatistical structures of the colloquial and newspaper styles in English sonorant phoneme group. In: Proceedings of the XIIth Scientific and Technical Conference, CSIT, Lviv, pp. 67–70 (2017)
Kingston, J., Baayen, H., Clopper, C.G.: Statistical analyses: statistics in laboratory phonology. Mixed-effects models clustering and classification methods. Oxford (2011)
Koppel, M., Schler, J., Argamon, S.: Computational methods in authorship attribution. J. Assoc. Inf. Sci. Technol. 60(1), 9–26 (2009)
Kornai, A.: A Mathematical Linguistics. Springer, Heidelberg (2008)
Lytvyn, V.: Development of a method for the recognition of author’s style in the Ukrainian language texts based on linguometry, stylemetry and glottochronology. East. Eur. J. Enterp. Technol. 4/2(88), 10–18 (2017)
Mines, M.A., Hanson, B.F., Shoup, J.E.: Frequency of occurrence of phonemes in conversational English. Lang. Speech 21(3), 221–241 (1978)
Roberts, A.H.: A Statistical Linguistic Analysis of American English. Mouton, The Hague (1965)
Shakhovska, N.B., Noha, R.Y.: Methods and tools for text analysis of publications to study the functioning of scientific schools. J. Autom. Inf. Sci. 47(12)
Sovkowiak, W.: On the phonostatistics of English onomatopoeia. Adam Mickiewicz University, Poznan (1990)
Stamatatos, E.: A survey of modern attribution methods. J. Assoc. Inf. Sci. Technol. 60(3), 538–556 (2009)
Zhezhnych, P., Markiv, O.: A linguistic method of web-site content comparison with tourism documentation objects. In: Proceedings of the XIIth Scientific and Technical Conference, CSIT, Lviv, pp. 340–343 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Khomytska, I., Teslyuk, V. (2019). Authorship and Style Attribution by Statistical Methods of Style Differentiation on the Phonological Level. In: Shakhovska, N., Medykovskyy, M. (eds) Advances in Intelligent Systems and Computing III. CSIT 2018. Advances in Intelligent Systems and Computing, vol 871. Springer, Cham. https://doi.org/10.1007/978-3-030-01069-0_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-01069-0_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01068-3
Online ISBN: 978-3-030-01069-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)