Abstract
This paper assesses if text possesses fractal properties, namely if several attributes that characterize sentences are self-similar. In order to do that, seven corpora were analyzed using several statistical tools, so as to determine if the empirical sequences for the attributes were Gaussian and self-similar. The Kolmogorov-Smirnov goodness-of-fit test and two Hurst parameter estimators were employed. The results show that there is a fractal beauty in the text produced by humans and suggest that its quality is directly proportional to the self-similarity degree.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alzahrani, S., Naomie, S., Ajith, A.: Understanding plagiarism linguistic patterns, textual features, and detection methods. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 42(2), 133–149 (2012)
Cordeiro, J., Dias, G., Cleuziou G.: Biology Based Alignments of Paraphrases for Sentence Compression. Workshop on Textual Entailment (ACL-PASCAL) (2007)
Corder, G.W., Foreman, D.I.: Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach. Wiley, New Jersey (2009)
Fernandes, D.A.B., Neto, M., Soares, L.F.B., Freire, M.M., Inácio, P.R.M.: A tool for estimating the hurst parameter and for generating self-similar sequences. In: Proceedings of the 46th Summer Computer Simulation Conference 2014 (SCSC 2014), Monterey, CA, USA (2014)
Hurst, H.: Long-Term Storage Capacity of Reservoirs. Transactions of the American Society of Civil Engineers 116, 770–799 (1951)
Koizumi, R.: Relationships Between Text Length and Lexical Diversity Measures: Can We Use Short Texts of Less than 100 Tokens? Vocabulary Learning and Instruction 1(1), 60–69 (2012)
Malvern, D., Richards, B., Chipere, N., Durán, P.: Lexical diversity and language development: Quantification and assessment. Houndmills, NH (2004)
McCarthy, P., Jarvis, S.: A theoretical and empirical evaluation of vocd. Language Testing 24, 459–488 (2007)
McCarthy, P., Jarvis, S.: MTLD, vocd-D, and HD-D: A validation study of sophisticated approaches to lexical diversity assessment. Behavior Research Methods 42(2), 381–392 (2010)
Olsson, J., Luchjenbroers, J: Forensic linguistics. A&C Black (2013)
Schler, J., Koppel, M., Argamon, S., Pennebaker, J.W.: Effects of Age and Gender on Blogging. AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs 6, 199–205 (2006)
Stamatatos, E.: A survey of modern authorship attribution methods. Journal of the American Society for Information Science and Tech. 60(3), 538–556 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Cordeiro, J., Inácio, P.R.M., Fernandes, D.A.B. (2015). Fractal Beauty in Text. In: Pereira, F., Machado, P., Costa, E., Cardoso, A. (eds) Progress in Artificial Intelligence. EPIA 2015. Lecture Notes in Computer Science(), vol 9273. Springer, Cham. https://doi.org/10.1007/978-3-319-23485-4_80
Download citation
DOI: https://doi.org/10.1007/978-3-319-23485-4_80
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23484-7
Online ISBN: 978-3-319-23485-4
eBook Packages: Computer ScienceComputer Science (R0)