Network Differences between Normal and Shuffled Texts: Case of Croatian
This paper is an initial attempt to study the properties of the Croatian word order via complex networks. We present network properties of normal and shuffled Croatian texts for different co-occurrence window sizes and different linkage boundaries. The results of network analysis show that the text shuffling causes the decrease of the network diameter, due to the establishment of previously non-existing links. This indicates that the syntax does play a significant role in the Croatian language, although it is a mostly free word-order language.
Keywordscomplex networks linguistic co-occurrence networks Croatian corpus shuffled text randomized text
Unable to display preview. Download preview PDF.
- 1.Meta-net white paper series: Key results and cross-language comparison (2012), http://www.meta-net.eu/whitepapers/key-results-and-cross-language-comparison
- 2.Alstott, J., Bullmore, E., Plenz, D.: Powerlaw: a python package for analysis of heavy-tailed distributions. arXiv preprint arXiv:1305.0215 (2013)Google Scholar
- 3.Ban, K., Martinčić-Ipšić, S., Meštrović, A.: Initial comparison of linguistic networks measures for parallel texts. In: 5th International Conference on Information Technologies and Information Society (ITIS), pp. 97–104 (2013)Google Scholar
- 6.Hagberg, A., Swart, P., Chult, D.: Exploring network structure, dynamics, and function using networkx. Technical report, Los Alamos National Laboratory (LANL) (2008)Google Scholar
- 7.Krishna, M., Hassan, A., Liu, Y., Radev, D.: The effect of linguistic constraints on the large scale organization of language. arXiv preprint arXiv:1102.2831 (2011)Google Scholar
- 10.Margan, D., Martinčić-Ipšić, S., Meštrović, A.: Preliminary report on the structure of Croatian linguistic co-occurrence networks. In: 5th International Conference on Information Technologies and Information Society (ITIS), Slovenia, pp. 89–96 (2013)Google Scholar
- 11.Masucci, A., Rodgers, G.: Network properties of written human language. Physical Review E 74(2), 026102 (2006)Google Scholar