Abstract
We observe the statistical properties of blogs that are expected to reflect social human interaction. Firstly, we introduce a basic normalization preprocess that enables us to evaluate the genuine word frequency in blogs that are independent of external factors such as spam blogs, server-breakdowns, increase in the population of bloggers, and periodic weekly behaviors. After this process, we can confirm that small frequency words clearly follow an independent Poisson process as theoretically expected. Secondly, we focus on each blogger’s basic behaviors. It is found that there are two kinds of behaviors of bloggers. Further, Zipf’s law on word frequency is confirmed to be universally independent of individual activity types.
Similar content being viewed by others
References
Furusawa C, Kaneko K (2003) Zipf’s law in gene expression. Phys Rev Lett 90: 088102
Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L (2009) Detecting influenza epidemics using search engine query data. Nature 457: 1012–1014
Lambiotte R, Ausloos M, Thelwall M (2007) Word statistics in blogs and rss feeds: towards empirical universal evidence. J Infometr 1: 277–286
Menezes MA, Barabási A-L (2004) Separating internal and external dynamics of complex systems. Phys Rev Lett 93: 068702
Ministry of Internal Affairs and Communications (2009) Report of institute for information and communication policy (Japanese)
Narisawa K, Yamada Y, Ikeda D, Takeda M (2006) Detecting blog spams using the vocabulary size of all substrings in their copies. In: Proceedings of the 3rd annual workshop on weblogging ecosystem
Okuyama k, Takayasu H, Takayasu M (1999) Zipf’s law in income distribution of companies. Physica A 269: 125–131
Page L, Brin S, Motwani R, Winograd T (1998) The pageRank citation ranking. Bringing from the standford digital library technologies project
Sato Y, Utsuro T, Fukuhara T, Kawada Y, Murakami Y, Nakagawa H, Kando N (2008) Analysing features of japanese splogs and characteristics of keywords. In: Proceedings of the 4th international workshop on adversarial information retrieval on the web, pp 33–40
The State of the Live Web, April 2007. In: Sifry’s Alerts Available via http://www.sifry.com/alerts/archives/000493.html. Accessed April 30th, 2009
Zipf GK (1949) Human behavior and the principle of least effort. Addison-Wesley, Cambridge
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sano, Y., Takayasu, M. Macroscopic and microscopic statistical properties observed in blog entries. J Econ Interact Coord 5, 221–230 (2010). https://doi.org/10.1007/s11403-010-0065-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11403-010-0065-7