Skip to main content
Log in

Macroscopic and microscopic statistical properties observed in blog entries

  • Regular Article
  • Published:
Journal of Economic Interaction and Coordination Aims and scope Submit manuscript

Abstract

We observe the statistical properties of blogs that are expected to reflect social human interaction. Firstly, we introduce a basic normalization preprocess that enables us to evaluate the genuine word frequency in blogs that are independent of external factors such as spam blogs, server-breakdowns, increase in the population of bloggers, and periodic weekly behaviors. After this process, we can confirm that small frequency words clearly follow an independent Poisson process as theoretically expected. Secondly, we focus on each blogger’s basic behaviors. It is found that there are two kinds of behaviors of bloggers. Further, Zipf’s law on word frequency is confirmed to be universally independent of individual activity types.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Furusawa C, Kaneko K (2003) Zipf’s law in gene expression. Phys Rev Lett 90: 088102

    Article  Google Scholar 

  • Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L (2009) Detecting influenza epidemics using search engine query data. Nature 457: 1012–1014

    Article  Google Scholar 

  • Lambiotte R, Ausloos M, Thelwall M (2007) Word statistics in blogs and rss feeds: towards empirical universal evidence. J Infometr 1: 277–286

    Article  Google Scholar 

  • Menezes MA, Barabási A-L (2004) Separating internal and external dynamics of complex systems. Phys Rev Lett 93: 068702

    Google Scholar 

  • Ministry of Internal Affairs and Communications (2009) Report of institute for information and communication policy (Japanese)

  • Narisawa K, Yamada Y, Ikeda D, Takeda M (2006) Detecting blog spams using the vocabulary size of all substrings in their copies. In: Proceedings of the 3rd annual workshop on weblogging ecosystem

  • Okuyama k, Takayasu H, Takayasu M (1999) Zipf’s law in income distribution of companies. Physica A 269: 125–131

    Article  Google Scholar 

  • Page L, Brin S, Motwani R, Winograd T (1998) The pageRank citation ranking. Bringing from the standford digital library technologies project

  • Sato Y, Utsuro T, Fukuhara T, Kawada Y, Murakami Y, Nakagawa H, Kando N (2008) Analysing features of japanese splogs and characteristics of keywords. In: Proceedings of the 4th international workshop on adversarial information retrieval on the web, pp 33–40

  • The State of the Live Web, April 2007. In: Sifry’s Alerts Available via http://www.sifry.com/alerts/archives/000493.html. Accessed April 30th, 2009

  • Zipf GK (1949) Human behavior and the principle of least effort. Addison-Wesley, Cambridge

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yukie Sano.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sano, Y., Takayasu, M. Macroscopic and microscopic statistical properties observed in blog entries. J Econ Interact Coord 5, 221–230 (2010). https://doi.org/10.1007/s11403-010-0065-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11403-010-0065-7

Keywords

Navigation