Skip to main content

The Wisdom of the Audience: An Empirical Study of Social Semantics in Twitter Streams

  • Conference paper

Part of the Lecture Notes in Computer Science book series (LNISA,volume 7882)


Interpreting the meaning of a document represents a fundamental challenge for current semantic analysis methods. One interesting aspect mostly neglected by existing methods is that authors of a document usually assume certain background knowledge of their intended audience. Based on this knowledge, authors usually decide what to communicate and how to communicate it. Traditionally, this kind of knowledge has been elusive to semantic analysis methods. However, with the rise of social media such as Twitter, background knowledge of intended audiences (i.e., the community of potential readers) has become explicit to some extents, i.e., it can be modeled and estimated. In this paper, we (i) systematically compare different methods for estimating background knowledge of different audiences on Twitter and (ii) investigate to what extent the background knowledge of audiences is useful for interpreting the meaning of social media messages. We find that estimating the background knowledge of social media audiences may indeed be useful for interpreting the meaning of social media messages, but that its utility depends on manifested structural characteristics of message streams.


  • Background Knowledge
  • Topic Model
  • Latent Dirichlet Allocation
  • Twitter Message
  • Audience User

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  2. Paul Grice, H.: Logic and conversation. In: Cole, P. (ed.) Speech Acts. Syntax and semantics, vol. 3, pp. 41–58. Academic Press, New York (1975)

    Google Scholar 

  3. Hong, L., Davison, B.D.: Empirical study of topic modeling in twitter. In: Proceedings of the IGKDD Workshop on Social Media Analytics (SOMA) (2010)

    Google Scholar 

  4. Hotho, A., Staab, S., Stumme, G.: Wordnet improves text document clustering. In: Proc. of the SIGIR 2003 Semantic Web Workshop, pp. 541–544 (2003)

    Google Scholar 

  5. Laniado, D., Mika, P.: Making sense of twitter. In: Patel-Schneider, P.F., Pan, Y., Hitzler, P., Mika, P., Zhang, L., Pan, J.Z., Horrocks, I., Glimm, B. (eds.) ISWC 2010, Part I. LNCS, vol. 6496, pp. 470–485. Springer, Heidelberg (2010)

    CrossRef  Google Scholar 

  6. Litt, E.: Knock, knock. Who’s there? The imagined audience. Journal of Broadcasting and Electronic Media 56 (2012)

    Google Scholar 

  7. Marwick, A., Boyd, D.: I tweet honestly, i tweet passionately: Twitter users, context collapse, and the imagined audience. New Media and Society (2010)

    Google Scholar 

  8. McCallum, A.K.: Mallet: A machine learning for language toolkit (2002),

  9. Phan, X.-H., Nguyen, L.-M., Horiguchi, S.: Learning to classify short and sparse text & web with hidden topics from large-scale data collections. In: Proceedings of the 17th International Conference on World Wide Web, WWW 2008, pp. 91–100. ACM, New York (2008)

    CrossRef  Google Scholar 

  10. Romero, D.M., Meeder, B., Kleinberg, J.: Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, pp. 695–704. ACM, New York (2011)

    Google Scholar 

  11. Searle, J.: A taxonomy of illocutionary acts, pp. 334–369. University of Minnesota Press, Minneapolis (1975)

    Google Scholar 

  12. Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M.: Short text classification in twitter to improve information filtering. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, pp. 841–842. ACM, New York (2010)

    Google Scholar 

  13. Wagner, C., Strohmaier, M.: The wisdom in tweetonomies: Acquiring latent conceptual structures from social awareness streams. In: Semantic Search Workshop at WWW 2010 (2010)

    Google Scholar 

  14. Wallach, H.M.: Structured Topic Models for Language. PhD thesis, University of Cambridge (2008)

    Google Scholar 

  15. Yang, L., Sun, T., Zhang, M., Mei, Q.: We know what @you #tag: does the dual role affect hashtag adoption? In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012, pp. 261–270. ACM, New York (2012)

    CrossRef  Google Scholar 

Download references

Author information

Authors and Affiliations


Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wagner, C., Singer, P., Posch, L., Strohmaier, M. (2013). The Wisdom of the Audience: An Empirical Study of Social Semantics in Twitter Streams. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds) The Semantic Web: Semantics and Big Data. ESWC 2013. Lecture Notes in Computer Science, vol 7882. Springer, Berlin, Heidelberg.

Download citation

  • DOI:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38287-1

  • Online ISBN: 978-3-642-38288-8

  • eBook Packages: Computer ScienceComputer Science (R0)