Skip to main content

TUCAN: Twitter User Centric ANalyzer

  • Chapter
  • First Online:
Online Social Media Analysis and Visualization

Part of the book series: Lecture Notes in Social Networks ((LNSN))

Abstract

Twitter has attracted millions of users that generate a humongous flow of information at constant pace. The research community has thus started proposing tools to extract meaningful information from tweets. In this paper, we take a different angle from the mainstream of previous work: we explicitly target the analysis of the timeline of tweets from “single users”. We define a framework—named TUCAN—to compare information offered by the target users over time, and to pinpoint recurrent topics or topics of interest. First, tweets belonging to the same time window are aggregated into “bird songs”. Several filtering procedures can be selected to remove stop-words and reduce noise. Then, each pair of bird songs is compared using a similarity score to automatically highlight the most common terms, thus highlighting recurrent or persistent topics. TUCAN can be naturally applied to compare bird song pairs generated from timelines of different users. By showing actual results for both public profiles and anonymous users, we show how TUCAN is useful to highlight meaningful information from a target user’s Twitter timeline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Note that by definition, \(\textit{VS}(u,i) \otimes \textit{VS}(u,i)=1\).

  2. 2.

    https://dev.twitter.com/docs/api.

References

  1. Java A, Song X, Finin T, Tseng B (2007) Why we Twitter: understanding microblogging usage and communities. In: Workshop on web mining and social network, analysis, pp 56–65

    Google Scholar 

  2. Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: WWW, pp 591–600

    Google Scholar 

  3. Alvanaki F, Michel S, Ramamritham K, Weikum G (2012) See what’s enBlogue—real-time emergent topic identification in social media. In: EDBT. ACM, Berlin

    Google Scholar 

  4. Hong L, Davison BD (2010) Empirical study of topic modeling in Twitter. In: Workshop on social media analytics. ACM, New York, pp 80–88

    Google Scholar 

  5. Mathioudakis M, Koudas N (2010) Twittermonitor: trend detection over the Twitter stream. In: SIGMOD’10. ACM, New York, pp 1155–1158

    Google Scholar 

  6. Ramage D, Dumais ST, Liebling DJ (2010) Characterizing microblogs with topic models. In: Cohen WW, Gosling S (eds) ICWSM. The AAAI Press

    Google Scholar 

  7. Salton G, Mcgill MJ (1986) Introduction to modern information retrieval. McGraw-Hill Inc, New York

    Google Scholar 

  8. Grimaudo L, Song H, Baldi M, Mellia M, Munafò M (2013) TUCAN Twitter user centric analyzer. In: Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining

    Google Scholar 

  9. Ramage D, Hall D, Nallapati R, Manning CD (2009) Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 conference on empirical methods in natural language processing: volume 1—volume 1. Stroudsburg, pp 248–256

    Google Scholar 

  10. Chang J, Boyd-Graber J, Blei DM (2009) Connections between the lines: augmenting social networks with text. In: ACM SIGKDD. ACM, New York, pp 169–178

    Google Scholar 

  11. Zhao WX, Jiang J, Weng J, He J, Lim EP, Yan H, Li X (2011) Comparing twitter and traditional media using topic models. In: ECIR’11. Berlin, pp 338–349

    Google Scholar 

  12. Phan X-H, Nguyen L-M, Horiguchi S (2008) Learning to classify short and sparse text and web with hidden topics from large-scale data collections. In: WWW. New York, pp 91–100

    Google Scholar 

  13. Liu Y, Niculescu-Mizil A, Gryc W (2009) Topic-link LDA: joint models of topic and author community. In: Annual international conference on machine learning. ACM, New York, pp 665–672

    Google Scholar 

  14. Blei DM, Ng A, Jordan M (2003) Latent dirichlet allocation. JMLR 3:993–1022

    MATH  Google Scholar 

  15. Rosen-Zvi M, Griffiths T, Steyvers M, Smyth P (2004) The author-topic model for authors and documents. In: UAI. Arlington, pp 487–494

    Google Scholar 

  16. Rosen-Zvi M, Chemudugunta C, Griffiths T, Smyth P, Steyvers M (2010) Learning author-topic models from text corpora, vol 28(1). ACM, New York, pp 4:1–4:38

    Google Scholar 

  17. Das Sarma A, Jain A, Yu C (2011) Dynamic relationship and event discovery. In: WSDM. New York, pp 207–216

    Google Scholar 

  18. Malik S, Smith A, Hawes T, Papadatos P, Dunne C, Shneiderman B (2013) TopicFlow: visualizing topic alignment of twitter data over time. In: Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining

    Google Scholar 

  19. Porter M (1980) An algorithm for suffix stripping. Program 14(3):130–137

    Article  Google Scholar 

  20. Fellbaum C (1998) WordNet: An Electronic Lexical Database. MIT Press, Cambridge, p 422

    MATH  Google Scholar 

  21. Honeycutt C, Herring SC (2009) Beyond microblogging: conversation and collaboration via Twitter. In: HICSS’09, pp 1–10

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luigi Grimaudo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Grimaudo, L., Song, H.H., Baldi, M., Mellia, M., Munafò, M. (2014). TUCAN: Twitter User Centric ANalyzer. In: Kawash, J. (eds) Online Social Media Analysis and Visualization. Lecture Notes in Social Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-13590-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13590-8_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13589-2

  • Online ISBN: 978-3-319-13590-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics