Skip to main content

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 219))

  • 1089 Accesses

Abstract

This paper proposes an approach of hot topic detection on microblogging, that is, do microblogging texts clustering and heat evaluation so as to find hot topics. For the shortness and sparseness and the existence of synonyms and polysemy of the massive microblogging texts, when modeling the microblogging texts, microblogging text vectors are mapped to low-dimensional feature vector space to achieve the purpose of dimensionally reduction and denoising using Latent Semantic Indexing (LSI). For the shortcomings of traditional single-pass algorithm on topic detection, this paper proposes a two-level Hierarchical Agglomerative Clustering combined single-pass clustering method. Finally, according to text and propagation characteristics of microblogging, this paper proposes the topic heat evaluation model. Experimental studies on real world microblogging data show the method in this paper works well on massive microblogging texts and can effectively dig out hot topics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zhang M (2010) Research on algorithm of topic detection and tracking. J Beijing Jiaotong Univ 6:22–27

    Google Scholar 

  2. Li B, Yu S (2003) Research on topic detection and tracking. Comput Eng Appl 39(17):6–10

    Google Scholar 

  3. Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R (2011) Indexing by latent semantic analysis. J Am Soc Inf Sci 41(6):88–94

    Google Scholar 

  4. Yin F, Xiao W (2011) Incremental algorithm for clustering texts in internet-oriented topic detection. Appl Res Comp 28(1):228–234

    Google Scholar 

  5. Xiao F (2003) The 2003 topic detection and tracking (TDT2003) task definition and evaluation plan. Eng Appl Res Comp 4:59–64

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Changchun Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag London

About this paper

Cite this paper

Yang, C., Yang, J., Ding, H., Xue, H. (2013). A Hot Topic Detection Approach on Chinese Microblogging. In: Zhong, Z. (eds) Proceedings of the International Conference on Information Engineering and Applications (IEA) 2012. Lecture Notes in Electrical Engineering, vol 219. Springer, London. https://doi.org/10.1007/978-1-4471-4853-1_52

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-4853-1_52

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-4852-4

  • Online ISBN: 978-1-4471-4853-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics