Abstract
This paper proposes an approach of hot topic detection on microblogging, that is, do microblogging texts clustering and heat evaluation so as to find hot topics. For the shortness and sparseness and the existence of synonyms and polysemy of the massive microblogging texts, when modeling the microblogging texts, microblogging text vectors are mapped to low-dimensional feature vector space to achieve the purpose of dimensionally reduction and denoising using Latent Semantic Indexing (LSI). For the shortcomings of traditional single-pass algorithm on topic detection, this paper proposes a two-level Hierarchical Agglomerative Clustering combined single-pass clustering method. Finally, according to text and propagation characteristics of microblogging, this paper proposes the topic heat evaluation model. Experimental studies on real world microblogging data show the method in this paper works well on massive microblogging texts and can effectively dig out hot topics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhang M (2010) Research on algorithm of topic detection and tracking. J Beijing Jiaotong Univ 6:22–27
Li B, Yu S (2003) Research on topic detection and tracking. Comput Eng Appl 39(17):6–10
Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R (2011) Indexing by latent semantic analysis. J Am Soc Inf Sci 41(6):88–94
Yin F, Xiao W (2011) Incremental algorithm for clustering texts in internet-oriented topic detection. Appl Res Comp 28(1):228–234
Xiao F (2003) The 2003 topic detection and tracking (TDT2003) task definition and evaluation plan. Eng Appl Res Comp 4:59–64
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag London
About this paper
Cite this paper
Yang, C., Yang, J., Ding, H., Xue, H. (2013). A Hot Topic Detection Approach on Chinese Microblogging. In: Zhong, Z. (eds) Proceedings of the International Conference on Information Engineering and Applications (IEA) 2012. Lecture Notes in Electrical Engineering, vol 219. Springer, London. https://doi.org/10.1007/978-1-4471-4853-1_52
Download citation
DOI: https://doi.org/10.1007/978-1-4471-4853-1_52
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4852-4
Online ISBN: 978-1-4471-4853-1
eBook Packages: EngineeringEngineering (R0)