Abstract
Publishing review comments of micro-blog hot topic is becoming the daily round in modern smart city. With the in-depth discussion about the micro-blog topic, it is a challenge for users to understand quickly the basic content of the topic due to its gradually complex inner relationship. To solve the problem, this paper presents the construction of micro-blog topic summarization. First, the keywords are extracted based on the lexical features of words occurred in comments published by users. And then the association rules between keywords are extracted by Apriori. Second, the associated semantic network for micro-blog topic (ASN-MT), the visualization of micro-blog topic summarization, is constructed according to the extracted keywords and association rules. Third, ASN-MT is optimized by reducing its scale based on the confidence threshold. And then its optimal value is selected according to the integrity of ASN-MT. The experimental results show that the proposed algorithm can construct accurately and quickly a concise and complete micro-blog topic summarization.
Similar content being viewed by others
References
Alemi M, Haghighi H, Shahrivari S (2017) Ccfinder: using spark to find clustering coefficient in big graphs. J Supercomput 73(11):4683–4710
Belyi E, Giabbanelli PJ, Patel I, Balabhadrapathruni NH, Abdallah AB, Hameed W, Mago VK (2016) Combining association rule mining and network analysis for pharmaco-surveillance. J Supercomput 72(5):2014–2034
Bicalho P, Pita M, Pedrosa G, Lacerda A, Pappa GL (2017) A general framework to expand short text for topic modeling. Inf Sci 393:66–81
Blanco I (2008) Using association rules to mine for strong approximate dependencies. Data Min Knowl Disc 16(3):313–348
Boldi P, Santini M, Vigna S (2009) PageRank. ACM Trans Inform Syst 27(4):1–23
Chen Z, Lin F, Liu H, Liu Y, Ma WY, Liu W (2002) User intention modeling in web applications using data mining. World Wide Web internet Web Inform Syst 5(3):181–191
Ercan G, Cicekli I (2007) Using lexical chains for keyword extraction. Inf Process Manage 43(6):1705–1714
Erra U, Senatore S, Minnella F, Caggianese G (2015) Approximate TF–IDF based on topic extraction from massive message stream using the GPU. Inf Sci 292:143–161
Feng F, Cho J, Pedrycz W, Fujita H, Herawan T (2016) Soft set based association rule mining. Knowl Based Syst 111:268–282
Han X, Clemmensen L (2016) Regularized generalized eigen-decomposition with applications to sparse supervised feature extraction and sparse discriminant analysis. Pattern Recogn 49:43–54
Hong T-P, Lin C-W, Yang K-T, Wang S-L (2012) Using TF-IDF to hide sensitive itemsets. Appl Intell 38(4):502–510
Hur J, Özgür A, He Y (2018) Ontology-based literature mining and class effect analysis of adverse drug reactions associated with neuropathy-inducing drugs. J Biomed Semant 17:1–10
Jiang D, Leung KW-T, Yang L, Ng W (2015) TEII: Topic enhanced inverted index for top-k document retrieval. Knowl Based Syst 89:346–358
Jiao H, Liu Q, Jia HB (2007) Chinese Keyword Extraction Based on N-Gram and Word Co-occurrence. In: 2007 International Conference on Computational Intelligence and Security Workshops (CISW 2007). Harbin, pp 152–155
Li P, Wang B, Shi Z, Cui Y, Li H (2012) Tag-TextRank: a webpage keyword extraction method based on tags. J Comput Res Dev 49(11):2344–2351
Litvak M, Last M, Kandel A (2013) Degext: a language-independent keyphrase extractor. J Ambient Intell Hum Comput 4(3):377–387
Liu H, Guan H, Jian J, Liu X, Pei Y (2018) Clustering based on words distances. Cluster Comput 21:945–953
Ma H-F, Wang S, Li M, Li N (2019) Enhancing graph-based keywords extraction with node association. Knowl Sci Eng Manag 1:497–510
Martín D, Alcalá-Fdez J, Rosete A, Herrera F (2016) Nicgar: a niching genetic algorithm to mine a diverse set of interesting quantitative association rules. Inf Sci 355–356:208–228
Martinez-Romo J, Araujo L, Fernandez AD (2016) Semgraph: extracting keyphrases following a novel semantic graph-based approach. J Assoc Inform Sci Technol 67(1):71–82
Nemet S, Kukolj D, Ostojic G, Stankovski S, Jovanovic D (2019) Aggregation framework for TSK fuzzy and association rules: interpretability improvement on a traffic accidents case. Appl Intell 49(11):3909–3922
Pasquier N, Bastide Y, Taouil R, Lakhal L (1999) Efficient mining of association rules using closed itemset lattices. Inf Syst 24(1):25–46
Plaza L, Díaz A, Gervás P (2011) A semantic graph-based approach to biomedical summarisation. Artif Intell Med 53(1):1–14
Qin P, Xu W, Guo J (2016) A novel negative sampling based on TFIDF for learning word representation. Neurocomputing 177:257–265
Rafiei-Asl J, Nickabadi A (2017) Tsake: a topical and structural automatic keyphrase extractor. Appl Soft Comput 58:620–630
Shao ZZ, Li YC, Wang X, Zhao XC, Guo YH (2020) Research on a new automatic generation algorithm of concept map based on text analysis and association rules mining. J Ambient Intell Hum Comput 11(2):539–551
Thabtah F (2007) A review of associative classification mining. Knowl Eng Rev 22(1):37–65
Tîrnauca C, Balcázar J-L, Gómez-Pérez D (2020) Closed-set-based discovery of representative association rules. In J Found Comput Sci 31(1):143–156
Turney PD (2000) Learning algorithms for keyphrase extraction. Inf Retr 2(4):303–336
Wu X, Du Z-K, Guo Y-K (2018) A visual attention-based keyword extraction for document classification. Multimed Tools Appl 77(19):25355–25367
Xu Z, Chen H (2015) The semantic analysis of knowledge map for the traffic violations from the surveillance video big data. Comput Syst Sci Eng 30(5):403–410
Yoo I, Hu X, Song IY (2007) A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method. BMC Bioinform 8(Suppl 9):1–15
Zhang SX, Luo XF, Xuan JY, Chen X, Xu WM (2014) Discovering small-world in association link networks for association learning. World Wide Web Internet Web Inform Syst 17(2):229–254
Zhang SX, Wang Y, Zhang SY, Zhu GL (2016) Building associated semantic representation model for the ultra-short microblog text jumping in big data. Cluster Comput 19(3):1–12
Acknowledgements
This research work was supported in part by the Anhui Provincial Natural Science Foundation Project (No.: 19808085 MF189) and the Anhui University Top Talent Cultivation Project (No. gxbjZD15).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Cai, J., Zhang, S., Zhu, H. et al. Building the summarization model of micro-blog topic. J Ambient Intell Human Comput 12, 797–809 (2021). https://doi.org/10.1007/s12652-020-02078-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-020-02078-9