Multimedia Tools and Applications

, Volume 77, Issue 4, pp 4339–4353 | Cite as

A news-topic recommender system based on keywords extraction

  • Zihuan Wang
  • Kyusup Hahn
  • Youngsam Kim
  • Sanghyup Song
  • Jong-Mo Seo


In recent years, internet news has become one of the most important channels for information acquisition, as more and more people read news through internet connected computers, tablets, and smart phones, etc. Owing to the constantly reproduced news, the number of online media increases dramatically and the volume of news also expands rapidly. Consequently, obtaining primary information from the internet is of great interest. This paper presents a news-topic recommender system based on keywords extraction. It is shown that the proposed system is very effective in acquiring specific topics within any specific period of time.


Internet news Recommender system Keywords extraction Topic extraction 



This work was supported by Seoul National University Big Data Institute through the Data Science Research Project 2015.


  1. 1.
    Bruno P, Ralf S, Camelia I, Emilia K, Irina T (2004) Multilingual and cross-lingual news topic tracking. In: Proceedings of the 20th international conference on computational linguisticsGoogle Scholar
  2. 2.
    Dai XY, Chen QC, Wang XL, Xu J (2010) Online topic detection and tracking of financial news based on hierarchical clustering. In: 2010 international conference on machine learning and cybernetics (ICMLC), vol 6. IEEE, pp 3341–3346Google Scholar
  3. 3.
    Hong Y, Zhang Y, Liu T, Li S (2007) Topic detection and tracking review. J Chinese Inform Process 21.6:71–87Google Scholar
  4. 4.
    Hsu WH, Chang SF (2006) Topic tracking across broadcast news videos with visual duplicates and semantic concepts. In: 2006 IEEE international conference on image processing. IEEE, pp 141–144Google Scholar
  5. 5.
    Iwata T, Watanabe S, Yamada T, Ueda N (2009) Topic tracking model for analyzing consumer purchase behavior. In: IJCAI, vol 9Google Scholar
  6. 6.
    James A (2012) Topic detection and tracking: event-based information organization, vol 12. Springer Science and Business Media, BerlinGoogle Scholar
  7. 7.
    James A, Papka R, Lavrenko V (1998) On-line new event detection and tracking. In: Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 37–45Google Scholar
  8. 8.
    Jianshu W, Lee B-S (2011) Event detection in twitter. In: ICWSM 11, pp 401–408Google Scholar
  9. 9.
    Jin Z, Lin HF, Zhao J (2005) Study on topic tracking and tendency classification based on HowNew (in Chinese). J China Soc Sci Tech Inform 24(5):555–561Google Scholar
  10. 10.
    Kuan-Yu C, Luesukprasert L, Chou S-cT (2007) Hot topic extraction based on timeline analysis and multidimensional sentence modeling. IEEE Trans Knowl Data Eng 19(8):1016–1025CrossRefGoogle Scholar
  11. 11.
    Li S, Lv X, Li Y, Shi S (2010) Study on feature selection algorithm in topic tracking. In: 2010 2nd international conference on software engineering and data mining (SEDM). IEEE, pp 384–389Google Scholar
  12. 12.
    Li H, Li CH, Wang X (2009) Research on the algorithm of feature selection based on difference and multiple features (in Chinese). Microcomput Appl 30(10):1–5Google Scholar
  13. 13.
    Masaki M, Miura T, Shioya I (2006) Topic detection and tracking for news web pages. In: Proceedings of the 2006 IEEE/WIC/ACM international conference on web intelligence, pp 338–342Google Scholar
  14. 14.
    Mihalcea R, Tarau P (2004) Textrank: Bringing order into text. In: Proceedings of the 2004 conference on empirical methods in natural language processingGoogle Scholar
  15. 15.
    Mikel R, Ali S, Kanade T (2009) Tracking in unstructured crowded scenes. In: 2009 IEEE 12th international conference on computer vision. IEEE, PiscatawayGoogle Scholar
  16. 16.
    Stuart R, Engel D, Cramer N, Cowley W (2010) Automatic keyword extraction from individual documents. In: Text mining, pp 1–20Google Scholar
  17. 17.
    Sungjick L, Kim H-j (2008) News keyword extraction for topic tracking. In: Fourth international conference on networked computing and advanced information management. NCM’08, vol 2. IEEE, Piscataway, pp 554–559Google Scholar
  18. 18.
    Sung-Jick L, Han-Joon K (2009) Keyword extraction from news corpus using modified TF-IDF. The Journal of Society for e-Business Studies 14(4):59–73Google Scholar
  19. 19.
    Thorsten B, Chen F, Farahat A (2003) A system for new event detection. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in informaion retrieval. ACM, pp 330–337Google Scholar
  20. 20.
    Wang J (2015) Support recovery with orthogonal matching pursuit in the presence of noise. IEEE Trans Signal Process 63(21):5868–5877MathSciNetCrossRefGoogle Scholar
  21. 21.
    Wang J, Li P (2017) Recovery of sparse signals using multiple orthogonal least squares. IEEE Trans Signal Process 65(8):2049–2062MathSciNetCrossRefGoogle Scholar
  22. 22.
    Yang Y, Carbonell JG, Brown RD , Pierce T, Archibald BT, Liu X (1999) Learning approaches for detecting and tracking news events. IEEE Intell Syst Their Appl 14(4):32–43CrossRefGoogle Scholar
  23. 23.
    Yiming Y, Pierce T, Carbonell J (1998) A study of retrospective and on-line event detection. In: Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 28–36Google Scholar
  24. 24.
    Zheng W, Zhang Y, Hong Y, Fan J, Liu T (2008) Topic tracking based on keywords dependency profile. In: Asia information retrieval symposium. Springer, BerlinGoogle Scholar
  25. 25.
    Zi-Yan J, Qing H, Hai-Jun Z, Jia-You L, Zhong-Zhi S (2004) A news event detection and tracking algorithm based on dynamic evolution model [J]. J Comp Res Dev 7:032Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2017

Authors and Affiliations

  • Zihuan Wang
    • 1
  • Kyusup Hahn
    • 2
  • Youngsam Kim
    • 3
  • Sanghyup Song
    • 4
  • Jong-Mo Seo
    • 1
  1. 1.Department of Electrical and Computer EngineeringSeoul National UniversityGwanak-guSouth Korea
  2. 2.Department of CommunicationSeoul National UniversityGwanak-guSouth Korea
  3. 3.Department of LinguisticsSeoul National UniversityGwanak-guSouth Korea
  4. 4.Big Data InstituteSeoul National UniversityGwanak-guSouth Korea

Personalised recommendations