Trend Analysis of Machine Learning Research Using Topic Network Analysis

  • Deepak SharmaEmail author
  • Bijendra Kumar
  • Satish Chand
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 799)


In this paper, a topic network analysis approach is proposed which integrates topic modeling and social network analysis. We collected 16,855 scientific papers from six top journals in the field of machine learning published from 1997 to 2016 and analyzed them with the topic network. The dataset is break down into 4 intervals to identify topic trends and performed the time-series analysis of topic network. Our experimental results show centralization of the topic network has the highest score from 2002 to 2006, and decreases for next 5 years and increases again. For last 5 years, centralization of the degree centrality and closeness centrality increases, while centralization of the betweenness centrality decreases again. Also, data analytic and computer vision are identified as the most interrelated topic among other topics. Topics with the highest degree centrality evolve component analysis, text mining, biometric and computer vision according to time. Our approach extracts the interrelationships of topics, which cannot be detected with conventional topic modeling approaches, and provides topical trends of machine learning research.


Topic network analysis Social network analysis Topic modeling Latent Dirichlet Allocation Research trend analysis 


  1. 1.
    Kim, C., Hong, Y.-S.: Classification techniques for XML document using text mining. J. Korea Soc. Comput. Inf. 11(2), 15–23 (2006)MathSciNetGoogle Scholar
  2. 2.
    Moon, J.-P., Lee, W.-S., Chang, J.-H.: A proper folder recommendation technique using frequent item sets for efficient e-mail classification. J. Korea Soc. Comput. Inf. 16(2), 33–46 (2011)CrossRefGoogle Scholar
  3. 3.
    Blei, D.M., Andrew, Y.N., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)zbMATHGoogle Scholar
  4. 4.
    Park, J.H., Song, M.: A study on the research trends in library & information science in Korea using topic modeling. J. Korean Soc. Inf. Manag. 30(1), 7–32 (2013)Google Scholar
  5. 5.
    Blei, D.M.: Probabilistic topic models. Commun. ACM 55(4), 77–84 (2012)CrossRefGoogle Scholar
  6. 6.
    Wasserman, S., Faust, K.: Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge (1994)CrossRefGoogle Scholar
  7. 7.
    Duvvuru, A., Kamarthi, S., Sultornsanee, S.: Undercovering research trends: network analysis of keywords in scholarly articles. In: Proceedings of the 9th International Joint Conference on Computer Science and Software Engineering, pp. 265–270 (2012)Google Scholar
  8. 8.
    Griffiths, T.L., Steyvers, M.: Finding scientific topics. In: Proceedings of the National Academy of Sciences of the USA, vol. 101, no. 1, pp. 5228–5235, April 2004CrossRefGoogle Scholar
  9. 9.
    Bae, J., Han, N., Song, M.: Twitter issue tracking system by topic modeling techniques. J. Intell. Inf. Syst. 20(2), 109–122 (2014)Google Scholar
  10. 10.
    Blei, D.M., Lafferty, J.D.: Correlated topic models. In: Proceedings of Neural Information Processing Systems, pp. 147–154 (2005)Google Scholar
  11. 11.
    Mei, Q., et al.: Topic modeling with network regularization. In: Proceedings of International Conference on World Wide Web, pp. 101–110 (2008)Google Scholar
  12. 12.
    Mao, X.-L., et al.: SSHLDA: a semi-supervised hierarchical topic model. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 800–809 (2012)Google Scholar
  13. 13.
    Wang, X., McCallum, A.: Topics over time: a non-Markov continuous-time model of topical trends. In: Proceedings of the 12th International Conference on Knowledge Discovery and Data Mining, pp. 424–433 (2006)Google Scholar
  14. 14.
    Kim, C., Hong, Y.-S.: Trend analysis of data mining research using topic network analysis. J. Korea Soc. Comput. Inf. 11(5), 141–148 (2016)Google Scholar
  15. 15.
    R: The R Project for Statistical Computing.
  16. 16.
    Gruen, B., Hornik, K.: topicmodels: an R package for fitting topic models. J. Stat. Softw. 40(13), 1–29 (2011)Google Scholar
  17. 17.
    Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)CrossRefGoogle Scholar
  18. 18.
    Delp, P., Thesen, A., Motiwalla, J., Seshardi, N. (eds.) System Tools for Project Planning. International Development Institute, Bloomington (1977)Google Scholar
  19. 19.
    Freeman, L.C.: Centrality in social networks: conceptual clarification. Soc. Netw. 1, 215–239 (1979)CrossRefGoogle Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  1. 1.Department of Computer EngineeringNetaji Subash Institute of TechnologyNew DelhiIndia
  2. 2.School of Computer and Systems SciencesJawaharlal Nehru UniversityNew DelhiIndia

Personalised recommendations