Detection of Embryonic Research Topics by Analysing Semantic Topic Networks

  • Angelo Antonio SalatinoEmail author
  • Enrico Motta
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9792)


Being aware of new research topics is an important asset for anybody involved in the research environment, including researchers, academic publishers and institutional funding bodies. In recent years, the amount of scholarly data available on the web has increased steadily, allowing the development of several approaches for detecting emerging research topics and assessing their trends. However, current methods focus on the detection of topics which are already associated with a label or a substantial number of documents. In this paper, we address instead the issue of detecting embryonic topics, which do not possess these characteristics yet. We suggest that it is possible to forecast the emergence of novel research topics even at such early stage and demonstrate that the emergence of a new topic can be anticipated by analysing the dynamics of pre-existing topics. We present an approach to evaluate such dynamics and an experiment on a sample of 3 million research papers, which confirms our hypothesis. In particular, we found that the pace of collaboration in sub-graphs of topics that will give rise to novel topics is significantly higher than the one in the control group.


Ontology Research trend detection Scholarly data Semantic web Topic discovery Topic emergence detection 



We would like to thank Springer Nature ( for partially funding this research and Elsevier B.V. ( for providing us with access to their large repositories of scholarly data.


  1. 1.
    Becher, T., Trowler, P.: Academic Tribes and Territories: Intellectual Enquiry and the Culture of Disciplines. McGraw-Hill Education, New York (2001)Google Scholar
  2. 2.
    Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Sci. Am. 284, 28–37 (2001)CrossRefGoogle Scholar
  3. 3.
    Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)zbMATHGoogle Scholar
  4. 4.
    Bolelli, L., Ertekin, Ş., Giles, C.L.: Topic and trend detection in text collections using latent dirichlet allocation. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 776–780. Springer, Heidelberg (2009). doi: 10.1007/978-3-642-00958-7_84 CrossRefGoogle Scholar
  5. 5.
    Decker, S.L., Aleman-Meza, B., Cameron, D., Arpinar, I.B.: Detection of bursty and emerging trends towards identification of researchers at the early stage of trends. University of Georgia (2007)Google Scholar
  6. 6.
    Duvvuru, A., Kamarthi, S., Sultornsanee, S.: Undercovering research trends: network analysis of keywords in scholarly articles. In: 2012 International Joint Conference on Computer Science and Software Engineering (JCSSE), pp. 265–270 (2012)Google Scholar
  7. 7.
    Duvvuru, A., Radhakrishnan, S., More, D., Kamarthi, S., Sultornsanee, S.: Analyzing structural & temporal characteristics of keyword system in academic research articles. Procedia Comput. Sci. 20, 439–445 (2013)CrossRefGoogle Scholar
  8. 8.
    Erten, C., Harding, P.J., Kobourov, S.G., Wampler, K., Yee, G.: Exploring the computing literature using temporal graph visualization. In: Electronic Imaging 2004, pp. 45–56 (2004)Google Scholar
  9. 9.
    Gruhl, D., Guha, R., Liben-Nowell, D., Tomkins, A.: Information diffusion through blogspace. In: Proceedings of the 13th International Conference on World Wide Web, pp. 491–501 (2004)Google Scholar
  10. 10.
    He, Q., Chen, B., Pei, J., Qiu, B., Mitra, P., Giles, L.: Detecting topic evolution in scientific literature: how can citations help? In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 957–966 (2009)Google Scholar
  11. 11.
    Jo, Y., Lagoze, C., Giles, C.L.: Detecting research topics via the correlation between graphs and texts. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 370–379 (2007)Google Scholar
  12. 12.
    Kuhn, T.S.: The Structure of Scientific Revolutions. University of Chicago Press, Chicago (2012)CrossRefGoogle Scholar
  13. 13.
    Luce, R.D., Perry, A.D.: A method of matrix analysis of group structure. Psychometrika 14, 95–116 (1949)MathSciNetCrossRefGoogle Scholar
  14. 14.
    Lv, P.H., Wang, G.-F., Wan, Y., Liu, J., Liu, Q., Ma, F.-C.: Bibliometric trend analysis on global graphene research. Scientometrics 88, 399–419 (2011)CrossRefGoogle Scholar
  15. 15.
    Mathioudakis, M., Koudas, N.: Twittermonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp. 1155–1158 (2010)Google Scholar
  16. 16.
    Morinaga, S., Yamanishi, K.: Tracking dynamics of topic trends using a finite mixture model. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 811–816 (2004)Google Scholar
  17. 17.
    Osborne, F., Motta, E., Mulholland, P.: Exploring scholarly data with rexplore. In: Alani, H., et al. (eds.) ISWC 2013. LNCS, vol. 8218, pp. 460–477. Springer, Heidelberg (2013). doi: 10.1007/978-3-642-41335-3_29 CrossRefGoogle Scholar
  18. 18.
    Osborne, F., Motta, E.: Klink-2: integrating multiple web sources to generate semantic topic networks. In: Arenas, M., et al. (eds.) ISWC 2015. LNCS, vol. 9366, pp. 408–424. Springer, Cham (2015). doi: 10.1007/978-3-319-25007-6_24 CrossRefGoogle Scholar
  19. 19.
    Osborne, F., Motta, E.: Mining semantic relations between research areas. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012. LNCS, vol. 7649, pp. 410–426. Springer, Heidelberg (2012). doi: 10.1007/978-3-642-35176-1_26 CrossRefGoogle Scholar
  20. 20.
    Osborne, F., Scavo, G., Motta, E.: A hybrid semantic approach to building dynamic maps of research communities. In: Janowicz, K., Schlobach, S., Lambrix, P., Hyvönen, E. (eds.) EKAW 2014. LNCS (LNAI), vol. 8876, pp. 356–372. Springer, Cham (2014). doi: 10.1007/978-3-319-13704-9_28 Google Scholar
  21. 21.
    Rosen-Zvi, M., Griffiths, T., Steyvers, M., Smyth, P.: The author-topic model for authors and documents. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pp. 487–494 (2004)Google Scholar
  22. 22.
    Salatino, A.: Early detection and forecasting of research trends (2015)Google Scholar
  23. 23.
    Sun, X., Kaur, J., Milojevi, S., Flammini, A., Menczer, F.: Social dynamics of science. Sci. Rep. 3, 1069 (2013)CrossRefGoogle Scholar
  24. 24.
    Tseng, Y.-H., Lin, Y.-I., Lee, Y.-Y., Hung, W.-C., Lee, C.-H.: A comparison of methods for detecting hot topics. Scientometrics 81, 73–90 (2009)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  1. 1.Knowledge Media InstituteThe Open UniversityMilton KeynesUK

Personalised recommendations