, Volume 93, Issue 1, pp 151–166 | Cite as

A new approach for automatizing the analysis of research topics dynamics: application to optoelectronics research



The objective of this paper is to propose a new unsupervised incremental approach in order to follow the evolution of research themes for a given scientific discipline in terms of emergence or decline. Such behaviors are detectable by various methods of filtering. However, our choice is made on the exploitation of neural clustering methods in a multi-view context. This new approach makes it possible to take into account the incremental and chronological aspects of information by opening the way to the detection of convergences and divergences of research themes at a large scale.


Diachronic analysis Clustering Multiple viewpoint analysis Unsupervised learning Bayesian reasoning Neural networks 



The author wishes to thanks Pascal Cuxac (INIST-CNRS) for his valuable help in the results validation task.


  1. Al Shehabi, S., Lamirel, J.-C. (2004). Inference Bayesian Network for Multi-topographic neural network communication: A case study in documentary data. In Proceedings of ICTTA, Damas, Syria, April 2004.Google Scholar
  2. Al Shehabi, S., Lamirel, J.-C. (2006). Evaluation of collaboration between European universities using dynamic interaction between multiple sources. Journal of Information Management and Scientometrics, 1(3).Google Scholar
  3. Allan, J., Carbonell, J., Doddington, G., Yamron, J., Yang, Y. (1998). Topic detection and tracking pilot study, final report. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, Virginia.Google Scholar
  4. Attik, M., Lamirel, J.-C., Al Shehabi, S. (2006). Clustering analysis for data with multiple labels. In Proceedings of the The IASTED International Conference on Databases and Applications (DBA), Innsbruck, Austria, February 2006.Google Scholar
  5. Davies, D., & Bouldin, W. (1979). A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1, 224–227.CrossRefGoogle Scholar
  6. Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood for incomplete data via the EM algorithm. Journal of the Royal Statistical Society, B39, 1–38.MathSciNetGoogle Scholar
  7. François, C., Hoffmann, M., Lamirel, J.-C., Polanco, X. (2003). Artificial Neural Network mapping experiments. EICSTES (IST-1999-20350) Final Report (WP 9.4), September 2003.Google Scholar
  8. Frizke, B. (1995). A growing neural gas network learns topologies. In G Tesauro, D. S Touretzky, T. K leen (Eds.), Advances in neural Information processing Systems 7 (pp. 625–632). Cambridge: MIT Press.Google Scholar
  9. Gaber, M., Zaslavsky, A., Krishnaswamy, S. (2005). Mining data streams: A review. SIGMOD Record, 34(2).Google Scholar
  10. Ghribi, M., Cuxac, P., Lamirel, J. C., Lelu, A. (2010). Mesures de qualité de clustering de documents: Prise en compte de la distribution des mots-clés. In EvalECD’2010 Workshop, Hamamet, Tunisia.Google Scholar
  11. Glanzel, W., & Thijs, B. (2010). Using ‘core documents’ for the representation of clusters and topics. Scientometrics, 88(1), 297–309.CrossRefGoogle Scholar
  12. Lamirel, J.-C., & Al Shehabi, S. (2004b). Comparison of unsupervised neural clustering methods for mining Web and textual data. In SCI 2004, Orlando, FL, USA, July 2004.Google Scholar
  13. Lamirel, J.-C., Créhange, M. (1994). Application of a symbolico-connectionist approach for the design of a highly interactive documentary database interrogation system with on-line learning capabilities. In Proceedings ACM-CIKM 94, Gaitherburg, MD, USA, November 1994.Google Scholar
  14. Lamirel, J.-C., Al-Shehabi, S., François, C., & Hoffmann, M. (2004). New classification quality estimators for analysis of documentary information: Application to patent analysis and web mapping. Scientometrics, 60(3), 445–462.CrossRefGoogle Scholar
  15. Lamirel, J.-C., Ta, A. P., & Attik M. (2008). Novel labeling strategies for hierarchical representation of multidimensional data analysis results. In IASTED International Conference on Artificial Intelligence and Applications (AIA), Innsbruck, Austria, February 2008.Google Scholar
  16. Lamirel, J.-C., Boulila, Z., Ghribi, M., Cuxac, P. (2010). A new incremental growing neural gas algorithm based on clusters labeling maximization: application to clustering of heterogeneous textual data. In Proceedings of IEA-AIE 2010, Cordoba, Spain, June 2010.Google Scholar
  17. Lamirel, J.-C., Mall, R., Cuxac, P., Safi, G. (2011). Variations to incremental growing neural gas algorithm based on label maximization. In Proceedings of IJCNN 2011, San José, CA, USA, August 2011.Google Scholar
  18. MacQueen, J. B. (1967). Some methods of classification and analysis of multivariate observations. In L. Le Cam & J. Neyman (Eds.), Proceedings 5th Berkeley Symposium in Mathematics, Statistics and Probability (Vol 1, pp. 281–297), University of California, Berkeley, USA, 1967.Google Scholar
  19. Robertson, S. E., & Sparck Jones, K. (1976). Relevance weighting of search terms. Journal of the American Society for Information Science, 27, 129–146.CrossRefGoogle Scholar
  20. Schiebel, E., Hörlesberger, Roche, I., François, C., & Besagni, D. (2010). An advanced diffusion model to identify emergent research issues: The case of optoelectronic devices. Scientometrics, 83(3), 765–781.CrossRefGoogle Scholar
  21. Thijs, B., Glänzel, W. (2010). A new hybrid approach for bibliometrics aided retrieval. In Sixth International Conference on Webometrics, Informetrics & Scientometrics, and 11th COLLNET Meeting, Mysore, India, October 2010.Google Scholar
  22. Voorhees, E. M. (1986). Implementing agglomerative hierarchical clustering algorithms for use in document retrieval. Information Processing and Management, 22, 465–476.CrossRefGoogle Scholar

Copyright information

© Akadémiai Kiadó, Budapest, Hungary 2012

Authors and Affiliations

  1. 1.LORIA, INRIA-TALARIS ProjectVillers-lès-NancyFrance

Personalised recommendations