, Volume 88, Issue 1, pp 297–309 | Cite as

Using ‘core documents’ for the representation of clusters and topics

  • Wolfgang GlänzelEmail author
  • Bart Thijs


The notion of ‘core documents’, first introduced in the context of co-citation analysis and later re-introduced for bibliographic coupling, refers to the representation of the core of a publication set according to given criteria. In the present study, the notion of core documents is extended to the combination of citation-based and textual links. It is shown that core documents defined this way can be used to represent and describe document clusters and topics at different levels of aggregation. Methodology is illustrated using the example of two ISI Subject Categories selected from applied and social sciences.


Core documents Cluster analysis Hybrid clustering Bibliographic coupling Text mining 



Methodology has partially been developed in the context of the ERACEP project within the Coordination and Support Actions (CSAs) of the ERC work programme. The authors wish to acknowledge this support.


  1. Batagelj, V., & Mrvar, A. (2003). Pajek—analysis and visualization of large networks. In M. Jünger & P. Mutzel (Eds.), Graph drawing software (pp. 77–103). New York: Springer.Google Scholar
  2. Boyack, K.W., & Klavans, R. (2010). Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately? Journal of the American Society for Information Science and Technology, 61(12), 2389–2404.CrossRefGoogle Scholar
  3. Braam, R. R., Moed, H. F., & van Raan, A. F. J. (1991a). Mapping of science by combined cocitation and word analysis, Part 1: Structural aspects. Journal of the American Society for Information Science, 42(4), 233–251.CrossRefGoogle Scholar
  4. Braam, R. R., Moed, H. F., & van Raan, A. F. J. (1991b). Mapping of science by combined cocitation and word analysis, Part II: Dynamical aspects. Journal of the American Society for Information Science, 42(4), 252–266.CrossRefGoogle Scholar
  5. Glänzel, W., & Czerwon, H. J. (1996). A new methodological approach to bibliographic coupling and its application to the national, regional and institutional level. Scientometrics, 37, 195–221.CrossRefGoogle Scholar
  6. Glenisson, P., Glänzel, W., Janssens, F., & de Moor, B. (2005). Combining full text and bibliometric information in mapping scientific disciplines. Information Processing & Management, 41(6), 1548–1572.CrossRefGoogle Scholar
  7. Janssens, F., Zhang, L., & Glänzel, W. (2009). Hybrid clustering for validation and improvement of subject-classification schemes. Information Processing & Management, 45(6), 683–702.CrossRefGoogle Scholar
  8. Lamirel, J.C., Ta A.P., & Attik, M. (2008), Novel labeling strategies for hierarchical representation of multidimensional data analysis results. In: A. Gammerman (Ed.), Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications (Track 595-138, pp. 169–174), 11–13 Feb 2008, Innsbruck, Austria. Anaheim, CA: ACTA Press.Google Scholar
  9. Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 53–65.zbMATHCrossRefGoogle Scholar
  10. Sen, S. K., & Gan, S. K. (1983). A mathematical extension of the idea of bibliographic coupling and its applications. Annals of Library Science and Documentation, 30, 78–82.Google Scholar
  11. Small, H. (1973). Cocitation in scientific literature—new measure of relationship between 2 documents. JASIS, 24(4), 265–269.CrossRefGoogle Scholar
  12. Zhang, L., Glänzel, W., & Liang, L. (2009). Tracing the role of individual journals in a cross-citation network based on different indicators. Scientometrics, 81(3), 821–838.CrossRefGoogle Scholar
  13. Zitt, M., & Bassecoulard, E. (1994). Development of a method for detection and trend analysis of research fronts built by lexical or cocitation analysis. Scientometrics, 30(1), 333–351.CrossRefGoogle Scholar

Copyright information

© Akadémiai Kiadó, Budapest, Hungary 2011

Authors and Affiliations

  1. 1.Centre for R&D Monitoring (ECOOM) and Department of MSIK.U. LeuvenLeuvenBelgium
  2. 2.Institute for Research Policy Studies (IRPS)Hungarian Academy of SciencesBudapestHungary

Personalised recommendations