Automatically Detecting and Organizing Documents into Topic Hierarchies: A Neural Network Based Approach to Bookshelf Creation and Arrangement

  • Andreas Rauber
  • Michael Dittenbach
  • Dieter Merkl
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1923)


With the increasing amount of information available in el- ectronic document collections, methods for organizing these collections to allow topic-oriented browsing and orientation gain importance. The SOMLib Digital Library System provides such an organization based on the self-organizing map, a popular neural network model. In this pa- per, we present the GHSOM, which, based on the same concepts, allows an automatic hierarchical decomposition and organization of documents, which very intuitively reflects the organization typically found in (ma- nually organized) conventional libraries. We present a case study based on a 3-month article collection from an Austrian daily newspaper.


Digital Library Document Collection Organize Document Hierarchical Feature Topic Hierarchy 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    M. Dittenbach, D. Merkl, and A. Rauber. Using growing hierarchical self-organizing maps for document classification. In Proc. European Symp. on Artificial Neural Networks (ESANN00), Bruges, Belgium, 2000.Google Scholar
  2. 2.
    B. Fritzke. Growing grid-a self-organizing network with constant neighborhood range and adaption strength. Neural Processing Letters, 2, No. 5:1–5, 1995.CrossRefGoogle Scholar
  3. 3.
    T. Kohonen. Self-Organizing Maps. Springer Verlag, Berlin, Germany, 1995.CrossRefGoogle Scholar
  4. 4.
    R. Miikkulainen. Script recognition with hierarchical feature maps. Connection Science, 2:83–101, 1990.CrossRefGoogle Scholar
  5. 5.
    A. Rauber and D. Merkl. The SOMLib Digital Library System. In Proc. Europ. Conf. on Research and Advanced Technology for Digital Libraries (ECDL99), Paris, France, 1999. LNCS, Springer Verlag.CrossRefGoogle Scholar
  6. 6.
    A. Rauber and D. Merkl. Providing topically sorted access to subsequently released newspaper editions or: How to build your private digital library. In Proc. 11th Int’l Conf. on Database and Expert Systems Applications (DEXA00), Greenwich, UK, 2000.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • Andreas Rauber
    • 1
  • Michael Dittenbach
    • 1
  • Dieter Merkl
    • 1
  1. 1.Department of Software TechnologyVienna University of TechnologyWienAustria

Personalised recommendations