Automatically Detecting and Organizing Documents into Topic Hierarchies: A Neural Network Based Approach to Bookshelf Creation and Arrangement
With the increasing amount of information available in el- ectronic document collections, methods for organizing these collections to allow topic-oriented browsing and orientation gain importance. The SOMLib Digital Library System provides such an organization based on the self-organizing map, a popular neural network model. In this pa- per, we present the GHSOM, which, based on the same concepts, allows an automatic hierarchical decomposition and organization of documents, which very intuitively reflects the organization typically found in (ma- nually organized) conventional libraries. We present a case study based on a 3-month article collection from an Austrian daily newspaper.
KeywordsDigital Library Document Collection Organize Document Hierarchical Feature Topic Hierarchy
Unable to display preview. Download preview PDF.
- 1.M. Dittenbach, D. Merkl, and A. Rauber. Using growing hierarchical self-organizing maps for document classification. In Proc. European Symp. on Artificial Neural Networks (ESANN00), Bruges, Belgium, 2000.Google Scholar
- 6.A. Rauber and D. Merkl. Providing topically sorted access to subsequently released newspaper editions or: How to build your private digital library. In Proc. 11th Int’l Conf. on Database and Expert Systems Applications (DEXA00), Greenwich, UK, 2000.Google Scholar