Advertisement

International Journal on Digital Libraries

, Volume 2, Issue 2–3, pp 111–123 | Cite as

Lexically-generated subject hierarchies for browsing large collections

  • Craig G. Nevill-Manning
  • Ian H. Witten
  • Gordon W. Paynter
Original article: Browsing and searching

Abstract

Developing intuition for the content of a digital collection is difficult. Hierarchies of subject terms allow users to explore the space of topics that a collection covers, to form and specialize useful query terms, and to directly identify interesting documents. We describe two interfaces for navigating such hierarchies, and present a technique for inferring hierarchies automatically from large corpora. We also discuss scalability issues for the techniques involved, and our solutions to these problems.

Key words: Browsing – Hierarchies – Inference – Scalability – Intuition 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag 1999

Authors and Affiliations

  • Craig G. Nevill-Manning
    • 1
  • Ian H. Witten
    • 2
  • Gordon W. Paynter
    • 2
  1. 1.Department of Computer Science, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA; E-mail: nevill@cs.rutgers.eduUSA
  2. 2.Department of Computer Science, University of Waikato, Hamilton, New Zealand; E-mail: {ihw,gwp}@cs.waikato.ac.nzNew Zealand

Personalised recommendations