User Modelling for Interactive User-Adaptive Collection Structuring

  • Andreas Nürnberger
  • Sebastian Stober
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4918)

Abstract

Automatic structuring is one means to ease access to document collections, be it for organization or for exploration. Of even greater help would be a presentation that adapts to the user’s way of structuring and thus is intuitively understandable. We extend an existing user-adaptive prototype system that is based on a growing self-organizing map and that learns a feature weighting scheme from a user’s interaction with the system resulting in a personalized similarity measure. The proposed approach for adapting the feature weights targets certain problems of previously used heuristics. The revised adaptation method is based on quadratic optimization and thus we are able to pose certain contraints on the derived weighting scheme. Moreover, thus it is guaranteed that an optimal weighting scheme is found if one exists. The proposed approach is evaluated by simulating user interaction with the system on two text datasets: one artificial data set that is used to analyze the performance for different user types and a real world data set – a subset of the banksearch dataset – containing additional class information.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Greiff, W.R.: A theory of term weighting based on exploratory data analysis. In: 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, NY (1998)Google Scholar
  2. 2.
    Hotho, A., Nürnberger, A., Paaß, G.: A brief survey of text mining. GLDV-Journal for Computational Linguistics and Language Technology 20(1), 19–62 (2005)Google Scholar
  3. 3.
    Klose, A., Nürnberger, A., Kruse, R., Hartmann, G.K., Richards, M.: Interactive text retrieval based on document similarities. Physics and Chemistry of the Earth, Part A: Solid Earth and Geodesy 25(8), 649–654 (2000)CrossRefGoogle Scholar
  4. 4.
    Lochbaum, K.E., Streeter, L.A.: Combining and comparing the effectiveness of latent semantic indexing and the ordinary vector space model for information retrieval. Information Processing and Management 25(6), 665–676 (1989)CrossRefGoogle Scholar
  5. 5.
    Nürnberger, A., Detyniecki, M.: Weighted self-organizing maps - incorporating user feedback. In: Artificial Neural Networks and Neural Information Processing - ICANN/ICONIP 2003, Proc. of the joined 13th Int. Conf. (2003)Google Scholar
  6. 6.
    Nürnberger, A., Detyniecki, M.: Externally growing self-organizing maps and its application to e-mail database visualization and exploration. Applied Soft Computing 6(4), 357–371 (2006)CrossRefGoogle Scholar
  7. 7.
    Nürnberger, A., Klose, A.: Improving clustering and visualization of multimedia data using interactive user feedback. In: Proc. of the 9th Int. Conf. on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2002) (2002)Google Scholar
  8. 8.
    Porter, M.: An algorithm for suffix stripping. Program, 130–137 (1980)Google Scholar
  9. 9.
    Salton, G., Allan, J., Buckley, C.: Automatic structuring and retrieval of large text files. Communications of the ACM 37(2), 97–108 (1994)CrossRefGoogle Scholar
  10. 10.
    Salton, G., Buckley, C.: Term weighting approaches in automatic text retrieval. Information Processing & Management 24(5), 513–523 (1988)CrossRefGoogle Scholar
  11. 11.
    Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18(11), 613–620 (1975) (see also TR74-218, Cornell University, NY, USA) Google Scholar
  12. 12.
    Sinka, M., Corne, D.: A large benchmark dataset for web document clustering. In: Soft Computing Systems: Design, Management and Applications. Frontiers in Artificial Intelligence and Applications, vol. 87, pp. 881–890 (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Andreas Nürnberger
    • 1
  • Sebastian Stober
    • 1
  1. 1.Institute for Knowledge and Language Engineering, Faculty of Computer ScienceOtto-von-Guericke-University MagdeburgMagdeburgGermany

Personalised recommendations