Automatic Labelling of References for Internet Information Systems

  • A. Geyer-Schulz
  • M. Hahsler
Conference paper
Part of the Studies in Classification, Data Analysis, and Knowledge Organization book series (STUDIES CLASS)


Today users of Internet information services like e.g. Yahoo! or AltaVista often experience high search costs. One important reason for this is the necessity to browse long reference lists manually, because of the well-known problems of relevance ranking. A possible remedy is to complement the references with automatically generated labels which provide valuable information about the referenced information source. Presenting suitably labelled lists of references to users aims at improving the clarity and thus comprehensibility of the information offered and at reducing the search cost. In the following we survey several dimensions for labelling (time, frequency of usage, region, language, subject, industry, and preferences) and the corresponding classification problems. To solve these problems automatically we sketch for each problem a pragmatic mix of machine learning methods and report selected results.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Cohen, Rr. and Feigenbaum, E.A. (Eds.) (1982): The Handbook of Artificial Intelligence, Vol. 3, Chapter XIII: Vision, William Kaufmann, Los Altos, California.Google Scholar
  2. Compaq (1999): AltaVista. Online: Scholar
  3. Dublin Core (1999): Dublin Core Metadata Initiative. Online:
  4. Dunning, T. (1994): Statistical Identification of Language. Technical report CRL MCCS-94-273, Computing Research Lab, New Mexico State University.Google Scholar
  5. Fast (1999): Fast MP3 Search. Online: Scholar
  6. Iso 3166 (1997): Codes for the Representation of Names of Countries and their Subdivisions. International Organization for Standardization.Google Scholar
  7. Leung, M.K. and Yang, Y.-H. (1995): First Sight: A Human Body Outline Labeling System, IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(4), 359–377.CrossRefGoogle Scholar
  8. Salton, G. and Mcgill, M.J. (1983): Introduction to Modern Information Retrieval. McGraw-Hill, New York.Google Scholar
  9. Sutton, R. S. (1992): Introduction: The Challenge of Reinforcement Learning. Machine Learning, 8(3/4), 225–227.CrossRefGoogle Scholar
  10. Van-Rijsbergen, C.J. (1979). Information Retrieval. Butterworths, London. 2nd edition.Google Scholar
  11. Vu (1999): Living Lectures — Virtual University Project. Online: Scholar
  12. W3C (1999): Resource Description Framework (RDF) Model and Syntax Specification. W3C Proposed Recommendation. Online:

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • A. Geyer-Schulz
    • 1
  • M. Hahsler
    • 1
  1. 1.Abteilung für Angewandte Informatik insbesondere BetriebsinformatikWirtschaftsuniversität WienViennaAustria

Personalised recommendations