Optimizing knowledge discovery over the WWW

  • Matthew Montebello
Regular Papers Knowledge Discovery and the Web
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1475)


The rapid growth in data volume, user base, and data diversity render Internet-accessible information increasingly difficult to be used effectively. In this paper we discuss the issues involved with knowledge discovery in knowledge bases, in particular the WWW, by presenting a general architecture and describing how it has been instantiated in a functional system we developed. The system attempts to concurrently maximize and optimize the resource/knowledge discovery, and custimize the information to individual users. A number of machine learning techniques have been employed in the development of the system for comparative reasons — results are presented and discussed.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    T Berners-Lee, R Caillian, A Luotonen, HF Nielsen, and A Secret. The World-Wide Web. Communications of the ACM, 37(8):76–82, 1994.CrossRefGoogle Scholar
  2. 2.
    Digital Equipment Corp. AltaVista. http://altavista.digital.com/.Google Scholar
  3. 3.
    Excite Inc. Excite. http://www.excite.com/.Google Scholar
  4. 4.
    H Berghel. Cyberspace 2000: Dealing with information overload. Communications of the ACM, 40(2):19–25, 1997.CrossRefGoogle Scholar
  5. 5.
    H Chen, C Schuffels, and R Orwig. Internet categorization and search: A self-organizing approach. Journal of Visual Communication and Image Representation, 7(1):88–102, 1996.CrossRefGoogle Scholar
  6. 6.
    C Knoblock and Levy (eds). Agent-based knowledge discovery. AAAI Spring Symposium on Information Gathering, 1995.Google Scholar
  7. 7.
    B Krulwich. Learning user interests across heterogeneous document databases. AAAI Spring Symposium on Information Gathering, 1995.Google Scholar
  8. 8.
    W H E Davies and P Edwards. Distributed learning: An agent-based approach to data-mining. In ML95 — workshop on agents that learn from other agents, 1995.Google Scholar
  9. 9.
    G Piatetsky-Shapiro and W J Frawley. Knowledge Discovery in Databases. MIT press, 1991.Google Scholar
  10. 10.
    D Bayer. A learning agent for resource discovery on the world wide web. Master’s thesis, University of Aberdeen, 1995.Google Scholar
  11. 11.
    C L Green and P Edwards. Using Machine Learning to enhance software tools for internet information management. In A Franz and H Kitamo, editors, AAAI-96, Workshop on Internet-Based Information Systems, pages 48–55. AAAI Press, 1996.Google Scholar
  12. 12.
    G Salton and M J McGill. Introduction to Modern Information Retrieval. McGraw-Hill, 1983.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1998

Authors and Affiliations

  • Matthew Montebello
    • 1
  1. 1.Computer Science DepartmentCardiff UniversityWales

Personalised recommendations