Optimizing knowledge discovery over the WWW
The rapid growth in data volume, user base, and data diversity render Internet-accessible information increasingly difficult to be used effectively. In this paper we discuss the issues involved with knowledge discovery in knowledge bases, in particular the WWW, by presenting a general architecture and describing how it has been instantiated in a functional system we developed. The system attempts to concurrently maximize and optimize the resource/knowledge discovery, and custimize the information to individual users. A number of machine learning techniques have been employed in the development of the system for comparative reasons — results are presented and discussed.
Unable to display preview. Download preview PDF.
- 2.Digital Equipment Corp. AltaVista. http://altavista.digital.com/.Google Scholar
- 3.Excite Inc. Excite. http://www.excite.com/.Google Scholar
- 6.C Knoblock and Levy (eds). Agent-based knowledge discovery. AAAI Spring Symposium on Information Gathering, 1995.Google Scholar
- 7.B Krulwich. Learning user interests across heterogeneous document databases. AAAI Spring Symposium on Information Gathering, 1995.Google Scholar
- 8.W H E Davies and P Edwards. Distributed learning: An agent-based approach to data-mining. In ML95 — workshop on agents that learn from other agents, 1995.Google Scholar
- 9.G Piatetsky-Shapiro and W J Frawley. Knowledge Discovery in Databases. MIT press, 1991.Google Scholar
- 10.D Bayer. A learning agent for resource discovery on the world wide web. Master’s thesis, University of Aberdeen, 1995.Google Scholar
- 11.C L Green and P Edwards. Using Machine Learning to enhance software tools for internet information management. In A Franz and H Kitamo, editors, AAAI-96, Workshop on Internet-Based Information Systems, pages 48–55. AAAI Press, 1996.Google Scholar
- 12.G Salton and M J McGill. Introduction to Modern Information Retrieval. McGraw-Hill, 1983.Google Scholar