Abstract
This paper introduces and evaluates a new paradigm, called Knowledge Agents, that incorporates agent technology into the process of domainspecific Web search. An agent is situated between the user and a search engine. It specializes in a specific domain by extracting characteristic information from search results. Domains are thus user-defined and can be of any granularity and specialty. This information is saved in a knowledge base and used in future searches. Queries are refined by the agent based on its domain-specific knowledge and the refined queries are sent to general purpose search engines. The search results are ranked based on the agent’s domain specific knowledge, thus filtering out pages which match the query but are irrelevant to the domain. A topological search of the Web for additional relevant sites is conducted from a domain-specific perspective. The combination of a broad search of the entire Web with domain-specific textual and topological scoring of results, enables the knowledge agent to find the most relevant documents for a given query within a domain of interest. The knowledge acquired by the agent is continuously updated and persistently stored thus users can benefit from search results of others in common domains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ben-Shaul, I., Herscovici, M., Jacovi, M., Maarek, Y.S., Pelleg, D., Shtalhaim, M., Soroka, V., Ur, S.: Adding support for dynamic and focused search with fetuccino. In: Proceedings of the Eighth International WWW Conference, pp. 575–587. Elsevier, Amsterdam (1999)
CampSearch. The search engine for camps, http://www.campsearch.com
IBM Almaden Research Center. Clever, http://www.almaden.ibm.com/cs/k53.clever.html
Chakrabarti, S., Dom, B., Gibson, D., Kleinberg, J., Kumar, S.R., Raghavan, P., Rajagopalan, S., Tomkins, A.: Mining the web’s link structure. IEEE Computer 32(8), 60–67 (1999)
Chakrabarti, S., Dom, B., ven den Berg, M.: Focused crawling: A new approach to topic-specific web resource discovery. In: Proceedings of the Eighth International WWW Conference, pp. 545–562. Elsevier, Amsterdam (1999)
Excite Inc. Excite search, http://www.excite.com/
Google Inc. Google search engine, http://www.google.com/
Yahoo Inc. Yahoo!, http://www.yahoo.com
IBM Jcentral. Search the web for java, http://www.jcentral.com
Kleinberg, J.M.: Authoritaive sources in a hyperlinked environment. In: Proceedings ofthe Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, January 1998, vol. 25-27, pp. 668–677 (1998)
Lempel, R.: Finding authoritative sites on the WWW (and other hyperlinked media) by analyzing the web’s link-structure. Master’s thesis, Technion, Israel Institute of Technology (July 1999)
Maarek, Y., Smadja, F.: Full text indexing based on lexical relations, an application: Software libraries. In: Belkin, N., van Rijsbergen, C. (eds.) Proceedings of SIGIR 1989, pp. 198–206. ACM press, Cambridge (1989)
Manber, U., Bigot, P.A.: The search broker. In: The First Usenix Symposium on Internet Technologies and Systems, Monterey CA, December 1997, pp. 231–240 (1997)
MRQE. Movie review query engine, http://www.mrqe.com
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. Computer Series. McGraw-Hill, New York (1983)
Search Engine Watch. Search engine watch, http://www.searchenginewatch.com
Xu, J., Croft, W.B.: Query expansion using local and global document analysis. In: Proceedings of the 19th annual international ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 4–11 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aridor, Y., Carmel, D., Lempel, R., Soffer, A., Maarek, Y.S. (2000). Knowledge Agents on the Web. In: Klusch, M., Kerschberg, L. (eds) Cooperative Information Agents IV - The Future of Information Agents in Cyberspace. CIA 2000. Lecture Notes in Computer Science(), vol 1860. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45012-2_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-45012-2_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67703-1
Online ISBN: 978-3-540-45012-2
eBook Packages: Springer Book Archive