Abstract
Searching for information on the Internet remains a difficult task, despite considerable progress in search engines, such as Google. One difficulty for many users is to formulate a suitable search query. In this paper we propose a new interactive query refinement process that helps the user to articulate their information needs by supporting the word sense disambiguation of search terms as well as by dynamically generating potentially new relationships among several search terms, based on analysing the retrieved documents.
The main functionality of our system, called WebConceptualizer, presented in this paper support the following: 1) the user’s awareness of the different word senses of query terms. 2) to visualise a network of concepts surrounding the query terms in a graph structure that allows the user to operate at the conceptual level rather than the term level when articulating their information need. 3) the identification of documents whose content reflects a particular word sense rather than just the query words as such.
Our initial experiments with the implemented prototype shows a good accuracy in recognising the correct word sense of the main topic of the document. The user interface has not been systematically evaluated as yet. However, the usability seems rather good based on a verbal reports from a few test users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Google (2003), http://www.google.com/
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. JASIS 41, 391–407 (1990)
Yahoo (2003), http://www.yahoo.com/
Open directory project (2003), http://dmoz.org/
Scatter/gather (2000), http://www.sims.berkeley.edu/~hearst/sg-overview.html
Wisenut (2003), http://www.wisenut.com/
Vivisimo (2003), http://vivisimo.com/
Northern light (2002), http://www.northernlight.com/
Zamir, O., Etzioni, O.: Grouper: A dynamic clustering interface to web search results. Computer Networks 31, 1361–1374 (1999)
Grouper (2000), http://www.cs.washington.edu/research/projects/WebWare1/www/metacrawler/
Bollmann-Sdorra, P., Raghavan, V.V.: On the necessity of term dependence in a query space for weighted retrieval. JASIS 49, 1161–1168 (1998)
Billhardt, H., Borrajo, D., Maojo, V.: A context vector model for information retrieval. JASIST 53, 236–249 (2002)
Schütze, H.: Dimensions of meaning. In: Proceedings of Supercomputing 1992, pp. 787–796. IEEE, Los Alamitos (1992)
Rungsawang, A.: DSIR: the First TREC-7 Attempt. In: Voorhees, E., Harman, D. (eds.) Proceedings of the Seventh Text REtrieval Conference (TREC 1998), Department of Commerce, National Institute of Standards and Technology, pp. 366–372 (1998)
Wordnet (2003) http://www.cogsci.princeton.edu/wn/
Fellbaum, C. (ed.): WordNet - An electronic lexical database. MIT Press, Cambridge (1998)
Wille, R.: Restructuring lattice theory: An approach based on hierarchies of concepts. In: Ordered Sets. Series C., NATO Advanced Study Institute, vol. 83, pp. 445–470 (1982)
Wille, R., Ganter, B.: Formal Concept Analysis: mathematical foundations. Springer, Heidelberg (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yoo, S.Y., Hoffmann, A. (2003). A New Approach for Concept-Based Web Search. In: Gedeon, T.(.D., Fung, L.C.C. (eds) AI 2003: Advances in Artificial Intelligence. AI 2003. Lecture Notes in Computer Science(), vol 2903. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24581-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-24581-0_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20646-0
Online ISBN: 978-3-540-24581-0
eBook Packages: Springer Book Archive