Applications of a Lightweight, Web-Based Retrieval, Clustering, and Visualisation Framework
Today’s web search engines return very large result sets for query formulations consisting of few specific keywords. Results are presented as ranked lists containing textual description of found items. Such representations do not allow identification of topical clusters, and consequentially make it difficult for users to refine queries efficiently. In this paper, we present WebRat, a framework for web-based retrieval, clustering and visualisation which enables parallel querying of multiple search engines, merging of retrieved result sets, automatic identification of topical clusters and interactive visualisation of the result sets and clusters for query refinement. This framework is lightweight in the sense that it consists of a small, platform-independent component which can be easily integrated into exisiting Internet or Intranet search forms without requiring specific system environments, server resources or precalculation efforts.
The WebRat system extends existing approaches to web search result visualisation in many aspects: Found results are added incrementally as they arrive, labelling is performed in 2-dimensional space on clusters the user can see and rendering is optimised to provide sufficient performance on standard office machines.
The WebRat framework has been used to implement a variety of applications: We have provided enhanced web search capabilities for users doing scientific research. Overview and refinement capabilities have been implemented for the environmental domain. Finally, abstracts generated on the fly by a knowledge management system have been used to provide topical navigation capabilities to developers searching for technical information in mailing list archives.
- 1.Andrews K., Gütl C., Moser J., Sabol V., Lackner W.: Search Result Visualisation with xFIND In Proceedings of UIDIS 2001, Zurich, Switzerland, May 2001. pp. 50–58Google Scholar
- 2.Boerner K., Chen C., Boyack K.W. (2003): Visualizing Knowledge Domains Annual Review of Information Science and Technology 37.Google Scholar
- 3.Cavnar, W.B., Trenkle, J. M. (1994): n-Gram based text categorization. In Symposion on Document Analysis and Information Retrieval, p161–176, University of Nevada, Las Vegas.Google Scholar
- 4.Chalmers M. (1993): Using a landscape methaphor to represent a corpus of documents. In Proceedings European Conference on Spatial Information Theory, COSIT 93, pages 337–390, Elba, September 1993.Google Scholar
- 5.Kohonen, T., Kaski, S., Lagus, K., Salojärvi, J., Honkela, J., Paatero, V., Saarela, A. (1999): Self organization of a massive text document collection. In Oja, E. and Kaski, S., editors, Kohonen Maps pages 171–182, Elsevier, Amsterdam, 1999Google Scholar
- 6.Leuski, A., Allan, J. (2000): Lighthouse: Showing theWay to Relevant Information. In the Proceedings of IEEE Symposium on Information Visualization 2000 (Info-Vis2000), Salt Lake City, Utah, October 2000. pp. 125–130.Google Scholar
- 8.Reiter H., Muβler G., Mann T., Handschuh S.: Insyder — An Information Assistant for Business Intelligence In Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Developement in Information Retrieval, July 14–18, 2000Athens, GreeceGoogle Scholar
- 9.Sabol V. (2001): Visualisation Islands: Interactive Visualisation and Clustering of Search Result Sets. Master’s Thesis at IICM, Graz University of TechnologyGoogle Scholar
- 11.Thomas, J., et al. (2001): Visual Text analysis-SPIRE Technical flier from the Pacific Northwest National Laboratory. http://www.pnl.gov/infoviz/spire.pdf
- 12.Tochtermann, K., Sabol, V., Kienreich, W., Granitzer, M. and Becker, J. (2002): Intelligent Maps and Information Landscapes: Two new Approaches to support Search and Retrieval of Environmental Information Objects. In Proceedings of 16th International Conference on Informatics for Environmental Protection, Vienna University of Technology, September 2002Google Scholar
- 13.Wise, J., Thomas, J., Pennock, K., Lantrip, D., Pottier, M., Schur, A. (1995): Visualizing the non-visual: Spatial analysis and interaction with information from text documents. In Proceedings of the Information Visualization Symposium 95, p51–58. IEEE Computer Society PressGoogle Scholar