Exploiting Rich Context: An Incremental Approach to Context-Based Web Search

  • David Leake
  • Ana Maguitman
  • Thomas Reichherzer
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3554)

Abstract

Proactive retrieval systems monitor a user’s task context and automatically provide the user with related resources. The effectiveness of such systems depends on their ability to perform context-based retrieval, generating queries which return context-relevant results. Two factors make this task especially challenging for Web-based retrieval. First, the quality of Web retrieval can be strongly affected by the vocabulary used to generate the queries. If the system’s vocabulary for describing the context differs from the vocabulary used in the resources themselves, relevant resources may be missed. Second, search engine restrictions on query length may make it difficult to include sufficient contextual information in a single query. This paper presents an algorithm, IACS (Incremental Algorithm for Context-Based Search), which addresses these problems by building up, applying, and refining partial context descriptions incrementally. In IACS, an initial term-based context description is the starting point for a cycle of mining search engines, performing context-based filtering of results, and refining context descriptions to generate new rounds of queries in an expanded vocabulary. IACS has been applied in a system for proactively supporting concept-map-based knowledge modeling, by retrieving resources relevant to target concepts in the context of the rich information provided by “in progress” concept maps. An evaluation of the system shows that it provides significant improvements over a baseline for retrieving context-relevant resources. We expect the algorithm to have broad applicability to context-based Web retrieval for rich contexts.

References

  1. 1.
    Rhodes, B., Starner, T.: The remembrance agent: A continuously running automated information retrieval system. In: The Proceedings of The First International Conference on The Practical Application of Intelligent Agents and Multi Agent Technology (PAAM 1996), London, UK, pp. 487–495 (1996)Google Scholar
  2. 2.
    Budzik, J., Hammond, K.J., Birnbaum, L.: Information access in context. Knowledge based systems 14, 37–53 (2001)CrossRefGoogle Scholar
  3. 3.
    Lawrence, S.: Context in Web search. IEEE Data Engineering Bulletin 23, 25–32 (2000)Google Scholar
  4. 4.
    Cañas, A.J., Hill, G., Carff, R., Suri, N., Lott, J., Eskridge, T., Gómez, G., Arroyo, M., Carvajal, R.: CmapTools: A knowledge modeling and sharing environment. In: Cañas, A.J., Novak, J.D., González, F. (eds.) Concept Maps: Theory, Methodology, Technology. Proceedings of the First International Conference on Concept Mapping. (2004)Google Scholar
  5. 5.
    Cañas, A., Carvalho, M., Arguedas, M., Eskridge, T., Leake, D., Maguitman, A., Reichherzer, T.: Mining the web to suggest concepts during concept map construction. In: Cañas, A.J., Novak, J.D., González, F. (eds.) Concept Maps: Theory, Methodology, Technology. Proceedings of the First International Conference on Concept Mapping (2004)Google Scholar
  6. 6.
    Leake, D., Maguitman, A., Reichherzer, T., Cañas, A., Carvalho, M., Arguedas, M., Brenes, S., Eskridge, T.: Aiding knowledge capture by searching for extensions of knowledge models. In: Proceedings of the Second International Conference on Knowledge Capture (K-CAP), pp. 44–53. ACM Press, New York (2003)CrossRefGoogle Scholar
  7. 7.
    Leake, D., Maguitman, A., Reichherzer, T., Cañas, A., Carvalho, M., Arguedas, M., Eskridge, T.: Googling” from a concept map: Towards automatic concept-map-based query formation. In: Cañas, A.J., Novak, J.D., González, F. (eds.) Concept Maps: Theory, Methodology, Technology. Proceedings of the First International Conference on Concept Mapping (2004)Google Scholar
  8. 8.
    Novak, J.: A Theory of Education. Cornell University Press, Ithaca (1977)Google Scholar
  9. 9.
    Novak, J., Gowin, D.B.: Learning How to Learn. Cambridge University Press, Cambridge (1984)Google Scholar
  10. 10.
    Briggs, G., Shamma, D., Cañas, C.R., Scargle, J., Novak, J.D.: Concept maps applied to Mars exploration public outreach. In: Cañas, A.J., Novak, J.D., González, F. (eds.) Concept Maps: Theory, Methodology, Technology. Proceedings of the First International Conference on Concept Mapping, pp. 125–133 (2004)Google Scholar
  11. 11.
    Leake, D., Maguitman, A., Reichherzer, T., Cañas, A., Carvalho, M., Arguedas, M., Brenes, S., Eskridge, T.: Aiding knowledge capture by searching for extensions of knowledge models. In: Proceedings of KCAP 2003. ACM Press, New York (2003)Google Scholar
  12. 12.
    Leake, D., Maguitman, A., Reichherzer, T.: Understanding knowledge models: Modeling assessment of concept importance in concept maps. In: Proceedings of CogSci 2004 (2004)Google Scholar
  13. 13.
    Maguitman, A., Leake, D., Reichherzer, T., Menczer, F.: Dynamic extraction of topic descriptors and discriminators: Towards automatic context-based topic search. In: Proceedings of the Thirteenth Conference on Information and Knowledge Management (CIKM), pp. 463–472. ACM Press, New York (2004)CrossRefGoogle Scholar
  14. 14.
    Maguitman, A., Leake, D., Reichherzer, T.: Suggesting novel but related topics: Towards context-based support for knowledge model extension. In: Proceedings of the 2005 International Conference on Intelligent User Interfaces, pp. 207–214 (2005)Google Scholar
  15. 15.
    Maguitman, A.: Intelligent Support for Knowledge Capture and Construction. PhD thesis, Indiana University (2005)Google Scholar
  16. 16.
    Lieberman, H.: Letizia: An agent that assists Web browsing. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI 1995), pp. 924–929. Morgan Kaufmann, San Mateo (1995)Google Scholar
  17. 17.
    Armstrong, R., Freitag, D., Joachims, T., Mitchell, T.: WebWatcher: A learning apprentice for the World Wide Web. In: AAAI Spring Symposium on Information Gathering, pp. 6–12 (1995)Google Scholar
  18. 18.
    Leake, D.B., Bauer, T., Maguitman, A., Wilson, D.C.: Capture, storage and reuse of lessons about information resources: Supporting task-based information search. In: Proceedings of the AAAI 2000 Workshop on Intelligent Lessons Learned Systems, pp. 33–37. AAAI Press, Austin (2000)Google Scholar
  19. 19.
    Bauer, T., Leake, D.: WordSieve: A method for real-time context extraction. In: Akman, V., Bouquet, P., Thomason, R.H., Young, R.A. (eds.) CONTEXT 2001. LNCS (LNAI), vol. 2116, p. 30. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  20. 20.
    Baldonado, M.Q.W., Winograd, T.: SenseMaker: an information-exploration interface supporting the contextual evolution of a user’s interests. In: Proceedings of the SIGCHI conference on Human factors in computing systems, pp. 11–18. ACM Press, New York (1997)CrossRefGoogle Scholar
  21. 21.
    Maglio, P.P., Barrett, R., Campbell, C.S., Selker, T.: SUITOR: an attentive information system. In: Proceedings of the 5th international conference on Intelligent user interfaces, pp. 169–176. ACM Press, New York (2000)CrossRefGoogle Scholar
  22. 22.
    Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learning to map between ontologies on the semantic web. In: Proceedings of the Eleventh International WWW Conference. ACM Press, New York (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • David Leake
    • 1
  • Ana Maguitman
    • 1
  • Thomas Reichherzer
    • 1
  1. 1.Computer Science DepartmentIndiana UniversityBloomingtonU.S.A

Personalised recommendations