Structural Feedback for Keyword-Based XML Retrieval

  • Ralf Schenkel
  • Martin Theobald
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3936)

Abstract

Keyword-based queries are an important means to retrieve information from XML collections with unknown or complex schemas. Relevance Feedback integrates relevance information provided by a user to enhance retrieval quality. For keyword-based XML queries, feedback engines usually generate an expanded keyword query from the content of elements marked as relevant or nonrelevant. This approach that is inspired by text-based IR completely ignores the semistructured nature of XML. This paper makes the important step from pure content-based to structural feedback. It presents a framework that expands a keyword query into a full-fledged content-and-structure query. Extensive experiments with the established INEX benchmark and our TopX search engine show the feasibility of our approach.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Amati, G., Carpineto, C., Romano, G.: Merging XML indices. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 253–260. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  2. 2.
    Balmin, A., et al.: A system for keyword proximity search on XML databases. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds.) VLDB 2003. LNCS, vol. 2944, pp. 1069–1072. Springer, Heidelberg (2004)Google Scholar
  3. 3.
    Blanken, H.M., Grabs, T., Schek, H.-J., Schenkel, R., Weikum, G. (eds.): Intelligent Search on XML Data. LNCS, vol. 2818. Springer, Heidelberg (2003)MATHGoogle Scholar
  4. 4.
    Crouch, C.: Relevance feedback at the INEX 2004 workshop. In: INEX 2004 (2005)Google Scholar
  5. 5.
    Crouch, C.J., Mahajan, A., Bellamkonda, A.: Flexible XML retrieval based on the extended vector model. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 149–153. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  6. 6.
    Efthimiadis, E.N.: A user-centred evaluation of ranking algorithms for interactive query expansion. In: SIGIR Forum (USA), special issue, pp. 146–159 (1993)Google Scholar
  7. 7.
    Grust, T.: Accelerating XPath location steps. In: SIGMOD 2002, pp. 109–120 (2002)Google Scholar
  8. 8.
    Guo, L., et al.: XRANK: ranked keyword search over XML documents. In: SIGMOD 2003, pp. 16–27 (2003)Google Scholar
  9. 9.
    Hlaoua, L., Boughanem, M.: Towards context and structural relevance feedback in XML retrieval. In: Workshop on Open Source Web Information Retrieval, OSWIR (2005), http://www.emse.fr/OSWIR05/
  10. 10.
    Hsu, W., Lee, M.L., Wu, X.: Path-augmented keyword search for XML documents. In: ICTAI 2004, pp. 526–530 (2004)Google Scholar
  11. 11.
  12. 12.
    Kazai, G., et al.: The INEX evaluation initiative. In: Blanken, et al. (eds.) [3], pp. 279–293Google Scholar
  13. 13.
    Liu, S., Zou, Q., Chu, W.: Configurable indexing and ranking for XML information retrieval. In: SIGIR 2004, pp. 88–95 (2004)Google Scholar
  14. 14.
    Mass, Y., Mandelbrod, M.: Relevance feedback for XML retrieval. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 154–157. Springer, Heidelberg (2005)Google Scholar
  15. 15.
    Mihajlovic̀, V., et al.: TIJAH at INEX 2004 modeling phrases and relevance feedback. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 141–148. Springer, Heidelberg (2005)Google Scholar
  16. 16.
    Pan, H., Theobald, A., Schenkel, R.: Query refinement by relevance feedback in an XML retrieval system. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 854–855. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  17. 17.
    Porter, M., Galpin, V.: Relevance fedback in a public access catalogue for a research library: Muscat at the Scott Polar Research Institute. Program 22(1), 1–20 (1988)CrossRefGoogle Scholar
  18. 18.
    Ramirez, G., Westerveld, T., de Vries, A.: Structural features in content oriented xml retrieval. Technical Report INS-E0508, CWI, Centre for Mathematics and Computer Science (2005)Google Scholar
  19. 19.
    Ramírez, G., Westerveld, T., de Vries, A.P.: Structural features in content oriented XML retrieval. In: CIKM 2005 (2005)Google Scholar
  20. 20.
    Robertson, S.: On term selection for query expansion. Journal of Documentation 46, 359–364 (1990)CrossRefGoogle Scholar
  21. 21.
    Robertson, S., Sparck-Jones, K.: Relevance weighting of search terms. Journal of the American Society of Information Science 27, 129–146 (1976)CrossRefGoogle Scholar
  22. 22.
    Rocchio Jr., J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The SMART Retrieval System: Experiments in Automatic Document Processing, ch. 14, pp. 313–323. Prentice Hall, Englewood Cliffs (1971)Google Scholar
  23. 23.
    Ruthven, I., Lalmas, M.: A survey on the use of relevance feedback for information access systems. Knowledge Engineering Review 18(1) (2003)Google Scholar
  24. 24.
    Sigurbjörnsson, B., Kamps, J., de Rijke, M.: The University of Amsterdam at INEX 2004. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 104–109. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  25. 25.
    Theobald, M., Schenkel, R., Weikum, G.: An efficient and versatile query engine for TopX search. In: VLDB 2005, pp. 625–636 (2005)Google Scholar
  26. 26.
    Theobald, M., Schenkel, R., Weikum, G.: TopX & XXL at INEX 2005. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 282–295. Springer, Heidelberg (2006)Google Scholar
  27. 27.
    Trotman, A., Sigurbjörnsson, B.: Narrowed Extended XPath I, NEXI (2004), Available at http://www.cs.otago.ac.nz/postgrads/andrew/2004-4.pdf
  28. 28.
    van Rijsbergen, C., Harper, D., Porter, M.: The selection of good search terms. Information Processing and Management 17(2), 77–91 (1981)CrossRefGoogle Scholar
  29. 29.
    Vittaut, J.-N., Piwowarski, B., Gallinari, P.: An algebra for structured queries in bayesian networks. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 58–64. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  30. 30.
    Weber, R.: Using relevance feedback in XML retrieval. In: Blanken, et al. (eds.) [3], pp. 133–143Google Scholar
  31. 31.
    Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest LCAs in XML databases. In: SIGMOD 2005, pp. 537–538 (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Ralf Schenkel
    • 1
  • Martin Theobald
    • 1
  1. 1.Max-Planck-Institut für InformatikSaarbrückenGermany

Personalised recommendations