Advertisement

Why Using Structural Hints in XML Retrieval?

  • Karen Sauvagnat
  • Mohand Boughanem
  • Claude Chrisment
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4027)

Abstract

When querying XML collections, users cannot always express their need in a precise way. Systems should therefore support vagueness at both the content and structural level of queries. This paper present a relevance-oriented method for ranking XML components. The aim here is to evaluate whether structural hints help to better answer the user needs. We experiment (within the INEX framework) with users needs expressed in a flexible way (i.e with ou without structural hints). Results show that they clearly improve performance, even if they are expressed in an ”artificial way”. Relevance seems therefore to be closely linked to structure. Moreover, too complex structural hints do not lead to better results.

Keywords

Leaf Node Query Processing Vector Space Model Relevance Score Quantisation Function 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abolhassani, M., Fuhr, N.: Applying the divergence from randomness approach for content-only search in XML documents. In: McDonald, S., Tait, J.I. (eds.) ECIR 2004. LNCS, vol. 2997, pp. 409–419. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  2. 2.
    Anh, V.N., Moffat, A.: Compression and an IR approach to XML retrieval. In: Proceedings of INEX 2002 Workshop, Dagstuhl, Germany (2002)Google Scholar
  3. 3.
    Fernandez, M., Malhotra, A., Marsh, J., Nagy, M., Walsh, N.: XQuery 1.0 and XPath 2.0 data model. Technical report, World Wide Web Consortium (W3C), W3C Working Draft (May 2003)Google Scholar
  4. 4.
    Fuhr, N., Lalmas, M., Malik, S.: INEX 2003 workshop proceedings (2003)Google Scholar
  5. 5.
    Fuhr, N., Lalmas, M., Malik, S., Kazai, G.: INEX 2005 workshop pre-proceedings (2005)Google Scholar
  6. 6.
    Fuhr, N., Malik, S., Lalmas, M.: Overview of the initiative for the evaluation of XML retrieval (INEX) 2003. In: Proceedings of INEX 2003 Workshop, Dagstuhl, Germany (December 2003)Google Scholar
  7. 7.
    Gövert, N., Abolhassani, M., Fuhr, N., Grossjohann, K.: Content-oriented XML retrieval with hyrex. In: Proceedings of the first INEX Workshop, Dagstuhl, Germany (2002)Google Scholar
  8. 8.
    Grabs, T., Scheck, H.-J.: Flexible information retrieval from xml with PowerDB XML. In: Proceedings in the First Annual Workshop for the Evaluation of XML Retrieval (INEX), December 2002, pp. 26–32 (2002)Google Scholar
  9. 9.
    Kakade, V., Raghavan, P.: Encoding XML in vector spaces. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 96–111. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  10. 10.
    Kamps, J., de Rijke, M., Sigurbjornsson, B.: Length normalization in XML retrieval. In: Proceedings of SIGIR 2004, Sheffield, England, pp. 80–87 (2004)Google Scholar
  11. 11.
    Kamps, J., Marx, M., de Rijke, M., Sigurbjornsson, B.: Structured queries in XML retrieval. In: Proceedings of CIKM 2005, Bremen, Germany (2005)Google Scholar
  12. 12.
    Karen Sauvagnat, M.B., Hlaoua, L.: XFIRM at INEX 2005: Ad-hoc and relevance feedback tracks. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 88–103. Springer, Heidelberg (2006)Google Scholar
  13. 13.
    Kazai, G., Lalmas, M.: INEX 2005 evaluation measures. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 16–29. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  14. 14.
    Kazai, G., Lalmas, M., de Vries, A.P.: The overlap problem in content-oriented XML retrieval evaluation. In: Proceedings of SIGIR 2004, Sheffield, England, July 2004, pp. 72–79 (2004)Google Scholar
  15. 15.
    Lalmas, M., Rölleke, T.: Modelling vague content and structure querying in XML retrieval with a probabilistic object-relational framework. In: Christiansen, H., Hacid, M.-S., Andreasen, T., Larsen, H.L. (eds.) FQAS 2004. LNCS, vol. 3055, pp. 432–445. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  16. 16.
    Mass, Y., Mandelbrod, M.: Component ranking and automatic query refinement for XML retrieval. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 73–84. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  17. 17.
    Mass, Y., Mandelbrod, M.: Experimenting various user models for xml retrieval. In: Pre-Proceedings of INEX 2005, Dagstuhl, Germany (2005)Google Scholar
  18. 18.
    Ogilvie, P., Callan, J.: Using language models for flat text queries in XML retrieval. In: Proceedings of INEX 2003 Workshop, Dagstuhl, Germany, December 2003, pp. 12–18 (2003)Google Scholar
  19. 19.
    Rölleke, T., Lalmas, M., Kazai, G., Ruthven, I., Quicker, S.: The accessibility dimension for structured document retrieval. In: Crestani, F., Girolami, M., van Rijsbergen, C.J.K. (eds.) ECIR 2002. LNCS, vol. 2291, p. 284. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  20. 20.
    Sauvagnat, K., Boughanem, M., Chrisment, C.: Answering content-and-structure-based queries on XML documents using relevance propagation. In: Information Systems - Special Issue SPIRE 2004. Elsevier, Amsterdam (2006)Google Scholar
  21. 21.
    Sauvagnat, K., Hlaoua, L., Boughanem, M.: XML retrieval: what about using contextual relevance?. In: ACM Symposium on Applied Computing (SAC) - IAR (Information Access and Retrieval) Dijon (April 2006)Google Scholar
  22. 22.
    Sigurbjörnsson, B., Kamps, J., de Rijke, M.: The university of Amsterdam at INEX 2005: Adhoc track. In: Pre-Proceedings of INEX 2005 workshop, Dagstuhl, Germany (November 2005)Google Scholar
  23. 23.
    W3C. XQuery and XPath full-text use cases. Technical report, World Wide Web Consortium (W3C), W3C working draft (February 2003)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Karen Sauvagnat
    • 1
  • Mohand Boughanem
    • 1
  • Claude Chrisment
    • 1
  1. 1.IRIT – SIGToulouse

Personalised recommendations