ArHeX: An Approximate Retrieval System for Highly Heterogeneous XML Document Collections

  • Ismael Sanz
  • Marco Mesiti
  • Giovanna Guerrini
  • Rafael Berlanga Llavori
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3896)

Abstract

Handling the heterogeneity of structure and/or content of XML documents for the retrieval of information is a fertile field of research nowadays. Many efforts are currently devoted to identifying approximate answers to queries that require relaxation on conditions both on the structure and the content of XML documents [1,2,4,5]. Results are ranked relying on score functions that measure their quality and relevance and only the top-k returned.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Amer-Yahia, S., Cho, S., Srivastava, D.: Tree Pattern Relaxation. In: Jensen, C.S., Jeffery, K., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, pp. 496–513. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  2. 2.
    Amer-Yahia, S., Koudas, N., Marian, A., Srivastava, D., Toman, D.: Structure and Content Scoring for XML. In: VLDB, pp. 361–372 (2005)Google Scholar
  3. 3.
    Guerrini, G., Mesiti, M., Sanz, I.: An Overview of Similarity Measures for Clustering XML Documents. In: Vakali, A., Pallis, G. (eds.) Web Data Management Practices: Emerging Techniques and Technologies, Idea Group, USAGoogle Scholar
  4. 4.
    Marian, A., Amer-Yahia, S., Koudas, N., Srivastava, D.: Adaptive Processing of Top-k Queries in XML. In: ICDE, pp. 162–173 (2005)Google Scholar
  5. 5.
    Nierman, A., Jagadish, H.V.: Evaluating Structural Similarity in XML Documents. In: WebDB, pp. 61–66 (2002)Google Scholar
  6. 6.
    Sanz, I., Mesiti, M., Guerrini, G., Llavori, R.B.: Approximate Subtree Identification in Heterogeneous XML Documents Collections. In: Bressan, S., Ceri, S., Hunt, E., Ives, Z.G., Bellahsène, Z., Rys, M., Unland, R. (eds.) XSym 2005. LNCS, vol. 3671, pp. 192–206. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  7. 7.
    Sanz, I., Mesiti, M., Guerrini, G., Berlanga Llavori, R.: Approximate Retrieval of Highly Heterogeneous XML Documents. Tech. report. University of Milano (2005)Google Scholar
  8. 8.
    Theobald, A., Weikum, G.: The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking. In: Jensen, C.S., Jeffery, K., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, pp. 477–495. Springer, Heidelberg (2002)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Ismael Sanz
    • 1
  • Marco Mesiti
    • 2
  • Giovanna Guerrini
    • 3
  • Rafael Berlanga Llavori
    • 1
  1. 1.Universitat Jaume ICastellónSpain
  2. 2.Università di MilanoItaly
  3. 3.Università di GenovaItaly

Personalised recommendations