Flexible Querying of XML Documents

  • Krishnaprasad Thirunarayan
  • Trivikram Immaneni
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4203)

Abstract

Text search engines are inadequate for indexing and searching XML documents because they ignore metadata and aggregation structure implicit in the XML documents. On the other hand, the query languages supported by specialized XML search engines are very complex. In this paper, we present a simple yet flexible query language, and develop its semantics to enable intuitively appealing extraction of relevant fragments of information while simultaneously falling back on retrieval through plain text search if necessary. We also present a simple yet robust relevance ranking for heterogeneous document-centric XML.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Brin, S., Page, L.: The Anatomy of a Large-Scale Hypertextual Web Search Engine. In: Proceedings of the Seventh International Conference on World Wide Web, pp. 107–117 (1998)Google Scholar
  2. 2.
  3. 3.
    Cohen, S., Mamou, J., Kanza, Y., Sagiv, Y.: XSEarch: A Semantic Search Engine for XML. In: The 29th International Conference on Very Large Databases (VLDB) (September 2003)Google Scholar
  4. 4.
    Florescu, D., Kossmann, D., Manolescu, I.: Integrating keyword search into XML query processing. Computer Networks: The International Journal of Computer and Telecommunications Networking 33(1-6), 119–135 (2000)Google Scholar
  5. 5.
    Fuhr, N., Grojohann, K.: XIRQL: A Query Language for Information Retrieval in XML Documents. In: Proceedings of the 24th ACM SIGIR Conference, pp. 172–180 (2001)Google Scholar
  6. 6.
    Meyer, H., Bruder, I., Weber, G., Heuer, A.: The Xircus Search Engine (2003)Google Scholar
  7. 7.
    Carmel, D., Maarek, Y.S., Mass, Y., Efraty, N., Landau, G.M.: An Extension of the Vector Space Model for Querying XML Documents via XML Fragment. In: The 25th Annual ACM SIGIR Conference (2002)Google Scholar
  8. 8.
    Schlieder, T., Meuss, H.: Querying and ranking XML documents. Journal of the American Society for Information Science and Technology 53(6), 489–503 (2002)CrossRefGoogle Scholar
  9. 9.
    Theobald, A., Weikum, G.: The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking. In: Jensen, C.S., Jeffery, K.G., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, pp. 477–495. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  10. 10.
    Grabs, T., Schek, H.: Generating Vector Spaces On-the-fly for Flexible XML Retrieval. In: Proceedings of the 2nd XML and Information Retrieval Workshop, 25th ACM SIGIR Conference (2002)Google Scholar
  11. 11.
    Li, Y., Yu, C., Jagadish, H.V.: Schema-Free XQuery. In: The 30th International Conference on Very Large Databases (VLDB), pp. 72–83 (2004)Google Scholar
  12. 12.
    Catania, B., Maddalena, A., Vakali, A.: XML Document Indexes: A Classification. In: IEEE Internet Computing, pp. 64–70 (2005)Google Scholar
  13. 13.
    Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked keyword search over XML documents. In: Proceedings of ACM SIGMOD 2003, pp. 16–27 (2003)Google Scholar
  14. 14.
  15. 15.
    Antoniou, G., van Harmelen, F.: A Semantic Web Primer. MIT Press, Cambridge (2004)Google Scholar
  16. 16.
    Fensel, D., Hendler, J., Lieberman, H., Wahlster, W. (eds.): Spinning the Semantic Web: Bringing the WWW to Its Full Potential. MIT Press, Cambridge (2003)Google Scholar
  17. 17.
    http://www.w3.org/TR/ (retrieved, June 2006)
  18. 18.
    http://lucene.apache.org/java/docs/ (retrieved, June 2006)

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Krishnaprasad Thirunarayan
    • 1
  • Trivikram Immaneni
    • 1
  1. 1.Department of Computer Science and EngineeringWright State UniversityDaytonUSA

Personalised recommendations