Improving the Efficiency of XPath Execution on Relational Systems

  • Haris Georgiadis
  • Vasilis Vassalos
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3896)

Abstract

This work describes a method for processing XPath on a relational back-end that significantly limits the number of SQL joins required, takes advantage of the strengths of modern SQL query processors, exploits XML schema information and has low implementation complexity. The method is based on the splitting of XPath expressions into Primary Path Fragments (PPFs) and their subsequent combination using an efficient structural join method, and is applicable to all XPath axes. A detailed description of the method is followed by an experimental study that shows our technique yields significant efficiency improvements over other XPath processing techniques and systems.

Keywords

Regular Expression Simple Path XPath Query XPath Expression Prominent Relation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Florescu, D., Kossmann, D.: Storing and Querying XML Data using an RDMBS. Data Engineering Bulletin 22(3) (1999)Google Scholar
  2. 2.
    Grust, T., Keulen, M.V., Teubner, J.: Accelerating XPath Evaluation in Any RDBMS. ACM Transactions on Database Systems 29(1) (2004)Google Scholar
  3. 3.
    Yoshikawa, M., Amagasa, T., Shimura, T., Uemura, S.: XRel: A Path-Based Approach to Storage and Retrieval of XML Documents Using Relational Databases. ACM Transactions on Internet Technology 1(1) (2001)Google Scholar
  4. 4.
    Amer Yahia, S., Du, F., Freire, J.: A Comprehensive Solution to the XML-to-Relational Mapping Problem. In: WIDM 2004, pp. 12–13 (2004)Google Scholar
  5. 5.
    Shanmugasundaram, J., Tufte, K., et al.: Relational Databases for Querying XML Documents: Limitations and Opportunities. In: Proc. of the 25th VLDB Conf. (1999)Google Scholar
  6. 6.
    Clark, J., DeRose, S.: XML Path Language (XPath) Version 1.0. W3C Recommendation 16 November (1999), http://www.w3.org/TR/xpath
  7. 7.
    Boag, S., Chamberlin, D., et al.: Query 1.0: An XML Query Language. W3C Working Draft 04 April (2005), http://www.w3.org/TR/xquery/
  8. 8.
    DeHaan, D., Toman, D., Consens, M.P., Ozsu, M.T.: A Comprehensive XQuery to SQL Translation using Dynamic Interval Encoding. SIGMOD (2003)Google Scholar
  9. 9.
    Tatarinov, I., Viglas, S., Beyer, K., Shanmugasundaram, J., Shekita, E., Zhang, C.: Storing and querying ordered XML using a relational database system. In: SIGMOD (2002)Google Scholar
  10. 10.
    Virmani, A., Agarwal, S., Thathoo, R., Suman, S., Sanyal, S.: A Fast XPATH Evaluation Technique with the Facility of Updates. In: CIKM 2003 ACM (2003)Google Scholar
  11. 11.
    Krishnamurthy, R., Kaushik, R., Naughton, J.F.: Efficient XML-to-SQL Query Translation: Where to Add the Intelligence? In: Proc. of the 30th VLDB Conf. (2004)Google Scholar
  12. 12.
    Krishnamurthy, R., Chakaravarthy, V.T., Kaushik, R., Naughton, J.F.: Recursive XML Schemas, Recursive XML Queries, and Relational Storage: XML-to-SQL Query Translation. In: Proc. of the 20th ICDE (2004)Google Scholar
  13. 13.
    Berglund, A., Boag, S., et al.: XML Path Language (XPath) 2.0. W3C Working Draft (2005), http://www.w3.org/TR/xpath20/
  14. 14.
    Sur, G.M., Hammer, J., Siméon, J.: UpdateX - An XQuery-Based Language for Processing Updates in XML. PLAN-X 2004. In: Proc., BRICS Notes Series NS-03-4 (2004)Google Scholar
  15. 15.
    Deschler, K., Rundensteiner, E.: MASS: A Multi-Axis Storage Structure for Large XML Documents. In: CIKM 2003 (2003)Google Scholar
  16. 16.
    Pal, S., Cseri, I., Seeliger, O., Schaller, G., Giakoumakis, L., Zolotov, V.: Indexing XML Data Stored in a Relational Database. In: Proc. of the 30th VLDB Conference (2004)Google Scholar
  17. 17.
    IEEE Std 1003.1, Open Group Technical StandardGoogle Scholar
  18. 18.
    Boncz, P., Grust, T., Keulen, M., et al.: PathFinder/MonetDB: XQuery-The Relational Way. In: Proc. of the 31st VLDB Conference (2005)Google Scholar
  19. 19.
    O’Neil, P., O’Neil1, E., Pal, S., Cseri, I., Schalle, G., Westbury, N.: ORDPATHs: Insert-Friendly XML Node Labels. In: SIGMOD (2004)Google Scholar
  20. 20.
    Schmidt, A., Waas, F., Kersten, M., et al.: XMark: A Benchmark for XML Data Management. In: Proc. of the 28th VLDB Conference (2002)Google Scholar
  21. 21.
    Franceschet, M.: an XPath benchmark for the XMark generated data. In: XSym 2005, pp. 129–143 (2005)Google Scholar
  22. 22.
    Florescu, D., et al.: The BEA streaming XQuery processor. VLDB Journal 13(3) (2004)Google Scholar
  23. 23.
    Papakonstantinou, Y., Vassalos, V.: Architecture and Implementation of an XQuery-based Information Integration Platform. IEEE Data Eng. Bull. 25(1), 18–26 (2002)Google Scholar
  24. 24.
    Schöning, H., Wäsch, J.: Tamino - an internet database system. In: Zaniolo, C., Grust, T., Scholl, M.H., Lockemann, P.C. (eds.) EDBT 2000. LNCS, vol. 1777, p. 383. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  25. 25.
  26. 26.
    Balmin, A., Papakonstantinou, Y.: Storing and querying XML data using denormalized relational databases. Springer, Heidelberg (2004)Google Scholar
  27. 27.
    Amer-Yahia, S., Kotidis, Y., Srivastava, D.: Teaching Relational Optimizers About XML Processing. In: XSym (2004)Google Scholar
  28. 28.
    Bruno, N., Koudas, N., Srivastava, D.: Holistic twig joins: optimal XML pattern matching. In: SIGMOD Conference (2002)Google Scholar
  29. 29.
    Fiebig, T., Helmer, S., et al.: Anatomy of a native XML base management system. VLDB J. 11(4), 292–314 (2002)MATHCrossRefGoogle Scholar
  30. 30.
    Paparizos, S., Al-Khalifa, S., et al.: TIMBER: A Native System for Querying XML. In: SIGMOD Conference (2003)Google Scholar
  31. 31.
    Abiteboul, S., Cluet, S., Milo, T.: Querying and Updating the File. In: Proc. VLDB Conf. (1993)Google Scholar
  32. 32.
    Georgiadis, H., Vassalos, V.: Improving the Efficiency of XPath Execution on Relational Systems, Extended version, Available from http://wim.aueb.gr/papers/PPFProcessingfull.pdf

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Haris Georgiadis
    • 1
  • Vasilis Vassalos
    • 1
  1. 1.Department of InformaticsAthens University of Economics and BusinessAthensGreece

Personalised recommendations