Skip to main content

A Glance on Current XML Twig Pattern Matching Algorithms

  • Conference paper
Computational Science and Its Applications – ICCSA 2008 (ICCSA 2008)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5073))

Included in the following conference series:

  • 1606 Accesses

Abstract

Extensible Markup Language (XML) has become a de facto standard for information representation and exchange over the Internet. The core operation of XML Query Processing is twig pattern matching, that is, finding in an XML document tree D all matches of a given twig query Q. Modern twig query matching algorithms often first decompose individual path matches and then merge them to form twig matches. In this paper, we review and evaluate two current efficient algorithms, namely HolisticTwigStack and TwigList. The HolisticTwigStack algorithm, which based on TwigStack, a notable and effective algorithm, uses a stack structure to avoid the storage of intermediate matches and the process of merging matches, whereas the TwigList algorithm use simple lists. Our analysis on experimental results show that the advantages and disadvantages of both HolisticTwigStack and TwigList for efficient XML Twig Pattern Matching in term of processing time and memory usage.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berglund, A., Boag, S., Chamberlin, D., Fernadez, M.F., Kay, M., Robie, J., Simeon, J.: Xml path language (xpath). version 2.0 (2007), http://www.w3.org/TR?Xpath20/ , http://www.w3.org/TR/1999/REC-xpath-19991116

  2. Boag, S., Chamberlin, D., Fernandez, M.F., Florescu, D., Robie, J., Simeon, J.: Xquery 1.0: An xml query language. W3C Recommendation (January 2007), http://www.w3.org/TR/2007/REC-xquery-20070123/

  3. Qin, L., Yu, J.X., Ding, B.: Twiglist: Make twig pattern matching fast. In: Kotagiri, R., et al. (eds.) DASFAA 2007. LNCS, vol. 4443, pp. 850–862. Springer, Heidelberg (2007)

    Google Scholar 

  4. Chen, S., Li, H.-G., Tatemura, J., Hsiung, W.-P., Agrawal, D., Candan, K.S.: Twig2stack: Bottom-up processing of generalizedtree pattern queries over xml documents. In: VLDB 2006, September 2006, pp. 283–294. VLDB Endowment, Seoul (2006)

    Google Scholar 

  5. Jiang, Z., Luo, C., Hou, W.-C.: An efficient one-phase holistic twig join algorithm for xml data. In: CIKM 2006, pp. 786–787. ACM, New York (2006)

    Chapter  Google Scholar 

  6. Bruno, N., Koudas, N., Srivastav, D.: Holistic twig joins: Optimal xml pattern matching. In: Proceedings of the 21st ACM SIGMOD International Conference Management of Data (SIGMOD 2002), pp. 310–321 (2002)

    Google Scholar 

  7. Jiang, H., Wang, W., Lu, H., Yu, J.X.: Holistic twig joins on indexed xml documents. In: Proceedings of the 29th VLDB Conference, Germany, VLDB (2003)

    Google Scholar 

  8. Jiang, Z., Luo, C., Hou, W.-C., Zhu, Q., Che, D.: Effiecient processing of xml twig pattern: A novel one-phase holistic solution. In: Wagner, R., Revell, N., Pernul, G. (eds.) DEXA 2007. LNCS, vol. 4653, pp. 87–97. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  9. Choi, B., Mahoui, M., Wood, D.: On the optimal of holistic algorithms for twig queries. In: Mařík, V., Štěpánková, O., Retschitzegger, W. (eds.) DEXA 2003. LNCS, vol. 2736, pp. 28–37. Springer, Heidelberg (2003)

    Google Scholar 

  10. Aghili, S.A., Li, H.-G., Agrawal, D., Abbadi, A.E.: Twix: Twig structure and content matching of selective queries using binary labeling. In: Proceedings of the First International Conference on Scalable Information Systems (INFORSCALE 2006), Hong Kong, ACM, New York (2006)

    Google Scholar 

  11. Al-Khalifa, S., Jagadish, H.V., Koudas, N., Patel, J.M., Srivastava, D., Wu, Y.: Structural joins: A primitive for efficient xml query pattern matching. In: Proceedings of the 18th International Conference on Data Engineering (ICDE 2002) (2002)

    Google Scholar 

  12. Lu, J., Ling, T., Chen, T.: Tjfast: Effective processing of xml twig pattern matching. In: Proceedings of the 14th International Conference on World Wide Web (WWW), pp. 1118–1119 (2005)

    Google Scholar 

  13. Lu, J., Chen, T., Ling, T.W.: Efficient processing of xml twig patterns with parent child edges: A look-ahead approach. In: Grossman, D., Gravano, L., Zhai, C., Herzog, O., Evans, D.A. (eds.) Proceedings of the ACM Thirteenth Conference on Information and Knowledge Management (CIKM 2004), pp. 533–542. ACM, New York (2004)

    Chapter  Google Scholar 

  14. Zhang, C., Naughton, J., DeWitt, D., Luo, Q., Lohman, G.: On supporting containment queries in relational database management systems. In: Proceedings of ACM SIGMOSD 2001, pp. 425–436. ACM, New York (2001)

    Chapter  Google Scholar 

  15. Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton, J.: Relational databases for query xml documents: Limitations and opportunities. In: Proceedings of the 25th VLDB Conference, pp. 302–314 (1999)

    Google Scholar 

  16. Wu, Y., Patel, J.M., Jagadish, H.V.: Structural join order selection for xml query optimization. In: Proceedings of the 19th International Conference on Data Engineering (ICDE 2003), pp. 443–454. IEEE, Los Alamitos (2003)

    Google Scholar 

  17. Chen, T., Lu, J., Ling, T.W.: On boosting holism in xml twig pattern matching using structural indexing techniques, Baltimore, Maryland, USA, pp. 455–466 (2005)

    Google Scholar 

  18. Chien, S.-Y., Vagena, Z., Zhang, D., Tsotras, V.J.: Efficient structural joins on indexed xml documents. In: Proceedings of the 28th International Conference on Very Large Data Bases (VLDB 2002), pp. 263–274 (2002)

    Google Scholar 

  19. Jiang, H., Lu, H., Wang, W., Ooi, B.C.: Xr-tree: Indexing xml data for efficient structural joins. In: Proceedings of the 19th International Conference on Data Engineering (ICDE 2003), pp. 253–264. IEEE, Los Alamitos (2003)

    Google Scholar 

  20. Lu, J., Ling, T.W., Chan, C.-Y., Chen, T.: From region encoding to extended dewey: On efficient processing of xml twig pattern matching. In: Proceedings of the 31st VLDB Conference, Norway, pp. 193–204. VLDB (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Osvaldo Gervasi Beniamino Murgante Antonio Laganà David Taniar Youngsong Mun Marina L. Gavrilova

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dao, DB., Cao, J. (2008). A Glance on Current XML Twig Pattern Matching Algorithms. In: Gervasi, O., Murgante, B., Laganà, A., Taniar, D., Mun, Y., Gavrilova, M.L. (eds) Computational Science and Its Applications – ICCSA 2008. ICCSA 2008. Lecture Notes in Computer Science, vol 5073. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69848-7_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-69848-7_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-69840-1

  • Online ISBN: 978-3-540-69848-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics