Skip to main content
Log in

Query Processing and Optimization Techniques over Streamed Fragmented XML

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

With the extensive use of XML in applications over the Web, efficient query processing over streaming XML has become a core challenge due to one-pass processing and limited resources. Taking advantage of Hole-Filler model for XML fragments, this paper proposes a hybrid structure (FQ-Index) for both the queries and fragments, and proposes an XML fragment processing algorithm to evaluate forward XPath queries over streamed XML fragments. Two optimization rules, dependence pruning and prefix pruning are also developed. Dependence pruning scheme prunes off the dependent operations caused by fragmentation and transforms the queries for XML tag into queries for XML fragments, while prefix pruning scheme prunes off the “redundant” prefix along the path according to the tag structure. The effectiveness of the techniques developed is illustrated with a detailed set of experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Altmel, M., Franklin, M.: Efficient filtering of XML documents for selective dissemination of information. In: Abbadi, A.E., Brodie, M.L., Chakravarthy, S., Dayal, U., Kamel, N., Schlageter, G., Whang, K.-Y. (eds.) Proceedings of the 26th International Conference on Very Large Data Bases (Cairo, Egypt), pp. 53–63. Morgan Kaufmann, San Francisco, 10–14 September 2000

    Google Scholar 

  2. Apers, P.M.G., Atzeni, P., Ceri, S., Paraboschi, S., Ramamohanarao, K., Snodgrass, R.T. (eds.): Proceedings of the 27th International Conference on Very Large Data Bases. (Roma, Italy). Morgan Kaufmann, San Francisco, 11–14 September 2001

    Google Scholar 

  3. Bose, S., Fegaras, L.: Data stream management for historial xml. In: Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data (SIGMOD 2004), Paris, France, 13–18 June 2004

  4. Bose, S., Fegaras, L.: XFrag: a query processing framework for fragmented XML data. In: Eighth International Workshop on the Web and Databases (WebDB 2005), Baltimore, Maryland, 16–17 June 2005

  5. Bose, S., Fegaras, L., Levine, D., Chaluvadi, V.: A query algebra for fragmented XML stream data. In: Proceedings of the 9th International Conference on Data Base Programming Languages, Potsdan, Germany, 6–8 September 2003

  6. Diao, Y., Fischer, P., Franklin, M., To, R.: YFilter: efficient and scalable filtering of XML documents. In: ICDE Conference, San Jose, California, USA, 26 February–1 March 2002

  7. Diaz, A.L., Lovell, D.: XML Generator, September. http://www.alphaworks.ibm.com/tech/xmlgenerator (1999)

  8. Fegaras, L., Levine, D., Bose, S., Chaluvadi, V.: Query processing of streamed XML data. In: Eleventh International Conference on Information and Knowledge Management (CIKM 2002), McLean, Virginia, USA, 4–9 November 2002

  9. Golab, L., Ozsu, M.T.: Issues in data stream management. SIGMOD Rec. 32(2), 5–14 (2003)

    Article  Google Scholar 

  10. Goldman, R., Widom, J.: Dataguides: enabling query formulation and optimization in semistructured databases. In: Proceedings of the 23rd International Conference on Very Large Data Bases, Athens, Greece, August 1997

  11. Lee, M.L., Chua, B.C., Hsu, W., Tan, K.-L.: Efficient evaluation of multiple queries on streaming XML data. In: Eleventh International Conference on Information and Knowledge Management, McLean, Virginia, USA, 4–9 November 2002

  12. Li, Q., Moon, B.: Indexing and querying xml data for regular path expressions. In: Apers, et al. (eds.) [2], pp. 361–370

  13. McHugh, J., Abiteboul, S., Goldman, R., Quass, D., Widom, J.: Lore: A database management system for semistructured data. SIGMOD Rec. 26(3), 54–66 (1997)

    Article  Google Scholar 

  14. Milo, T., Suciu, D.: Index structures for path expressions. In: Beeri, C., Buneman, P. (eds.) Proceedings of Database Theory - ICDT ’99, 7th International Conference. Lecture Notes in Computer Science (Jerusalem, Israel), vol. 1540, pp. 277–295. Springer, Berlin, 10–12 January 1999

    Google Scholar 

  15. Su, H., Rundensteiner, E.A., Mani, M.: Automaton meets algebra: a hybrid paradigm for xml stream processing. Data Knowl. Eng. 59(3), 576–602 (2003)

    Article  Google Scholar 

  16. W3C Recommendation. Extensible Markup Language (XML) 1.0 (2nd edn.), 6 October 2000. http://www.w3.org/TR/REC-xml

  17. W3C Working Draft. XML Path Languages (XPath), ver 2.0. Technical report WD-xpath20-20011220, W3C, 2001, 20 December 2001. http://www.w3.org/TR/WD-xpath20-20011220

  18. W3C working draft. XQuery 1.0: an XML Query Language. Technical report WD-xquery-20010607, World Wide Web Consortium, June (2001)

  19. Wang, F., Zaniol, C.: An xml-based approach to publishing and querying the history of database. World Wide Web 8(3), 233–259 (2005)

    Article  Google Scholar 

  20. Wang, G., Lv, J., Sun, B., Yu, G.: RPE query processing and optimization techniques for XML databases. J. Comput. Sci. Technol. 19(2), 224–237 (2004)

    Article  Google Scholar 

  21. Wang, E., et al.: Efficient management of XML contents over wireless environment by Xstream. In: ACM-SAC 2004, pp. 1122–1127, Nicosia, 14–17 March 2004

  22. Zhou, A., Lu, H., Zheng, S., Liang, Y., Zhang, L., Ji, W., Tian, Z.: VXMLR: a visual XML-relational database system. In: Apers, et al. (eds.) [2], pp. 719–720

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guoren Wang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, G., Huo, H., Han, D. et al. Query Processing and Optimization Techniques over Streamed Fragmented XML. World Wide Web 11, 339–359 (2008). https://doi.org/10.1007/s11280-007-0041-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-007-0041-x

Keywords

Navigation