Abstract
With the extensive use of XML in applications over the Web, efficient query processing over streaming XML has become a core challenge due to one-pass processing and limited resources. Taking advantage of Hole-Filler model for XML fragments, this paper proposes a hybrid structure (FQ-Index) for both the queries and fragments, and proposes an XML fragment processing algorithm to evaluate forward XPath queries over streamed XML fragments. Two optimization rules, dependence pruning and prefix pruning are also developed. Dependence pruning scheme prunes off the dependent operations caused by fragmentation and transforms the queries for XML tag into queries for XML fragments, while prefix pruning scheme prunes off the “redundant” prefix along the path according to the tag structure. The effectiveness of the techniques developed is illustrated with a detailed set of experiments.
Similar content being viewed by others
References
Altmel, M., Franklin, M.: Efficient filtering of XML documents for selective dissemination of information. In: Abbadi, A.E., Brodie, M.L., Chakravarthy, S., Dayal, U., Kamel, N., Schlageter, G., Whang, K.-Y. (eds.) Proceedings of the 26th International Conference on Very Large Data Bases (Cairo, Egypt), pp. 53–63. Morgan Kaufmann, San Francisco, 10–14 September 2000
Apers, P.M.G., Atzeni, P., Ceri, S., Paraboschi, S., Ramamohanarao, K., Snodgrass, R.T. (eds.): Proceedings of the 27th International Conference on Very Large Data Bases. (Roma, Italy). Morgan Kaufmann, San Francisco, 11–14 September 2001
Bose, S., Fegaras, L.: Data stream management for historial xml. In: Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data (SIGMOD 2004), Paris, France, 13–18 June 2004
Bose, S., Fegaras, L.: XFrag: a query processing framework for fragmented XML data. In: Eighth International Workshop on the Web and Databases (WebDB 2005), Baltimore, Maryland, 16–17 June 2005
Bose, S., Fegaras, L., Levine, D., Chaluvadi, V.: A query algebra for fragmented XML stream data. In: Proceedings of the 9th International Conference on Data Base Programming Languages, Potsdan, Germany, 6–8 September 2003
Diao, Y., Fischer, P., Franklin, M., To, R.: YFilter: efficient and scalable filtering of XML documents. In: ICDE Conference, San Jose, California, USA, 26 February–1 March 2002
Diaz, A.L., Lovell, D.: XML Generator, September. http://www.alphaworks.ibm.com/tech/xmlgenerator (1999)
Fegaras, L., Levine, D., Bose, S., Chaluvadi, V.: Query processing of streamed XML data. In: Eleventh International Conference on Information and Knowledge Management (CIKM 2002), McLean, Virginia, USA, 4–9 November 2002
Golab, L., Ozsu, M.T.: Issues in data stream management. SIGMOD Rec. 32(2), 5–14 (2003)
Goldman, R., Widom, J.: Dataguides: enabling query formulation and optimization in semistructured databases. In: Proceedings of the 23rd International Conference on Very Large Data Bases, Athens, Greece, August 1997
Lee, M.L., Chua, B.C., Hsu, W., Tan, K.-L.: Efficient evaluation of multiple queries on streaming XML data. In: Eleventh International Conference on Information and Knowledge Management, McLean, Virginia, USA, 4–9 November 2002
Li, Q., Moon, B.: Indexing and querying xml data for regular path expressions. In: Apers, et al. (eds.) [2], pp. 361–370
McHugh, J., Abiteboul, S., Goldman, R., Quass, D., Widom, J.: Lore: A database management system for semistructured data. SIGMOD Rec. 26(3), 54–66 (1997)
Milo, T., Suciu, D.: Index structures for path expressions. In: Beeri, C., Buneman, P. (eds.) Proceedings of Database Theory - ICDT ’99, 7th International Conference. Lecture Notes in Computer Science (Jerusalem, Israel), vol. 1540, pp. 277–295. Springer, Berlin, 10–12 January 1999
Su, H., Rundensteiner, E.A., Mani, M.: Automaton meets algebra: a hybrid paradigm for xml stream processing. Data Knowl. Eng. 59(3), 576–602 (2003)
W3C Recommendation. Extensible Markup Language (XML) 1.0 (2nd edn.), 6 October 2000. http://www.w3.org/TR/REC-xml
W3C Working Draft. XML Path Languages (XPath), ver 2.0. Technical report WD-xpath20-20011220, W3C, 2001, 20 December 2001. http://www.w3.org/TR/WD-xpath20-20011220
W3C working draft. XQuery 1.0: an XML Query Language. Technical report WD-xquery-20010607, World Wide Web Consortium, June (2001)
Wang, F., Zaniol, C.: An xml-based approach to publishing and querying the history of database. World Wide Web 8(3), 233–259 (2005)
Wang, G., Lv, J., Sun, B., Yu, G.: RPE query processing and optimization techniques for XML databases. J. Comput. Sci. Technol. 19(2), 224–237 (2004)
Wang, E., et al.: Efficient management of XML contents over wireless environment by Xstream. In: ACM-SAC 2004, pp. 1122–1127, Nicosia, 14–17 March 2004
Zhou, A., Lu, H., Zheng, S., Liang, Y., Zhang, L., Ji, W., Tian, Z.: VXMLR: a visual XML-relational database system. In: Apers, et al. (eds.) [2], pp. 719–720
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wang, G., Huo, H., Han, D. et al. Query Processing and Optimization Techniques over Streamed Fragmented XML. World Wide Web 11, 339–359 (2008). https://doi.org/10.1007/s11280-007-0041-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-007-0041-x