Abstract
XML publishing has been an emerging technique for transforming (portions of) a relational database into an XML document, for example, to facilitate interoperability between heterogeneous applications. Such applications may update the XML document and the source relational database must be updated accordingly. In this paper, we consider such XML documents as (possibly) recursively defined XML views of relations. We propose new optimization techniques to efficiently support XML view updates specified via an XPATH expression with recursion and complex filters. The main novelties of our techniques are: (1) we propose a space-efficient relational encoding of recursive XML views; and (2) we push the bulk of update processing inside a relational database. Specifically, a compressed representation of the XML views is stored as extended shared-inlining relations. A space-efficient and updatable 2-hop index is used to optimize XPATH evaluation on XML views. Updates of the XML views are evaluated on these relations and index. View update translation is handled by a heuristic procedure inside a relational database, as opposed to previous middleware approaches. We present an experimental study to demonstrate the effectiveness of our proposed techniques.
Similar content being viewed by others
References
Agrawal, R., Borgida, A., Jagadish, H.V.: Efficient management of transitive relationships in large data and knowledge bases. In: SIGMOD (1989)
Agrawal, R., Jagadish, H.V.: Hybrid transitive closure algorithms. In: VLDB, pp. 326–334 (1990)
Al-Khalifa, S., Jagadish, H.V., Patel, J.M., Wu, Y., Koudas, N., Srivastava, D.: Structural joins: a primitive for efficient xml query pattern matching. In: ICDE, pp. 141–152 (2002)
Batagelj, V., Mrvar, A.: Pajek datasets (2006). http://vlado.fmf.uni-lj.si/pub/networks/data/
Benedikt, M., Chan, C.Y., Fan, W., Rastogi, R., Zheng, S., Zhou, A.: DTD-directed publishing with attribute translation grammars. In: VLDB (2002)
Bohannon, P., Choi, B., Fan, W.: Incremental evaluation of schema-directed XML publishing. In: SIGMOD (2004)
Bohannon, P., Freire, J., Roy, P., Simeon, J.: From XML schema to relations: a cost-based approach to XML storage. In: ICDE, pp. 64–75 (2002)
Braganholo, V.P., Davidson, S.B., Heuser, C.A.: From XML view updates to relational view updates: old solutions to a new problem. In: VLDB (2004)
Buneman, P., Choi, B., Fan, W., Hutchison, R., Mann, R., Viglas, S.: Vectorizing and querying large xml repositories. In: ICDE, pp. 261–272 (2005)
Buneman, P., Grohe, M., Koch, C.: Path Queries on Compressed XML. In: VLDB, pp. 436–445 (2003)
Carey, M.J., Kiernan, J., Shanmugasundaram, J., Shekita, E.J., Subramanian, S.N.: XPERANTO: Middleware for publishing object-relational data as XML documents. In: VLDB (2000)
Cheng, J., Yu, J.X., Lin, X., Wang, H., Yu, P.S.: Fast computation of reachability labeling for large graphs. In: EDBT, pp. 961–979 (2006)
Cheng, J., Yu, J.X., Tang, N.: Fast reachability query processing. In: DASFAA, pp. 674–688 (2006)
Chien, S.-Y., Vagena, Z., Zhang, D., Tsotras, V.J., Zaniolo, C.: Efficient structural joins on indexed xml documents. In: VLDB, pp. 263–274 (2002)
Choi, B.: What are real DTDs like. In: WebDB (2002)
Choi B., Cong G., Fan W., Viglas S.: Updating recursive xml views of relations. J. Comput. Sci. Technol. 23(4), 516–537 (2008)
Choi, B., Fan, W., Jia, X., Kasprzyk, A.: A uniform system for publishing and maintaining xml data. In: VLDB, pp. 1301–1304 (2004)
Cohen E., Halperin E., Kaplan H., Zwick U.: Reachability and distance queries via 2-hop labels. SIAM J. Comput. 32(5), 1338–1355 (2003)
Cong, G., Fan, W., Geerts, F.: Annotation propagation revisited for key preserving views. In: CIKM, pp. 632–641 (2006)
Davidson, S., Kosky, A.: WOL: a language for database transformations and constraints. In: ICDE (1997)
Dayal, U., Bernstein, P.A.: On the correct translation of update operations on relational views. TODS, 7(3), (1982)
EBI. Gene ontology. http://www.geneontology.org/
Fan, W.: Xml publishing: bridging theory and practice. In: DBPL, pp. 1–16 (2007)
Fernandez M.F., Kadiyska Y., Suciu D., Morishima A., Tan W.C.: SilkRoute: A framework for publishing relational data in XML. TODS 27(4), 438–493 (2002)
Florescu D., Kossmann D.: Storing and Querying XML Data using an RDMBS. IEEE Data Eng. Bull. 22(3), 27–34 (1999)
IBM. IBM DB2 Universal Database SQL Reference. http://www.ibm.com/software/data/db2/
Ingber L.: Simulated annealing: practice versus theory. Math. Comput. Model. 18(11), 29–57 (1993)
Jiang, H., Lu, H., Wang, W., Ooi, B.: Xr-tree: Indexing xml data for efficient structural join. In: ICDE (2003)
Keller, A.: Algorithms for translating view updates to database updates for views involving selections, projections, and joins. In: PODS (1985)
Kirkpatrick, S., Gelatt, C.D., Vecchi, M.P.: Optimization by simulated annealing. Science, Number 4598, 13 May 1983, 220, 4598:671–680 (1983)
Laux, A., Martin, L.: XUpdate—XML Update Language (2000). http://www.xmldb.org/xupdate/xupdate-wd.html
Lu H., Yu J.X., Wang G., Zheng S., Jiang H., Yu G., Zhou A.: What makes the differences: benchmarking xml database implementations. ACM Trans. Internet Techn. 5(1), 154–194 (2005)
McMillan, K.L.: Symbolic model checking (1993)
Mitchell, D.G., Selman, B., Levesque, H.J.: Hard and easy distributions for SAT problems. In: AAAI-92, pp. 459–465 (1992)
Oracle. SQL Reference. http://www.oracle.com/technology/documentation/database10g.html
Schenkel, R., Theobald, A., Weikum, G.: Efficient creation and incremental maintenance of the hopi index for complex xml document collections. In: ICDE, pp. 360–371 (2005)
Selman, B., Kautz, H.: Walksat home page (2004). http://www.cs.washington.edu/homes/kautz/walksat/
Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D.J., Naughton, J.F.: Relational databases for querying xml documents: Limitations and opportunities. In: VLDB, pp. 302–314 (1999)
Sleepycat. Berkeley DB XML. http://sleepycat2.inetu.net/products/bdbxml.html
Software AG. Tamino XML Server. http://www.softwareag.com/tamino/details.htm
SQL server. MSDN Library. http://msdn.microsoft.com/library
Sur, G., Hammer, J., Simeon, J.: An XQuery-based language for processing updates in XML. In: PLAN-X (2004)
Tarjan R.: Depth-first search and linear graph algorithms. SIAM J. Comput. 1(2), 146–160 (1972)
Tolani, P.M., Haritsa, J.R.: Xgrind: a query-friendly xml compressor. In: ICDE, p. 225 (2002)
Wang, H., He2, H., Yang, J., Yu, P.S., Yu, J.X.: Dual labeling: Answering graph reachability queries in constant time. In: ICDE, p. 75, Washington, DC, USA, 2006. IEEE Computer Society, Washington
Wang, L., Mulchandani, M., Rundensteiner, E.: Updating XQuery views published over relational data: a round-trip case study. In: Xsym (2003)
Wang L., Rundensteiner E.A., Mani M.: Updating XML views published over relational databases: Towards the existence of a correct update mapping. DKE 58(3), 263–298 (2006)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bramandia, R., Cheng, J., Choi, B. et al. Optimizing updates of recursive XML views of relations. The VLDB Journal 18, 1313–1333 (2009). https://doi.org/10.1007/s00778-009-0141-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00778-009-0141-6