A Content-Aware Adaptive Storage Approach for XML in PXRDB

  • Xue Wang
  • Xiao Zhang
  • Xiaoyong Du
  • Shan Wang
  • Kuicheng Liu
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6637)

Abstract

In many cases, it is pretty difficult to choose an efficient storage method, such as native, xml-enabled or hybrid, for storing XML documents in a relational database. We provide multiple storage approaches for XML documents in our hybrid XML-relational database PXRDB( Pure XML-Relational DataBase). Further, another problem is how to automatically choose storage method for a given XML document and whether different documents in same column can be stored in different formats. In this paper, we provide a content-aware adaptive storage approach for XML in PXRDB. This novel storage approach automatically selects one better storage scheme for a specific XML document from three candidate schemata, i.e., native storage, flat stream and multi-relations after fast-checking its content. Our approach frees end-users or administrators from either having no choice or having to specify the specific storage scheme for large number of XML documents manually. It also allows different XML documents in same relational column to be stored in different formats while being accessed indistinctively. By providing unified access interfaces, new storage approaches can be easily registered in our system. The performance evaluation illustrates our approach is feasible and effective.

Keywords

Content-aware Adaptive Storage Approach XML-Relational Database 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Rys, M.: XML and Relational Database Management Systems: inside Microsoft SQL Server 2005. In: Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 958–962 (2005)Google Scholar
  2. 2.
    Microsoft. White Paper: What’s New for XML in SQL Server, White Paper (2008)Google Scholar
  3. 3.
    Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton, J.: Relational Databases for Querying XML Documents: Limitations and Opportunities. In: VLDB 1999 (1999)Google Scholar
  4. 4.
    Tian, F., DeWitt, D., Chen, J., Zhang, C.: The Design and Performance Evaluation of Alternative XML Storage Strategies. ACM Sigmod Record 31(1) (March 2002)Google Scholar
  5. 5.
    Florescu, D., Kossmann, D.: A Performance Evaluation of Alter native mapping Schemas for Storing XML Data in a Relational Database. In: Proc. of the VLDB 1999 (1999)Google Scholar
  6. 6.
    Shanmugasundaram, J., Tufte, K., He, G., et al.: Relational Databases for Querying XML Documents: Limitations and Opportunities. In: VLDB 1999 (1999)Google Scholar
  7. 7.
    Boncz, P.A., Grust, T., Keulen, M., Manegold, S., Rittinger, J., Teubner, J.: MonetDB/XQuery: a fast XQuery processor powered by a relational engine. In: SIGMOD Conference 2006, pp. 479–490 (2006)Google Scholar
  8. 8.
    Ennser, L., Delporte, C., Oba, M., Sunil, K.: Integrating XML with DB2 XML Extender and DB2 Text Extender. IBM Redbooks (2001)Google Scholar
  9. 9.
    Profressional XML, Wrox Press (2000) Google Scholar
  10. 10.
    DB2 goes hybrid Integrating native XML and XQuery with relational data and SQL 2006 (2006)Google Scholar
  11. 11.
    Liu, Z.H., Chandrasekar, S., Baby, T., Chang, H.J.: Towards a Physical XML independent XQuery/SQL/XML Engine. In: Proc. 34th Int. Conf. on Very Large Data Bases, pp. 1356–1367 (2008)Google Scholar
  12. 12.
    Chebotko, A.: Reconstructing XML subtrees from Relational Storage of XML documents. In: ICDE 2004 (2004)Google Scholar
  13. 13.
    Florescu, D., Kossmann, D.: A Performance Evaluation of Alternative Mapping Schemes for Storing XML in A Relational Database. Technical Report 3680, INRIA (1999)Google Scholar
  14. 14.
    Kanne, C.-C., Moerkotte, G.: A Linear Time Algorithm for Optimal Tree Sibling Partitioning and Approximation Algorithms in Natix. In: Proc. 32th Int. Conf. on Very Large Data Bases, (September 2006)Google Scholar
  15. 15.
    Fiebig, T., Helmer, S., Kanne, C.-C., Mildenberger, J., Moerkotte, G., Schiele, R., Westmann, T.: Anatomy of a Native XML Base Management System. The VLDB Journal 11(4), 292–314 (2002)CrossRefMATHGoogle Scholar
  16. 16.
    Kanne, C.-C., Moerkotte, G.: Efficient Storage of XML Data. In: Proc. 16th Int. Conf. on Data Engineering, pp. 198–209 (2000)Google Scholar
  17. 17.
    Beyer, K.S., Cochrane, R., Josifovski, V., Kleewein, J., Lapis, G., Lohman, G.M., Lyle, B., Ozcan, F., Pirahesh, H., Seemann, N., Truong, T.C., der Linden, B.V., Vickery, B., Zhang, C.: System RX: One Part Relational, One Part XML. In: Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 347–358 (2005)Google Scholar
  18. 18.
    Funderburk, J.E., Kiernan, G., Shanmugasundaram, J., Shekita, E., Wei, C.: XTABLES: Bridging relational technology and XML. IBM Systems Journal 41(4), 616–641 (2002)CrossRefGoogle Scholar
  19. 19.
    Zhang, N., Agarwal, N., Chandrasekar, S.: Sum Idicula.: Binary XML Storage and Query Processing in Oracle 11g. In: Proc. 35th Int. Conf. on Very Large Data Bases (June 2009)Google Scholar
  20. 20.
    Jagadish, H.V., Al-Khalifa, S., Chapman, A., Lakshmanan, L.V.S., Nierman, A., Paparizos, S., Patel, J.M., Srivastava, D., Wiwatwattana, N., Wu, Y., Yu, C.: TIMBER: A Native XML Database. VLDB Journal 11(1), 274–291 (2002)CrossRefMATHGoogle Scholar
  21. 21.
    Meier, W.: eXist: An Open Source Native XML Database, http://exist-db.org

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Xue Wang
    • 1
  • Xiao Zhang
    • 1
  • Xiaoyong Du
    • 1
  • Shan Wang
    • 1
  • Kuicheng Liu
    • 1
  1. 1.Key Laboratory of Data Engineering and Knowledge Engineering, MOERenmin University of ChinaBeijingChina

Personalised recommendations