Skip to main content

Xandy: Detecting Changes on Large Unordered XML Documents Using Relational Databases

  • Conference paper

Part of the Lecture Notes in Computer Science book series (LNISA,volume 3453)

Abstract

Previous works in change detection on XML documents are not suitable for detecting the changes to large XML documents as it requires a lot of memory to keep the two versions of XML documents in the memory. In this paper, we take a more conservative yet novel approach of using traditional relational database engines for detecting the changes to large unordered XML documents. We elaborate how we detect the changes on unordered XML documents by using relational database. To this end, we have implemented a prototype system called Xandy that converts XML documents into relational tuples and detects the changes from these tuples by using SQL queries. Our experimental results show that the relational approach has better scalability compared to published algorithms like X-Diff. The result quality of our approach is comparable to the one of X-Diff.

Keywords

  • Execution Time
  • Leaf Node
  • Similarity Score
  • Relational Database
  • Internal Node

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chen, Y., Madria, S., Bhowmick, S.S.: DiffXML: Change Detection in XML Data. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 289–301. Springer, Heidelberg (2004)

    CrossRef  Google Scholar 

  2. Curbera, D.A.E.: Fast Difference and Update of XML Documents. In: XTech 1999, San Jose (1999)

    Google Scholar 

  3. Cobena, G., Abiteboul, S., Marian, A.: Detecting Changes in XML Documents. In: ICDE 2002, San Jose (2002)

    Google Scholar 

  4. Jiang, H., Lu, H., Wang, W., Xu Yu, J.: Path Materialization Revisited: An Efficient Storage Model for XML Data. In: Australasian Database Conference, Melbourne, Australia (2002)

    Google Scholar 

  5. Leonardi, E., Bhowmick, S.S., Madria, S.: Detecting Content Changes on Ordered XML Documents Using Relational Databases. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds.) DEXA 2004. LNCS, vol. 3180, pp. 580–590. Springer, Heidelberg (2004)

    CrossRef  Google Scholar 

  6. Leonardi, E., Bhowmick, S.S.: Xandy: Detecting Changes on Large Unordered XML Documents Using Relational Database. Technical Report, Center for Advanced Information System, Nanyang Technological University, Singapore (2004), http://www.cais.ntu.edu.sg/~erwin/docs/

  7. Prakash, S., Bhowmick, S.S., Mardia, S.: SUCXENT: An Efficient Path-based Approach to Store and Query XML Documents. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds.) DEXA 2004. LNCS, vol. 3180, pp. 285–295. Springer, Heidelberg (2004)

    CrossRef  Google Scholar 

  8. Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D.J., Naughton, J.F.: Relational Databases for Querying XML Documents: Limitations and Opportunities. The VLDB Journal (1999)

    Google Scholar 

  9. Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. Journal Molecular Biology 147, 195–197 (1981)

    CrossRef  Google Scholar 

  10. Wang, Y., DeWitt, D.J., Cai, J.: X-Diff: An Effective Change Detection Algorithm for XML Documents. In: ICDE 2003, Bangalore (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Leonardi, E., Bhowmick, S.S., Madria, S. (2005). Xandy: Detecting Changes on Large Unordered XML Documents Using Relational Databases. In: Zhou, L., Ooi, B.C., Meng, X. (eds) Database Systems for Advanced Applications. DASFAA 2005. Lecture Notes in Computer Science, vol 3453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11408079_65

Download citation

  • DOI: https://doi.org/10.1007/11408079_65

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25334-1

  • Online ISBN: 978-3-540-32005-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics