Skip to main content

Online Integration of Fragmented XML Documents

  • Conference paper
  • First Online:
  • 1811 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10191))

Abstract

Online data integration of large XML documents provides the most up-to-date results from the processing of user requests issued at a central site of heterogeneous multi-database system. The fragments of large XML documents received from the remote sites are continuously combined with the most current state of integrated documents. Online integration of fragmented XML documents has a positive impact on performance of entire online data integration system.

This paper presents the online integration procedures for the fragments of large XML documents. We propose a new model of data for fragmented XML documents and we define a set of operations to manipulate the fragments. A new optimisation procedure presented in the paper finds the smallest core of each new fragment that can be integrated with the documents available at a central site. We show that processing of the smallest cores of XML fragments significantly reduces overall processing time.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Bose, S., Fegaras, L.: Data stream management for historical XML data. SIGMOD 99(3), 403–422 (2004)

    Google Scholar 

  2. Bose, S., Fegaras, L., Levine, D., Chaluvadi, V.: A query algebra for fragmented XML stream data. In: Lausen, G., Suciu, D. (eds.) DBPL 2003. LNCS, vol. 2921, pp. 195–215. Springer, Heidelberg (2004). doi:10.1007/978-3-540-24607-7_13

    Chapter  Google Scholar 

  3. Braganholo, V., Mattoso, M.: A survey on XML fragmentation. SIGMOD Rec. 43(3), 24–35 (2014)

    Article  Google Scholar 

  4. Fegaras, L.: Incremental query processing on Big Data streams. CoRR, abs/1511.07846 (2015)

    Google Scholar 

  5. Handoko, Getta, J.R.: Dynamic query scheduling for online integration of semistructured data. In: 2015 IEEE 39th Annual Computer Software and Applications Conference (COMPSAC), vol. 3, pp. 375–380, July 2015

    Google Scholar 

  6. Ma, H., Schewe, K.-D.: Fragmentation of XML documents. J. Inf. Data Manage. 1(1), 21–33 (2010)

    Google Scholar 

  7. Özsu, T.M., Valduriez, P.: Principles of Distributed Database Systems, 3rd edn. Springer, Heidelberg (2011)

    Google Scholar 

  8. Wang, G., Huo, H., Han, D., Hui, X.: Query processing and optimization techniques over streamed fragmented XML. World Wide Web 11(3), 339–359 (2008)

    Article  Google Scholar 

  9. Wu, X., Theodoratos, D.: A survey on XML streaming evaluation techniques. VLDB J. 22(2), 177–202 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Handoko .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Handoko, Getta, J.R. (2017). Online Integration of Fragmented XML Documents. In: Nguyen, N., Tojo, S., Nguyen, L., Trawiński, B. (eds) Intelligent Information and Database Systems. ACIIDS 2017. Lecture Notes in Computer Science(), vol 10191. Springer, Cham. https://doi.org/10.1007/978-3-319-54472-4_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-54472-4_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-54471-7

  • Online ISBN: 978-3-319-54472-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics