Online Integration of Fragmented XML Documents

  • Handoko
  • Janusz R. Getta
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10191)


Online data integration of large XML documents provides the most up-to-date results from the processing of user requests issued at a central site of heterogeneous multi-database system. The fragments of large XML documents received from the remote sites are continuously combined with the most current state of integrated documents. Online integration of fragmented XML documents has a positive impact on performance of entire online data integration system.

This paper presents the online integration procedures for the fragments of large XML documents. We propose a new model of data for fragmented XML documents and we define a set of operations to manipulate the fragments. A new optimisation procedure presented in the paper finds the smallest core of each new fragment that can be integrated with the documents available at a central site. We show that processing of the smallest cores of XML fragments significantly reduces overall processing time.


Data integration Online algorithm Fragmented XML documents Semistructured data 


  1. 1.
    Bose, S., Fegaras, L.: Data stream management for historical XML data. SIGMOD 99(3), 403–422 (2004)Google Scholar
  2. 2.
    Bose, S., Fegaras, L., Levine, D., Chaluvadi, V.: A query algebra for fragmented XML stream data. In: Lausen, G., Suciu, D. (eds.) DBPL 2003. LNCS, vol. 2921, pp. 195–215. Springer, Heidelberg (2004). doi: 10.1007/978-3-540-24607-7_13 CrossRefGoogle Scholar
  3. 3.
    Braganholo, V., Mattoso, M.: A survey on XML fragmentation. SIGMOD Rec. 43(3), 24–35 (2014)CrossRefGoogle Scholar
  4. 4.
    Fegaras, L.: Incremental query processing on Big Data streams. CoRR, abs/1511.07846 (2015)Google Scholar
  5. 5.
    Handoko, Getta, J.R.: Dynamic query scheduling for online integration of semistructured data. In: 2015 IEEE 39th Annual Computer Software and Applications Conference (COMPSAC), vol. 3, pp. 375–380, July 2015Google Scholar
  6. 6.
    Ma, H., Schewe, K.-D.: Fragmentation of XML documents. J. Inf. Data Manage. 1(1), 21–33 (2010)Google Scholar
  7. 7.
    Özsu, T.M., Valduriez, P.: Principles of Distributed Database Systems, 3rd edn. Springer, Heidelberg (2011)Google Scholar
  8. 8.
    Wang, G., Huo, H., Han, D., Hui, X.: Query processing and optimization techniques over streamed fragmented XML. World Wide Web 11(3), 339–359 (2008)CrossRefGoogle Scholar
  9. 9.
    Wu, X., Theodoratos, D.: A survey on XML streaming evaluation techniques. VLDB J. 22(2), 177–202 (2013)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.Electronic and Computer Engineering DepartmentSatya Wacana Christian UniversitySalatigaIndonesia
  2. 2.School of Computer Science and Software EngineeringUniversity of WollongongWollongongAustralia

Personalised recommendations