Online Integration of Fragmented XML Documents
Online data integration of large XML documents provides the most up-to-date results from the processing of user requests issued at a central site of heterogeneous multi-database system. The fragments of large XML documents received from the remote sites are continuously combined with the most current state of integrated documents. Online integration of fragmented XML documents has a positive impact on performance of entire online data integration system.
This paper presents the online integration procedures for the fragments of large XML documents. We propose a new model of data for fragmented XML documents and we define a set of operations to manipulate the fragments. A new optimisation procedure presented in the paper finds the smallest core of each new fragment that can be integrated with the documents available at a central site. We show that processing of the smallest cores of XML fragments significantly reduces overall processing time.
KeywordsData integration Online algorithm Fragmented XML documents Semistructured data
- 1.Bose, S., Fegaras, L.: Data stream management for historical XML data. SIGMOD 99(3), 403–422 (2004)Google Scholar
- 4.Fegaras, L.: Incremental query processing on Big Data streams. CoRR, abs/1511.07846 (2015)Google Scholar
- 5.Handoko, Getta, J.R.: Dynamic query scheduling for online integration of semistructured data. In: 2015 IEEE 39th Annual Computer Software and Applications Conference (COMPSAC), vol. 3, pp. 375–380, July 2015Google Scholar
- 6.Ma, H., Schewe, K.-D.: Fragmentation of XML documents. J. Inf. Data Manage. 1(1), 21–33 (2010)Google Scholar
- 7.Özsu, T.M., Valduriez, P.: Principles of Distributed Database Systems, 3rd edn. Springer, Heidelberg (2011)Google Scholar