Abstract
With the ever increasing connection between XML information systems over the Web, users are able to obtain integrated sources of XML information in a cooperative manner, such as developing an XML mediator schema or using eXtensible Stylesheet Language Transformation (XSLT). However, it is not trivial to evaluate the quality of such merged XML data, even when we have the knowledge of the involved XML data sources. Herein, we present a unifying framework for merging XML data and study the quality issues of merged XML information. We capture the coverage of the object sources as well as the structural diversity of XML data objects, respectively, by the two metrics of Information Completeness (IC) and Data Complexity (DC) of the merged data.
Keywords
- Unify Framework
- Core Node
- Density Score
- Coverage Score
- Merge Result
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bertino, E., Ferrari, E.: XML and Data Integration. IEEE Internet Computing. 5(6), 75–76 (2001)
Christophides, V., Cluet, S., Simeon, J.: On Wrapping Query Language and Efficient XML Integration. In: Proc. of SIGMOD Conference (2000)
Ives, Z.G., et al.: An Adaptive Query Execution System for Data Integration. In: Proc. of SIGMOD (1999)
Florescu, D., Koller, D., Levy, A.: Using probabilistic information in data integration. In: Proc. of VLDB (1997)
Lim, E., Srivastava, J., Prabhakar, S., Richardson, J.: Entity Identification in Database Integration. In: Proc. of ICDE (1993)
Motro, A., Rakov, I.: Estimating the quality of databases. In: Andreasen, T., Christiansen, H., Larsen, H.L. (eds.) FQAS 1998. LNCS (LNAI), vol. 1495, p. 298. Springer, Heidelberg (1998)
Naumann, F., Freytag, J.C., Leser, U.: Completeness of Information Sources. In: Proc. of DQCIS (2003)
Elmasri, Navathe: Fundamentals of Database Systems., 3rd edn. Addison Wesley, Reading (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lau, HL., Ng, W. (2005). A Unifying Framework for Merging and Evaluating XML Information. In: Zhou, L., Ooi, B.C., Meng, X. (eds) Database Systems for Advanced Applications. DASFAA 2005. Lecture Notes in Computer Science, vol 3453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11408079_10
Download citation
DOI: https://doi.org/10.1007/11408079_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25334-1
Online ISBN: 978-3-540-32005-0
eBook Packages: Computer ScienceComputer Science (R0)
