Skip to main content

A Unifying Framework for Merging and Evaluating XML Information

  • Conference paper

Part of the Lecture Notes in Computer Science book series (LNISA,volume 3453)

Abstract

With the ever increasing connection between XML information systems over the Web, users are able to obtain integrated sources of XML information in a cooperative manner, such as developing an XML mediator schema or using eXtensible Stylesheet Language Transformation (XSLT). However, it is not trivial to evaluate the quality of such merged XML data, even when we have the knowledge of the involved XML data sources. Herein, we present a unifying framework for merging XML data and study the quality issues of merged XML information. We capture the coverage of the object sources as well as the structural diversity of XML data objects, respectively, by the two metrics of Information Completeness (IC) and Data Complexity (DC) of the merged data.

Keywords

  • Unify Framework
  • Core Node
  • Density Score
  • Coverage Score
  • Merge Result

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bertino, E., Ferrari, E.: XML and Data Integration. IEEE Internet Computing. 5(6), 75–76 (2001)

    CrossRef  Google Scholar 

  2. Christophides, V., Cluet, S., Simeon, J.: On Wrapping Query Language and Efficient XML Integration. In: Proc. of SIGMOD Conference (2000)

    Google Scholar 

  3. Ives, Z.G., et al.: An Adaptive Query Execution System for Data Integration. In: Proc. of SIGMOD (1999)

    Google Scholar 

  4. Florescu, D., Koller, D., Levy, A.: Using probabilistic information in data integration. In: Proc. of VLDB (1997)

    Google Scholar 

  5. Lim, E., Srivastava, J., Prabhakar, S., Richardson, J.: Entity Identification in Database Integration. In: Proc. of ICDE (1993)

    Google Scholar 

  6. Motro, A., Rakov, I.: Estimating the quality of databases. In: Andreasen, T., Christiansen, H., Larsen, H.L. (eds.) FQAS 1998. LNCS (LNAI), vol. 1495, p. 298. Springer, Heidelberg (1998)

    CrossRef  Google Scholar 

  7. Naumann, F., Freytag, J.C., Leser, U.: Completeness of Information Sources. In: Proc. of DQCIS (2003)

    Google Scholar 

  8. Elmasri, Navathe: Fundamentals of Database Systems., 3rd edn. Addison Wesley, Reading (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lau, HL., Ng, W. (2005). A Unifying Framework for Merging and Evaluating XML Information. In: Zhou, L., Ooi, B.C., Meng, X. (eds) Database Systems for Advanced Applications. DASFAA 2005. Lecture Notes in Computer Science, vol 3453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11408079_10

Download citation

  • DOI: https://doi.org/10.1007/11408079_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25334-1

  • Online ISBN: 978-3-540-32005-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics