Repairing Inconsistent XML Documents

  • Zijing Tan
  • Wei Wang
  • JianJun Xu
  • Baile Shi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4092)


XML document may contain inconsistencies that violate predefined integrity constraints, and there are two basic concepts for this problem: Repair is the data consistent with the integrity constraints, and also minimally differs from the original one. Consistent data is the data common for every possible repair. In this paper, first we give a general constraint model for XML, which can express functional dependencies, keys and multivalued dependencies. Next we provide a repair framework for inconsistent XML document with three basic update operations: node insertion, node deletion and value modification. Following this approach, we introduce the concept of repair for inconsistent XML document, discuss the chase process to generate repairs, and prove some important properties of the chase process. Finally we give a method to obtain the greatest lower bound of all possible repairs, which is sufficient for consistent data.


Child Node Symbol Mapping Consistent Data Integrity Constraint Element Node 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Abiteboul, S., Segoufin, L., Vianu, V.: Representing and Querying XML with Incomplete Information. In: PODS, pp. 35–47 (2001)Google Scholar
  2. 2.
    Arenas, M., Bertossi, L.E., Chomick, J.: Consistent Query Answers in Inconsistent Databases. In: PODS, pp. 68–79 (1999)Google Scholar
  3. 3.
    Arenas, M., Libkin, L.: A Normal Form for XML Documents. TODS 29(1), 195–232 (2004)CrossRefGoogle Scholar
  4. 4.
    Arenas, M., Libkin, L.: XML Data Exchange: Consistency and Query Answering. In: PODS, pp. 13–24 (2005)Google Scholar
  5. 5.
    Bohannon, P., Fan, W.F., Flaster, M., Rastogi, R.: A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification. In: SIGMOD, pp. 143–154 (2005)Google Scholar
  6. 6.
    Buneman, P., Davidson, S., Fan, W., Hara, C., Tan, W.: Reasoning about Keys for XML. In: Database Programming Languages, pp. 133–148 (2002)Google Scholar
  7. 7.
    Bravo, L., Bertossi, L.: Logic programs for consistently querying data integration systems. In: IJCAI, pp. 10–15 (2003)Google Scholar
  8. 8.
    Chomicki, J., Marcinkowski, J.: Minimal-Change Integrity Maintenance Using Tuple Deletions. Information and Computation 197(1-2), 90–121 (2005)zbMATHCrossRefMathSciNetGoogle Scholar
  9. 9.
    Greco, G., Greco, S., Zumpano, E.: A logical framework for querying and repairing inconsistent databases. IEEE Transaction on Knowledge and Data Engineering 15(6), 1389–1408 (2003)CrossRefGoogle Scholar
  10. 10.
    Ng, W.: Repairing Inconsistent Merged XML Data. In: Mařík, V., Štěpánková, O., Retschitzegger, W. (eds.) DEXA 2003. LNCS, vol. 2736, pp. 244–255. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  11. 11.
    Vincent, M.W., Liu, J.: Multivalued Dependencies and a 4NF for XML. In: Eder, J., Missikoff, M. (eds.) CAiSE 2003. LNCS, vol. 2681, pp. 14–29. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  12. 12.
    Wijsen, J.: Database Repairing Using Updates. TODS 30(3), 722–768 (2005)CrossRefGoogle Scholar
  13. 13.
    Extensible Markup Language (XML) 1.0, 2nd edn. W3C Recommendation (October 2000),
  14. 14.
    XML Schema Part 1: Structures. W3C Recommendation (May 2001),

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Zijing Tan
    • 1
  • Wei Wang
    • 1
  • JianJun Xu
    • 1
  • Baile Shi
    • 1
  1. 1.Department of Computing and Information TechnologyUniversity of FudanShanghaiChina

Personalised recommendations