Chapter

Database and Expert Systems Applications

Volume 3588 of the series Lecture Notes in Computer Science pp 576-585

Searching Multi-hierarchical XML Documents: The Case of Fragmentation

  • Alex DekhtyarAffiliated withDepartment of Computer Science, University of Kentucky
  • , Ionut E. IacobAffiliated withDepartment of Computer Science, University of Kentucky
  • , Srikanth MethukuAffiliated withDepartment of Computer Science, University of Kentucky

Abstract

To properly encode properties of textual documents using XML, multiple markup hierarchies must be used, often leading to conflicting markup in encodings. Text Encoding Initiative (TEI) Guidelines [1] recognize this problem and suggest a number of ways to incorporate multiple hierarchies in a single well-formed XML document. In this paper, we present a framework for processing XPath queries over multi-hierarchical XML documents represented using fragmentation, one of the TEI-suggested techniques. We define the semantics of XPath over DOM trees of fragmented XML, extend the path expression language to cover overlap in markup, and describe FragXPath, our implementation of the proposed XPath semantics over fragmented markup.