Advances in Web-Age Information Management

Volume 4016 of the series Lecture Notes in Computer Science pp 1-12

On-Demand Index for Efficient Structural Joins

  • Kun-Lung WuAffiliated withIBM T.J. Watson Research Center
  • , Shyh-Kwei ChenAffiliated withIBM T.J. Watson Research Center
  • , Philip S. YuAffiliated withIBM T.J. Watson Research Center

* Final gross prices may vary according to local VAT.

Get Access


A structural join finds all occurrences of structural, or containment, relationship between two sets of XML node elements: ancestor and descendant. Prior approaches to structural joins mostly focus on maintaining offline indexes on disks or requiring the elements in both sets to be sorted. However, either one can be expensive. More important, not all node elements are beforehand indexed or sorted. We present an on-demand, in-memory indexing approach to performing structural joins. There is no need to sort the elements. We discover that there are similarities between the problems of structural joins and stabbing queries. However, previous work on stabbing queries, although efficient in search time, is not directly applicable to structural joins because of high storage costs. We develop two storage reduction techniques to alleviate the problem of high storage costs. Simulations show that our new method outperforms prior approaches.