Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Structural Indexing

  • Mariano P. Consens
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_376

Synonyms

Dataguide; Path index; Sketch; Structural index; Structural summary; Synopsis

Definition

Structure indexing creates summaries of the structure present in semi-structured data collections by grouping data items with similar structure, providing a mechanism to index such items. Since semi-structured data models are commonly represented by labeled graphs or trees (the XML data model being a prime example), structural indexes or summaries are naturally described as graphs where nodes represent sets of data items (called extents), and where edges represent structural relationships between the corresponding extents derived from the instance data. A concrete physical index can be created by selecting appropriate data structures to store the graph and the extents.

Structure indexing helps to find data items that satisfy structural constraints in queries by locating nodes in the structural summary graph that satisfy the query conditions (expecting far less summary nodes than data...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Buneman P, Choi B, Fan W, Hutchison R, Mann R, Viglas S. Vectorizing and querying large XML repositories. In: Proceedings of the 21st International Conference on Data Engineering; 2005. p. 261–72.Google Scholar
  2. 2.
    Chung C-W, Min J-K, Shim K. APEX: an adaptive path index for XML data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2002. p. 121–32.Google Scholar
  3. 3.
    Consens MP, Milo T. Optimizing queries on files. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1994. p. 301–12.Google Scholar
  4. 4.
    Consens MP, Rizzolo F, Vaisman AA. AxPRE summaries: exploring the (semi-)structure of XML web collections. In: Proceedings of the 24th International Conference on Data Engineering; 2008.p. 1519–21.Google Scholar
  5. 5.
    Freire J, Haritsa JR, Ramanath M, Roy P, Simeon J. StatiX: making XML count. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2002. p. 181–91.Google Scholar
  6. 6.
    Goldman R, Widom J. Dataguides: enabling query formulation and optimization in semistructured databases. In: Proceedings of the 23th International Conference on Very Large Data Bases; 1997. p. 436–45.Google Scholar
  7. 7.
    Kaushik R, Bohannon P, Naughton JF, Korth HF. Covering indexes for branching path queries. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2002. p. 133–44.Google Scholar
  8. 8.
    Kaushik R, Shenoy P, Bohannon P, Gudes E. Exploiting local similarity for indexing paths in graph-structured data. In: Proceedings of the 18th International Conference on Data Engineering; 2002. p. 129–40.Google Scholar
  9. 9.
    Milo T, Suciu D. Index structures for path expressions. In: Proceedings of the 7th International Conference on Database Theory; 1999. p. 277–95.Google Scholar
  10. 10.
    Nestorov S, Ullman JD, Wiener JL, Chawathe SS. Representative objects: concise representations of semistructured, hierarchial data. In: Proceedings of the 13th International Conference on Data Engineering; 1997. p. 79–90.Google Scholar
  11. 11.
    Polyzotis N, Garofalakis MN. XSketch synopses for XML data graphs. ACM Trans Database Syst. 2006;31(3):1014–63.CrossRefGoogle Scholar
  12. 12.
    Qun C, Lim A, Ong KW. D(k)-index: an adaptive structural summary for graph-structured data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2003. p. 134–44.Google Scholar
  13. 13.
    Rizzolo F, Mendelzon AO. Indexing XML data with ToXin. In: Proceedings of the 4th International Workshop on the Web and Databases; 2001. p. 49–54.Google Scholar
  14. 14.
    Young-Lai M, Tompa FW. One-pass evaluation of region algebra expressions. Inform Syst. 2003;28(3):159–68.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.University of TorontoTorontoCanada

Section editors and affiliations

  • Frank Tompa
    • 1
  1. 1.David R. Cheriton School of Computer ScienceUniv. of WaterlooWaterlooCanada