Entity-Oriented Queries for Semistructured Data

  • Keishi Tajima
Part of the The Springer International Series in Engineering and Computer Science book series (SECS, volume 579)

Abstract

In this paper, we propose an entity-oriented style of queries for semistructured data. A semistructured data represented by a large graph usually consists of subgraphs corresponding to real-world entities, in other words, subgraphs representing composite objects in object-oriented database terminology. To detect such composite objects in semistructured data, we use the exclusiveness of references. If a reference is exclusive, then we regard it as a composite link, i.e. a link connecting parts of one composite object. Then we extend usual semistructured query languages by introducing new language constructs for path expressions that match only with composite links or non-composite links. By using those constructs in combination with wild cards, we can specify queries of a form like “retrieve all entities including these data items,” which we call entity-oriented style queries. This style of query is useful especially when one does not have enough knowledge on the structure of the database.

Keywords

semistructured data query entity composite object structure discovery path expressions 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S. (1997). Querying semi-structured data. In Proc. of ICDT, volume 1186 of LNCS, pages 1–18. Springer-Verlag.Google Scholar
  2. Abiteboul, S., Quass, D., McHugh, J., Widom, J., and Wiener, J. L. (1997). The Lorel query language for semistructured data. International Journal of Digital Libraries, l(l):68–88.Google Scholar
  3. Botafogo, R. A. and Shneiderman, B. (1991). Identifying aggregates in hypertext structures. In Proc. of ACM Hypertext, pages 63–74.Google Scholar
  4. Buneman, P. (1997). Semistructured data. In Proc. of ACM PODS, pages 117–121.CrossRefGoogle Scholar
  5. Buneman, P., Davidson, S., Hillebrand, G., and Suciu, D. (1996). A query language and optimization techniques for unstructured data. In Proc. of ACM SIGMOD, pages 505–516.Google Scholar
  6. Huffman, S. and Baudin, C. (1996). Notes explorer: Entity-based retrieval in shared, semi-structured information spaces. In Proc. of ACM CIKM, pages 99–106.Google Scholar
  7. Kim, W., Bertino, E., and Garza, J. F. (1989). Composite objects revisited. In Proc. of ACM SIGMOD, pages 337–347.Google Scholar
  8. Mendelzon, A. O. and Milo, T. (1997). Formal models of Web queries. In Proc. of ACM PODS, pages 134–143.CrossRefGoogle Scholar
  9. Nestorov, S., Abiteboul, S., and Motwani, R. (1997). Inferring structure in semistructured data. In Proc. of Workshop on Management of Semistructured Data (in Conjunction with PODS/SIGMOD).Google Scholar
  10. Nestorov, S., Abiteboul, S., and Motwani, R. (1998). Extracting schema from semistructured data. In Proc. of ACM SIGMOD, pages 295–306.Google Scholar
  11. Papakonstantinou, Y., Garcia-Molina, H., and Widom, J. (1995). Object exchange across heterogeneous information sources. In Proc. of IEEE ICDE, pages 251–260.Google Scholar
  12. Seo, D.-Y., Lee, D.-H., Lee, K.-M., and Lee, J.-Y. (1997). Discovery of schema information from a forest of selectively labeled ordered trees. In Proc. of Workshop on Management of Semistructured Data (in Conjunction with PODS/SIGMOD).Google Scholar
  13. Tajima, K., Mizuuchi, Y., Kitagawa, M., and Tanaka, K. (1998). Cut as a querying unit for WWW, Netnews, and E-mail. In Proc. of ACM Hypertext, pages 235–244.Google Scholar

Copyright information

© Springer Science+Business Media New York 2000

Authors and Affiliations

  • Keishi Tajima
    • 1
  1. 1.Department of Computer and Systems EngineeringKobe UniversityJapan

Personalised recommendations