Generating Relations from XML Documents

  • Sara Cohen
  • Yaron Kanza
  • Yehoshua Sagiv
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2572)

Abstract

This paper discusses several mechanisms for creating relations out of XML documents. A relation generator consists of two parts: (1) a tuple of path expressions and (2) an index indicating which path expressions may not be assigned the null value. Evaluating a relation generator involves finding tuples of nodes that satisfy the path expressions and are related to one another in a meaningful fashion. Different semantics for evaluation are given that take into account the possible presence of incomplete information. The complexity of generating relations from documents is analyzed and evaluation algorithms are described.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    U. Chakravarthy and J. Minker. Multiple query processing in deductive databases using query graphs. In Proceedings of International Conference on Very Large Data Bases, pages 384–391. Morgan Kaufmann, 1986.Google Scholar
  2. [2]
    S. Cohen, Y. Kanza, and Y. Sagiv. SQL4X: A flexible query language for XML and relational databases. In Proc. of the 8th International Workshop on Database and Programming Languages (DBPL), pages 263–280, Marino, (Rome, Italy), September 2001. Springer-Verlag.Google Scholar
  3. [3]
    S. Cohen, Y. Kanza, and Y. Sagiv. Select project queries over xml documents. In Proc. 5th Workshop on Next Generation Information Technologies and Systems, pages 2–13, Caesarea (Israel), June 2002. Springer-Verlag.Google Scholar
  4. [4]
    M. Garofalakis, A. Gionis, R. Rastogi, S. Seshadri, and K. Shim. XTRACT: A system for extracting document type descriptors from xml documents. In Proc. 2000 ACM SIGMOD International Conference on Management of Data, pages 165–176, Dallas (Texas, USA), May 2000. ACM Press.Google Scholar
  5. [5]
    M. Graham. On the universal relation. Technical report, University of Toronto, Toronto (Canada), 1979.Google Scholar
  6. [6]
    A. Gupta, V. Harinarayan, and A. Rajaraman. Virtual database technology. In Proc. 14th International Conference on Data Engineering, pages 297–301, Orlando (Florida, USA), Feb. 1998. IEEE Computer Society.Google Scholar
  7. [7]
    Y. Kanza and S. Sagiv. Flexible queries over semistructured data. In Proc. 20th Symposium on Principles of Database Systems, pages 40–51, Santa Barbara (California, USA), May 2001. ACM Press.Google Scholar
  8. [8]
    D. Maier, J. D. Ullman, and M. Y. Vardi. On the foundation of the universal relation model. ACM Trans. on Database System (TODS), 9(2):283–308, 1984.MATHCrossRefMathSciNetGoogle Scholar
  9. [9]
    S. Nestorov, S. Abiteboul, and R. Motwani. Extracting schema from semistructured data. In Proc. 1998 ACM SIGMOD International Conference on Management of Data, pages 295–306, Seattle (Washington, USA), June 1998. ACM Press.Google Scholar
  10. [10]
    A. Rajaraman and J. D. Ullmann. Querying websites using compact skeletons. In Proc. 20th Symposium on Principles of Database Systems, pages 16–27, Santa Barbara (California, USA), May 2001. ACM Press.Google Scholar
  11. [11]
    J. D. Ullman. The U. R. strikes back. In Proc. of the ACM Symposium on Principles of Database Systems (PODS), pages 10–22, Los Angeles, (California), March 1982. ACM Press.Google Scholar
  12. [12]
    J. D. Ullman. Principles of Database and Kowledge Base Systems, volume II. Computer Science Press, 1989.Google Scholar
  13. [13]
    E. Wong and K. Yousse.. Decomposition-a strategy for query processing. ACM Trans. on Database Systems, 1(3):223–241, 1976.CrossRefGoogle Scholar
  14. [14]
    M. Yannakakis. Algorithms for acyclic database schemes. In Proceedings of International Conference on Very Large Data Bases, pages 82–94. Morgan Kaufmann, 1981.Google Scholar
  15. [15]
    C. Yu and M. Özsoyoglu. An algorithm for tree-query membership of a distributed query. In Proceedings of IEEE COMPSAC, pages 306–312, 1979.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Sara Cohen
    • 1
  • Yaron Kanza
    • 1
  • Yehoshua Sagiv
    • 1
  1. 1.School of Computer Science and EngineeringThe Hebrew University of JerusalemJerusalemIsrael

Personalised recommendations