Skip to main content

Evaluating Queries on Structure with eXtended Access Support Relations

  • Conference paper
  • First Online:

Part of the Lecture Notes in Computer Science book series (LNCS,volume 1997)

Abstract

There are three common design decisions taken by today’s search engines. First, they do not replicate the data found on the Web. Second, they rely on full-text indexes instead. Third, they do not support the querying of document structure. The main reason for the latter is that HTML’s ability to express semantics with syntactic structure is very limited. This is different for XML since it allows for self-describing data. Due to its flexibility by inventing arbitrary new element and attribute names, XML allows to encode semantics within syntax. The consequence is that search engines for XML should support the querying of structure. In our current work on search engines for XML data on the Web, we want to keep the first two design decisions of traditional search engines but modify the last one according to the new requirements implied by the necessity to query structure. Since our search engine accepts queries with structural information, a full-text index does not suffice any longer. What is needed is a scalable index structure that allows to answer queries over the structure of XML documents. One possible index structure called eXtended Access Support Relation (XASR) is introduced. Further, we report on a search engine for XML data called Mumpits. Due to its prototypical character, we intentionally kept the design and implementation of Mumpits very simple. Its design is centered around a single XASR and its implementation heavily builds on a commercial relational database management system.

Keywords

  • Search Engine
  • Query Language
  • Path Expression
  • Path Connector
  • Semistructured Data

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/3-540-45271-0_8
  • Chapter length: 12 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   74.99
Price excludes VAT (USA)
  • ISBN: 978-3-540-45271-3
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   99.00
Price excludes VAT (USA)

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. The BIOpolymer Markup Language (BIOML) Home Page.http://www.bioml.com/BIOML/index.html.

    Google Scholar 

  2. S. Abiteboul, P. Buneman, and D. Suciu. Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufman, 1999.

    Google Scholar 

  3. E. Bertino and W. Kim. Indexing techniques for queries on nested objects. IEEE Trans. on Knowledge and Data Engineering, 1(2):196–214, Jun 1989.

    CrossRef  Google Scholar 

  4. T. Bray, J. Paoli, and C. M. Sperberg-McQueen. Extensible markup language (xml) 1.0. Technical report, World Wide Web Consortium, 1998. W3C Recommendation 10-Feb-98.

    Google Scholar 

  5. J. Chen, D. DeWitt, F. Tian, and Y. Wang. NiagaraCQ: A scalable continuous query system for internet databases. In Proc. of the ACM SIGMOD Conf. on Management of Data, 2000. to appear.

    Google Scholar 

  6. J. Clark and S. DeRose. XML path language (XPath) version 1.0. Technical report, World Wide Web Consortium, 1999. W3C Recommendation 16 Nov. 1999.

    Google Scholar 

  7. T. Cormen, C. Leiserson, and R. Rivest. Introduction to Algorithms. MIT Press, 1989.

    Google Scholar 

  8. A. Deutsch, M. Fernandez, and D. Suciu. Storing semistructured data with STORED. In Proc. of the ACM SIGMOD Conf. on Management of Data, 1999.

    Google Scholar 

  9. D. Florescu and D. Kossmann. Storing and querying XML data using and RDBMS. IEEE Data Engineering Bulletin, 22(3):27–34, 1999.

    Google Scholar 

  10. R. Goldman, J. McHugh, and J. Widom. From semistructured data to XML: Migrating the Lore data model and query language. In ACM SIGMOD Workshop on the Web and Databases (WebDB), 1999.

    Google Scholar 

  11. A. Kemper and G. Moerkotte. Access support in object bases. In Proc. of the ACM SIGMOD Conf. on Management of Data, pages 364–374, 1990.

    Google Scholar 

  12. A. Kemper and G. Moerkotte. Advanced query processing in object bases using access support relations. In Proc. Int. Conf. on Very Large Data Bases (VLDB), pages 294–305, 1990.

    Google Scholar 

  13. D. Maier and J. Stein. Indexing in an object-oriented DBMS. In Proc. IEEE Intl. Workshop on Object-Oriented Database Systems, Asilomar, Pacific Grove, CA, pages 171–182. IEEE Computer Society Press, 1986.

    Google Scholar 

  14. J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A database management system for semistructured data. ACM SIGMOD Record, 26(3):54–66, 1997.

    CrossRef  Google Scholar 

  15. J. Shanmugasundaram, H. Gang, K. Tufte, C. Yhang, D. J. DeWitt, and J. Naughton. Relational databases for querying xml documents: Limitations and opportunities. In Proc. Int. Conf. on Very Large Data Bases (VLDB), pages 302–314, 1999.

    Google Scholar 

  16. B. Shidlowsky and E. Bertino. A graph-theoretic approach to indexing in object-oriented databases. In Proc. IEEE Conference on Data Engineering, pages 230–237, 1996.

    Google Scholar 

  17. R. Van Zwol, P. Apers, and A. Wilschut. Modeling and querying semistructured data with MOA. In ICDT’99 Workshop on Query Processing for semistructured data, 1999.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fiebig, T., Moerkotte, G. (2001). Evaluating Queries on Structure with eXtended Access Support Relations. In: Goos, G., Hartmanis, J., van Leeuwen, J., Suciu, D., Vossen, G. (eds) The World Wide Web and Databases. WebDB 2000. Lecture Notes in Computer Science, vol 1997. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45271-0_8

Download citation

  • DOI: https://doi.org/10.1007/3-540-45271-0_8

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41826-9

  • Online ISBN: 978-3-540-45271-3

  • eBook Packages: Springer Book Archive

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.