Refining OEM to Improve Features of Query Languages for Semistructured Data

  • Pavel Hlousek
  • Jaroslav Pokorny

Abstract

Semistructured data can be explained as “schemaless” or “self-describing”, indicating that there is no separate description of the type or structure of the data. This is in contrast with the structured approaches, such, e.g. relational databases, where the data structure is usually designed first and described as a database schema. Semistructured data is data whose structure is irregular, is heterogeneous, is partial, has not a fixed format, and evolves quickly. These characteristics are typical for data available in the Web (HTML pages, e-mail message bases, bookmarks collections etc). The research of semistructured data aimed at extending the database management techniques to semistructured data in the late 90’s (Suciu, 1998).

Keywords

Query Language Data Graph Path Expression Wild Card Semistructured Data 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Quass, D., McHugh, J., Widom, J., and Wiener, J., 1996, The Lord query language for semistructured data, International Journal on Digital Libraries. 1(1), pp. 68–88.Google Scholar
  2. Abiteboul, S., and Suciu, D., 2000, Data on the Web: From Relations to Semistructured Data and XML, Data Management Systems, 1st edition, Morgan Kaufmann.Google Scholar
  3. Bourret, R., 2001, XML and Databases; http://wwwspbourret.conexml//XMLAndDatabases.htmGoogle Scholar
  4. Buneman, P., Davidson, S., Hillebrand, G., and Suciu, D., 1996, A query language and optimization techniques for unstructured data, (Jagadish, H.V. and Mumick, I.S. Eds.), SIGMOD, pp. 505–516. ACM Press.Google Scholar
  5. Bray, T., Paoli, J., and Sperberg-McQueen, C. M., 1998, Extensible Markup Language (XML) 1.0, February 1998; http://www.w3.org/TR/1998/REC-xml-19980210.Google Scholar
  6. Cattell, R.G.G. et al., 2000, The Object Database Standard: ODMG 3.0, Morgan Kaufmann Publishers, Inc.Google Scholar
  7. Deutsch, A., Fernandez, M., Florescu, F., Levy, A., and Suciu, D., 1998, XML-QL: A query language for XML; http://www.w3.org/TR/1998/NOTE-xml-q1-19980819.html.Google Scholar
  8. Hlousek, P., 2000, MaiIQL, query language for an email message base, Master’s thesis, Charles University, Prague. In Czech.Google Scholar
  9. Papakonstantinou, Y., Garcia-Molina, H., and Widom, J., 1995, Object exchange across heterogeneous information sources, in Proceedings of the Eleventh International Conference on Data Engineering, Yu, Ph. S. and Chen, A.L.P. eds., pp. 251–260, IEEE Comp. Soc.CrossRefGoogle Scholar
  10. Pokorny, J., 2001, XML: a challenge for databases? Chap. 13 in: Contemporary Trends in Systems Development, Sein, M. et al, eds., Kluwer Academic Publishers, Boston, pp. 147–164.CrossRefGoogle Scholar
  11. Suciu, D., 1998, An Overview of Semistructured Data, SIGACTN: SIGACT News (ACM Special Interest Group on Automata and Computability Theory), 29.Google Scholar
  12. W3C, 1998, Extensible Markup Language (XML) 1.0; http://www.w3.org/TR/REC-xml Google Scholar
  13. W3C, 2001, XQuery 1.0: An XML Query Language W3C Working Draft 07; http://www.w3.org/TR/xquery/.Google Scholar

Copyright information

© Springer Science+Business Media New York 2002

Authors and Affiliations

  • Pavel Hlousek
    • 1
  • Jaroslav Pokorny
    • 1
  1. 1.Dept. of Software Engineering, Faculty of Mathematics and PhysicsCharles UniversityPraha 1Czech Republic

Personalised recommendations