Skip to main content

SemiLog: A Logic-Based Query Language for Hierarchical Data in Web Documents

  • Conference paper
Internet Applications (ICSC 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1749))

Included in the following conference series:

  • 333 Accesses

Abstract

Most of the textual information posted on the Web are in documents which conform to the HTML [12] or recently emerging XML [4] specification. In the past, a number of query languages have been proposed for querying data in Web documents. We notice that these query languages are incapable of inferring hierarchically structured data from linked Web documents as well as within a Web document itself. In this paper, we propose a logic-based query language, called SemiLog, for retrieving data in Web documents that are hierarchically structured. SemiLog is capable of handling recursive queries, which infer data that are not explicitly presented in hierarchically structured Web documents, and processing partial knowledge of data in Web documents with irregular structure to answer a given query.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Quass, D., McHugh, J., Widom, J., Wiener, J.: The Lorel Query Language for Semistructured Data. Journal on Digital Libraries 1(1), 68–88 (1997)

    Google Scholar 

  2. Abiteboul, S., Cluet, S., Christophides, V., Milo, T., Moerkotte, G., Simeon, J.: Querying Documents in Object Databases. Int. J. on Digital Libraries, 5–19 (1997)

    Google Scholar 

  3. Arocena, G.O., Mendelzon, A.O.: WebOQL: Restructuring Documents, Databases and Webs. In: Proceedings of the 14th Intl. Conf. on Data Engineering, pp. 24–33 (1998)

    Google Scholar 

  4. Bray, T., Paoli, J., Sperberg-McQueen, C.: Extensible Markup Language (XML) 1.0 W3C Recommendation (February 10 1998), http://www.w3.org/TR/1998/RECxml-19980210

  5. Florescu, D., Levy, A., Mendelzon, A.O.: Database Techniques for the World-Wide Web: A Survey. SIGMOD Record 27(3), 59–74 (1998)

    Article  Google Scholar 

  6. Lakshmanan, L.V.S., Sadri, F., Subramanian, I.N.: A Declarative Language for Querying and Restructuring the Web. In: Post-ICDE IEEE Workshop on Research Issues in Data Engineering (February 1996)

    Google Scholar 

  7. Lim, S.-J., Ng, Y.-K.: WebView: A Tool for Retrieving Internal Structures and Extracting Information from HTML Documents. In: Proceedings of the 6th International Conference on Database Systems for Advanced Applications, pp. 71–80 (April 1999).

    Google Scholar 

  8. Lim, S.-J., Ng, Y.-K.: SemiLog: A Logic-Based Query Language for Hierarchical Data in Web Documents, http://lunar.cs.byu.edu/papers.html/semi.ps

  9. Lloyd, J.W.: Foundations of Logic Programming, 2nd edn. Springer, New York (1993) (extended edition)

    MATH  Google Scholar 

  10. Mendelzon, A.O., Mihaila, G., Milo, T.: Querying the World Wide Web. In: Proceedings of the Conf. on Parallel and Distributed Information Systems, pp. 80–91 (1996)

    Google Scholar 

  11. Papakonstantinou, Y., Abiteboul, S., Garcia-Molina, H.: Object Fusion in Mediator Systems. In: Proceedings of the 22nd Intl. Conf. on VLDB, pp. 413–424 (1996)

    Google Scholar 

  12. Raggett, D., Hors, A.L., Jacobs, I.: HTML 4.0 Specification - W3C Recommendation (April 1998), http://www.w3.org/TR/REC-html40

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lim, SJ., Ng, YK. (1999). SemiLog: A Logic-Based Query Language for Hierarchical Data in Web Documents. In: Hui, L.C.K., Lee, DL. (eds) Internet Applications. ICSC 1999. Lecture Notes in Computer Science, vol 1749. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-46652-9_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-46652-9_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66903-6

  • Online ISBN: 978-3-540-46652-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics