Amalgamating SGML documents and databases

  • Masatoshi Yoshikawa
  • Osamu Ichikawa
  • Shunsuke Uemura
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1057)


We propose a uniform and flexible mechanism to make reference links from SGML documents to database objects. In addition to typical document logical structures such as sections and paragraphs, our mechanism allows arbitrary character strings in documents as source of these links. By using this mechanism, SGML attributes and their values of marked-up words can be transparently stored as database attributes, and we can establish hyperlinks between keywords in documents, which reflect relationships between the corresponding database objects. Also, we present a query language to retrieve SGML documents which are coupled with databases in this manner. The query language does not assume a particular database schema; instead, it utilizes DTD graphs, representing element structures of DTDs, as virtual schemas.


Logical Structure Query Language Database Schema Character String Dynamic Link 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    G. E. Blake, M. P. Consens, I. J. Davis, P. Kilpeläinen, E. Kuikka, P.-Å. Larson, T. Snider, and F. W. Tompa. Text/Relational Database Management Systems: Overview and Proposed SQL Extensions. Technical Report CS-95-25, UW Centre for the New OED and Text Research, Department of Computer Science, University of Waterloo, June 1995.Google Scholar
  2. 2.
    V. Christophides, S. Abiteboul, S. Cluet, and M. Scholl. From Structured Documents to Novel Query Facilities. In Proc. ACM SIGMOD International Conference on Management of Data, pages 313–324, May 1994.Google Scholar
  3. 3.
    V. Christophides and A. Rizk. Querying Structured Documents with Hypertext Links using OODBMS. In Proc. of ACM European Conference on Hypermedia Technology (ECHT'94), pages 186–197, September 1994.Google Scholar
  4. 4.
    Charles F. Goldfarb. The SGML Handbook. Clarendon Press, Oxford, 1990.Google Scholar
  5. 5.
    Gaston H. Gonnet. Tutorial: Text Dominated Databases, Theory Practice and Experience. In Proc. ACM Symp. on Principles of Database Systems, pages 301–302, May 1994.Google Scholar
  6. 6.
    ISO 8879: 1986. Information Processing — Text and Office System — Standard Generalized Markup Language (SGML), Oct. 15 1986.Google Scholar
  7. 7.
    ISO/IEC 10744: 1992. Hypermedia/Time-based Structuring Language (Hy Time), 1992.Google Scholar
  8. 8.
    I. A. Macleod. A Query Language for Retrieving Information from Hierarchic Text Structures. The Computer Journal, 34(3):254–264, 1991.Google Scholar
  9. 9.
    Ian A. Macleod. Storage and Retrieval of Structured Documents. Information Processing & Management, 26(2):197–208, 1990.Google Scholar
  10. 10.
    Ron Sacks-Davis, Timothy Arnold-Moore, and Justin Zobel. Database Systems for Structured Documents. In Proc. of the International Symposium on Advanced Database Technologies and Their Integration, pages 272–283, October 1994.Google Scholar
  11. 11.
    Eric van Herwijnen. Practical SGML. Kluwer Academic, 2nd edition, 1994.Google Scholar
  12. 12.
    Tak W. Yan and Jurgen Annevelink. Integrating a Structured-Text Retrieval System with an Object-Oriented Database System. In Proceedings of the Twentieth International Conference on Very Large Databases, pages 740–749, Santiago, Chile, 1994. Industrial Case.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1996

Authors and Affiliations

  • Masatoshi Yoshikawa
    • 1
  • Osamu Ichikawa
    • 2
  • Shunsuke Uemura
    • 1
  1. 1.Graduate School of Information ScienceNara Institute of Science and TechnologyNaraJapan
  2. 2.Semiconductor Research CenterMatsushita Electric Industrial Co., Ltd.OsakaJapan

Personalised recommendations