Adding semantics to SGML databases

  • Subhasish Mazumdar
  • Gary Yuan
  • Weifeng Bao
  • Jonathan Price
Part III: EP'98 Hypertext Creation and Management
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1375)


Huge collections of linked documents can now be efficiently stored. However, full online access and electronic publishing through reuse of document parts require sophistication and precision in queries. Such a query facility is only possible through the inclusion of appropriate semantic information. Manually adding such information to multigigabyte document sources is daunting for technical writers. Our approach aims at making this task feasible by exploiting a conceptual schema of the enterprise. The result is an integrated schema — one that covers the traditional information system of the enterprise as well as the information that exists solely in the world of documents.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    B. Amann and M. Scholl. GRAM: A Graph Data Model and Query Language. In Proceedings of the Fourth European Conference on Hypertext and Hypermedia ECHT'92., pages 201–211, 1992.Google Scholar
  2. 2.
    C. Batini, S. Ceri, and S. Navathe. Conceptual Database Design. Benjamin/Cummings, 1992.Google Scholar
  3. 3.
    G. Blake, M. Consens, P. Kilpeläeinen, P.-å. Larson, T. Snider, and F. Tompa. Text/Relational Database Management Systems: Harmonizing SQL and SGML. In W. Litwin and T. Risch, editors, Applications of Databases, Proceedings of the First International Conference ADB-94, pages 267–280. Springer-Verlag, 1994.Google Scholar
  4. 4.
    J. Callan, B. Croft, and S. Harding. The INQUERY Retrieval System. In Proceedings of the Third International Conference on Database and Expert Systems Application, pages 78–83, 1992.Google Scholar
  5. 5.
    V. Christophides, S. Abiteboul, S. Cluet, and M. Scholl. From Structured Documents to Novel Query Facilities. In Proceedings of the ACM-SIGMOD International Conference on Management of Data, Minneapolis, Minnesota, pages 313–324, 1994.Google Scholar
  6. 6.
    W. B. Croft and D. W. Stemple. Supporting Office Document Architectures with Constrained Types. In Proceedings of the ACM-SIGMOD International Conference on Management of Data, San Francisco, California, pages 504–509, 1987.Google Scholar
  7. 7.
    S. Dobson and V. Burrill. Lightweight Databases. In Proceedings of the Third International World Wide Web Conference, 1995.Google Scholar
  8. 8.
    C. Goldfarb. The SGML Handbook. Clarendon Press, Oxford, 1990.Google Scholar
  9. 9.
    R. H. Guting, R. Zicari, and D. Choy. An Algebra for Structured Office Documents. ACM Transactions on Office Information Systems, 7(4):123–157, April 1989.CrossRefGoogle Scholar
  10. 10.
    M. Gyssens, J. Paredaens, and D. Van Gucht. A Grammar-based Approach towards Unifying Hierarchical Data Models. In Proceedings of the ACM-SIGMOD International Conference on Management of Data, Portland, Oregon, pages 263–272, 1989.Google Scholar
  11. 11.
    ISO8879:1986. Information Processing — Text and Office System — Standard Generalized Markup Language (SGML), 1986.Google Scholar
  12. 12.
    W. Kim, H. Chou, and J. Banerjee. Operations and Implementation of Complex Objects. In Proceedings of the IEEE Third International Conference on Data Engineering, 1987.Google Scholar
  13. 13.
    A. Müller and U. Thiel. Query Expansion in an Abductive Information Retrieval System. In Proceedings of the RIAO'94. New York., pages 461–480, 1994.Google Scholar
  14. 14.
    J. Price. Introduction: Special Issue on Structuring Complex Information for Electronic Publication. IEEE Transactions on Professional Communication, 40(2):1–9, June 1997.MathSciNetCrossRefGoogle Scholar
  15. 15.
    C. Tattersall and A. Cole. Modelling the Content of Technical Documentation. In Proceedings of Electronic Publishing EP-92, pages 223–232, 1992.Google Scholar
  16. 16.
    E. van Herwijnen. Practical SGML. Kluwer Academic, 1994.Google Scholar
  17. 17.
    C. van Rijsbergen. Towards an Information Logic. In Proceedings of the Twelfth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1989.Google Scholar
  18. 18.
    M. Yoshikawa, O. Ichikawa, and S. Uemura. Amalgamating SGML Documents and Databases. In Proceedings of the 5th International Conference on Extending Database Technology, pages 259–274, 1996.Google Scholar
  19. 19.
    M. Zloof. Query By Example: a Data Base Language. IBM Systems Journal, 16(4):324–343, 1977.CrossRefGoogle Scholar
  20. 20.
    J. Zobel, A. Moffat, and R. Sacks-Davis. An Efficient Indexing Technique for Full Text Databases. In Proceedings of the Eighteenth International Conference on Very Large Databases, pages 352–362, 1992.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1998

Authors and Affiliations

  • Subhasish Mazumdar
    • 1
  • Gary Yuan
    • 1
  • Weifeng Bao
    • 1
  • Jonathan Price
    • 2
  1. 1.Department of Computer ScienceSocorroUSA
  2. 2.Technical Communication ProgramNew Mexico Institute of Mining and TechnologySocorroUSA

Personalised recommendations