Representing and reasoning on SGML documents

  • Diego Calvanese
  • Giuseppe D Giacomo
  • Maurizio Lenzerini
Communications Session 7A Intelligent Information Systems
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1325)


In this paper, we address the issue of representing and reasoning about documents for which an explicit structure is provided. Specifically, we devise a framework where Document Type Definitions (DTDs) expressed in the Standard Generalized Markup Language (SGML) are formalized in an expressive Description Logic equipped with sound, complete, and terminating inference procedures. In this way, we provide a general reasoning mechanism that enables various reasoning tasks on DTDs, including the verification of typical forms of equivalences between DTDs, such as strong equivalence and structural equivalence, as well as parametric versions of these equivalences. Notably, this general reasoning mechanism allows for verifying structural equivalence in worst case deterministic exponential time, in contrast to the known algorithms which are double exponential. As a whole, the study in this paper provides some of the fundamental building blocks for developing articulated inference systems that support tasks involving the intelligent navigation of large document databases such as the World Wide Web.


Description Logic Reasoning Task Document Type Definition Terminal Symbol Nonterminal Symbol 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    J. Åberg. Creating a description logics knowledge base for world wide web documents. Technical Report LiTH-IDA-Ex-9641, Department of Computer and Information Science, Linköping University, Sweden, 1996.Google Scholar
  2. 2.
    A. Artale, E. Franconi, and N. Guarino. Open problems with part-whole relations. In Proc. of the 1996 Description Logic Workshop (DL-96), number WS-96-05, pages 70–73. AAAI Press, 1996.Google Scholar
  3. 3.
    A. Artale, E. Franconi, N. Guarino, and L. Pazzi. Part-whole relations in object-centered systems: An overview. Data and Knowledge Engineering, 20:347–383, 1996.Google Scholar
  4. 4.
    D. Calvanese. Finite model reasoning in description logics. In Proc. of the 5th Int. Conf. on the Principles of Knowledge Representation and Reasoning (KR-96), pages 292–303. Morgan Kaufmann, 1996.Google Scholar
  5. 5.
    D. Calvanese, G. De Giacomo, and M. Lenzerini. Structured objects: Modeling and reasoning. In Proc. of the 4th Int. Conf. on Deductive and Object-Oriented Databases (DOOD-95), number 1013 in LNCS, pages 229–246. Springer-Verlag, 1995.Google Scholar
  6. 6.
    V. Christophides, S. Abiteboul, S. Cluet, and M. Scholl. From structured documents to novel query facilities. In R. T. Snodgrass and M. Winslett, editors, Proc. of the ACM SIGMOD Int. Conf. on Management of Data, pages 313–324, 1994.Google Scholar
  7. 7.
    International Organization for Standardization. ISO-8879: Information processing — Text and office systems — Standard Generalized Markup Language (SGML), October 1986.Google Scholar
  8. 8.
    T. Kirk, A.Y. Levy, Y. Sagiv, and Divesh Srivastava. The Information Manifold. In Proceedings of the AAAI 1995 Spring Symp. on Information Gathering from Heterogeneous, Distributed Enviromenis, pages 85–91, 1995.Google Scholar
  9. 9.
    Craig Knoblock and Alon Y. Levy, editors. Proceedings of the AAAI 1995 Spring Symp. on Information Gathering from Heterogeneous, Distributed Enviroments, number SS-95-08, Menlo Park (U.S.A.), 1995. AAAI Press/The MIT Press.Google Scholar
  10. 10.
    D. Konopnicki and O. Shmueli. W3QS: A query system for the World Wide Web. In Proc. of the 21th Int. Conf. on Very Large Data Bases (VLDB-95), pages 54–65, 1995.Google Scholar
  11. 11.
    L. Lakshmanan, F. Sadri, and I. N. Subramanian. A declarative language for querying and restructuring the Web. In Proc. of the 6th Int. Workshop on Reasearch Issues in Data Enginnering: Interoperability of Nontraditional Database Systems. IEEE Computer Science Press, 1996.Google Scholar
  12. 12.
    P. Lambrix. Part-Whole Reasoning in Description Logic. PhD thesis, Dep. of Computer and Information Science, Linköping University, Sweden, 1996.Google Scholar
  13. 13.
    A. Mendelzon, G. A. Mihaila, and T. Milo. Querying the World Wide Web. International Journal on Digital Libraries, 1(1):54–67, 1997.Google Scholar
  14. 14.
    D. Quass, A. Rajaraman, I. Sagiv, J. Ullman, and J. Widom. Querying semistructured heterogeneous information. In Proc. of the 4th Int. Conf. on Deductive and Object-Oriented Databases (DOOD-95), pages 319–344. Springer-Verlag, 1995.Google Scholar
  15. 15.
    D. R. Raymond, F.W. Tompa, and D. Wood. From data implementation to data model: Meta-semantic issues in the evolution of SGML. Computer Standards and Interfaces, 1995.Google Scholar
  16. 16.
    U. Sattler. A concept language for an engineering application with part-whole relations. In A. Borgida, M. Lenzerini, D. Nardi, and B. Nebel, editors, Working Notes of the 1995 Description Logics Workshop, Technical Report, RAP 07.95, Dipartimento di Informatioa e Sistemistica, University di Roma “La Sapienza”, pages 119–123, Rome (Italy), 1995.Google Scholar
  17. 17.
    D. Wood. Standard Generalized Markup Language: Mathematical and philosophical issues. In Jan van Leeuwen, editor, Computer Science Today, Recent Trends and Developments, number 1000 in LNCS, pages 344–365. Springer-Verlag, 1995.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1997

Authors and Affiliations

  • Diego Calvanese
    • 1
  • Giuseppe D Giacomo
    • 1
  • Maurizio Lenzerini
    • 1
  1. 1.Dipartimento di Informatica e SistemisticaUniversità di Roma “La Sapienza”RomaItaly

Personalised recommendations