Abstract
In this paper, we address the issue of representing and reasoning about documents for which an explicit structure is provided. Specifically, we devise a framework where Document Type Definitions (DTDs) expressed in the Standard Generalized Markup Language (SGML) are formalized in an expressive Description Logic equipped with sound, complete, and terminating inference procedures. In this way, we provide a general reasoning mechanism that enables various reasoning tasks on DTDs, including the verification of typical forms of equivalences between DTDs, such as strong equivalence and structural equivalence, as well as parametric versions of these equivalences. Notably, this general reasoning mechanism allows for verifying structural equivalence in worst case deterministic exponential time, in contrast to the known algorithms which are double exponential. As a whole, the study in this paper provides some of the fundamental building blocks for developing articulated inference systems that support tasks involving the intelligent navigation of large document databases such as the World Wide Web.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
References
J. Åberg. Creating a description logics knowledge base for world wide web documents. Technical Report LiTH-IDA-Ex-9641, Department of Computer and Information Science, Linköping University, Sweden, 1996.
A. Artale, E. Franconi, and N. Guarino. Open problems with part-whole relations. In Proc. of the 1996 Description Logic Workshop (DL-96), number WS-96-05, pages 70–73. AAAI Press, 1996.
A. Artale, E. Franconi, N. Guarino, and L. Pazzi. Part-whole relations in object-centered systems: An overview. Data and Knowledge Engineering, 20:347–383, 1996.
D. Calvanese. Finite model reasoning in description logics. In Proc. of the 5th Int. Conf. on the Principles of Knowledge Representation and Reasoning (KR-96), pages 292–303. Morgan Kaufmann, 1996.
D. Calvanese, G. De Giacomo, and M. Lenzerini. Structured objects: Modeling and reasoning. In Proc. of the 4th Int. Conf. on Deductive and Object-Oriented Databases (DOOD-95), number 1013 in LNCS, pages 229–246. Springer-Verlag, 1995.
V. Christophides, S. Abiteboul, S. Cluet, and M. Scholl. From structured documents to novel query facilities. In R. T. Snodgrass and M. Winslett, editors, Proc. of the ACM SIGMOD Int. Conf. on Management of Data, pages 313–324, 1994.
International Organization for Standardization. ISO-8879: Information processing — Text and office systems — Standard Generalized Markup Language (SGML), October 1986.
T. Kirk, A.Y. Levy, Y. Sagiv, and Divesh Srivastava. The Information Manifold. In Proceedings of the AAAI 1995 Spring Symp. on Information Gathering from Heterogeneous, Distributed Enviromenis, pages 85–91, 1995.
Craig Knoblock and Alon Y. Levy, editors. Proceedings of the AAAI 1995 Spring Symp. on Information Gathering from Heterogeneous, Distributed Enviroments, number SS-95-08, Menlo Park (U.S.A.), 1995. AAAI Press/The MIT Press.
D. Konopnicki and O. Shmueli. W3QS: A query system for the World Wide Web. In Proc. of the 21th Int. Conf. on Very Large Data Bases (VLDB-95), pages 54–65, 1995.
L. Lakshmanan, F. Sadri, and I. N. Subramanian. A declarative language for querying and restructuring the Web. In Proc. of the 6th Int. Workshop on Reasearch Issues in Data Enginnering: Interoperability of Nontraditional Database Systems. IEEE Computer Science Press, 1996.
P. Lambrix. Part-Whole Reasoning in Description Logic. PhD thesis, Dep. of Computer and Information Science, Linköping University, Sweden, 1996.
A. Mendelzon, G. A. Mihaila, and T. Milo. Querying the World Wide Web. International Journal on Digital Libraries, 1(1):54–67, 1997.
D. Quass, A. Rajaraman, I. Sagiv, J. Ullman, and J. Widom. Querying semistructured heterogeneous information. In Proc. of the 4th Int. Conf. on Deductive and Object-Oriented Databases (DOOD-95), pages 319–344. Springer-Verlag, 1995.
D. R. Raymond, F.W. Tompa, and D. Wood. From data implementation to data model: Meta-semantic issues in the evolution of SGML. Computer Standards and Interfaces, 1995.
U. Sattler. A concept language for an engineering application with part-whole relations. In A. Borgida, M. Lenzerini, D. Nardi, and B. Nebel, editors, Working Notes of the 1995 Description Logics Workshop, Technical Report, RAP 07.95, Dipartimento di Informatioa e Sistemistica, University di Roma “La Sapienza”, pages 119–123, Rome (Italy), 1995.
D. Wood. Standard Generalized Markup Language: Mathematical and philosophical issues. In Jan van Leeuwen, editor, Computer Science Today, Recent Trends and Developments, number 1000 in LNCS, pages 344–365. Springer-Verlag, 1995.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Calvanese, D., D Giacomo, G., Lenzerini, M. (1997). Representing and reasoning on SGML documents. In: Raś, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1997. Lecture Notes in Computer Science, vol 1325. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63614-5_52
Download citation
DOI: https://doi.org/10.1007/3-540-63614-5_52
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63614-4
Online ISBN: 978-3-540-69612-4
eBook Packages: Springer Book Archive