Abstract
The paper argues for the use of general and intuitive knowledge representation languages for indexing the content of Web documents and representing knowledge within them. We believe these languages have advantages over metadata languages based on the Extensible Mark-up Language (XML). Indeed, the representation and retrieval of precise information is better supported by languages designed to represent semantic content and support logical inference, and the readability of such a language eases its exploitation, presentation and direct insertion within a document. To further ease the representation process, we propose techniques allowing users to leave some knowledge terms undeclared. We illustrate these ideas with WebKB1, a precision-oriented information retrieval/annotation tool, and show how lexical, structural and knowledge-based techniques may be combined to retrieve or generate knowledge or Web documents. Finally, to overcome the scalability problems of storing knowledge within Web documents, we propose some ideas for scalable and cooperatively built knowledge repositories.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Decker, S., Fensel, D.: Ontobroker: Ontology Based Access to Distributed and Semi-Structured Information. In: Meersman, R. (eds.), Semantic Issues in Multimedia Systems, Kluwer Academic Publisher, Boston (1999)
Ellis, G.: Managing Complex Objects. Ph.D thesis, Queensland University, Australia (1995)
Haemmerlé, O.: CoGITo: une plate-forme de développement de logiciels sur les graphes conceptuels. Ph.D thesis, Montpellier II University, France (1995)
Martin, Ph.: Using the WordNet Concept Catalog and a Relation Hierarchy for Knowledge Acquisition. In: Peirce’95, 4th Peirce workshop, California (1995) http://www.inria.fr/acacia/Publications/1995/peirce95phm.ps.Z
Martin, Ph.: Exploitation de graphes conceptuels et de documents structurés et hypertextes pour I’acquisition de connaissances et la recherche d’informations, PhD Thesis, University of Nice — Sophia Antipolis, France (1996)
Martin, Ph., Eklund, P.: WWW Indexation and Document Navigation Using Conceptual Structures. In: ICIPS’98, 2nd IEEE International Conference on Intelligent Processing Systems, IEEE Press (1998) 217–221
Martin, Ph., Eklund, P.: Embedding Knowledge in Web Documents. In: WWW8, 8th International World Wide Web Conference, Toronto, Canada (1999)
Nanard, J., Nanard, M., Massotte, A-M., Djemaa, A., Joubert, A., Betaille, H., Chauch, J.: Integrating Knowledge-based Hypertext and Database for Task-oriented Access to Documents. In: DEXA’93, LNCS Vol. 720, Springer-Verlag, Prague (1993) 721–732
Puder, A., Romer, K.: Generic Trading Service in Telecommunication Platforms. In: ICCS’97, 5th International Conference on Conceptual Structures, LNAI 1257 Springer Verlag (1997) 551–565.
Sowa, J.F.: Conceptual Structures: Information Processing in Mind and Machine. Addison-Wesley, Reading, MA (1984)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martin, P., Eklund, P. (1999). Embedding Knowledge in Web Documents: CGs versus XML-based Metadata Languages. In: Tepfenhart, W.M., Cyre, W. (eds) Conceptual Structures: Standards and Practices. ICCS 1999. Lecture Notes in Computer Science(), vol 1640. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48659-3_15
Download citation
DOI: https://doi.org/10.1007/3-540-48659-3_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66223-5
Online ISBN: 978-3-540-48659-6
eBook Packages: Springer Book Archive