Abstract
We are experimenting with the representation of a DTD and associated documents (i.e., documents conformant to the DTD) in a knowledge representation (KR) system, in order to provide more sophisticated query and retrieval from TEI documents than current systems provide. We are using CLASSIC, a frame-based representation system developed at AT&T Bell Laboratories. Like many KR systems, CLASSIC enables the definition of structured concepts/frames, their organization into taxonomies, the creation and manipulation of individual instances of such concepts, and inference such as inheritance, relation transitivity, inverses, etc. In addition, CLASSIC provides for the key inferences of subsumption and classification. By representing a document as an individual instance of a hierarchy of concepts derived from the DTD, and by allowing the creation of additional user-defined concepts and relations, sophisticated query and retrieval operations can be performed. This paper describes CLASSIC and the formalism of description logic that underlies it, and demonstrates how it can be used for enhanced retrieval from richly encoded documents.
Similar content being viewed by others
References
Artale, A., E. Franconi, N. Guarino and L. Pazzi. “Part-Whole Relations in Object-Centered Systems: An Overview”. Data and Knowledge Engineering Journal, 20 (1996), 347–383. Elsevier.
Blake, G.E., M. Consens, I.J. Davis, P. Kilpelainen, E. Kuikka, P-A. Larson, T. Snider and F.W. Tompa. Text/Relational Database Management Systems: Overview and Proposed SQL Extensions. Available at http://solo.uwaterloo.ca/trdbms/, 1997.
Borgida, A. “On the Relative Expressiveness of Description Logics and Predicate Logics”. To appear, Artificial Intelligence Journal. Available at ftp://cs.rutgers.edu/pub/borgida/dl-vsfol.dvi.Z, 1998.
Brachman, R. “What Is-a Is and Isn't”. IEEE Computer, October (1983), 30–36.
Brachman, R. and J. Schmolze. “An Overview of the KL-ONE Knowledge Representation System”. Cognitive Science, 9(2) (1985), 171–216.
Brachman, R, A. Borgida, D. McGuinness and L. Resnick. “The CLASSIC Knowledge Representation System (1989)”. Proceedings of the 11th International Joint Conference on Artificial Intelligence (IJCAI), Morgan-Kaufman, 1989.
Chesnutt, D. “The Model Editions Partnership”. D-Lib Magazine. November (1995). Available at http://www.dlib.org/.
Flanders, J. The Brown University Womens Writers Project. http://www.wwp.brown.edu/, 1998.
Harie, S., N. Ide, J. Le Maitre, E. Murisasco and J. Véronis. “SgmlQL — An SGML Query Language”. Proceedings of SGML'96, 127 (1996).
Ide, N. “Corpus Encoding Standard: SGML Guidelines for Encoding Linguistic Corpora”. Proceedings of the First International Language Resources and Evaluation Conference (LREC), Granada, Spain, 1998a, pp. 463–470. CES Documentation and DTDs available at http://www.cs.vassar.edu/CES/.
Ide, N. “Encoding Linguistic Corpora”. Proceedings of the Sixth Workshop on Very Large Corpora (WVLC6), Montréal, Canada, 1998b, pp. 9–17.
Minsky, M. “A Framework for Representing Knowledge”. Mind Design. MIT Press, 1981, pp. 95–128.
Patel-Schneider, P. and B. Swartout. Description Logic Knowledge Representation System Specification. Fromthe KRSS group of the ARPA Knowledge Sharing Effort, available at http://dl.kr.org/dl/, 1993.
Simons, G. “Using architectural forms to map TEI data into an object-oriented database”. Proceedings of TEI-10, 1997.
Welty, C. “Intelligent Assistance for Navigating the Web”. Proceedings of the 1996 Florida AI Research Symposium. AAAI Press, 1996.
Welty, C. “The Ontological Nature of Subject Taxonomies”. Proceedings of the 1998 International Conference on Formal Ontology in Information Systems. IOS Press, “Frontiers in Artificial Intelligence and Applications” series, 1998.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Welty, C., Ide, N. Using the Right Tools: Enhancing Retrieval from Marked-up Documents. Computers and the Humanities 33, 59–84 (1999). https://doi.org/10.1023/A:1001800717376
Issue Date:
DOI: https://doi.org/10.1023/A:1001800717376