Abstract
This paper describes some of the work done as part of the ESPRIT II SIMPR project. The aim of SIMPR is to research and develop methods for the effective retrieval and manipulation of textual information, using automatic natural language processing techniques. This paper describes the automatic language processing techniques developed during SIMPR and how these have been used to develop a method for retrieving and matching textual information. Input texts are processed automatically at the morpho-syntactic language analysis level and from this processing are generated tree-structured internal representations which are matched to perform retrieval. This paper describes the language analysis and how it is used in matching texts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Collins Cobuild English Language Dictionary 1987. Collins: London and Glasgow.
“FASIT: Fully Automatic Syntactically-Based Indexing of Text”, M. Dillon and A.S. Gray, Journal of the ASIS, 34(2), 99–108,1983.
“Experiments in Automatic Phrase Indexing for Document Retrieval: A Comparison of Syntactic and Non-syntactic Methods”, J. Fagan, Department of Computer science, Cornell University Technical Report, 87–868, September 1987.
“Knowledge-Based Extraction of Analytics from Text”, F. Gibb et al., SIMPR Document No. SIMPR-SU-1989-4.1e, December 1989.
“Morphological Features for English”, J. Heikkil, SIMPR Document SIMPR-RUCL- 1990-13.2e, 1990.
“Parsing and Constraint Grammar”, F. Karlsson, Unpublished paper. Research Unit for Computational Linguistics, University of Helsinki, 1990.
“Constraint Grammar as a Framework for Parsing Running Text”, F. Karlsson, in: Proceedings from the XIII Conference on Computational Linguistics, M. Karlgren (Ed), Helsinki, Vol 3, pp168-173,1990.
“The Constraint Grammar Parser CGP”, F. Karlsson, Unpublished paper. Research Unit for Computational Linguistics, University of Helsinki 1990.
“Two-Level Morphology: A General Computational Model for Word-Form Recognition and Production”, K. Koskenniemi, Publications of the Dept. of General Linguistics, 11. University of Helsinki, 1983.
“The Constituent Object Parser: Syntactic Structure Matching for Information Retrieval”, D. Metzler and S.W. Haas, Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, 1989, also in ACM TOIS, 7(3), 292–316, July 1989.
“Conjunction, Ellipses and Other Discontinuous Constituents in the Constituent Object Parser”, D. Metzler et al., Information Processing and Management, 26(1), 53–72, 1990.
“A Comprehensive Grammar of the English Language”, Quirk, Greenbaum, Leech, Svartvik, Longman: London and New York, 1983.
“Context Based Text Handling”, C. Schwartz et al., Information Processing and Management, 26(2), 219–226,1990.
“Subject Classification Research”, C. Sharif et al, SIMPR Document No. SIMPR- SU-1989-6.1e, December 1989.
“Syntactic Processing for Text Analysis: A Survey”, P. Sheridan, SIMPR Document No. SIMPR-DCU-1989-8.2e, November, 1989.
“Structured Analytics: A Method for Handling Syntactic Ambiguity”, P. Sheridan and A.F. Smeaton, SIMPR Document No. SIMPR-DCU-1990-16.1e, March 1990.
“Experiments on Incorporating Syntactic Processing of User Queries into a Document Retrieval Strategy”, A.F. Smeaton and C.J. van Rijsbergen, Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, 31–51, Grenoble, 1988.
“Searching Chemical Structures: Implications for Searching Parse Trees in SIMPR”, A.F. Smeaton, SIMPR Document No. SIMPR-DCU-1989-8.2e, May 1989.
“A Knowledge Representation for Conceptual Information Retrieval”, R.M. Tong et al. International Journal of Intelligent Systems, 4(3), 259–283, 1989
“Compilation of a Computerised Master Lexicon for English” A. Voutilainen
“Inflectional Categories in the RUCL Master Lexicon (Version 1.1) for English”, A. Voutilainen
“Constraint Based Disambiguation of Lexical Ambiguity, with Special Reference to English”, A. Voutilainen, a tentative title for a forthcoming Ph.D thesis.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1990 ECSC, EEC, EAEC, Brussels and Luxembourg
About this paper
Cite this paper
Smeaton, A.F., Voutilainen, A., Sheridan, P. (1990). The Application of Morpho-Syntactic Language Processing to Effective Text Retrieval. In: ESPRIT ’90. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-0705-8_44
Download citation
DOI: https://doi.org/10.1007/978-94-009-0705-8_44
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-6803-1
Online ISBN: 978-94-009-0705-8
eBook Packages: Springer Book Archive