Advertisement

Computers and the Humanities

, Volume 25, Issue 2–3, pp 103–114 | Cite as

Linguistically based functions in information retrieval: PADOK and the German Patent Information System

  • Jürgen Krause
  • Christa Womser-Hacker
Article

Abstract

This paper reports on methodological considerations and the results of the Information Retrieval (IR) project PADOK I and II. PADOK has been carried out by the Linguistic Information Science Group of the University of Regensburg (LIR) since November 1984 and has been sponsored by the German Ministry for Research and Technology. The long term objective is to integrate artificial intelligence topics and the methods of information retrieval research without neglecting traditional IR methodology. In PADOK we consider a type of mass data IR system which indexes its documents rather shallowly (freetext or morphological components) and adds an intelligent information retrieval component to this kernel system. So far we have obtained, on the basis of two large-scale retrieval tests of the German Patent Information System results which show how the linguistically based functions of an indexing system contribute to its performance, and indicate what is the most reasonable basic content analysis program for a German Patent Information System. This paper focusses on the general principles and aims of PADOK I and PADOK R and on the statistical evaluation of the retrieval tests.

Christa Womser-Hacker has a Ph.D. in Linguistic Information Science. From 1985 until 1990 she was involved in several LIR-Projects concerning text processing, evaluation of the German Patent Information System, man-machine-interaction, intelligent interfaces for databases. Since May 1990 she has been an LIR staff member. She is interested in information retrieval, (statistical) evaluation methods of man-machine-interaction, intelligent interfaces. She has published Der PADOK-Retrieval-test (1989) and “Die statistische Auswertung des Retrievaltests” (1990).

Key Words

information retrieval intelligent information retrieval evaluation mass data, patent information system statistical measurement indexing system protocol analysis 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. ACM SIGIR. Proceedings of the ACM SIGIR. 11th International Conference on Research & Development in Information Retrieval. Grenoble, June 13–15 1988.Google Scholar
  2. Badgett, T. “Tapping into On-line knowledge.” PC Magazine, 12, 5 (1987), 237–73.Google Scholar
  3. Bauer, G., Ch. Schneider and Ch. Womser-Hacker. Die Analyse der Texterschließung. PADOK-Arbeitsbericht 17. Regensburg, 1988.Google Scholar
  4. Bauer, G. and Ch. Womser-Hacker. “Qualitative Analyse von Retrievalprozessen. Untersuchungen zur Trunkierung als Ersatz für morphologische Reduktionsalgorithmen.” In Deutsche Gesellschaft für Dokumentation e. V., Deutscher Dokumentartag 1988. 1989.Google Scholar
  5. Belkin, N. J., C. L. Borgman, H. M. Brooks and T. Bylander. “Distributed Expert-based Information Systems: An Interdisciplinary Approach.” Information Processing & Management, 23, 5 (1987), 395–409.Google Scholar
  6. Brooks, H. M. ”Expert Systems and Intelligent Information Retrieval.“ Information Processing & Management, 23, 4 (1987), 367–82.Google Scholar
  7. Cleverdon, C. W. ”On the Inverse Relationship of Recall and Precision.“ Journal of Documentation, 28 (1972), 195–201.Google Scholar
  8. Cooper, W. S. “A Definition of Relevance for Information Retrieval.” Information Storage & Retrieval, 7 (1971), 19–37.Google Scholar
  9. Croft, W. B. “Approaches to Intelligent Information Retrieval.” Information Processing & Management, 23, 4 (1987), 249–54.Google Scholar
  10. Davies, R. “Outlines of the Emerging Paradigm in Cataloguing.” Information Processing & Management, 23, 2 (1987), 89–98.Google Scholar
  11. Deutsches Patentamt. Jahresbericht 1986. München, 1986.Google Scholar
  12. Gräbnitz, V. “PASSAT: Programm zur Automatischen Selektion von Stichwörtern aus Texten.” In Inhaltserschließung von Massendaten. Zur Wirksamkeit informationslinguistischer Verfahren am Beispiel des Deutschen Patentinformationssystems. Ed. J. Krause. Hildesheim et al., 1987.Google Scholar
  13. Hawkins, D. T. Applications of Artificial Intelligence (AI) and Expert Systems for Online Searching. Online, 1988.Google Scholar
  14. Information Processing & Management. Special Issue on Artificial Intelligence for Information Retrieval, 23, 4 (1987).Google Scholar
  15. Jacobs, P. S. and L. F. Rau. “Natural Language Techniques for Intelligent Information Retrieval.” In ACM SIGIR: Proceedings of the ACM SIGIR. 11th International Conference on Research & Development in Information Retrieval. Grenoble, June 13–15, 1988, pp. 85–99.Google Scholar
  16. Krause, J. “Linguistic Components in (Office) Information Systems and a General Evaluation Strategy for Automatic Indexing.” Journal of Information & Optimization Sciences (JIOS), 5 (1984), 227–59.Google Scholar
  17. Krause, J. (ed.). Inhaltserschließung von Massendaten. Zur Wirksamkeit informationslinguistischer Verfahren am Beispiel des Deutschen Patentinformationssystems. Hildesheim et al., 1987a.Google Scholar
  18. Krause, J. “Was leisten informationslinguistische Komponenten von Referenz-Retrievalsystemen far Massendaten? Von der Pragmatik im Computer zur Pragmatikanalyse als Designgrundlage.” In Deutsche Gesellschaft für Dokumentation e. V. Deutscher Dokumentartag 1986. München et al., 1987b, pp. 283–93Google Scholar
  19. Krause, J. and Ch. Womser-Hacker. Das Deutsche Patentinformationssystem. Entwicklungstendenzen, Retrievaltests and Bewertungen. Köln et al., 1990.Google Scholar
  20. Lancaster, F. W. Information Retrieval Systems: Characteristics, Testing and Evaluation. New York et al., 1979.Google Scholar
  21. Saracevic, T. “Relevance: A Review of a Framework for the Thinking on the Notion in Information Science.” Journal of the ASIS, 26 (1975), 321–43.Google Scholar
  22. Schneider, Ch. “Analyse der Texterschließung.” In Inhaltserschließung von Massendaten. Zur Wirksamkeit informatinslinguistischer Verfahren am Beispiel des Deutschen Patentinformationssystems. Ed. J. Krause. Hildesheim et al., 1987.Google Scholar
  23. Schneider, Ch. and Ch. Womser-Hacker. “Inhaltserschließungssysteme für Patenttexte. Test and Systemvergleich im Projekt PADOK.” In Deutsche Gesellschaft für Dokumentation e. V. Deutscher Dokumentartag 1986. München et al., 1987, pp. 251–69. Smeaton, A. F. and C. J. Van Rijsbergen. “Experiments on Incorporating Syntactic Processing of User Queries into a Document Retrieval Strategy.” In ACM SIGIR: Proceedings of the ACM SIGIR. 11th International Conference on Research & Development in Information Retrieval. Grenoble, June 13–15, 1988, pp. 31–51.Google Scholar
  24. Sparck Jones, K. (ed.). Information Retrieval Experiment. London et al., 1981.Google Scholar
  25. Spettel, G. and Ch. Womser-Hacker. “Statistische Auswertung des Retrievaltests auf der Grundlage von recall and precision.” In Inhaltserschließung von Massendaten. Zur Wirksamkeit informationslinguistischer Verfahren am Beispiel des Deutschen Patentinformationssystems. Ed. J. Krause. Hildesheim et al., 1987.Google Scholar
  26. Van Rijsbergen, C. J. “Foundation of Evaluation.” Journal of Documentation, 30 (1974),365–73.Google Scholar
  27. Wahlster, W. and A. Kobsa. User Models in Dialog Systems. XTRA-Bericht Nr. 30, Saarbrücken, 1988.Google Scholar
  28. Womser-Hacker, Ch. Der PADOK-Retrievaltest. ZurMethode und Verwendung statistischer Verfahren bei der Bewertung von Information-Retrieval-Systemen. Hildesheim et al., 1989.Google Scholar
  29. Womser-Hacker, Ch. “Die statistische Auswertung des Retrievaltests.” In Das Deutsche Patentinformationssystem. Entwicklungstendenzen, Retrievaltests and Bewertungen. Ed. J. Krause and Ch. Womser-Hacker. Kö1n et al., 1990.Google Scholar
  30. Zimmermann, H., E. Kroupa and G. Keil. CTX — Ein Verfahren zur computergesaitzten Texterschließung. BMFTForschungsbericht D 83-006, 1983.Google Scholar

Copyright information

© Kluwer Academic Publishers 1991

Authors and Affiliations

  • Jürgen Krause
    • 1
  • Christa Womser-Hacker
    • 1
  1. 1.Linguistische InformationswissenschaftUniversität RegensburgRegensburgGermany

Personalised recommendations