Crosslingual Interrogation of Multilingual Catalogs
In this paper, we describe a crosslingual Information Retrieval System (IRS), which makes the interrogation of multilingual databases possible. Indeed, the CEA needs to be able to process an important amount of multilingual databases and documents, and so we had to adapt the IRS we use, SPIRIT (which relies on linguistical and statistical processing) to this situation. We have thus set up a crosslingual interrogation system based on the indexation of documents containing parts written in different languages and on the bilingual reformulation of the query. The latter tries all the possible translations for every significant word of the query and the documents are used as filters in case of uncertainty or ambiguity. The answers to a query are given in the form of a list of classes of documents ranked according to their relevance. This paper describes the application of these techniques to crosslingual access to catalogs and bibliographic databases.
KeywordsRelevant Document Machine Translation Dependency Relation Bibliographic Database Information Retrieval System
Unable to display preview. Download preview PDF.
- 2.Automatic text processing. The transformation, analysis and retrieval of information by computer. G. Salton (1989) Addison-Wesley publishing company 1989. 530pGoogle Scholar
- 3."Fully automatic cross-language document retrieval using latent semantic indexing." T.K. Landauer, and M.L. Littman In Proceedings of the Sixth Annual Conference of the UW Centre for the New Oxford English Dictionary and Text Research. Waterloo Ontario 1990.Google Scholar
- 4.EMIR Final report, Christian Fluhr, Patrick Mordini, André Moulin, Erwin Stegentritt, ESPRIT project 5312, DG III, Commission of the European Union, october 1994Google Scholar
- 5.Multilingual database and crosslingual interrogation in a real Internet application, C. Fluhr, D. Schmit, F. Elkateb, K. Gurtner, Workshop "Cross-language Text and Speech retrieval" dans "AAAI 1997 Spring Symposium Series", 24–26 mars 1997, Stanford University, Californie.Google Scholar
- 6.SPIRIT-W3, A distributed crosslingual indexing and retrieval engine, C. Fluhr, D. Schmit, Ph. Ortet, F. Elkateb, K. Gurtner, INET’97, Kuala Lumpur, Juin 1997.Google Scholar
- 7.EMIR at the crosslingual track of TREC-6, F. Elkateb and C. Fluhr, TREC-6 Conference, 19–21 Novembre 1997, Gaithersburg, MarylandGoogle Scholar
- 8.Cross-language information retrieval, Grefenstette, G. and al., Boston: Kluwer Academic Publishers, 1998.Google Scholar
- 9.EMIR at the crosslingual track of TREC-7, F. Bisson, J. Charron, C. Fluhr, D. Schmit, TREC-6 Conference, 9–11 Novembre 1998, Gaithersburg, MarylandGoogle Scholar
- 10.Parallel Text Processing, Jean Véronis scientific editor, Kluwer Academic Publishers, Text Speech and Language technology series, to be published in 1999.Google Scholar