Advertisement

Query and Document Translation for English-Indonesian Cross Language IR

  • Herika Hayurani
  • Syandra Sari
  • Mirna Adriani
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4730)

Abstract

We present a report on our participation in the Indonesian-English ad hoc bilingual task of the 2006 Cross-Language Evaluation Forum (CLEF). This year we compare the use of several language resources to translate Indonesian queries into English. We used several readable machine dictionaries to perform the translation. We also used two machine translation techniques to translate the Indonesian queries. In addition to translating an Indonesian query set into English, we also translated English documents into Indonesian using the machine readable dictionaries and a commercial machine translation tool. The results show performing the task by translating the queries is better than translating the documents. Combining several dictionaries produced better result than only using one dictionary. However, the query expansion that we applied to the translated queries using the dictionaries reduced the retrieval effectiveness of the queries.

Keywords

Machine Translation Retrieval Performance Query Expansion Parallel Corpus English Document 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Adriani, M., van Rijsbergen, C.J.: Term Similarity Based Query Expansion for Cross Language Information Retrieval. In: Abiteboul, S., Vercoustre, A.-M. (eds.) ECDL 1999. LNCS, vol. 1696, pp. 311–322. Springer, Heidelberg (1999)CrossRefGoogle Scholar
  2. 2.
    Adriani, M.: Ambiguity Problem in Multilingual Information Retrieval. In: Peters, C. (ed.) CLEF 2000. LNCS, vol. 2069, Springer, Heidelberg (2001)CrossRefGoogle Scholar
  3. 3.
    Adriani, M.: English-Ducth CLIR Using Query Translation Techniques. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, Springer, Heidelberg (2002)CrossRefGoogle Scholar
  4. 4.
    Attar, R., Fraenkel, A.S.: Local Feedback in Full-Text Retrieval Systems. Journal of the Association for Computing Machinery 24, 397–417 (1977)zbMATHGoogle Scholar
  5. 5.
    Ballesteros, L., Croft, W.B.: Resolving Ambiguity for Cross-language Retrieval. In: Proceedings of the 21st International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 64–71 (1998)Google Scholar
  6. 6.
    Davis, M., Dunning, T.: A TREC Evaluation of Query Translation Methods for Multi-Lingual Text Retrieval. In: Harman, D.K. (ed.) The Fourth Text Retrieval Conference (TREC-4), NIST (November 1995)Google Scholar
  7. 7.
    Jones, G., Lam-Adesina, A.M.: Exeter at CLEF 2001: Experiments with Machine Translation for Bilingual Retrieval. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 105–114. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  8. 8.
    McCarley, J.S.: Should We Translate the Documents or the Queries in Cross-Language Information Retrieval? In: Proceedings of the Association for Computational Linguistics (ACL 1999), pp. 208–214 (1999)Google Scholar
  9. 9.
    Oard, D.W., Hackett, P.G.: Document Translation for Cross-Language Text Retrieval at the University of Maryland. In: Proceedings of the Sixth Text Retrieval Conference (TREC-6), Gaithersburg MD (November 1997)Google Scholar
  10. 10.
    Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)zbMATHGoogle Scholar
  11. 11.
    Sheridan, P., Ballerini, J.P.: Experiments in Multilingual Information Retrieval using the SPIDER System. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zürich, Switzerland (August 1996)Google Scholar
  12. 12.
    Xu, J., Croft, W.B.: Query expansion using local and global document analysis. In: Proceedings in the 19th Annual International ACM SIGIR Conference on Research and development in Information Retrieval (1996)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Herika Hayurani
    • 1
  • Syandra Sari
    • 1
  • Mirna Adriani
    • 1
  1. 1.Faculty of Computer Science, University of Indonesia, Depok 16424Indonesia

Personalised recommendations