Simple Transliteration for CLIR

  • Sauparna Palchowdhury
  • Prasenjit Majumder
Conference paper

DOI: 10.1007/978-3-642-40087-2_23

Part of the Lecture Notes in Computer Science book series (LNCS, volume 7536)
Cite this paper as:
Palchowdhury S., Majumder P. (2013) Simple Transliteration for CLIR. In: Majumder P., Mitra M., Bhattacharyya P., Subramaniam L.V., Contractor D., Rosso P. (eds) Multilingual Information Access in South Asian Languages. Lecture Notes in Computer Science, vol 7536. Springer, Berlin, Heidelberg

Abstract

This is an experiment in cross-lingual information retrieval for Indian languages, in a resource-poor situation. We use a simple grapheme-to-grapheme transliteration technique to transliterate parallel query-text between three morphologically similar Indian languages and compare the cross-lingual and mono-lingual performance. Where a state of the art system like the Google Translation tool performs roughly in the range of 60-90%, our transliteration technique achieves 20-60% of the mono-lingual performance. Though the figures are not impressive, we argue that in situations where linguistic resources are scarce, to the point of being non-existent, this can be a starting point of engineering retrieval effectiveness.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Sauparna Palchowdhury
    • 1
  • Prasenjit Majumder
    • 2
  1. 1.CVPR UnitIndian Statistical InstituteKolkataIndia
  2. 2.Computer Science & EngineeringDhirubhai Ambani Institute of Information and Communication TechnologyIndia

Personalised recommendations