Comparison of Statistical Approaches for Tamil to English Translation

  • R. Rajkiran
  • S. Prashanth
  • K. Amarnath Keshav
  • Sridhar Rajeswari
Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 32)


This work proposes a Machine Translation system from Tamil to English using a Statistical Approach. Statistical machine translation (SMT) is a machine translation paradigm where translations are generated on the basis of statistical models whose parameters are derived from the analysis of bilingual text corpora. It is the most widely used machine translation paradigm for the tradeoff between efficiency and implementation feasibility and due to its partial language independency. In syntax based approach, a phrase table is created which identifies the most probabilistically likely English translation of each Tamil phrase in the input sentence. In hierarchical phrase based approach, a rule table is used to reduce the input Tamil sentence into the output English sentence. We evaluated the two approaches based on different parameters like corpus size, gram size of language model and achieved a BLEU score of 0.26.


Statistical machine translation Syntax based Hierarchical phrase based BLEU score 


  1. 1.
    Forcada, M.L., Ginest´ı-Rosell, M., Nordfalk, J., ORegan, J., Ortiz- Rojas, S., Pe´rez-Ortiz, A., Sa´nchez-Mart´ınez, F., Ramırez- Sa´nchez, G., Tyers, F.M.: Apertium: a free/open-source platform for rule-based machine translation. Mach. Transl. 25(2), 127–144 (2011)CrossRefGoogle Scholar
  2. 2.
    Somers, H.: Review article: example-based machine translation. Mach. Transl. 14(2), 113–157 (1999)CrossRefMathSciNetGoogle Scholar
  3. 3.
    Koehn, P.: Statistical Machine Translation. Cambridge University Press, Cambridge (2010)Google Scholar
  4. 4.
    Muegge, U.: An excellent application for crummy machine translation: automatic translation of a large database. In: Proceedings of the Annual Conference of the German Society of Technical Communicators, pp. 18–21 (2006)Google Scholar
  5. 5.
    Aleksic, V., Thurmair, G., Will, T.: Hybrid machine translation system. US Patent App. 11/885,688, 7 Mar 2005Google Scholar
  6. 6.
    Hogan, C., Frederking, R.E.: An evaluation of the multi-engine MT architecture. In: Machine Translation and the Information Soup, pp. 113–123. Springer, Heidelberg (1998)Google Scholar
  7. 7.
    Post, M., Callison-Burch, C., Osborne M.: Constructing parallel corpora for six Indian languages via crowdsourcing. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 401–409. Association for Computational Linguistics (2012)Google Scholar
  8. 8.
    ZdenekŽabokrtský, L.O.: Morphological processing for English–Tamil statistical machine translation. In: 24th International Conference on Computational Linguistics, pp. 113–122 (2012)Google Scholar
  9. 9.
    Och, F.J., Ney, H.: A systematic comparison of various statistical alignment models. Comput. Linguist. 29(1), 19–51 (2003)CrossRefMATHGoogle Scholar
  10. 10.
    Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., et al.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, pp. 177–180. Association for Computational Linguistics (2007)Google Scholar
  11. 11.
    Stymne, S., Ahrenberg, L.: Using a grammar checker for evaluation and postprocessing of statistical machine translation. In: LREC, pp. 2175–2181 (2010)Google Scholar
  12. 12.
    Parthasarathi, R., Karky, M.: Agaraadhi: a novel online dictionary framework. In: 10th International Tamil Internet Conference of International Forum for Information Technology in Tamil, pp. 197–200Google Scholar

Copyright information

© Springer India 2015

Authors and Affiliations

  • R. Rajkiran
    • 1
  • S. Prashanth
    • 1
  • K. Amarnath Keshav
    • 1
  • Sridhar Rajeswari
    • 1
  1. 1.Department of Computer Science and EngineeringAnna University, College of Engineering, GuindyChennaiIndia

Personalised recommendations