Advertisement

Quality Estimation of MT-Engine Output Using Language Models for Post-editing and Their Comparative Study

  • Kuldeep Kumar Yogi
  • Nishith Joshi
  • Chandra Kumar Jha
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 340)

Abstract

Machine Translation (MT) systems do not have real-world knowledge or contextual awareness. MT errors are possible at any level: lexical, grammatical, syntactic, etc., MT systems give 10–70 % accurate output, so human post-editing(HPE) is required for final output. But HPE is very expensive and slow, if we can filter out good translations out of all translations, those can make correct via miner edits then our HPE would be fast and less expensive. We can estimate good quality of a sentence using language model (LM). There are different LMs available. We showed in our experiment that Kneser-Ney smoothing LM is the right choice for measuring MT-Engine-output’s quality for the post-editing.

Keywords

Machine translation Smoothing Postediting Quality-estimation Good translation MT-engine 

References

  1. 1.
    Lagarda, A.-L., Alabau, V., Casacuberta, F., Silva, R., Diaz-de-Liano, E.: Statistical post-editing of a rule-based machine translation system. In: Proceedings of NAACL HLT 2009, ACL, pp. 217–220 (2009)Google Scholar
  2. 2.
    NAACL (2012) Shared task: quality estimation. In: Seventh Workshop on Statistical Machine Translation, Montreal, Quebec, Canada, 7–8 June 2012Google Scholar
  3. 3.
    Knight, K., Chander, I.: Automated post-editing of documents. In: Proceedings of the Twelfth National (1994)Google Scholar
  4. 4.
    Simard, M., Goutte, C., Isabelle, P.: Statistical phrase-based post-editing. In: Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, Rochester, New York, pp. 508–515. Association for Computational Linguistics, April 2007Google Scholar
  5. 5.
    Allen, J.: Post-editing. In: Harold, S. (ed.) Benjamins Translation Library, p. 35, (2003)Google Scholar
  6. 6.
    Suzuki, H.: Automatic post-editing based on SMT and its selective application by sentence-level automatic quality evaluation. In: Proceedings of the Machine Translation Summit XIII, pp. 156-163 (2011)Google Scholar
  7. 7.
    Bahl, L.R., Jelinek, F., Mercer, L.R.: A maximum likelihood approach to continuous speech recognition. IEEE Trans. Patt. Anal. Mach. Intell. 5, 179–190 (1983)CrossRefGoogle Scholar
  8. 8.
    Brown, P.F., Cocke, J., Della Pietra, S.A., Della Pietra, V.J., Jelinek, F., Lafferty, J.D., Mercer, R.L., Roossin,P.S.: A statistical approach to machine translation. Comput. Linguist. 16, 79–85 (1990)Google Scholar
  9. 9.
    Dugast, L., Senellart, J., Koehn, P.: Statistical post-editing on Systran’s rule-based translation system. In: WMT, pp. 220–223 (2007)Google Scholar
  10. 10.
    Katz, S.M.: Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Trans. Acoust. Speech Signal Process. (ASSP) 35(3), 400–401 (1987)Google Scholar
  11. 11.
    Kneser, R., Ney, H.: Improved backing-off for m-gram language modeling. In: International Conference on Acoustics, Speech and Signal Processing, pp. 181–184 (1995)Google Scholar
  12. 12.
    de Ilarraza, A.D., Labaka, G., Sarasola, K.: Statistical postediting: a valuable method in domain adaptation of RBMT systems for less-resourced languages. In: MATMT, pp. 35–40 (2008)Google Scholar
  13. 13.
    Dugast, L., Senellart, J., Koehn, P.: Statistical post editing and dictionary extraction: Systran/Edinburgh submissions for ACL-WMT2009. In: WMT, pp. 110–114 (2009)Google Scholar
  14. 14.
    Chen, S.F., Goodman, J.T.: An empirical study of smoothing techniques for language modeling. Comput. Speech Lang. 13(4), 359–393 (1999)CrossRefGoogle Scholar
  15. 15.
    Allen, J., Hogan, C.: Toward the development of a post editing module for raw machine translation output: a controlled language perspective. In: CLAW, pp. 62–71 (2000)Google Scholar
  16. 16.
    Béchara, H., Ma, Y., van Genabith, J.: Statistical post-editing for a statistical MT system. In: MT Summit XIII, pp. 308–315 (2011)Google Scholar

Copyright information

© Springer India 2015

Authors and Affiliations

  • Kuldeep Kumar Yogi
    • 1
  • Nishith Joshi
    • 1
  • Chandra Kumar Jha
    • 1
  1. 1.Banasthali UniversityRajasthanIndia

Personalised recommendations